| 1 |
lars |
1 |
ISSUES REGARDING THE CONVERSION PROCESS
|
|
|
2 |
|
|
|
3 |
SSL should be supported by PHP in order to use https protocol; please check it before reporting probles with
|
|
|
4 |
page accessible via https.
|
|
|
5 |
|
|
|
6 |
Note that slow/unstable sites may sometimes cause part of a page to be not rendered due a connection loss.
|
|
|
7 |
|
|
|
8 |
KNOWN PROBLEM: Script sometimes hangs indefinitely (Apache 2, Windows, PHP 4). Bug is not replicable,
|
|
|
9 |
after restarting Apache the same page is processed normally. I doubt is it is the script problem.
|
|
|
10 |
|
|
|
11 |
ISSUES REGARDING GENERATED FILES
|
|
|
12 |
|
|
|
13 |
Generated files should be CLOSE to the pages you view in browser, but not EXACT, as fonts metrics used
|
|
|
14 |
in generated files may differ from ones used by browser.
|
|
|
15 |
|
|
|
16 |
PDF generated using PDFLIB may SLIGHTLY differ from the PS/PS2PDF output as these output methods use
|
|
|
17 |
different fonts; nevertheless, serious differences between generated outputs should be considered as a bug.
|
|
|
18 |
|
|
|
19 |
PS files will take a significiant amount of time (compared to PDFLIB) to show up in the viewer
|
|
|
20 |
(5-15 seconds in average, up to several minutes on the composite files).
|
|
|
21 |
|
|
|
22 |
KNOWN PROBLEM: PS files generated from HTML pages containing a lot of text are rendered slow.
|
|
|
23 |
|
|
|
24 |
KNOWN PROBLEM: sometimes Ghostscript fails to convert PS files to PDF while still can render it using GSView
|
|
|
25 |
or other PS viewer. It is important to distinguish between such files and files which cannot be converted at all.
|
|
|
26 |
|
|
|
27 |
Note that transparent PNGs using alpha channel are not supported due the limitations of Ghostscript/PDFLIB;
|
|
|
28 |
script makes an attempt to draw such images, but still resuls can be bad.
|
|
|
29 |
|
|
|
30 |
DATA REQUIRED TO DESCRIBE BUGS/PROBLEMS
|
|
|
31 |
|
|
|
32 |
- OS name/version
|
|
|
33 |
- Apache (or other HTTP server version)
|
|
|
34 |
- PHP version
|
|
|
35 |
- if native PHP or ActiveLink DOM XML extension used
|
|
|
36 |
- GD library version (and if PHP uses bundled or standalone GD)
|
|
|
37 |
- PDFLIB version (in case of errors in PDFLIB output)
|
|
|
38 |
- Ghostscript version
|
|
|
39 |
- setting used to convert the file
|
|
|
40 |
- HTML code caused the error (preferrable reduced to the simplest possible case)
|
|
|
41 |
- if the problem reappear on the other output types (PDFLIB output? Plain PS output?)
|
|
|
42 |
|
|
|
43 |
LIST OF SITES USED DURING THE TESTING PROCESS
|
|
|
44 |
|
|
|
45 |
Note that some sites use client IP address information when generating the HTML page (a good example is
|
|
|
46 |
www.google.com) possibly making the page content and layout different when accessing it from different servers.
|
|
|
47 |
|
|
|
48 |
The meaning of the marks below:
|
|
|
49 |
|
|
|
50 |
COOKIES/REFERRER - site uses cookies or 'referrer string' to identify user; sometimes this may cause
|
|
|
51 |
an infinite redirect loop.
|
|
|
52 |
USER-AGENT - site checks the user-agent string and generates pages depending on its value; the rendered page layout
|
|
|
53 |
may differ from the page layout viewed in browser (though usually it is close to it)
|
|
|
54 |
JAVASCRIPT - part of the page is generated by a javascript; as script does not process javascript,
|
|
|
55 |
layout of the page will differ from the one in the browser; if you want to compare the results,
|
|
|
56 |
disable javascipt in your browser
|
|
|
57 |
WWW - prepend www. part to the domain name in order to process page; without 'www' you'll
|
|
|
58 |
get the "cannot process" message, as not all HTTP redirect types are supported by the script
|
|
|
59 |
N/A - not tested, site was unavailable for the several last days.
|
|
|
60 |
SLOW - generated file may take 5-10 minutes to be rendered completely
|
|
|
61 |
|
|
|
62 |
247realmedia.com - processed
|
|
|
63 |
888.com - processed (COOKIES/REFERRER - processed local copy saved from Firefox)
|
|
|
64 |
abetterinternet.com - processed
|
|
|
65 |
alphadg.com - processed
|
|
|
66 |
aol.com - processed (USER-AGENT)
|
|
|
67 |
bbc.co.uk - processed
|
|
|
68 |
benews.net - processed
|
|
|
69 |
bestoffersnetworks.com - processed
|
|
|
70 |
casalemedia.com - processed (JAVASCRIPT)
|
|
|
71 |
cnn.com - processed
|
|
|
72 |
cra-arc.gc.ca/menu-e.html - processed (JAVASCRIPT)
|
|
|
73 |
crux.nu - processed
|
|
|
74 |
cs.wisc.edu/~ghost/ - processed
|
|
|
75 |
download.com - processed (SLOW)
|
|
|
76 |
ebay.com - processed (USER-AGENT)
|
|
|
77 |
ewizard.com - processed
|
|
|
78 |
exactsearch.net - processed
|
|
|
79 |
exitexchange.com - processed
|
|
|
80 |
falkag.net - processed (WWW)
|
|
|
81 |
geocities.com - processed
|
|
|
82 |
go.com - processed
|
|
|
83 |
google.com - processed
|
|
|
84 |
google.com/about.html - processed
|
|
|
85 |
google.com/froogle - processed
|
|
|
86 |
google.com/services/ - processed
|
|
|
87 |
hamster.sazco.net - processed
|
|
|
88 |
internet-optimizer.com - processed
|
|
|
89 |
jakpsatweb.cz/css/css-vertical-center-solution.html - processed (SLOW)
|
|
|
90 |
johnlewis.com - processed
|
|
|
91 |
microsoft.com - processed
|
|
|
92 |
msn.com - processed
|
|
|
93 |
myblog.de - processed
|
|
|
94 |
myway.com - processed
|
|
|
95 |
mywebsearch.com - processed (JAVASCRIPT)
|
|
|
96 |
net-offers.net - (N/A)
|
|
|
97 |
netscape.com - processed (JAVASCRIPT)
|
|
|
98 |
netvenda.com - processed
|
|
|
99 |
offeroptimizer.com - processed
|
|
|
100 |
onet.pl - processed
|
|
|
101 |
papajohns.com - processed
|
|
|
102 |
partypoker.com - processed
|
|
|
103 |
passport.com - (N/A)
|
|
|
104 |
php.net - processed (SLOW)
|
|
|
105 |
pilger.carlton.com - file_get_contents('http://pilger.carlton.com') returns empty string - PHP problem
|
|
|
106 |
python.org/~guido/ - processed
|
|
|
107 |
realmedia.com - processed
|
|
|
108 |
rentacoder.com - processed
|
|
|
109 |
revenue.net - processed
|
|
|
110 |
sage.com/local/regionNorthAmerica.aspx - processed
|
|
|
111 |
searchscout.com - processed (WWW)
|
|
|
112 |
smarty.php.net - processed
|
|
|
113 |
stallman.org - processed (SLOW)
|
|
|
114 |
thefacebook.com - processed
|
|
|
115 |
tickle.com - processed
|
|
|
116 |
trafficmp.com - processed
|
|
|
117 |
tufat.com - processed
|
|
|
118 |
user.it.uu.se/~jan/html2ps.html - processed
|
|
|
119 |
vianet.com.pl - (N/A)
|
|
|
120 |
whenu.com - processed
|
|
|
121 |
whitehouse.gov - processed
|
|
|
122 |
yahoo.com - processed (USER-AGENT)
|
|
|
123 |
zango.com - processed
|