Subversion-Projekte lars-tiefland.php_share

Revision

Details | Letzte Änderung | Log anzeigen | RSS feed

Revision Autor Zeilennr. Zeile
1 lars 1
ISSUES REGARDING THE CONVERSION PROCESS
2
 
3
SSL should be supported by PHP in order to use https protocol; please check it before reporting probles with
4
page accessible via https.
5
 
6
Note that slow/unstable sites may sometimes cause part of a page to be not rendered due a connection loss.
7
 
8
KNOWN PROBLEM: Script sometimes hangs indefinitely (Apache 2, Windows, PHP 4). Bug is not replicable,
9
after restarting Apache the same page is processed normally. I doubt is it is the script problem.
10
 
11
ISSUES REGARDING GENERATED FILES
12
 
13
Generated files should be CLOSE to the pages you view in browser, but not EXACT, as fonts metrics used
14
in generated files may differ from ones used by browser.
15
 
16
PDF generated using PDFLIB may SLIGHTLY differ from the PS/PS2PDF output as these output methods use
17
different fonts; nevertheless, serious differences between generated outputs should be considered as a bug.
18
 
19
PS files will take a significiant amount of time (compared to PDFLIB) to show up in the viewer
20
(5-15 seconds in average, up to several minutes on the composite files).
21
 
22
KNOWN PROBLEM: PS files generated from HTML pages containing a lot of text are rendered slow.
23
 
24
KNOWN PROBLEM: sometimes Ghostscript fails to convert PS files to PDF while still can render it using GSView
25
or other PS viewer. It is important to distinguish between such files and files which cannot be converted at all.
26
 
27
Note that transparent PNGs using alpha channel are not supported due the limitations of Ghostscript/PDFLIB;
28
script makes an attempt to draw such images, but still resuls can be bad.
29
 
30
DATA REQUIRED TO DESCRIBE BUGS/PROBLEMS
31
 
32
- OS name/version
33
- Apache (or other HTTP server version)
34
- PHP version
35
- if native PHP or ActiveLink DOM XML extension used
36
- GD library version (and if PHP uses bundled or standalone GD)
37
- PDFLIB version (in case of errors in PDFLIB output)
38
- Ghostscript version
39
- setting used to convert the file
40
- HTML code caused the error (preferrable reduced to the simplest possible case)
41
- if the problem reappear on the other output types (PDFLIB output? Plain PS output?)
42
 
43
LIST OF SITES USED DURING THE TESTING PROCESS
44
 
45
Note that some sites use client IP address information when generating the HTML page (a good example is
46
www.google.com) possibly making the page content and layout different when accessing it from different servers.
47
 
48
The meaning of the marks below:
49
 
50
COOKIES/REFERRER - site uses cookies or 'referrer string' to identify user; sometimes this may cause
51
  an infinite redirect loop.
52
USER-AGENT - site checks the user-agent string and generates pages depending on its value; the rendered page layout
53
  may differ from the page layout viewed in browser (though usually it is close to it)
54
JAVASCRIPT - part of the page is generated by a javascript; as script does not process javascript,
55
  layout of the page will differ from the one in the browser; if you want to compare the results,
56
  disable javascipt in your browser
57
WWW        - prepend www. part to the domain name in order to process page; without 'www' you'll
58
  get the "cannot process" message, as not all HTTP redirect types are supported by the script
59
N/A        - not tested, site was unavailable for the several last days.
60
SLOW       - generated file may take 5-10 minutes to be rendered completely
61
 
62
247realmedia.com                       - processed
63
888.com                                - processed (COOKIES/REFERRER - processed local copy saved from Firefox)
64
abetterinternet.com                    - processed
65
alphadg.com                            - processed
66
aol.com                                - processed (USER-AGENT)
67
bbc.co.uk                              - processed
68
benews.net                             - processed
69
bestoffersnetworks.com                 - processed
70
casalemedia.com                        - processed (JAVASCRIPT)
71
cnn.com                                - processed
72
cra-arc.gc.ca/menu-e.html              - processed (JAVASCRIPT)
73
crux.nu                                - processed
74
cs.wisc.edu/~ghost/                    - processed
75
download.com                           - processed (SLOW)
76
ebay.com                               - processed (USER-AGENT)
77
ewizard.com                            - processed
78
exactsearch.net                        - processed
79
exitexchange.com                       - processed
80
falkag.net                             - processed (WWW)
81
geocities.com                          - processed
82
go.com                                 - processed
83
google.com                             - processed
84
google.com/about.html                  - processed
85
google.com/froogle                     - processed
86
google.com/services/                   - processed
87
hamster.sazco.net                      - processed
88
internet-optimizer.com                 - processed
89
jakpsatweb.cz/css/css-vertical-center-solution.html - processed (SLOW)
90
johnlewis.com                          - processed
91
microsoft.com                          - processed
92
msn.com                                - processed
93
myblog.de                              - processed
94
myway.com                              - processed
95
mywebsearch.com                        - processed (JAVASCRIPT)
96
net-offers.net                         - (N/A)
97
netscape.com                           - processed (JAVASCRIPT)
98
netvenda.com                           - processed
99
offeroptimizer.com                     - processed
100
onet.pl                                - processed
101
papajohns.com                          - processed
102
partypoker.com                         - processed
103
passport.com                           - (N/A)
104
php.net                                - processed (SLOW)
105
pilger.carlton.com                     - file_get_contents('http://pilger.carlton.com') returns empty string - PHP problem
106
python.org/~guido/                     - processed
107
realmedia.com                          - processed
108
rentacoder.com                         - processed
109
revenue.net                            - processed
110
sage.com/local/regionNorthAmerica.aspx - processed
111
searchscout.com                        - processed (WWW)
112
smarty.php.net                         - processed
113
stallman.org                           - processed (SLOW)
114
thefacebook.com                        - processed
115
tickle.com                             - processed
116
trafficmp.com                          - processed
117
tufat.com                              - processed
118
user.it.uu.se/~jan/html2ps.html        - processed
119
vianet.com.pl                          - (N/A)
120
whenu.com                              - processed
121
whitehouse.gov                         - processed
122
yahoo.com                              - processed (USER-AGENT)
123
zango.com                              - processed