[Top][All Lists]

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Lynx-dev] Lynx HTML Parsing Bug

From: Isaac Morland
Subject: [Lynx-dev] Lynx HTML Parsing Bug
Date: Wed, 11 Oct 2006 09:37:33 -0400 (EDT)

There is a bug in Lynx' handling of at least some HTML tags. If whitespace precedes the closing '>' of "</script>" then problems occur (see below for Unix and Lynx version, as well as the HTML source obtained using -mime_header and the result of using -dump). If the space is removed, then -dump shows the title and line of text as expected.

This was detected because I have an HTML-generation library that puts a newline ('\n') before every '>' in order to put the output on multiple lines without ever inserting unwanted whitespace between tags. The W3C validator has no problem with this and I have never observed any other browser to have a problem.

Since I'm reporting things I'd like changed in Lynx, I will also mention that I'd really like -mime_header to *always* just show what comes back from the server. If the server response is a 401, then the server headers are not shown; instead the Lynx error message about access denied is shown.

Thanks for putting out Lynx.  It's still useful....

@cpu102[110]% uname -a
SunOS cpu102.cs 5.8 Generic_117350-35 sun4u sparc
@cpu102[111]% lynx -version
Lynx Version 2.8.5dev.7 (06 Jan 2002)
libwww-FM 2.14, SSL-MM 1.4.1, OpenSSL 0.9.7b
Built on solaris2.5 Jul 22 2003 15:32:14

Copyrights held by the University of Kansas, CERN, and other contributors.
Distributed under the GNU General Public License.
See and the online help for more information.

See for information about SSL for Lynx.
See for information about OpenSSL.

@cpu102[112]% lynx -mime_header
HTTP/1.1 200 OK
Date: Wed, 11 Oct 2006 12:44:52 GMT
Server: Apache/1.3.34 Ben-SSL/1.55 (Unix) mod_perl/1.21
Last-Modified: Tue, 10 Oct 2006 12:59:09 GMT
ETag: "c7308d-120-452b991d"
Accept-Ranges: bytes
Content-Length: 288
Connection: close
Content-Type: text/html; charset=utf-8

<script src="/templates/styleswitcher.js" type="text/javascript">
</script >
<title>Test Page</title>

<h1>Test Page</h1>

<p>This is a test page.</p>

@cpu102[113]% lynx -dump


Isaac Morland                   CSCF Web Guru
DC 2554C, x36650                WWW Software Specialist

reply via email to

[Prev in Thread] Current Thread [Next in Thread]