[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
bug#37009: EWW Gets Confused on Invalid HTML
From: |
Noam Postavsky |
Subject: |
bug#37009: EWW Gets Confused on Invalid HTML |
Date: |
Tue, 13 Aug 2019 13:55:01 -0400 |
User-agent: |
Gnus/5.13 (Gnus v5.13) Emacs/26.1.92 (windows-nt) |
Nick Daly <nick.m.daly@gmail.com> writes:
> EWW gets confused when facing unescaped less-than-signs.
> Unfortunately, the page does not escape the less-than symbol before "xs"
> on the second line, so the "<-" (and several more characters) aren't
> displayed.
I'm not sure how feasible it will be to fix this at all. Eww relies on
libxml for parsing, and it's not as flexible as a typical web browser:
(with-temp-buffer
(insert "<html>
<body>abc <- xyz<body>
</html>")
(libxml-parse-html-region (point-min) (point-max)))
;=> (html nil (body nil "abc\n"))