[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Lynx-dev] Extract links from html with application/ld+json script
From: |
Super Bonaci |
Subject: |
[Lynx-dev] Extract links from html with application/ld+json script |
Date: |
Sun, 17 Dec 2023 19:31:33 +0000 (UTC) |
Version in use: Lynx Version 2.8.9rel.1 (08 Jul 2018)
Some html pages contain <script type="application/ld+json"> content, for
example:
wget -E 'https://www.twitch.tv/egctv/videos?filter=all&sort=time' -O twitch.html
Wether the html is embedded or not depends on the wget or curl flags which are
used.
The twitch.html sample can be browsed here:
https://controlc.com/9ed7a8bb
https://pastebin.com/87edaepd
Lynx is not able to extract most html links inside the html file.
Since the Lynx version is from 2018 probably that's the cause, being too old
and not supporting new formats.
Could this issue be fixed?
bye.
- [Lynx-dev] Extract links from html with application/ld+json script,
Super Bonaci <=