bug-wget
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Bug-wget] [bug #52705] HTML assets embedding with --page-requisites


From: Darshit Shah
Subject: [Bug-wget] [bug #52705] HTML assets embedding with --page-requisites
Date: Thu, 21 Dec 2017 07:59:46 -0500 (EST)
User-agent: Mozilla/5.0 (X11; Linux x86_64; rv:59.0) Gecko/20100101 Firefox/59.0

Follow-up Comment #2, bug #52705 (project wget):

While MHTML was a convenient way to create snapshots of pages, sadly it was
never properly standardized and most popular browsers no longer support it.

WARC has been almost standardized and is considered the de-facto way of
archiving a web page / web site.

Wget supports saving into the WARC format. So you may want to look into using
that. 

Else, implementing MHTML should not be too hard. Just some postprocessing code
in all the places where WARC data is stored. However, none of the developers
currently have time to work on a new feature. So, if you could write a patch,
we might review  and accept it.

Implementing this as a plugin for Wget would however be easier and cleaner.

    _______________________________________________________

Reply to this item at:

  <http://savannah.gnu.org/bugs/?52705>

_______________________________________________
  Message sent via/by Savannah
  http://savannah.gnu.org/




reply via email to

[Prev in Thread] Current Thread [Next in Thread]