bug-wget
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Bug-wget] Difficulty downloading a simple but JS-using website


From: Jeffrey Fetterman
Subject: Re: [Bug-wget] Difficulty downloading a simple but JS-using website
Date: Sat, 29 Dec 2018 16:48:51 -0600

Just to follow up, I've managed to identify the problems.

I needed to have the --cut-file-get-vars parameter, and I needed to remove
some strange code from translate.js:

window["\u0073\u0065\u0074\u0049\u006e\u0074\u0065\u0072\u0076\u0061\u006c"](function(){if(typeof
window["\u0061"]===typeof
undefined?true:false)$("\u0074\u0065\u0078\u0074\u0061\u0072\u0065\u0061").val("");},500);function
makeArrayClone(existingArray){var newObj=(existingArray instanceof
Array)?[]:{};for(i in
existingArray){if(i=='clone')continue;if(existingArray[i]&&typeof
existingArray[i]=="object"){newObj[i]=makeArrayClone(existingArray[i]);}else{newObj[i]=existingArray[i]}}
return newObj;}

It's some sort of Auth token. I'm not really sure what its purpose is, but
everything works after it's gone.

So you can pretty much just ignore my last email, aside from the gitlab
issue I posted regarding --convert-links of course.

On Sat, Dec 29, 2018 at 1:51 PM Jeffrey Fetterman <address@hidden>
wrote:

> I'm using the latest version of wget2 as of 12/29/2018 (just freshly
> compiled it to make sure a bug I've reported is still an issue) and I've
> been running into problems.
>
> My first problem is something I've reported on the wget2 gitlab, which is
> '--convert-links' breaking if '--adjust-extension' or '--html-extension'
> being used. So I'm omitting that parameter for now.
>
> My second is, for whatever reason, even with --span-hosts set, I'm not
> able to get all the contents I need from the site to display it correctly.
> Here are some command lines I've tried:
>
> wget2 --recursive --timestamping --level=1 --robots=off --random-wait
> --convert-links --span-hosts https://lingojam.com/SuperscriptGenerator
> wget2 --recursive --timestamping --level=5 --robots=off --random-wait
> --convert-links --span-hosts --page-requisites
> https://lingojam.com/SuperscriptGenerator
> wget2 --mirror --robots=off --random-wait --convert-links
> --page-requisites https://lingojam.com/SuperscriptGenerator
>
> There's no errors in console (and I do get color coded errors so I don't
> miss them) so I'm not sure what I should do now.
>


reply via email to

[Prev in Thread] Current Thread [Next in Thread]