bug-wget
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Bug-wget] Faking browser?


From: Gilles
Subject: Re: [Bug-wget] Faking browser?
Date: Tue, 4 Apr 2017 00:29:46 +0200
User-agent: Mozilla/5.0 (Windows NT 6.1; rv:45.0) Gecko/20100101 Thunderbird/45.8.0

Problem solved: The URLs in the homepage pointed to a remote site. I simply needed to add the following to get it work:

--span-hosts -D www.remote.site.com

On 03/04/2017 17:04, Gilles wrote:
Hello

1. Using the following command + wget.ini, wget fails pretending to be a browser: It downloads an empty index.html.tmp.tmp file instead of index.htm, and stops there:

=== wget.ini
hsts=0
robots = off
header = Accept-Language: en-us,en;q=0.5
header = Accept: text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8
header = Connection: keep-alive
user_agent = Mozilla/5.0 (X11; Fedora; Linux x86_64; rv:40.0) Gecko/20100101 Firefox/40.0
referer = /

=== command
wget.exe --spider --recursive --reject jpg,png,gif --accept htm --output-file=spider.log http://www.acme.com

What else could I try to get wget to pass muster?

Thank you.







reply via email to

[Prev in Thread] Current Thread [Next in Thread]