wget-dev
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Wget-dev] wget2 | Restricting domains with host-spanning does not w


From: Archit Pandey
Subject: Re: [Wget-dev] wget2 | Restricting domains with host-spanning does not work (#483)
Date: Tue, 22 Oct 2019 06:39:36 +0000



Archit Pandey commented:


Hi @notafunction1!

Thank you for reaching out to the community.

I tried reproducing your results and can confirm that using `-H` and `-D` 
together gives the same results as you've shown.

Upon checking the [User 
Manual](https://gnuwget.gitlab.io/wget2/reference/md_wget2.html#Options) I 
found the following description for `-D`:
```
-D domain-list, --domains=domain-list

Set domains to be followed. domain-list is a comma-separated list of domains. 
Note that it does not turn on -H.
```
On the other hand, the following description for `-H`:
```
-H, --span-hosts

Enable spanning across hosts when doing recursive retrieving.
```

I believe using them together would mean all hosts are followed without any 
restriction placed by `-D`. 

Hence, I used the following command:
```
wget2 -r -w 8 --filter-mime-type="text/html" -a wget_log 
--domains=*.kedo.gov.cn http://www.kedo.gov.cn
```
Using this command, prevents any domains other than `*.kedo.gov.cn` to be 
followed.

Hope that helps! :smile:

-- 
Reply to this email directly or view it on GitLab: 
https://gitlab.com/gnuwget/wget2/issues/483#note_233658236
You're receiving this email because of your account on gitlab.com.




reply via email to

[Prev in Thread] Current Thread [Next in Thread]