[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Wget-dev] wget2 | Restricting domains with host-spanning does not w
From: |
Archit Pandey |
Subject: |
Re: [Wget-dev] wget2 | Restricting domains with host-spanning does not work (#483) |
Date: |
Tue, 22 Oct 2019 06:39:36 +0000 |
Archit Pandey commented:
Hi @notafunction1!
Thank you for reaching out to the community.
I tried reproducing your results and can confirm that using `-H` and `-D`
together gives the same results as you've shown.
Upon checking the [User
Manual](https://gnuwget.gitlab.io/wget2/reference/md_wget2.html#Options) I
found the following description for `-D`:
```
-D domain-list, --domains=domain-list
Set domains to be followed. domain-list is a comma-separated list of domains.
Note that it does not turn on -H.
```
On the other hand, the following description for `-H`:
```
-H, --span-hosts
Enable spanning across hosts when doing recursive retrieving.
```
I believe using them together would mean all hosts are followed without any
restriction placed by `-D`.
Hence, I used the following command:
```
wget2 -r -w 8 --filter-mime-type="text/html" -a wget_log
--domains=*.kedo.gov.cn http://www.kedo.gov.cn
```
Using this command, prevents any domains other than `*.kedo.gov.cn` to be
followed.
Hope that helps! :smile:
--
Reply to this email directly or view it on GitLab:
https://gitlab.com/gnuwget/wget2/issues/483#note_233658236
You're receiving this email because of your account on gitlab.com.