bug-wget
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Bug-wget] Re: Only follow paths with /res/ in them


From: Micah Cowan
Subject: [Bug-wget] Re: Only follow paths with /res/ in them
Date: Wed, 19 Nov 2008 20:57:33 -0800
User-agent: Thunderbird 2.0.0.17 (X11/20080925)

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Oh! Please don't use this list (address@hidden) any more; I'm trying to
get the dotsrc folks to make it go away/forward to bug-wget (I need to
ping 'em on this again). The official list for Wget is now address@hidden

Micah Cowan wrote:
> Brian wrote:
>> I would like to follow all the urls on a site that contain /res/ in the
>> path. I've tried using -I and -A, with values such as res, *res*,
>> */res/*, etc.. Here is an example that downloads pretty much the entire
>> site, rather than what I appear  (to me) to have specified:
> 
>> wget -O- -q http://img.site.org/b/imgboard.html | wget -q -r -l1 -O- -I
>> '*res*' -A '*res*' --force-html -B http://img.site.org/b/ -i-
> 
>> The urls I would like to follow and output to the command line are of
>> the form:
> 
>> http://img.site.org/b/res/97867797.html
> 
> -A isn't useful here: it's applied only against the "filename" portion
> of the URL.
> 
> -I is what you want; the trouble is that the * wildcard doesn't match
> slashes (there's plans to introduce a ** wildcard, probably in 1.13). So
> unfortunately you gotta do -I'res,*/res,*/*/res' etc as needed.
> 

- --
Micah J. Cowan
Programmer, musician, typesetting enthusiast, gamer.
GNU Maintainer: wget, screen, teseq
http://micah.cowan.name/
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.9 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

iEYEARECAAYFAkkk7j0ACgkQ7M8hyUobTrH+CACbBzcO4vM6qHIumBeDS2ZyAdfq
ONYAnjX7SHAOvEJylkbjjq7IsDXEv+27
=3Hrq
-----END PGP SIGNATURE-----




reply via email to

[Prev in Thread] Current Thread [Next in Thread]