[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Bug-wget] Trouble downloading data from nomads.ncdc.noaa.gov
From: |
Micah Cowan |
Subject: |
Re: [Bug-wget] Trouble downloading data from nomads.ncdc.noaa.gov |
Date: |
Fri, 23 Sep 2011 15:37:46 -0700 |
User-agent: |
Mozilla/5.0 (X11; U; Linux x86_64; en-US; rv:1.9.2.21) Gecko/20110831 Thunderbird/3.1.13 |
(09/23/2011 01:14 PM), Kevin Doty wrote:
> I am using the short script below using "wget" to try to download files
> like "narrmonhr-a_221_19790101_2100_000.grb" from the location
> http://nomads.ncdc.noaa.gov/thredds/catalog/narrmonthly/197901/19790101/.
>
> The resulting log file is shown below the script. Apparently it finds the
> files but says they do not meet the accept/reject criteria. I have tried
> with the recursive level and no-parent turned "off" but that did not help.
Sadly, -A and -R do not match the query string portion of a URL
(anything past a "?"). Just the filename portion (which in this case, is
"catalog.html" for every URL you wanted to match against). You can see
this reflected (perhaps a bit cryptically) by the debug messages
explaining the failed match (they all have "(catalog.html)" after they
give the full URL). At some point it is hoped that Wget will have
regular exrpessions support, for matching against the entire URL; for
now I'm afraid there's not much you can do directly with wget to work
around this problem.
--
Micah J. Cowan
http://micah.cowan.name/