[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Bug-wget] non-ascii characters - url conversion
From: |
Miroslav Oujeský |
Subject: |
[Bug-wget] non-ascii characters - url conversion |
Date: |
Fri, 10 Apr 2009 10:11:53 +0200 |
Hello,
I am using wget for recursively downloading whole website (creating
offline version), with the --convert-links option.
I have URL-encoded link which contains non-ascii characters. After the
conversion, the link is not URL-encoded, and resulting file name
contains some "rubbish" characters (I suspect something with UTF-8 =>
iso-8859-1) - this leads to the converted link not working.
The original URL is "/tags/V%C3%BDprodej/"
The converted URL is "../tags/Výprodej/index.html" (in UTF-8)
The name of file (or directory in this case) is shown as "V??prodej"
with ls command
Is there any possibility to let the URL and respective file name be
with %XX encoding after conversion?
(not sure if it is relevant, but I am using wget devel 1.12, because
of the css parsing)
Thanks for any response.
With regards.
Miroslav Oujesky
- [Bug-wget] non-ascii characters - url conversion,
Miroslav Oujeský <=