bug-wget
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Bug-wget] non-ascii characters - url conversion


From: Miroslav Oujeský
Subject: [Bug-wget] non-ascii characters - url conversion
Date: Fri, 10 Apr 2009 10:11:53 +0200

Hello,

I am using wget for recursively downloading whole website (creating
offline version), with the --convert-links option.

I have URL-encoded link which contains non-ascii characters. After the
conversion, the link is not URL-encoded, and resulting file name
contains some "rubbish" characters (I suspect something with UTF-8 =>
iso-8859-1) - this leads to the converted link not working.

The original URL is "/tags/V%C3%BDprodej/"
The converted URL is "../tags/Výprodej/index.html" (in UTF-8)
The name of file (or directory in this case) is shown as "V??prodej"
with ls command

Is there any possibility to let the URL and respective file name be
with %XX encoding after conversion?

(not sure if it is relevant, but I am using wget devel 1.12, because
of the css parsing)

Thanks for any response.



With regards.

Miroslav Oujesky




reply via email to

[Prev in Thread] Current Thread [Next in Thread]