[Bug-wget] Output format

bug-wget

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Bug-wget] Output format

From:	Gustavo Feijó
Subject:	[Bug-wget] Output format
Date:	Tue, 27 Jan 2009 15:21:57 -0200

Hi there,

I'm new on posting to this list.

I'm building a shell script to search and inform broken links on various domains hosted by my server.

The script is allmost ready, but I'm tied in two points.

It uses the --spider parameter to test links and this is the line wich does all the magic
/usr/bin/wget --header='Accept-Charset: iso-8859-2' -F --base=http://${DOMINIO} --spider -r -nd -nc http://${DOMINIO} -o spider-${DOMINIO}.log --limit-rate=20k --delete-after -b

Now the questions

1) The output file doesn't indicate which page is calling the broken link. Am I missing something?

2) The broken link report shows some hexadecimals characters. Is there a way to prevent wget to decode the originals characters?
Output Exemple: http://${DOMINIO}/images/%22%20+%20imagem[x]%20+%20%22

Thanks in advance and sorry if I wrote to the wrong list.

--
Gustavo Feijó
Linux User #291033
LPI ID #000132840
http://www.gfeijo.eti.br

--
Please consider the protection of the environment
before printing this e-mail.
--

[Prev in Thread]

Current Thread

[Next in Thread]

[Bug-wget] Output format, Gustavo Feijó <=

Prev by Date: Re: [Bug-wget] Kernel 2.6.28.1: [493192.461454] TCP(wget:28098): Application bug, race in MSG_PEEK.
Previous by thread: Re: [Bug-wget] Output filename not same as from browser save...reason?...bug?
Index(es):
- Date
- Thread