[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[bug-gawk] Use gawk and regex to stip comments from html files regex pro
From: |
frank ernest |
Subject: |
[bug-gawk] Use gawk and regex to stip comments from html files regex problem |
Date: |
Wed, 29 Oct 2014 22:54:25 +0100 |
Hello, I'm trying to use the lazy star to strip comments from html files but
it's not working:
gawk '{ gsub(/<!--.*?-->/, "", $0); print $0}'
I have to use a non greedy method so that this:
<!-- comment one --> Important text <!-- comment2 -->
does not be come this:
See? But, despite the fact that several docs on extended regexes mention the
fact that the lazy star works it does not work in gawk. I know that I might use
some other tool like lynx, but I wanted to do it with gawk and I don't see why
a perfectly fine programming language should fail for so simple a task.
Thanks
- [bug-gawk] Use gawk and regex to stip comments from html files regex problem,
frank ernest <=