nmh-workers
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: mhfixmsg character set conversion


From: Steven Winikoff
Subject: Re: mhfixmsg character set conversion
Date: Wed, 09 Feb 2022 20:27:07 -0500

>> I think Steven says he's running Manjaro which is an Arch Linux spin off, and
>> Archers prefer to pass on upstream code unaltered where possible.
>
>Except that par has been altered?

Not by me, at any rate.


>I use this version, unaltered:
>$ par version
>par 1.53.0

   $ par version
   1.52-i18n.4

   $ pacman -Qi par
   Name            : par
   Version         : 1.52-8
   Description     : Paragraph reformatter
   Architecture    : x86_64
   URL             : http://www.nicemice.net/par/
   Licenses        : custom
   Groups          : None
   Provides        : None
   Depends On      : None
   Optional Deps   : None
   Required By     : None
   Optional For    : None
   Conflicts With  : None
   Replaces        : None
   Installed Size  : 98.90 KiB
   Packager        : Unknown Packager
   Build Date      : Mon 06 Jan 2020 12:53:58 AM
   Install Date    : Mon 06 Jan 2020 12:54:19 AM
   Install Reason  : Explicitly installed
   Install Script  : No
   Validated By    : None


>> > Do you have any idea where the following warning comes from?
>>
>> My money's on par(1) given
>>
>>     
>> https://inbox.vuxu.org/voidlinux-github/20191027084150.NZqC6wHlZkyQJ7AkACI7juvuCp0AD_u_IIwftMlDmKs@z/T/
>
>That sure looks like it.

Perhaps, but it isn't.


>> Steven, to confirm, try
>>
>>     egrep -l 'seems not configured' /usr/bin/par

   $ egrep -l 'seems not configured' /usr/bin/par
   $ echo $?
   1


>Steven, I would try removing par from the end of your mhbuild-convert-text/html
>entry.

The problem with that is that it's not there in the first place:

   $ grep par ~/.mh_profile
   $ echo $?
   1

In fact,

   $ grep mhbuild ~/.mh_profile
   mhbuild:        -maxunencoded 500

   $ grep html ~/.mh_profile
   #: mhshow-show-text/html:       %pmime_helper %F %s %{name}
   mhshow-show-text/html:       html_to_text %F | cat -s
   mhshow_in_browser-show-text/html:  %pmime_helper %F %s "%{name}"
   mhfixmsg-format-text/html:      html_to_text < '%F'

   $ grep -w par ~/bin/html_to_text
   $ echo $?
   1

I'll append the full text of the script in case you'd like to see it, but
I'm pretty sure it's not implicated here.

In fact there are no invocations of par anywhere in my ~/bin directory; the
only occurrences of the word are in some old data files:

   $ grep -lrisw par ~/bin
   /home/smw/bin/mars/reports/data/FMARS/jrn/text/20070729
   /home/smw/bin/mars/reports/data/FMARS/jrn/text/20070718
   /home/smw/bin/mars/reports/data/FMARS/jrn/text/20070719
   /home/smw/bin/mars/reports/data/FMARS/jrn/raw/20070719

...and these files have nothing to do with nmh in any way.

I'm reminded of an old Jackie Mason routine, in which he describes a visit
to a psychiatrist.  After a fair bit of dialog which I won't repeat here,
this snippet occurs:

   psychiatrist:  I see your problem.  You hate your sister.

   Jackie Mason:  I haven't got a sister.

   psychiatrist:  I can't help you if you won't cooperate.

...so I feel a need to apologize for being uncooperative :-/, but I'm at a
loss here.

     - Steven


8<-----------------------------   cut here   ---------------------------->8
#!/bin/sh
#
#  html_to_text -- convert HTML to plain text
#
#  Steven Winikoff
#  2010/04/28
#
#  note:  this script uses links
#         [ http://atrey.karlin.mff.cuni.cz/~clock/twibright/links ]
#         because it seems to be the only program available which
#         renders tables reasonably
# 
#         alternatives (lynx and vilistextum) both show tables one
#         column at a time instead of row by row!
#
#
#         UPDATE, 2018/08/22:
#
#         switched from links to elinks, because links fails when invoked
#         via procmail if the source HTML code contains invalid characters
#         (as in a file in Windows character encoding which isn't labelled
#         as such) -- the symptom is that a properly structured message
#         will be converted into one which has an empty HTML part, which
#         is a problem if (and only if :-) the HTML part needs to be viewed
#         in a graphical browser (see ~smw/bin/view_html_message, as called
#         from ~smw/bin/mhread)
#
#--------------------------------------------------------------------------

if [ ! -z "${1}" ]
then
   html="${1}"
else
   # links (as of April 2010, at least) refuses to read standard 
   # input with -dump

   html="/tmp/html_to_text.`date +%Y%m%d.%H%M%S`.$$"
   trap "rm -f ${html} >/dev/null 2>&1; exit 1" 1 2 3 15
   cat > ${html}
fi

elinks -force-html -dump -dump-charset utf-8  ${html} | sed 's/^   //;s/[   
]*$//'
##     | cat -s

#------------------------------------------------------------------------
#  w3m -I utf8 -T text/html -dump -s -o display_link_number=1 \
#      -o color=1 -graph ${html} | sed 's/^   //;s/[   ]*$//'
#------------------------------------------------------------------------
#  elinks -force-html -dump -dump-charset utf-8  ${html} | \
#         sed 's/^   //;s/[   ]*$//'
#------------------------------------------------------------------------
#  links -force-html -dump -codepage utf-8 ${html} | sed 's/^   //;s/[   ]*$//'
#------------------------------------------------------------------------
#  links -force-html -no-g -width 75 -html-margin 0 -codepage us-ascii \
#        -html-tables 1 -html-table-order 0 -html-frames 1 \
#        -html-numbered-links 1 -dump ${html} | sed 's/[        ]*$//'
#------------------------------------------------------------------------

[ -z "${1}" ] && rm -f ${html} >/dev/null 2>&1
8<-----------------------------   cut here   ---------------------------->8
-- 
___________________________________________________________________________
Steven Winikoff      |
Montreal, QC, Canada | "It's amazing how much 'mature wisdom'
smw@smwonline.ca     |  resembles being too tired."
http://smwonline.ca  |
                     |                      - Robert Heinlein



reply via email to

[Prev in Thread] Current Thread [Next in Thread]