Re: Ispell and unibyte characters

emacs-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Ispell and unibyte characters

From:	Eli Zaretskii
Subject:	Re: Ispell and unibyte characters
Date:	Thu, 29 Mar 2012 20:06:17 +0200

> Date: Wed, 28 Mar 2012 21:18:21 +0200
> From: Agustin Martin <address@hidden>
> 
> > OTHERCHARS are not very important anyway, at least for languages I'm
> > interested in.
> > 
> > > Since currently it is not possible to ask hunspell for installed
> > > dictionaries (hunspell -D does not return control to the console)
> > > no one tried something similar for hunspell.
> > 
> > In what version do you have problems with -D?
> 
> Hunspell 1.3.2. Does not return control until I press ^C. This may be useful
> if someone wants to know about installed hunspell dictionaries and prepare
> something to play with that info, in a way similar to what is currently done
> for aspell in ispell.el.

Well, to be fair to the Hunspell developers, the documentation doesn't
say that -D should exit after displaying the available dictionaries.
And the code really doesn't do that.  However, with a simple 2-liner
(below) I can make it do what you want.

> > Since I wrote that, I found that the problem was due to a bug in
> > hunspell (which I fixed in my copy): it reported byte offsets of the
> > misspelled words, rather than character offsets.  After fixing that
> > bug, there's no issue here anymore and nothing to fix in ispell.el.
> > There's a bug report with a patch about that in the hunspell bug
> > tracker, so there's reason to believe this bug will be fixed in a
> > future release.
> 
> You mean
> 
> http://sourceforge.net/tracker/?func=detail&aid=3178449&group_id=143754&atid=756395

Yes.

> I filed that bug one year ago and received no reply from hunspell
> maintainers. This year I received a followup with a proposed change, but
> there is still no reply to it.

I simply fixed this.  This _is_ Free Software, isn't it?

> There is other problem that mostly hits re-using ispell default entries
> under hunspell
> 
> http://sourceforge.net/tracker/?func=detail&aid=2617130&group_id=143754&atid=756395
> 
> [~ prefixed strings are treated as words in pipe mode]

Another easy fix (the feature is not implemented, so the code should
simply ignore such lines).

> that now stands for three years. I have waited in the hope this is fixed,

It's true that development seems to be slow, but then aspell
development is not exactly vibrant, either: both spellers hadn't a
release in many months.

Anyway, to me, Hunspell is a better tool, because of its support for
multiple dictionaries, which fixes the most annoying inconvenience in
Emacs spell-checking: the need to switch dictionaries according to the
language -- this is really a bad thing when you use Flyspell.

With multiple dictionaries, with very rare exceptions, one needs a
single entry in ispell-dictionary-alist, having all of the
dictionaries for languages one normally uses, [[:alpha:]] as
CASECHARS, and UTF-8 as the encoding.

> but I think I will soon commit to Emacs the same change I use for Debian, 
> making sure extended-character-mode is nil for hunspell.

Probably a good idea.


--- src/tools/hunspell.cxx~0    2011-01-21 19:01:29.000000000 +0200
+++ src/tools/hunspell.cxx      2012-03-21 16:40:31.255690500 +0200
@@ -1756,6 +1763,7 @@ int main(int argc, char** argv)
                fprintf(stderr, gettext("SEARCH PATH:\n%s\n"), path);
                fprintf(stderr, gettext("AVAILABLE DICTIONARIES (path is not 
mandatory for -d option):\n"));
                search(path, NULL, NULL);
+               if (arg_files==-1) exit(0);
        }
 
        if (!privdicname) privdicname = mystrdup(getenv("WORDLIST"));

[Prev in Thread]

Current Thread

[Next in Thread]

Ispell and unibyte characters, Eli Zaretskii, 2012/03/17
- Re: Ispell and unibyte characters, Agustin Martin, 2012/03/26
  - Re: Ispell and unibyte characters, Eli Zaretskii, 2012/03/26
    - Re: Ispell and unibyte characters, Lennart Borgman, 2012/03/26
    - Re: Ispell and unibyte characters, Agustin Martin, 2012/03/28
    - Re: Ispell and unibyte characters, Eli Zaretskii <=
    - Re: Ispell and unibyte characters, Andreas Schwab, 2012/03/29
    - Re: Ispell and unibyte characters, Eli Zaretskii, 2012/03/30

Prev by Date: Re: dired-do-copy may create infinite directory hierarchy, again
Next by Date: Re: /srv/bzr/emacs/trunk r107697: calendar/calendar.el (calendar-exit): Use `quit-windows-on' instead of
Previous by thread: Re: Ispell and unibyte characters
Next by thread: Re: Ispell and unibyte characters
Index(es):
- Date
- Thread