[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [aspell-devel] Problem regarding aspell and hindi dictionary
From: |
Jose Da Silva |
Subject: |
Re: [aspell-devel] Problem regarding aspell and hindi dictionary |
Date: |
Tue, 16 May 2006 11:26:58 -0700 |
User-agent: |
KMail/1.7.2 |
On May 16, 2006 03:45 am, gm ravi sastry wrote:
> Hi all,
> I have installed aspell-0.60 .I have also
> installed the english and hindi dictionaries all from
> http://aspell.sourceforge.net/
>
> The s/w works well while checking english documents
> for misspelt words.However when i use it for spell
> checking for hindi documents it dosent work.It looks
> like it is accepting hindi documents with misspelt
> words.Can anyobody suggest me what i should
> do??Thanks in advance .Looking forwards to a reply
> from your side
> Ravi Sastry
Within the hindi language tar file, you should find a large file that
ends in letters ****.cwl (compressed word list) or ****.pz (prezipped
word list).
Some dictionary files contain more than one of those types of files.
You will want to check that file (or files) for wrong words, but before
you can read it (or them), you need to decompress it (them) into a
format that you can read/understand.
To decompress the file, go to the command line and use the prezip-bin
utility like this (please substitute the ???? with the correct name of
the compressed file):
prezip-bin -d <????.cwl >hindiwordlist.txt
or
prezip-bin -d <????.pz >hindiwordlist.txt
At this point, you have a decompressed text file "hindiwordlist.txt"
Now with a text utility, take a look at the text file and see if you
find the misspelt words within the wordlist.
If you find the misspelt words in the wordlist, then that may be the
cause of the wrong spelling because those words might not belong in
there, and it would be recommended to let the authour of that
particular dictionary know about the errors you found so that the
wordlist can be properly updated.
Another possibility is that they are words that really do belong, but
they are rarely used, in which case they belong in a separate
dictionary, sort of like keeping a common English dictionary and a
rare-english dictionary.... or in Aspell terms, the common or rare
english dictionaries would likely fit in the different "--size"
options.
Please check the README file within the hindi dictionary tar for the
authour.
There also appears to be an [aspell-dict-maintainers] list which may be
more towards solving your problem. Please check here:
http://lists.gnu.org/archive/html/aspell-dict-maintainers/2006-03/index.html
Hopefully that helps.