aramorph-users
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Aramorph-users] XML tables


From: Ahmed El-dawy
Subject: Re: [Aramorph-users] XML tables
Date: Fri, 19 Aug 2005 20:38:00 +0300

Hello,

On 8/19/05, Pierrick Brihaye <address@hidden> wrote:

1) remember that implementing classes should be singletons. The
interface may so have a GetInstance() method.
 
I don't think so. The getInstance() method will not be called from the interface because it is static. It will be called from the implementing class directly.

2) I would like an init() method to pass, for example, a JDBCConnection.
 
Do you mean something like constructor? It will be constructed in the getInstance() method, so this will be useless. Also I don't know which parameter it may receive. If it depends on the implementation, it cannot be added to the interface.

3) we must have a LoadFromFile() method and maybe even a LoadFromXML.
 
Is this different from init()? I think this is implementation specific. LoadFromXML, for example, will not be used with JDBCDictionaryHandler.

4) rather then "translitered" arguments, consider Unicode ones. We may
even have methods for both.
 
Right! And I think we will have to change the InMemoryDictionaryHandler to handle this.

5) A good part of this code wil be common to every handler. That's why
you should consider an AbstractDictionaryHandler (implementing
DictionaryHandler) and then an InMemoryDictionaryHandler, a
JDBCDictionaryHandler...
 
Do you have an idea about any common code?

>     When you process a string from the original dictionary, please give it
>     as as a comment in the code (my fault if this has not been done
>     earlier).
>
> Don't understand.

Give as a Java comment the string which is in the dictionary, e.g. :

Parsing a comment : ; conjunctions
Parsing unvocalized : w
Parsing vocalized : wa
Parsing morphological category : Pref-Wa
Parsing gloss and grammatical category : and <pos>wa/CONJ+</pos>

etc, etc. It will be useful for the bloody GlossPOS ;-)
 
Where will this go? Does this relate to the XML file?
By the way, I have set the transformer to write XML comments in place of dictionary comments.

Oh, one more important thing I would like you to do. To remain ASCII
compliant, use the Java \uXXXXsyntax for arabic characters. Aramorph
must comple even on computers that don't support arabic.
 
OK! I will do it.
 
> I found that some stems do not have any lemmaID,
 
Could you please provide an example ? Since I've planned to write a
LemmaIDAnalyzer, this is bad news :-)
 
The code you have writes error message when finding one. Here's an example:

Found an element without lemma at line: 5811

Found an element without lemma at line: 5812
Found an element without lemma at line: 5813
Found an element without lemma at line: 5814

I am attaching the version which writes comments to XML.

Another question. For which XML dictionary format do I start writing the parser?
--
Regards,
Ahmed Saad

reply via email to

[Prev in Thread] Current Thread [Next in Thread]