[Top][All Lists]

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Emacs as a translator's tool

From: Emanuel Berg
Subject: Re: Emacs as a translator's tool
Date: Tue, 09 Jun 2020 23:13:31 +0200
User-agent: Gnus/5.13 (Gnus v5.13) Emacs/26.1 (gnu/linux)

Jean-Christophe Helary wrote:

> CAT segmentation rules are defined by the SRX
> standard. They are basically a set of cascading
> regex rules (break/don't break).

OK? do (sentence-end) to get, maybe,

"\\([.?!…‽][]\"'”’)}]*\\($\\|[    ]\\)\\|[。.?!]+\\)[    

Because "ll paragraph boundaries also end sentences" ...

    paragraph-separate is a variable defined in
    ‘paragraphs.el’. Its value is "[ ]*$"

      This variable is safe as a file local variable if
      its value satisfies the predicate ‘stringp’.

    Regexp for beginning of a line that separates
    paragraphs. If you change this, you may have to
    change ‘paragraph-start’ also.

So we'll just change it to comply with the SRX
standard :)

> It is possible to fine-tune a translation by
> modifying a rule set ...

It is possible but with Emacs, even more so :)

underground experts united

reply via email to

[Prev in Thread] Current Thread [Next in Thread]