bug#12598: 24.2; utf-8 codepoints in doc-strings and compression of .el

bug-gnu-emacs

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

bug#12598: 24.2; utf-8 codepoints in doc-strings and compression of .el

From:	Stefan Monnier
Subject:	bug#12598: 24.2; utf-8 codepoints in doc-strings and compression of .el and .elc files
Date:	Thu, 31 Jan 2013 13:15:20 -0500
User-agent:	Gnus/5.13 (Gnus v5.13) Emacs/24.3.50 (gnu/linux)

> I've just removed some utf-8 codepoints from docstrings in org-mode
> because when I compress either the source (.el.gz) or the resulting
> byte-compiled file (.elc.gz), the loader fails after the first function

I can't reproduce this problem for the .el.gz case (indeed, I think
it's specific to byte-compiled files).

> So, any codepoint that is more than a single byte will throw the
> byte-compiler off, not just any utf-8 codepoint.  Since this has been in
> Emacs likely ever since unicode strings have been introduced, I'd
> suggest adding a *strong* warning in some prominent place in the
> documentation about this even when it gets fixed in a newer version of
> Emacs. Otherwise it's all too easy to produce libraries that have
> mysterious failures depending on whatever Emacs was used to compile or
> run them.

I think the problem lies between load-with-code-conversion and
eval-buffer, so it dates back to the introduction of
load-with-code-conversion, which IIRC predates the internal use
of Unicode.

Fixing `eval-buffer' so that it skips bytes when it sees #@NN is tricky,
so the best fix is probably to change load-with-code-conversion so that
(if the file is byte-compiled) it saves the buffer to a temp file and
passes that to `load'.


        Stefan

[Prev in Thread]

Current Thread

[Next in Thread]

bug#12598: 24.2; utf-8 codepoints in doc-strings and compression of .el and .elc files, Stefan Monnier <=
- bug#12598: 24.2; utf-8 codepoints in doc-strings and compression of .el and .elc files, Achim Gratz, 2013/01/31

Prev by Date: bug#13428: 24.2.92; Emacs windows doesn't disply on Mac OSX 10.4
Next by Date: bug#12598: 24.2; utf-8 codepoints in doc-strings and compression of .el and .elc files
Previous by thread: bug#13598: 24.3.50; url-http.el doesn't correctly parse headers when they are sent line-by-line
Next by thread: bug#12598: 24.2; utf-8 codepoints in doc-strings and compression of .el and .elc files
Index(es):
- Date
- Thread