bug#9747: C-x h TAB and M-x untabify

bug-gnu-emacs

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

bug#9747: C-x h TAB and M-x untabify

From:	Juri Linkov
Subject:	bug#9747: C-x h TAB and M-x untabify
Date:	Thu, 20 Oct 2011 02:32:14 +0300
User-agent:	Gnus/5.13 (Gnus v5.13) Emacs/24.0.90 (x86_64-pc-linux-gnu)

> I often use C-x h TAB and M-x untabify to format C, C++, and Java code.
>
> If a document has an errant UTF-8 byte order mark (a UTF-8 BOM is EF
> BB BF), Emacs cannot always format the source file.
>
> For example, the attached Java file (JavaEncryptor.java-backup) has
> 1845 BOMs sprinkled throughout. I'm not sure what editor put them in,
> but Emacs does not properly handle some operations with them present.
> If I strip the errant BOMs with the attached program
> (efbbbf-strip.cpp), Emacs will properly format the file.

"BYTE ORDER MARK" is the old name of the U+FEFF character.
The new name is "ZERO WIDTH NO-BREAK SPACE".

You can add to your .emacs something like:

(eval-after-load "cc-mode"
  '(progn (modify-syntax-entry ?\uFEFF " " java-mode-syntax-table)))

and the most of indentation code will work correctly.

However, in some places in core packages we need to replace such code

  (skip-chars-forward " \t")

with

  (skip-chars-forward " \t\uFEFF")

to take into account other whitespace characters.

[Prev in Thread]

Current Thread

[Next in Thread]

bug#9747: C-x h TAB and M-x untabify, Jeffrey Walton, 2011/10/13
- bug#9747: C-x h TAB and M-x untabify, Juri Linkov <=

Prev by Date: bug#9801: upgrading emacs when emacs is still running
Next by Date: bug#9751: 23.3; Alternative Keyboard Feature/Bug
Previous by thread: bug#9747: C-x h TAB and M-x untabify
Next by thread: bug#9749: 23.2; Compile-mode detection of error locations accounts for tabs incorrectly
Index(es):
- Date
- Thread