help-gnu-emacs
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Fwd: How to check whether a character (or one-character string) is a let


From: John Mastro
Subject: Fwd: How to check whether a character (or one-character string) is a letter?
Date: Fri, 3 Oct 2014 19:47:39 -0700

[I first sent this directly to Marcin in error - yeah, I use the email gateway]

Hi Marcin,

Marcin Borkowski <mbork@wmi.amu.edu.pl> wrote:
> Assume that I have a character (taken from some string, which in turn is
> copied from the buffer - so it need not be ASCII).  What is the best way
> to check whether it is a letter within ASCII range?
>
> The reason I'm asking is that I'm writing a function which converts an
> arbitrary string to a valid (and nice) filename (e.g., only letters and
> hyphens) - so basically I want to walk a string character by character
> and convert any space to a hyphen and omit any other non-letter.  Am I
> reinventing the wheel?

There are a bunch of ways to do this, but one reasonable approach is to
use a regular expression. I think this will do what you want:

    (defun reasonable-filename (str)
      (let* ((str (replace-regexp-in-string "[ \t\n\r]" "-" str))
             (str (replace-regexp-in-string "[^a-zA-Z-]" "" str)))
        str))

This is a variation which will also allow the result to contain numbers:

    (defun reasonable-filename (str)
      (let* ((str (replace-regexp-in-string "[ \t\n\r]" "-" str))
             (str (replace-regexp-in-string "[^a-zA-Z0-9-]" "" str)))
        str))

To answer your question about identifying whether a character is an
ASCII letter, the key is that Emacs's characters are really "just"
integers. Wikipedia has some charts[1] that show the numbers associated
with the characters. The letters are conveniently grouped together, so
we can use something like this:

    (defun ascii-letter-p (char)
      (and (characterp char)
           (>= char 65)
           (<= char 122)))

(Of course, this only works if it's really a character, as opposed to a
string of length one. If it's a string of length one you could either
"extract" the character with `aref' or use a regular expression
instead.)

Hope that helps.

[1] https://en.wikipedia.org/wiki/ASCII#ASCII_printable_code_chart

--
john



reply via email to

[Prev in Thread] Current Thread [Next in Thread]