emacs-devel
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Indirect text properties


From: Alan Mackenzie
Subject: Re: Indirect text properties
Date: Mon, 18 Nov 2019 18:06:59 +0000
User-agent: Mutt/1.10.1 (2018-07-13)

Hello, Dmitry.

On Mon, Nov 18, 2019 at 00:55:50 +0200, Dmitry Gutov wrote:
> On 17.11.2019 19:05, Alan Mackenzie wrote:

> > This is an idea I had a couple of years ago, and has recently resurfaced
> > in discussions with Dmitry (Subject: Several major modes).

> > The idea is that there could be several alternative sets of text
> > properties with the same symbol simultaneously in a buffer, the Lisp
> > code selecting which to use by binding a dynamic variable.  This would
> > be most useful for the syntax-table text property.

> Could char-property-alias-alist help?

That is kind of the other way round to indirect properties.  It gives
several names accessing one property, whereas the indirect properties
gives one name accessing several properties.

> > How would this work?  In textprop.c, the code would, on any access to a
> > text property, check its symbol's property 'indirect-text-property, and
> > if that is a non-nil symbol, access it's value (another symbol) and use
> > that as the symbol for the text property instead.  It's easier to say in
> > code, which would look something like:

> >      #define TEXP_PROP_END_NAME(sym) \
> >          !NILP (itp = Fget (sym, Qindirect_text_property)) && SYMPOLP (itp) 
> > \
> >          && !NILP (etp = find_symbol_value (itp)) && SYMBOLP (etp) \
> >          ? etp : sym

> > .  To switch to a different set of, e.g., syntax-table text properties
> > it would suffice to bind the lisp variable i-t-p to, say, the gensym
> > syntax-table-13.  Of course low level caches, e.g. in syntax.c, would
> > have to be kept synchronised, too.

> It's a lot of work with likely some performance overhead even for the 
> default case as well. It sounds like it could be a piece of the puzzle, 
> but let's see if we get the full picture first.

The performance overhead whilst the facility is not in use would be tiny:
a look-up of a symbol's property list which is highly likely to be empty
anyway.  This is in the context of text properties, which involve
traversing trees to find an "interval" containing the place we're looking
up.

But OK, we need firmer proposals.

> Also, I think most (all?) of this proposal could be implemented in Lisp 
> by just setting the 'syntax-table' on the overlays that cover different 
> submode regions. With more overhead when setting but less overhead than 
> accessing the values.

Overlays don't have a syntax-table property.  It could be implemented,
but it would slow down syntactic scanning probably a lot, since
parse-partial-sexp would have to check all the overlays on _each_
character, one by one.  Or, some optimisation which might be brittle.

> > So, what use would it be?  What I have proposed to Dmitry is having a
> > distinct set of syntax-table properties for each major mode chunk of an
> > MMM Mode ("multiple major mode") buffer.  Say syntax-table-13 would be
> > the set for a CC Mode chunk.  Outside of that chunk, every character
> > would be given a space syntax-table-13 text property.  This is the
> > critical thing.

> > Thus all actions dependent upon syntax (and there are a LOT), could be
> > performed by CC Mode in the chunk without the other chunks getting in
> > the way.  It may not even be necessary to narrow to the chunk.

> It doesn't seem like it covers all problematic cases. Maybe not even the 
> majority:

Possibly not.

> - Would this win over "local" syntax-table properties as assigned by 
> syntax-table? By the usual logic of how we implement property 
> priorities, probably not. But it should, for this to work.

I'm not clear what you mean by "local" syntax-table properties, or by
"assigned by syntax-table".  Sorry.

> - Some code can just be looking for certain characters instead of syntax 
> classes with re-search-backward, etc. It wouldn't be fooled either. So 
> this would likely require some "are we still in the same major mode" 
> predicate. At which point we might get by without the space-syntax-table 
> swapping entirely.

Yes, that is true.  So the concept of region boundary could not be done
away with altogether.

> So what are the exact scenarios that your aim is to fix with this?

You mentioned having a C Mode "main" section in MMMM with embedded other
modes, possibly in the middle of CC Mode constructs.  Here, my new scheme
would win, since the other modes would just look like whitespace to CC
Mode whilst it is scanning for strings, comments, braces, etc.  This
seems to be a big problem at the moment with CC Mode + MMM Mode.

The same applies to other major modes, many of which will want to use
syntax-ppss.

> /Cc Vitalie, he could have some ideas, maybe even tell us how Polymode 
> maybe solves this problem already.

-- 
Alan Mackenzie (Nuremberg, Germany).



reply via email to

[Prev in Thread] Current Thread [Next in Thread]