Undeclared identifiers in tokens assignment? #40

philregier · 2019-09-12T18:58:54Z

Please pardon my ignorance, but I'm having a very difficult time understanding what happens when I assign the tokens attribute of a Lexer instance.

I see in the documentation

Token names should be specified using all-caps as shown.

and some testing confirms that any identifier seems to be legal so long as its name is capitalized, but I don't understand what happens in Python when these identifiers are provided; nor do I recognize anything in lex.py that enables this behavior.

What is so special about tokens that allows it to accept previously undeclared names?

The text was updated successfully, but these errors were encountered:

dabeaz · 2019-09-12T21:07:16Z

Many magical behaviors can be achieved through the questionable use of metaclasses.

philregier · 2019-09-12T22:17:21Z

So if the mechanism itself requires a level of understanding of the Python data model beyond what I would be able to muster, is it reasonable (if imprecise) to say that where the derived lexer is concerned, the required tokens attribute -- whatever magic may be used to assemble it -- is there to identify which attributes get used to define specific lexer characteristics as the class is built?

For example, the ID attribute of CalcLexer in the introductory example is "special" by virtue of the fact that it first appeared in the tokens attribute when the derived class was first prepared? And if I were to add another attribute, say AMPERSAND = r'&', it would not be special because its name did not appear in tokens?

dabeaz · 2019-09-13T10:08:10Z

The primary purpose of tokens is to precisely specify the set of terminals needed for constructing parsers. If you were to add an attribute not listed in tokens, there would be no way for the parser to know about it. As for the underlying magic, it's not connected to tokens so much as the entire enclosing environment defined when you inherit from the Lexer base class.

EkremDincel · 2020-06-28T12:49:56Z

@dabeaz may you explain how this works a bit more please? I read the sly.lex module but didn't understand what section makes this magic, I know metaclasses though.

dabeaz · 2020-06-29T13:48:14Z

The mechanism is metaclasses.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Undeclared identifiers in tokens assignment? #40

Undeclared identifiers in tokens assignment? #40

philregier commented Sep 12, 2019

dabeaz commented Sep 12, 2019

philregier commented Sep 12, 2019

dabeaz commented Sep 13, 2019

EkremDincel commented Jun 28, 2020

dabeaz commented Jun 29, 2020

Undeclared identifiers in tokens assignment? #40

Undeclared identifiers in tokens assignment? #40

Comments

philregier commented Sep 12, 2019

dabeaz commented Sep 12, 2019

philregier commented Sep 12, 2019

dabeaz commented Sep 13, 2019

EkremDincel commented Jun 28, 2020

dabeaz commented Jun 29, 2020