TOP #45

ojwb · 2020-12-02T21:37:01Z

TOP is a bit of an odd-ball keyword in that it doesn't actually have its own token but is actually tokenised as a TO token followed by a letter P.

The syntax highlighting doesn't seem to know about this quirk, e.g. this actually colours the P brown instead of blue:

PRINT TOP

You can't just colour any P after TO blue though, consider:

F.P=TOP TOP:P.P:N.

There the first TOP is TOP, but the second TOP is actually TO P.

The text was updated successfully, but these errors were encountered:

mattgodbolt · 2020-12-02T21:48:02Z

Oh good grief! I can't believe I'm today years old and I learn TOP isn't real!

ojwb · 2020-12-02T22:20:36Z

The TOP is a lie!

I don't think there are any other pseudo-tokens to worry about at least.

I've long wondered (not very actively admittedly - it doesn't often keep me awake at night) why there isn't a token for TOP - several were spare in BASIC I and &CE was still spare in BASIC II (that became EDIT in later versions, which then had to get more inventive to have tokens beyond that). Maybe it was just easier to have context-sensitive handling at runtime rather than tokenisation time.

Golf tip: If you want to use TOP and you're pre-tokenising, you can save a byte by using LOMEM instead as that is a real token and the two have the same value unless you set them.

mattgodbolt · 2020-12-02T22:58:26Z

So good :) thanks @ojwb

mattgodbolt · 2020-12-13T18:30:38Z

Well, this one's a tricky one and no mistaking. The tokeniser is much too simple to know when to colesce the tokens: it's necessarily not a full parser (and it's based off what the the underlying editor provides us).

Looks like this would need us to "know" we're in a FOR and we're expecting TO, but there must be a simpler way!

mattgodbolt · 2020-12-13T18:32:02Z

Note for self: this test needs to pass:

    it("should handle TOP properly", () => {
        // See #45
        checkTokens(
            ["F.P=TOP TOP"],
            [
                {offset: 0, type: "keyword"}, // FOR
                {offset: 2, type: "variable"}, // P
                {offset: 3, type: "operator"}, // =
                {offset: 4, type: "keyword"}, // = TOP
                {offset: 5, type: "white"}, // space
                {offset: 6, type: "keyword"}, // TO
                {offset: 8, type: "variable"}, // P
            ]
        );
    });

ojwb · 2020-12-14T00:04:03Z

I've wondered if the editor should use the tokenised form as its internal representation, though if the syntax highlighting can only change the styling of the text we couldn't just make the text be the tokenised BASIC as we'd need to show something different to the text (e.g. display "TO" for character \xB8). So the whole idea might be a non-starter.

That wouldn't directly solve the TOP vs TO+P issue, but I think would make it simpler to determine if we're in a place where TOP is TO+P.

Perhaps there's a neat trick to get this right though - I'll see if I can come up with anything.

mattgodbolt · 2020-12-14T00:53:35Z

We are stuck with whatever Monaco does; its internal represntation is text. So making it tokens doesn't help unfortunately. We are limited to whatever it supports, and I don't know quite how the user experience would be trying to edit a token as text. Remember how the Spectrum toekn-based input worked...was super confusing :). Not saying your suggesting that but storing tokens but editing characters sounds tricky even if the editor allowed it.

…

On Sun, Dec 13, 2020, 18:04 Olly Betts ***@***.***> wrote: I've wondered if the editor should use the tokenised form as its internal representation, though if the syntax highlighting can only change the styling of the text we couldn't just make the text be the tokenised BASIC as we'd need to show something different to the text (e.g. display "TO" for character \xB8). So the whole idea might be a non-starter. That wouldn't directly solve the TOP vs TO+P issue, but I think would make it simpler to determine if we're in a place where TOP is TO+P. Perhaps there's a neat trick to get this right though - I'll see if I can come up with anything. — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#45 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAE2Y5NHNTSGJIGIBST6PGLSUVI77ANCNFSM4ULCM6BA> .

ojwb · 2025-01-21T19:16:18Z

Note for self: this test needs to pass:

Some of the offsets there are ... offset - they should be:

    it("should handle TOP properly", () => {
        // See #45
        checkTokens(
            ["F.P=TOP TOP"],
            [
                {offset: 0, type: "keyword"}, // FOR
                {offset: 2, type: "variable"}, // P
                {offset: 3, type: "operator"}, // =
                {offset: 4, type: "keyword"}, // TOP
                {offset: 7, type: "white"}, // space
                {offset: 8, type: "keyword"}, // TO
                {offset: 10, type: "variable"}, // P
            ]
        );
    });

I had a little look - it seems it would need to be done by adding more states to the highlighting, but doing it properly seems to require doing the runtime expression parsing which BASIC presumably does when running the code since after the = we need to skip over a complete expression before we treat TO followed by P as TO P rather than TOP - consider e.g.:

FORP=TOP-TOP TOP+TOP/TOP

After = we need to parse an expression (in which TO followed by P is TOP), i.e. TOP-TOP and then we're in the "in a FOR loop expecting TOstate. After we parse aTOwe need to be back in a state whereTOfollowed byPisTOP` for the rest of the line.

Perhaps we can cheat and in a FOR statement handle TO right after a variable or pseudo-variable token or constant or closing parenthesis, or something like that. We might not even have to take nesting of parentheses into account. I haven't spotted a counterexample that this trick fails with yet at least.

mattgodbolt added bug Something isn't working syntax highlighting labels Dec 2, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TOP #45

TOP #45

ojwb commented Dec 2, 2020

mattgodbolt commented Dec 2, 2020

ojwb commented Dec 2, 2020

mattgodbolt commented Dec 2, 2020

mattgodbolt commented Dec 13, 2020

mattgodbolt commented Dec 13, 2020

ojwb commented Dec 14, 2020

mattgodbolt commented Dec 14, 2020 via email

ojwb commented Jan 21, 2025

TOP #45

TOP #45

Comments

ojwb commented Dec 2, 2020

mattgodbolt commented Dec 2, 2020

ojwb commented Dec 2, 2020

mattgodbolt commented Dec 2, 2020

mattgodbolt commented Dec 13, 2020

mattgodbolt commented Dec 13, 2020

ojwb commented Dec 14, 2020

mattgodbolt commented Dec 14, 2020 via email

ojwb commented Jan 21, 2025