Prettyprinter and tests #67

inariksit · 2024-09-04T20:57:14Z

WIP, the Printer module is generated automatically from a throwaway BNFC grammar, and I just kept manually updating it as Lam4 CSTs evolved. I am still totally prepared to throw Printer.hs away.

The same PR also contains testing infra. I have made the following changes:

Move the file reading/parsing code from lam4-cli back into lam4-backend, as inspired by Preparations for proper tests. simala#13 . This so that it's possible to call those functions from tests and not have them in two different places.
Read all files in examples, parse them and prettyprint them, finally comparing the prettyprinted result against a golden file. Indirectly, this test also works as a test for the parser, because if the parser fails, the test will print the error message in the .actual file.

inariksit · 2024-09-04T21:04:20Z

lam4-backend/src/Lam4/Expr/Printer.hs

+    -- Name is actually part of the Predicate.
+    -- doing this quick hack to move it to where it belongs.
+    Rec name (Predicate vars expr ref) -> prPrec i 0 (concatD [prt 0 (Predicate (name:vars) expr ref)])


@ym-han I wonder about the reason to structure the AST like this. Here's an example of a tree with a Predicate:

Rec "`some predicate`" _4 (Predicate [] (BinExpr Gt (FunApp (Var "sumIntegers" _1) [ Lit (IntLit 1) , Lit (IntLit 2) ]) (Lit (IntLit 2))) Nothing)

which comes from this Lam4 expression:

DECIDE `some predicate` IF sumIntegers (1, 2) > 2

The string "`some predicate`" corresponds to the variable name on line 176, and the empty list is the vars. If I rendered name at the time of handling Rec, it would become like this instead:

`some predicate` DECIDE IF sumIntegers (1, 2) > 2

I can of course solve it by moving the name into the list of vars, and in rendering Predicate I know to treat the first element of the list specially. But changing the Haskell datatype sounds like a safer solution 😁

Couple of things here (and let's talk more about this after the meeting today if time permits):

Re Decl: I just remembered that Simala also has a Render / Pretty instance that might be helpful: https://github.com/smucclaw/simala/blob/746a6e0c98923374e55d7590fe30d93b1c9a3bef/src/Simala/Expr/Render.hs#L33
I would adopt a strategy like the one they take, at least for now, since the intent behind my design of the Decl vs Expr is pretty similar to theirs. In particular, the name in the Decl is not in some sense unique to Predicates. A Decl is basically just a Binding, except that it'd typically be a binding for a declaration (and in the case of Lam4, if I recall correctly, a top-level declaration, though this might change in the future).
Sorry, I should have linked you to their Render code earlier.

The Predicate example, though, does remind me that I probably have to augment the concrete syntax in the Haskell backend with more type signature / annotation constructs (and similarly with constructs like the FunDecl), if we really want to be able to support 'feature-complete' prettyprinting. The concrete syntax is a bit too 'abstract' in that sense (my original intent had just been to get something that would be easy to desugar even more and evaluate; I hadn't been aiming for bidirectionality).

I see, I wasn't aware of the Simala AST. That makes total sense, and as I said, I can get the things rendered correctly by just passing some arguments to lower levels. I will in that case just make that more intentional.

Yes, I also noticed that. This example for instance

// placeholder function for illustration Integer => Integer sumIntegers(x, y) = x + y GIVEN () DECIDE `some predicate` IF sumIntegers(1, 2) > 2

becomes this after parsing and prettyprinting:

sumIntegers (x, y) = x + y GIVEN () DECIDE `some predicate` IF sumIntegers (1, 2) > 2

I see, I wasn't aware of the Simala AST. That makes total sense, and as I said, I can get the things rendered correctly by just passing some arguments to lower levels. I will in that case just make that more intentional.

Yes, I also noticed that. This example for instance

// placeholder function for illustration Integer => Integer sumIntegers(x, y) = x + y GIVEN () DECIDE `some predicate` IF sumIntegers(1, 2) > 2

becomes this after parsing and prettyprinting:

sumIntegers (x, y) = x + y GIVEN () DECIDE `some predicate` IF sumIntegers (1, 2) > 2

I could also write a transformation to a 'more concrete' version of the concrete syntax. I just need to think more about this / play around with this a bit / look at your code

In order for the natural language rendering, keeping the Lam4 annotations seem to be the most important thing missing right now. Still working under the assumption that the desired input vs. output is something like this:

Lam4 code
STRUCTURE Lottery { /- description: "how much can be won from the jackpot." -/ total_jackpot: Integer /- description: "whether buying tickets from this lottery is tax deductible." -/ `tax deductible status`: Boolean }

NL rendering:
A Lottery is a type of game. Each Lottery has associated with it information like * its ‘total jackpot’; i.e. how much can be won from the jackpot * its ‘tax deductible status’; i.e. whether buying tickets from this lottery is tax deductible

I'll look into this! Probably the most important annotations / metadata to preserve would be the descriptions, if I'm understanding the point of this example correctly. (The Integer and Boolean type annotations in this example should already be preserved.)

Also, re what / how to render: you can think of that NL rendering I suggested as a 'default' template. But we should give users the ability to control things like whether it gets rendered at all (this would be the Simala-style Transparency), and if so, what 'template' to use for the rendering / how to render it (the latter is obviously going to require more work though, heh). I imagine that for certain audiences (e.g. busy executives), one might not even want to render most, perhaps even all, of the records and data types

I've just added a commit to PR #69 -- 93e9ec7 --- that partially addresses this (and tweaks the convention re metadata in the Langium grammar).

The new record decl metadata convention (this has also changed for function and predicate decls):

/- description: "Info about a lottery." -/ STRUCTURE Lottery { -- how much can be won from the jackpot. total_jackpot: Integer -- whether buying tickets from this lottery is tax deductible. `tax deductible status`: Boolean }

The backend parsing for the /- description: "Info about a lottery." -/ doesn't work yet --- I want to refactor that part of the Langium grammar first. But the inline descriptions do work (though maybe I should move the description up above the rest in the RowTypeDecl...):

[ TypeDecl "Lottery" _1 (RecordDecl [ MkRowTypeDecl { name = "total_jackpot" _3 , typeAnnot = BuiltinType BuiltinTypeInteger , metadata = Just "-- how much can be won from the jackpot." } , MkRowTypeDecl { name = "`tax deductible status`" _4 , typeAnnot = BuiltinType BuiltinTypeBoolean , metadata = Just "-- whether buying tickets from this lottery is tax deductible. " } ] [] RecordDeclMetadata { transparency = Transparent , description = Nothing }) ]

Hopefully this is helpful, and thank you @inariksit for pointing this out!

Makes sense! I just changed the printer based on the new ConcreteSyntax.hs in add-eval branch. As of 93e9ec7 this

TypeDecl (MkName "Lottery" 1) (RecordDecl [ MkRowTypeDecl { name = MkName "total_jackpot" 2 , typeAnnot = BuiltinType BuiltinTypeInteger , metadata = MkRowMetadata (Just "how much can be won from the jackpot") } , MkRowTypeDecl { name = MkName "`tax deductible status`" 4 , typeAnnot = BuiltinType BuiltinTypeBoolean , metadata = MkRowMetadata (Just "whether buying tickets from this lottery is tax deductible") } ] [] RecordDeclMetadata { transparency = Transparent , description = Just "game where you lose money" })

is prettyprinted into this:

/- description: game where you lose money -/ STRUCTURE Lottery { /- description: how much can be won from the jackpot -/ total_jackpot : Integer /- description: whether buying tickets from this lottery is tax deductible -/ `tax deductible status` : Boolean }

Ah there is one little complication: the /- ... -/ is no longer permitted for the fields in the Langium grammar. I.e., it should be

/- description: game where you lose money -/ STRUCTURE Lottery { -- how much can be won from the jackpot total_jackpot : Integer -- whether buying tickets from this lottery is tax deductible `tax deductible status` : Boolean }

But if this new convention doesn't seem like a good idea, please do let me know!

inariksit · 2024-09-19T20:25:06Z

lam4-backend/src/Lam4/Expr/Printer.hs

+-- Function arguments are separated by newlines
+newtype FunArg = FunArg {unFunArg :: String}
+
+mkFunArg :: T.Text -> FunArg
+mkFunArg t = FunArg [I.i|#{t} : TypeMissing|]


I added this, because I couldn't find the information about the argument types in the CST. Here's an example, when printing the file examples/infix_pred_app.l4.

Original:

GIVEN (some_person: Person another_person: Person) DECIDE some_person `has helped` another_person IF True

CST:

Rec "has helped" _2 (Predicate RuleMetadata { originalRuleRef = Nothing , transparency = Transparent , description = Nothing } [ "some_person" _3 , "another_person" _4 ] (Lit (BoolLit True)))

And this is how it's prettyprinted currently.

GIVEN (some_person : TypeMissing another_person : TypeMissing) DECIDE `has helped` IF True

(The CST is missing also the information about how the arguments are used in the body, but I will talk about that in another comment.)

inariksit · 2024-09-19T20:30:08Z

lam4-backend/src/Lam4/Expr/Printer.hs

+        , prt 0 (fmap (\a -> mkFunArg a.name) args) -- custom newtype to print args in a specific way
+        , doc (showString ")")
+        , doc (showString "\nDECIDE")
+        , prt 0 funname


Another example from the file as in the comment above: examples/infix_pred_app.l4.

Original:
GIVEN (person: Person) DECIDE `is fun` IF True

CST:
Rec "is fun" _8 (Predicate RuleMetadata { originalRuleRef = Nothing , transparency = Transparent , description = Nothing } [ "person" _9 ] (Lit (BoolLit True)))

With the current line 300, this is prettyprinted as follows:

GIVEN (person : TypeMissing) DECIDE `is fun` IF True

so it happens to work in this case, except for the TypeMissing part. But for the other cases where the predicate is actually applied to its argument, it's not working.

(pinging @ym-han for this and the comment on lines 138-143)

The actual tests still don't work, TODO fix

inariksit marked this pull request as draft September 4, 2024 20:57

inariksit commented Sep 4, 2024

View reviewed changes

inariksit force-pushed the prettyprinter branch 2 times, most recently from 5124167 to 79b96f7 Compare September 6, 2024 14:44

inariksit force-pushed the prettyprinter branch from 79b96f7 to 95ba895 Compare September 19, 2024 20:18

inariksit commented Sep 19, 2024

View reviewed changes

inariksit force-pushed the prettyprinter branch 2 times, most recently from 00f00f1 to d36eb3e Compare September 23, 2024 11:06

inariksit changed the title ~~Prettyprinter (eventually)~~ Prettyprinter and tests Sep 23, 2024

inariksit marked this pull request as ready for review September 23, 2024 11:12

inariksit marked this pull request as draft September 23, 2024 12:25

inariksit force-pushed the prettyprinter branch from d36eb3e to 97b3553 Compare September 23, 2024 12:46

inariksit marked this pull request as ready for review September 23, 2024 12:47

inariksit added 16 commits October 1, 2024 08:47

export Actions

1b70eec

add v0 of a Printer (doesn't deserve name Pretty yet)

7764db9

update for the new Haskell CST

ebf3c43

another update for changed ConcreteSyntax (more metedata)

febccce

introduce backticks if identifier contains spaces

b53be6f

fixes in metadata

884cf88

improve Predicates (NB. type of args is missing in CST?)

cd9eff2

misc fixes / tweaks

bd79537

fix some warnings

6e8391c

extra newline at printout

e61bdea

WIP: add test framework for prettyprinter

fd4ef3d

WIP: use examples directory as tests

dac21af

The actual tests still don't work, TODO fix

add new constructs (e.g. Foldl/r, StringLit)

8de680c

Move file handling etc. into backend for ease of testing

d9529d4

Test for prettyprinter. (Also tests parser, since it reads the files)

a0ef0cd

new structures + tweaks

01c9066

inariksit added 4 commits October 1, 2024 08:50

misc tweaks, gold standard for 6 files

8606afa

Anon functions, more misc tweaks

634051e

fix formatting

062a91d

remove MkAtom

3b12dce

inariksit force-pushed the prettyprinter branch from d9020fb to 3b12dce Compare October 1, 2024 06:55

ym-han force-pushed the main branch from 53ac9d0 to 9bda7eb Compare October 22, 2024 10:53

ym-han force-pushed the main branch 3 times, most recently from 288a190 to b9c08b3 Compare October 30, 2024 14:24

ym-han force-pushed the main branch from 96a1b4f to f864c48 Compare November 19, 2024 07:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prettyprinter and tests #67

Prettyprinter and tests #67

inariksit commented Sep 4, 2024 •

edited

Loading

inariksit Sep 4, 2024

ym-han Sep 5, 2024 •

edited

Loading

inariksit Sep 5, 2024

ym-han Sep 5, 2024 •

edited

Loading

inariksit Sep 6, 2024

ym-han Sep 6, 2024 •

edited

Loading

ym-han Sep 6, 2024 •

edited

Loading

inariksit Sep 6, 2024 •

edited

Loading

ym-han Sep 6, 2024 •

edited

Loading

inariksit Sep 19, 2024

inariksit Sep 19, 2024

Prettyprinter and tests #67

Are you sure you want to change the base?

Prettyprinter and tests #67

Conversation

inariksit commented Sep 4, 2024 • edited Loading

inariksit Sep 4, 2024

Choose a reason for hiding this comment

ym-han Sep 5, 2024 • edited Loading

Choose a reason for hiding this comment

inariksit Sep 5, 2024

Choose a reason for hiding this comment

ym-han Sep 5, 2024 • edited Loading

Choose a reason for hiding this comment

inariksit Sep 6, 2024

Choose a reason for hiding this comment

ym-han Sep 6, 2024 • edited Loading

Choose a reason for hiding this comment

ym-han Sep 6, 2024 • edited Loading

Choose a reason for hiding this comment

inariksit Sep 6, 2024 • edited Loading

Choose a reason for hiding this comment

ym-han Sep 6, 2024 • edited Loading

Choose a reason for hiding this comment

inariksit Sep 19, 2024

Choose a reason for hiding this comment

inariksit Sep 19, 2024

Choose a reason for hiding this comment

inariksit commented Sep 4, 2024 •

edited

Loading

ym-han Sep 5, 2024 •

edited

Loading

ym-han Sep 5, 2024 •

edited

Loading

ym-han Sep 6, 2024 •

edited

Loading

ym-han Sep 6, 2024 •

edited

Loading

inariksit Sep 6, 2024 •

edited

Loading

ym-han Sep 6, 2024 •

edited

Loading