Allow lazily-evaluated code as argument to TEST #3490

d-torrance · 2024-09-20T03:46:38Z

This is a quick followup to #3419, which changed TEST from a method to a keyword so that we could do a better job of keeping track of the location of tests.

In that PR, I gave it a very low binding strength (the same as elapsedTime, etc.) But this means that it behaves much differently than it did than it was a method, e.g.,:

i1 : TEST "assert(1 + 1 " | "== 2)"

i2 : toString tests(0, User)

o2 = assert(1 + 1 == 2)

In M2 1.24.05, when TEST was still a method, we got an error:

i1 : TEST "assert(1 + 1 " | "==2)"
stdio:1:22:(3): error: no method for binary operator | applied to objects:
--            null (of class Nothing)
--      |     "==2)" (of class String)

This is because SPACE has higher binding strength than |.

We increase the binding strength of TEST to just below SPACE so that it behaves essentially the same as it did when it was a method. We keep it a bit below to mimic the right associativity of SPACE, i.e., we want TEST f x to parse as TEST (f x), not (TEST f) x.

mahrud · 2024-09-20T08:39:36Z

I understand wanting it to behave the same as before, but is there a reason why the previous behavior was better?

d-torrance · 2024-09-20T10:53:23Z

This is maybe pretty artificial, but this is the current behavior, which seems wrong:

i1 : Nothing + Nothing := x -> "foo";

i2 : TEST "a" + TEST "b"
stdio:2:9:(3): error: no method for binary operator + applied to objects:
            "a" (of class String)
      +     null (of class Nothing)

It seems like we should get "foo" here.

mahrud · 2024-09-20T11:28:48Z

I can't imagine any use for the output of TEST.

I can, however, imagine setting up TEST to act as a delayed execution keyword, so we could do:

TEST 1 + 1 == 2

Without bothering with strings or parentheses.

d-torrance · 2024-09-20T12:18:58Z

You've convinced me! TEST ... should probably behave like () -> ....

I may try and implement writing tests w/o strings like this in this PR...

mahrud · 2024-09-20T17:17:51Z

TEST ... should probably behave like () -> ....

Yes, exactly, construct a TestInput object, store it in a list, and maybe also return it?

By the way, I'd like assert to behave similarly, and in particular I want to be able to sprinkle tons of assertions in my code and run them only depending on the value of debugLevel or assertLevel or something like that. It's possible that, with some work, we could get a interpreter-level assert keyword to also print extra debug information, in the spirit of how you use Equality expressions. I'll just turn this into a new issue ...

d-torrance · 2024-09-21T05:11:09Z

I have a very early draft where TEST ... behaves essentially like () -> ....

TestInput is now a subclass of FunctionClosure. We run it, and if it returns a string, then we treat it like a traditional test. Otherwise, we're done!

Tests defined in this way are very fast:

i1 : TEST "assert true"

i2 : TEST assert true

i3 : check User
 -- capturing check(0, "User")                   -- .0741398s elapsed
 -- calling   check(1, "User")                   -- .000079301s elapsed

mahrud · 2024-09-21T16:14:21Z

Tests defined in this way are very fast:

Interesting .. because we've front loaded the parsing. That might make benchmarking easier also (maybe benchmark should be a keyword as well?).

We should be careful that locality of symbols doesn't change in a bad way.

TestInput is now a subclass of FunctionClosure.

Then maybe rename it TestClosure?

d-torrance · 2024-09-22T12:37:20Z

We should be careful that locality of symbols doesn't change in a bad way.

It seems to work pretty well -- it's really no different than writing a function in a package.

I adapted the tests in TerraciniLoci as a proof of concept. The big differences were changing ='s to :='s to avoid "unexported mutable symbol" errors when loading the package, adding semicolons, and adding other packages used in the tests to PackageImports instead of calling needsPackage in the test itself since we need those symbols to be available when we first load the package.

The only breaking change to existing tests that I've encountered is the handful of tests that use TEST get(...). We won't call get until runtime, so using currentFileDirectory won't work. But changing that to PackageName#"auxiliary files" does the trick.

One thing I haven't quite figured out how to deal with is the desc field in the functionBody object. Currently, dummyDesc works since it specifies 0 arguments, which is what we want, but there's some frame stuff in there that's probably important.

d-torrance · 2024-09-23T02:23:35Z

One strange thing I haven't figured out how to fix yet:

i1 : TEST (f := x -> x^2)

i2 : TEST (f := x -> x^3)
stdio:2:6:(3): warning: local declaration of f shields variable with same name

mahrud · 2024-09-23T06:53:39Z

I think you need to define a local dictionary for each TestClosure.

mahrud · 2024-09-23T06:56:23Z

The only breaking change to existing tests that I've encountered is the handful of tests that use TEST get(...). We won't call get until runtime, so using currentFileDirectory won't work. But changing that to PackageName#"auxiliary files" does the trick.

I think this is not just testing that you can read the file, without loading it, so probably it should be changed to load.

Also, I think now looking at the code of the test just shows the load line rather than the actual contents. Not sure what's the correct solution here, maybe using something other than TEST to add test files?

d-torrance · 2024-09-23T15:29:24Z

I think this is not just testing that you can read the file, without loading it, so probably it should be changed to load.

I don't think we want to call load inside TEST. That would actually load the file, which maybe has undesired consequences. For example, suppose the file foo.m2 contains the single line x = 2. Then:

i1 : TEST load "foo.m2"

o1 = TestClosure[stdio:1:5-1:18]

o1 : TestClosure

i2 : check User
 -- calling   check(0, "User") -- .000138457s elapsed

i3 : x

o3 = 2

But with get, the test closure returns a string, which means we fall back on the traditional behavior:

i1 : TEST get "foo.m2"

o1 = TestClosure[stdio:1:5-1:17]

o1 : TestClosure

i2 : check User
 -- capturing check(0, "User")                   -- .0358661s elapsed

i3 : x

o3 = x

o3 : Symbol

Also, I think now looking at the code of the test just shows the load line rather than the actual contents. Not sure what's the correct solution here, maybe using something other than TEST to add test files?

That's exactly what addTest(String) does (currently unexported) for the Core tests. Theoretically, any package that has files in its currentLayout#"packagetests" directory will also load tests this way.

However, there's currently not a way to tell a package to put any files there, and the autotools/cmake builds have to manually do this for Core. I've toyed with the idea of implementing this -- maybe a TestFiles option for newPackage with a list/regex of test files to install there?

This way, we can recognize when when is a test

TEST will be one. Parsed as an Arrow with a dummy lhs

Now creates a nullary function containing the test's code rather than just computing the location of the string.

We get most of the desired methods for free via inheritance. We keep TestInput around as a synonym for now

We call each test, and if it returns a string, then it's a traditional test that we should try to capture. Otherwise, we're done! A couple things: - Tests run this way don't produce any output, so debugging is harder. In particular, Verbose => true doesn't do anything. - We print "calling" *after* we're done running the test, since we don't know if it returns a string or not.

Run a test, and if it returns a string (i.e., it's a traditional test), then capture that string.

currentFileDirectory will no likely no longer be the correct path to the test files, so we use the packages' "auxiliary files" keys to find them.

Also bump package to version 0.2

Otherwise we'll actually add a test to the current package.

Also add "tests" to SeeAlso section

List of files to install in package test directory when running installPackage.

Also bump the version number of JSON

d-torrance · 2024-09-29T12:43:35Z

I've added a TestFiles option to newPackage so that files can be added as tests directly without using get or load, and locate points to the right place.

I also figured out how to give each TestClosure object its own local dictionary -- the TEST keyword now behaves much more like -> and is parsed as an Arrow parse tree object.

mahrud · 2024-09-29T18:11:40Z

M2/Macaulay2/packages/Macaulay2Doc/functions/package-doc.m2

@@ -271,6 +271,7 @@ Node
    [newPackage, PackageExports]
    [newPackage, PackageImports]
    [newPackage, Reload]
+    [newPackage, TestFiles]


This is complicating the testing system rather than simplifying it, and I would personally rather not add yet another option to newPackage (e.g. should we also add DocumentationFiles next?)

Fair enough -- I wasn't that excited about this idea either.

Another possibility would be to change currentLayout#"packagetests" to point to a subdirectory of the auxiliary files directory (say test or tests) instead of the current behavior, where it's a subdirectory of the package's documentation directory. Then any files in that directory would automatically get loaded as tests, as currently happens for Core.

Polyhedra/tests might get in the way, but in principle that's fine with me.

mahrud · 2024-09-29T18:23:26Z

M2/Macaulay2/d/evaluate.d

-	seq(eval(c), locate(codePosition(c))));
-    when r is Error do r else nullE);
-setupop(TestS, testfun);
+addTestFromFile(e:Expr):Expr := (


I don't understand why this is better. It's much more complicated than before.

The current behavior is to store a test's location in a hash table, which is easy to do at top level. But with this proposal, tests are becoming functions, and so unless we want all the tests that have been loaded from files to have testing.m2 as their location, we need to fake it somehow in the interpreter. This function was my attempt at doing that. I figured we should try to find out the ending position so that code will work.

mahrud · 2024-09-29T18:25:14Z

M2/Macaulay2/d/parser.d

@@ -518,7 +525,10 @@ export treePosition(e:ParseTree):Position := (
    is s:Parentheses      do combinePositionL(s.left.position,       s.right.position)
    is s:EmptyParentheses do combinePositionL(s.left.position,       s.right.position)
    is a:Adjacent         do combinePositionM(treePosition(a.lhs),   treePosition(a.rhs))
-    is a:Arrow            do combinePositionL(treePosition(a.lhs),   treePosition(a.rhs))
+    is a:Arrow            do (


Treating TEST as -> is kludgy.

Yeah, it is a little strange. We need to call bind when things are still ParseTree objects in order to create a local dictionary, which is why I went this route. And Arrow is already set up for creating functions.

I also considered creating another member of ParseTree. Then we could avoid all the when lhs is dummy cases.

mahrud · 2024-09-29T18:56:22Z

I also figured out how to give each TestClosure object its own local dictionary -- the TEST keyword now behaves much more like -> and is parsed as an Arrow parse tree object.

I don't like this approach, and I don't think it needs to be this complicated.

I think we should keep TEST a unaryop and have it return an object of type:

export TestClosure := {+ frame:Frame, model:functionCode, hash:hash_t };

In particular, the model can be a normal functionCode, without having to change every step in the process.

mahrud · 2024-09-29T19:02:50Z

Could you make a branch on your fork with the last commit before you changed the behavior to act as an arrow? I see the commit before force pushes but I can't retrieve them because github doesn't allow fetch orphaned commits. See here. I also can't easily tell which one I want, because i can't view history of an orphaned commit.

I can try adding a local dictionary in a simpler way from there.

d-torrance · 2024-09-29T19:40:03Z

Sure thing!

https://github.com/d-torrance/M2/tree/test-unary

mahrud · 2024-10-04T22:35:54Z

Remind me, what is wrong with the version on your test-unary branch?

d-torrance · 2024-10-04T22:37:48Z

Remind me, what is wrong with the version on your test-unary branch?

No local dictionaries for the tests, e.g., #3490 (comment)

mahrud · 2024-10-07T13:59:36Z

I think it's going to take me more time than I have before November. Is it okay with you if we postpone this for the next release?

d-torrance · 2024-10-07T14:27:45Z

I think it's going to take me more time than I have before November. Is it okay with you if we postpone this for the next release?

Yes, that's totally fine!

d-torrance requested a review from mahrud September 20, 2024 03:46

d-torrance marked this pull request as draft September 20, 2024 12:19

mahrud mentioned this pull request Sep 20, 2024

Add assertLevel and automatic timer #3211

Open

d-torrance closed this Sep 20, 2024

d-torrance deleted the test branch September 20, 2024 17:27

d-torrance reopened this Sep 20, 2024

d-torrance force-pushed the test branch from e92223c to db6bcc4 Compare September 20, 2024 17:30

d-torrance force-pushed the test branch from e46746d to 7914f54 Compare September 21, 2024 05:21

d-torrance changed the title ~~Increase binding strength of TEST keyword~~ Allow lazily-evaluated code as argument to TEST Sep 22, 2024

d-torrance force-pushed the test branch from 8e44b22 to 064731a Compare September 22, 2024 23:30

d-torrance force-pushed the test branch 3 times, most recently from ab8e968 to bc8ee42 Compare September 28, 2024 19:30

d-torrance added 5 commits September 28, 2024 15:42

Store operator in functionCode objects

692f9e9

This way, we can recognize when when is a test

Add new type of operator ("thunk") for creating nullary functions

a5f6300

TEST will be one. Parsed as an Arrow with a dummy lhs

Make TEST a thunk operator w/ same precedence as ->

e2e1e1d

Update behavior of TEST keyword

357b426

Now creates a nullary function containing the test's code rather than just computing the location of the string.

Make TestInput (now TestClosure) a child of FunctionClosure

3656742

We get most of the desired methods for free via inheritance. We keep TestInput around as a synonym for now

d-torrance added 4 commits September 28, 2024 16:19

Update addTest to take a function closure

54f727d

Adapt addTest(String) for tests as functions

8448e54

Update capture(TestClosure) for new test behavior

87361bd

Run a test, and if it returns a string (i.e., it's a traditional test), then capture that string.

d-torrance force-pushed the test branch from bc8ee42 to e7d8e1f Compare September 28, 2024 21:26

d-torrance added 9 commits September 29, 2024 08:34

Update check(333, "Core") for new test behavior

c43056b

Adapt packages that use "TEST get(...)" for new test behavior

5772ffc

currentFileDirectory will no likely no longer be the correct path to the test files, so we use the packages' "auxiliary files" keys to find them.

Convert TerraciniLoci tests to functions as proof of concept

5be1c7b

Also bump package to version 0.2

Convert RInterface tests to functions

0318cec

Don't capture examples with TEST keyword

00af1af

Otherwise we'll actually add a test to the current package.

Update TEST doc with new behavior

5d46294

Also add "tests" to SeeAlso section

Add TestFiles option to newPackage

44f6447

List of files to install in package test directory when running installPackage.

Use TestFiles option in JSON package

c8efd14

Also bump the version number of JSON

Document the TestFiles option to newPackage

93c1355

d-torrance force-pushed the test branch from e7d8e1f to 93c1355 Compare September 29, 2024 12:35

d-torrance marked this pull request as ready for review September 29, 2024 12:40

mahrud reviewed Sep 29, 2024

View reviewed changes

d-torrance marked this pull request as draft October 7, 2024 14:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow lazily-evaluated code as argument to TEST #3490

Allow lazily-evaluated code as argument to TEST #3490

d-torrance commented Sep 20, 2024

mahrud commented Sep 20, 2024

d-torrance commented Sep 20, 2024

mahrud commented Sep 20, 2024

d-torrance commented Sep 20, 2024

mahrud commented Sep 20, 2024 •

edited

Loading

d-torrance commented Sep 21, 2024

mahrud commented Sep 21, 2024

d-torrance commented Sep 22, 2024

d-torrance commented Sep 23, 2024

mahrud commented Sep 23, 2024

mahrud commented Sep 23, 2024 •

edited

Loading

d-torrance commented Sep 23, 2024

d-torrance commented Sep 29, 2024

mahrud Sep 29, 2024 •

edited

Loading

d-torrance Sep 29, 2024

mahrud Sep 29, 2024

mahrud Sep 29, 2024

d-torrance Sep 29, 2024

mahrud Sep 29, 2024

d-torrance Sep 29, 2024

mahrud commented Sep 29, 2024

mahrud commented Sep 29, 2024 •

edited

Loading

d-torrance commented Sep 29, 2024

mahrud commented Oct 4, 2024

d-torrance commented Oct 4, 2024

mahrud commented Oct 7, 2024

d-torrance commented Oct 7, 2024

Allow lazily-evaluated code as argument to TEST #3490

Are you sure you want to change the base?

Allow lazily-evaluated code as argument to TEST #3490

Conversation

d-torrance commented Sep 20, 2024

mahrud commented Sep 20, 2024

d-torrance commented Sep 20, 2024

mahrud commented Sep 20, 2024

d-torrance commented Sep 20, 2024

mahrud commented Sep 20, 2024 • edited Loading

d-torrance commented Sep 21, 2024

mahrud commented Sep 21, 2024

d-torrance commented Sep 22, 2024

d-torrance commented Sep 23, 2024

mahrud commented Sep 23, 2024

mahrud commented Sep 23, 2024 • edited Loading

d-torrance commented Sep 23, 2024

d-torrance commented Sep 29, 2024

mahrud Sep 29, 2024 • edited Loading

Choose a reason for hiding this comment

d-torrance Sep 29, 2024

Choose a reason for hiding this comment

mahrud Sep 29, 2024

Choose a reason for hiding this comment

mahrud Sep 29, 2024

Choose a reason for hiding this comment

d-torrance Sep 29, 2024

Choose a reason for hiding this comment

mahrud Sep 29, 2024

Choose a reason for hiding this comment

d-torrance Sep 29, 2024

Choose a reason for hiding this comment

mahrud commented Sep 29, 2024

mahrud commented Sep 29, 2024 • edited Loading

d-torrance commented Sep 29, 2024

mahrud commented Oct 4, 2024

d-torrance commented Oct 4, 2024

mahrud commented Oct 7, 2024

d-torrance commented Oct 7, 2024

mahrud commented Sep 20, 2024 •

edited

Loading

mahrud commented Sep 23, 2024 •

edited

Loading

mahrud Sep 29, 2024 •

edited

Loading

mahrud commented Sep 29, 2024 •

edited

Loading