Automated changeset testing #44

jonathanolson · 2018-06-28T16:02:40Z

There are many times where I've been waiting for local testing (aqua, unit tests, snapshot comparison) to complete before pushing local commits to master. It's been inconvenient, since it prevents me from starting code changes until the testing is complete.

I'd like to consider something like the following (very open to modification):

Locally I'd identify two sets of SHAs (usually a "before" and "after" the changes that should be tested). Presumably master and commits on a feature branch.
There would be some way of automatically sending those up to an external server (bayes?) where processing starts.
It would run relevant tests, and use the snapshot comparison to identify if it changed anything visual/interactive. Presumably we'd send the server information about what to test (e.g. a Scenery change would probably do all tests and compare all sims, but an area-model-common change would have a much more limited testing).
Once complete (or in progress) you'd be able to view a report for the change. It would note all tests that fail before/after (generally only caring about things that changed), and it would provide a similar interface for the snapshot comparison that I've done (would be able to visually show a difference in any sim that it caused).
If the testing is as expected (passing), then I'd merge the branch into master.

I'm not sure how important some complexity would be (e.g. "only run tests for area-model sims"), but it would be possible to start with a simple interface and add anything needed.

Tagging for developer meeting to discuss if this would be helpful for others, priorities, features, etc.

The text was updated successfully, but these errors were encountered:

jbphet · 2018-06-28T17:27:59Z

It sounds awesome, and I think I would use it.

samreid · 2018-07-05T13:25:08Z

Assuming that we implemented the functionality in the issue description, would we also want to rewrite/adapt Bayes CT to leverage it? That is, shouldn't our default automated testing leverage some of those features, such as (a) running minimal required tests (based on understanding the dependencies) and (b) identifying visual diffs?

jonathanolson · 2018-07-05T16:14:14Z

Assuming that we implemented the functionality in the issue description, would we also want to rewrite/adapt Bayes CT to leverage it? That is, shouldn't our default automated testing leverage some of those features, such as (a) running minimal required tests (based on understanding the dependencies)

Yes, presumably we'd share whatever code would be desired.

(b) identifying visual diffs?

Yes, see #22. This would need design for how it should show up when there is a visual difference.

pixelzoom · 2018-07-11T20:57:00Z

Not something that I've encountered a need for. But if it's something that others need, and the benefit/cost ratio is high enough...

samreid · 2018-07-12T03:30:17Z

The proposed solution requires pushing the code to branches--why not check out a 2nd working copy, so you can run tests on the 1st working copy while developing in the 2nd working copy? Then we won't have to work out any client-server protocol, etc. and you can test changes that haven't been pushed at all. This approach will require approximately 0 investment and is something you can use right away. The main disadvantages that I see are (a) could be confusing to switch back and forth between two working copies and (b) it takes more disk space. But I think we could figure out how to deal with (a) after getting a little experience with this as a strategy.

jonathanolson · 2018-07-12T16:28:52Z

why not check out a 2nd working copy, so you can run tests on the 1st working copy while developing in the 2nd working copy?
The main disadvantages that I see are (a) could be confusing to switch back and forth between two working copies and (b) it takes more disk space.

I'm not concerned about disk space. Switching between working copies sounds inconvenient, so much that I'd prefer pushing to a branch (even locally) so I could check out the "testable" point on my 2nd working copy.

Also offloading the computational load to a server would be nice, since the more "comprehensive" testing that I'd like would take up a lot of time (and depending on the development device, might slow down editors).

It's probably worth implementing this (local) style first, and adding on the server/client bit if it's worth it.

pixelzoom · 2018-07-12T16:32:32Z

If we go with the approach of using a branch, +1 to keep the branch local, or doing whatever is necessary to avoid an explosion of branches.

jonathanolson · 2018-07-12T16:52:10Z

I'm not sure anything would cause an explosion of a branch, because it would typically look like:

Create a branch for a specific commit (say a some-feature branch in Scenery).
Run the testing/comparison.
Merge it into master and delete the branch.

pixelzoom · 2018-07-12T19:28:32Z

7/12/18 dev meeting:
• start as a "local" feature
• if it proves to be useful, investigate making available on bayes

pixelzoom · 2020-11-11T17:05:21Z

@ariel-phet no progress on this since 7/12/2018. How should we proceed?

ariel-phet · 2020-12-07T21:41:59Z

Although this kind of feature would like be "nice to have" people have been getting by with our current tools. Considering that the issue is basically 2 years old, and we have not had time or pressing need to work on it, it feels appropriate to close.

jonathanolson added the deprecated:meeting:developer label Jun 28, 2018

jonathanolson self-assigned this Jul 12, 2018

pixelzoom removed the deprecated:meeting:developer label Jul 12, 2018

pixelzoom assigned ariel-phet Nov 11, 2020

ariel-phet added the dev:enhancement label Dec 7, 2020

ariel-phet closed this as completed Dec 7, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Automated changeset testing #44

Automated changeset testing #44

jonathanolson commented Jun 28, 2018

jbphet commented Jun 28, 2018

samreid commented Jul 5, 2018

jonathanolson commented Jul 5, 2018

pixelzoom commented Jul 11, 2018 •

edited

Loading

samreid commented Jul 12, 2018

jonathanolson commented Jul 12, 2018

pixelzoom commented Jul 12, 2018

jonathanolson commented Jul 12, 2018

pixelzoom commented Jul 12, 2018

pixelzoom commented Nov 11, 2020

ariel-phet commented Dec 7, 2020

Automated changeset testing #44

Automated changeset testing #44

Comments

jonathanolson commented Jun 28, 2018

jbphet commented Jun 28, 2018

samreid commented Jul 5, 2018

jonathanolson commented Jul 5, 2018

pixelzoom commented Jul 11, 2018 • edited Loading

samreid commented Jul 12, 2018

jonathanolson commented Jul 12, 2018

pixelzoom commented Jul 12, 2018

jonathanolson commented Jul 12, 2018

pixelzoom commented Jul 12, 2018

pixelzoom commented Nov 11, 2020

ariel-phet commented Dec 7, 2020

pixelzoom commented Jul 11, 2018 •

edited

Loading