Use GenAI to make test scenarios #99

sloorush · 2024-12-10T10:53:06Z

Extremely chaotic but worth considering with how fast the new AI models run.

My idea is to use the OpenAI Package on Tatin(or any other LLM) and make a call to OpenAI(or similar) and ask it to give us some data that it thinks the APL primitive might break over. We should log everything this does.

I think it will give a lot of false positives but non-deterministic testing is a new way of doing things. Its on the same line as chaos testing.

Pros:

Lots of new data
Lots of uncommon scenarios that might give aplcores (or errors that are incorrect) because its gonna write something that does not work a lot of times
Its a new thing ¯\(⍣)/¯
(I think) We need more chaos testing

Cons:

Expensive because it will need a subscription to an AI model (we can use an open source one like LLAMA3, its pretty good)
Time taking- the time taken for each test will be increased (can be dealt with these only running with the slow tests)

The text was updated successfully, but these errors were encountered:

sloorush · 2024-12-10T11:06:14Z

I got this idea from a project that makes non-deterministic testing for GenAI applications. But this is the other way round, I would appreciate more opinions on this if you think it will make a major difference...

sloorush · 2024-12-11T10:03:18Z

It comes from an ideology that randomly generated deterministic data might not be the best test data but AI generated random data that might be relevant and it will try to generate the best random data possible.

(Considering a world with next to no AI hallucinations)
(Hallucinations are not bad either because it will test for known errors) (?)

sloorush · 2024-12-11T10:04:10Z

This can be added as an addition to RunVariations where it checks what the primitive can do, and gives it the hardest expression it thinks the primitive can solve

sloorush added enhancement Enhancements to the project feature New features that can be added to the project labels Dec 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use GenAI to make test scenarios #99

Use GenAI to make test scenarios #99

sloorush commented Dec 10, 2024 •

edited

Loading

sloorush commented Dec 10, 2024

sloorush commented Dec 11, 2024

sloorush commented Dec 11, 2024

Use GenAI to make test scenarios #99

Use GenAI to make test scenarios #99

Comments

sloorush commented Dec 10, 2024 • edited Loading

sloorush commented Dec 10, 2024

sloorush commented Dec 11, 2024

sloorush commented Dec 11, 2024

sloorush commented Dec 10, 2024 •

edited

Loading