4 different benchmark sets focused around str.replace_all #7
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This pull request introduces four new benchmark sets aimed at evaluating and challenging string solvers on diverse and difficult problem instances focusing on the operator str.replace_all.
Benchmark Sets
1. pcp-3-3-random (1000 benchmarks)
2. pcp-3-4-hard (3170 benchmarks)
Benchmarks encoding PCP[3,4] that were considered hard to solve by the paper "Creating Difficult Instances of the Post Correspondence Problem". The instances still have 3 tiles but strings up to length 4. Now the length is not exact but at most 4.
The benchmarks were originally considered hard; many have later been solved using alternative techniques such as Parikh Automata and Model Checking. However, string solvers have not been extensively tested on these benchmarks. Current experiments show that none of those benchmarks are solved by string solvers.
3. rna-sat and rna-unsat (500+500 benchmarks)
These benchmarks model a reverse transcription process inspired by bioinformatics.
An unknown RNA string
y
is converted into a DNA string by applying a series ofstr.replace_all
operations that simulate nucleotide base pairing.