This repository has been archived by the owner on Aug 1, 2024. It is now read-only.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This is an attempt to fix the error encountered when running predict.py using the MSA Transformer on the example data provided in examples/variant-prediction/data (#458)
The reason for the current error appears to be that the example MSA has an EVMutation-specific usage of lower-case columns to signify low-coverage (nonfocus) columns and not insertions. In this case the desired behaviour is presumably to coerce such columns to upper-case for scoring. This was the original behaviour of predict.py, before the PR #230, which seems to be intended to support the standard usage of lower-case residues to designate insertions in user-supplied a2m/a3m files and inadvertently breaks the example.
This PR supports both options via the lowercase-type flag. The default treats the lower-case columns as EVMutation style nonfocus (i.e. low coverage) columns for compatibility with the example MSA.