Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

matchms spectral library subsetting tool #431

Closed
2 tasks
hechth opened this issue Nov 2, 2023 · 0 comments · Fixed by #437
Closed
2 tasks

matchms spectral library subsetting tool #431

hechth opened this issue Nov 2, 2023 · 0 comments · Fixed by #437
Assignees

Comments

@hechth
Copy link
Member

hechth commented Nov 2, 2023

We should implement a tool which allows extracting spectra from a library given their index or some unique metadata identifier like InChI or SPLASH or something comparable.

The tool should be made to work in 2 directions, (1) removing the spectra matching identifiers given in the file and (2) the opposite direction, only keeping the spectra matching the identifiers in the file.

Input for the Galaxy tool should be the (1) msp file to be filtered, (2) the mode as a select for include and exclude and the (3) list of identifiers in tsv or csv format to use for filtering. The header row in this list of identifiers states the key which will be used to match the identifiers in the list and the spectra.

Future extensions:

  • Include option to filter based on index
  • Automaticalle deduce the key based on the file format, e.g. for SMI files deduce that it is SMILES
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants