Skip to content
This repository has been archived by the owner on Aug 1, 2024. It is now read-only.

extract and save embeddings for multiple sequence alignment based on msa transformer #92

Answered by tomsercu
maticmarin asked this question in Q&A
Discussion options

You must be logged in to vote

The internal state of MSA transformer is M x L x d (msa size x seqlength x embedding dim).
Typically you want the MSA to produce sequence-level features that summarize all MSA information, and taking the final layer's embedding of the first (typically query) sequence, gives good results. You could also try (weighted) averaging over the whole MSA but we didn't see much difference.

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@maticmarin
Comment options

Answer selected by tomsercu
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants