Skip to content

Is it possible to use a checkpoint (EN) to train another language? #51

Closed Answered by p0p4k
egorsmkv asked this question in Q&A
Discussion options

You must be logged in to vote

Yes;

  1. load the model with random weights and new vocab -- A
  2. load the pretrained model with older vocab (and therefore different TextEncoder embedding layer). -- B
  3. copy the weights from B to A in all the layers that are same size and keep others random.

Replies: 1 comment 6 replies

Comment options

You must be logged in to vote
6 replies
@egorsmkv
Comment options

@egorsmkv
Comment options

@p0p4k
Comment options

@lumpidu
Comment options

@egorsmkv
Comment options

Answer selected by egorsmkv
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants