Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Finetune CLAP on {audio, text} pairs #141

Open
jerpint opened this issue Feb 8, 2024 · 1 comment
Open

Finetune CLAP on {audio, text} pairs #141

jerpint opened this issue Feb 8, 2024 · 1 comment

Comments

@jerpint
Copy link

jerpint commented Feb 8, 2024

Hello!

Suppose I have a dataset of {audio, text} pairs. I would now like to finetune CLAP on this audio subset. Do you have any tips for getting started with such a task? Would continuing the training from a checkpoint with a smaller learning rate be somewhat of a good start? Do you have scripts that allow to do something similar?

Thanks

@lukewys
Copy link
Contributor

lukewys commented Mar 31, 2024

Please see https://github.com/LAION-AI/CLAP?tab=readme-ov-file#dataset-format for details on the dataset format that we trained on. I think you can refer to the training script for fine-tuning, but remember to modify the learning rate and weight initialization.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants