caption text at training #138

TATEXH · 2024-01-10T13:37:42Z

Hi
I am using music_audioset_epoch_15_esc_90.14.pt as a music classifier. I would like to classify the mood and genre of our music files. I am trying to find the cosine similarity using the text "The mood of this song is (romantic, energetic, etc)" but I only get about 0.4. I think that if I use a text similar to the one you used in your training, the value will be better, so could you please tell me what type of text you used?

lukewys · 2024-03-31T14:53:55Z

Hi, for music we used This audio is a <genre> song. I think the task you are dealing with is also a bit of out of distribution of the training data. I don't think we included a lot of music with mood labels in music version of the CLAP.

Best,

TATEXH · 2024-04-01T12:57:26Z

Thanks for the reply. I will try it with your text.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

caption text at training #138

caption text at training #138

TATEXH commented Jan 10, 2024

lukewys commented Mar 31, 2024

TATEXH commented Apr 1, 2024

caption text at training #138

caption text at training #138

Comments

TATEXH commented Jan 10, 2024

lukewys commented Mar 31, 2024

TATEXH commented Apr 1, 2024