Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

how preprocessing voxceleb data and acc==0? #5

Open
colinsongf opened this issue May 12, 2018 · 10 comments
Open

how preprocessing voxceleb data and acc==0? #5

colinsongf opened this issue May 12, 2018 · 10 comments

Comments

@colinsongf
Copy link

how process voxceleb data for run trian.py?

why I iter 200 using 2GPU, but acc=0?

@ChristopherLu
Copy link

Hi,

Got the same issue on the voxceleb_v1 dataset. I can see the loss is consistently decreasing from 7.2 to 6.01, but the eval/train accuracy is always 0.
Have you solved this?

@colinsongf
Copy link
Author

i‘m not , sorry

@ChristopherLu
Copy link

Is this because lack of certain pre-process step for voxceleb data?

@colinsongf
Copy link
Author

i think so, but i do not how to pre-process voxceleb data!

@colinsongf
Copy link
Author

how to Voxceleb dataset preprocessing for dropping silence segments

@andabi
Copy link
Owner

andabi commented May 22, 2018

@ChristopherLu @colinsongf I proprocessed voxceleb dataset to be sample rate 16,000 that is my config in default.yaml

@ChristopherLu
Copy link

@andabi

Could you share us the procedure to get the 'voxceleb_norm'? Is it the data after pre-processing? We are confused about the right procedures to run the code for voxceleb, it wold be great if you could share us the recipe or pipeline to achieve this.

Thanks

@andabi
Copy link
Owner

andabi commented May 23, 2018

voxceleb_norm is the processed dataset. The dataset is structured to directories for each celeb. Each directory contains each celeb's wav files which have sample rate 16,000 and format is 'wav'. You need to preprocess above before training.

@colinsongf
Copy link
Author

colinsongf commented May 23, 2018

image
after processed all wav to sample rate 16,000, the result is acc=0, why?

@andabi
Copy link
Owner

andabi commented May 23, 2018

Keep the training at least a few days because voxceleb is huge. I kept training the model a few days using 8 gpu to get over 90% eval accuracy.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants