Skip to content
This repository has been archived by the owner on Nov 17, 2023. It is now read-only.

Data imbalance handling in MXNet Gluon #17237

Discussion options

You must be logged in to vote

I recently updated the split sampler in gluonnlp, such that the number of samplers for each worker will always be the same (with even_size=True). https://gluon-nlp.mxnet.io/master/api/modules/data.html?highlight=splitsampler#gluonnlp.data.SplitSampler

This somewhat avoids the imbalanced data batch problem. If it is useful I can upstream the sampler to mxnet, too

Replies: 5 comments

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Answer selected by szha
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants
Converted from issue

This discussion was converted from issue #17237 on September 05, 2020 19:33.