Clarification of the purpose of train/val/test datasets in default configurations #5583
-
Hi all, Since I'm currently using MMDetection as CV-framework for my master thesis I wanted to make sure that I properly understand the meaning of the different datasets and their purpose during different steps of the model training for available "default" configurations such as Faster R-CNN. From my understanding the model will try to fit the "train" data during training, by comparing the recieved output with the desired output. While training, an in-between evaluation of the model takes place, e.g. after each epoch, using the "val" data. I understand that the results from these in-between evaluations COULD be used to impact the training process, i.e. by early stopping, but do these results also impact the training when using the available configurations? In the end there still is the "test" data, which can then be used for the evaluation of the final model. Now thats two questions:
I'd be happy if someone could help me out here, as I don't want to get things confused here :) Cheers, |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 3 replies
-
Hi.
|
Beta Was this translation helpful? Give feedback.
Hi.
MS COCO
, theval
dataset is always used to do the ablation study due to the scale(maybe 5K in the 2017version), and you can also quick verify your model.test
(usetest-dev
is better) is always used to compare with the SOTA due to the high fairness(evaluation on the server). So thetest
is different from theval
.Clear?