Enhance train_val_test_split with Flexible Test Dataset Support #59
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
What was changed
Changed function
train_val_test_split
in_bibmon_tools.py
data_test
andstart_test
.Objective
Enable the user to train and validate a model with one dataset and test it with another, allowing flexible partitioning for training, validation, and testing phases.
Importance
The new parameters data_test and start_test improve the function train_val_test_split by allowing the user to train and validate a model with one dataset and test it with another. This flexibility enables testing on data from different distributions or scenarios, enhancing model evaluation. The start_test parameter further allows for precise control over the test set's starting point within the provided test data, making the function more versatile for time-series and other partitioned datasets.
How to use