-
Notifications
You must be signed in to change notification settings - Fork 26
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
update model_training process (train and export) #169
Conversation
@@ -180,7 +181,7 @@ function quick_collect() { | |||
} | |||
|
|||
function train() { | |||
train_model stressng_kepler_query,coremark_kepler_query,parsec_kepler_query ${VERSION} | |||
train_model stressng_kepler_query ${VERSION} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is this selecting the data only for training?
If it is also for the validation: should we also use the coremark results for testing to verify the accuracy of the model with different workload?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It will be shuffled and use 10% of it for validation.
def normalize_and_split(X_values, y_values, scaler, test_size=0.1): |
We need to refactor the code to have a fixed validation dataset. Should be created for a separate issue.
Signed-off-by: Sunyanan Choochotkaew <[email protected]>
Signed-off-by: Sunyanan Choochotkaew <[email protected]>
@sunya-ch the CI passed now. |
** Rebase from the PR #172 **
Also, refer to archived pipeline that should be merged on kepler-model-db first by sustainable-computing-io/kepler-model-db#16
Need
to be merged first.
This PR includes
Signed-off-by: Sunyanan Choochotkaew [email protected]