4.8 Summary

Notes

General definitions:

Metric: A single number that describes the performance of a model
Accuracy: Fraction of correct answers; sometimes misleading
Precision and recall are less misleading when we have class inbalance
ROC Curve: A way to evaluate the performance at all thresholds; okay to use with imbalance
K-Fold CV: More reliable estimate for performance (mean + std)

In brief, this weeks was about different metrics to evaluate a binary classifier. These measures included accuracy, confusion table, precision, recall, ROC curves(TPR, FRP, random model, and ideal model), and AUROC. Also, we talked about a different way to estimate the performance of the model and make the parameter tuning with cross-validation.

The code of this project is available in this jupyter notebook.

Add notes from the video (PRs are welcome)

⚠️	The notes are written by the community. If you see an error here, please create a PR with a fix.

Navigation

Machine Learning Zoomcamp course
Session 4: Evaluation Metrics for Classification
Previous: Cross-Validation
Next: Explore more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

08-summary.md

08-summary.md

4.8 Summary

Notes

Navigation

Files

08-summary.md

Latest commit

History

08-summary.md

File metadata and controls

4.8 Summary

Notes

Navigation