Skip to content

Commit

Permalink
Update model-selection.ipynb
Browse files Browse the repository at this point in the history
  • Loading branch information
JERRYenSHU503 committed Apr 28, 2024
1 parent 648a421 commit 832e17c
Showing 1 changed file with 3 additions and 1 deletion.
Original file line number Diff line number Diff line change
Expand Up @@ -498,7 +498,9 @@
"source": [
"First of all, let's take a look at a plot, this is a simple learning curve using an iris dataset in sklearn.dataset. We can simply notice the two curve we plot fells far apart when we have less examples, and when we enlarge the training examples we can see the two lines are approaching convergence.\n",
"\n",
"This is how we can see the fitting process using learning curve."
"Why? \n",
"\n",
"To train a model, it is necessary to have a sufficient number of samples so that it can generalize patterns from the data. Assuming we have a function y=f(x), essentially, machine learning algorithms summarize and fit the f function based on a large number of (x, y) pairs. Therefore, if you have too few (x, y) pairs, the algorithm will not be able to summarize the function effectively. This is the impact of the sample size on the degree of fitting."
]
},
{
Expand Down

0 comments on commit 832e17c

Please sign in to comment.