Name	Name	Last commit message	Last commit date
parent directory ..
mnist	mnist
MulticlassClassification_mnist.sln	MulticlassClassification_mnist.sln
README.md	README.md

MNIST Classification

ML.NET version	API type	Status	App Type	Data type	Scenario	ML Task	Algorithms
v0.10	Dynamic API	Up-to-date	Console app	.csv files	MNIST classification	Multi-class classification	Sdca Multi-class

In this introductory sample, you'll see how to use ML.NET to classify handwritten digits from 0 to 9 using the MNIST dataset. This is a multiclass classification problem that we will solve using SDCA (Stochastic Dual Coordinate Ascent) algorithm.

Problem

The MNIST data set contains handwritten images of digits, ranging from 0 to 9.

The MNIST dataset we are using contains 65 columns of numbers. The first 64 columns in each row are integer values in the range from 0 to 16. These values are calculated by dividing 32 x 32 bitmaps into non-overlapping blocks of 4 x 4. The number of ON pixels is counted in each of these blocks, which generates an input matrix of 8 x 8. The last column in each row is the number that is represented by the values in the first 64 columns. These first 64 columns are our features and our ML model will use these features to classifiy the testing images. The last column in our training and validation datasets is the label - the actual number that we will predict using our ML model.

the ML model that we will build will return probabilities for a given image of being one of the numbers from 0 to 9 as explained above.

ML task - Multiclass classification

The generalized problem of multiclass classification is to classify items into one of three or more classes. (Classifying items into one of the two classes is called binary classification).

Solution

To solve this problem, first we will build an ML model. Then we will train the model on existing data, evaluate how good it is, and lastly we'll consume the model to predict a number the given image represents.

1. Build model

Building a model includes:

Uploading data (optdigits-train.csv) with (DataReader)
Create an Estimator and transform the data in the first 64 columns to one column so it can be used effectively by an ML algorithm (with Concatenate)
Choosing a learning algorithm (StochasticDualCoordinateAscent).

The initial code is similar to the following:

// STEP 1: Common data loading configuration
var trainData = mLContext.Data.ReadFromTextFile(path: TrainDataPath,
                        columns : new[] 
                        {
                            new TextLoader.Column(nameof(InputData.PixelValues), DataKind.R4, 0, 63),
                            new TextLoader.Column("Number", DataKind.R4, 64)
                        },
                        hasHeader : false,
                        separatorChar : ','
                        );

var testData = mLContext.Data.ReadFromTextFile(path: TestDataPath,
                        columns: new[]
                        {
                            new TextLoader.Column(nameof(InputData.PixelValues), DataKind.R4, 0, 63),
                            new TextLoader.Column("Number", DataKind.R4, 64)
                        },
                        hasHeader: false,
                        separatorChar: ','
                        );						

// STEP 2: Common data process configuration with pipeline data transformations
var dataProcessPipeline = mLContext.Transforms.Concatenate(DefaultColumnNames.Features, nameof(InputData.PixelValues)).AppendCacheCheckpoint(mLContext);

                

// STEP 3: Set the training algorithm, then create and config the modelBuilder
var trainer = mLContext.MulticlassClassification.Trainers.StochasticDualCoordinateAscent(labelColumn: "Number", featureColumn: DefaultColumnNames.Features);
var trainingPipeline = dataProcessPipeline.Append(trainer);

2. Train model

Training the model is a process of running the chosen algorithm on a training data to tune the parameters of the model. Our training data consists of pixel values and the digit they represent. It is implemented in the Fit() method from the Estimator object.

To perform training we just call the method providing the training dataset (iris-train.txt file) in a DataView object.

// STEP 4: Train the model fitting to the DataSet            
ITransformer trainedModel = trainingPipeline.Fit(trainData);

3. Evaluate model

We need this step to conclude how accurate our model operates on new data. To do so, the model from the previous step is run against another dataset that was not used in training (iris-test.txt). This dataset also contains known iris types. MulticlassClassification.Evaluate calculates the difference between known types and values predicted by the model in various metrics.

var predictions = trainedModel.Transform(testData);
var metrics = mLContext.MulticlassClassification.Evaluate(data:predictions, label:"Number", score:DefaultColumnNames.Score);

Common.ConsoleHelper.PrintMultiClassClassificationMetrics(trainer.ToString(), metrics);

To learn more on how to understand the metrics, check out the Machine Learning glossary from the ML.NET Guide or use any available materials on data science and machine learning.

If you are not satisfied with the quality of the model, there are a variety of ways to improve it, which will be covered in the examples category.

4. Consume model

After the model is trained, we can use the Predict() API to predict the probability that this flower belongs to each iris type.

ITransformer trainedModel;
using (var stream = new FileStream(ModelPath, FileMode.Open, FileAccess.Read, FileShare.Read))
{
	trainedModel = mlContext.Model.Load(stream);
}

// Create prediction engine related to the loaded trained model
var predEngine = trainedModel.CreatePredictionEngine<InputData, OutPutNum>(mlContext);

//InputData data1 = SampleMNISTData.MNIST1;
var resultprediction1 = predEngine.Predict(SampleMNISTData.MNIST1);

Console.WriteLine($"Actual: 1     Predicted probability:       zero:  {resultprediction1.Score[0]:0.####}");
Console.WriteLine($"                                           One :  {resultprediction1.Score[1]:0.####}");
Console.WriteLine($"                                           two:   {resultprediction1.Score[2]:0.####}");
Console.WriteLine($"                                           three: {resultprediction1.Score[3]:0.####}");
Console.WriteLine($"                                           four:  {resultprediction1.Score[4]:0.####}");
Console.WriteLine($"                                           five:  {resultprediction1.Score[5]:0.####}");
Console.WriteLine($"                                           six:   {resultprediction1.Score[6]:0.####}");
Console.WriteLine($"                                           seven: {resultprediction1.Score[7]:0.####}");
Console.WriteLine($"                                           eight: {resultprediction1.Score[8]:0.####}");
Console.WriteLine($"                                           nine:  {resultprediction1.Score[9]:0.####}");
Console.WriteLine();

Where SampleMNISTData.MNIST1 stores the pixel values of the digit that want to predict using the ML model.

class SampleMNISTData
{
	internal static readonly InputData MNIST1 = new InputData()
	{
		PixelValues = new float[] { 0, 0, 0, 0, 14, 13, 1, 0, 0, 0, 0, 5, 16, 16, 2, 0, 0, 0, 0, 14, 16, 12, 0, 0, 0, 1, 10, 16, 16, 12, 0, 0, 0, 3, 12, 14, 16, 9, 0, 0, 0, 0, 0, 5, 16, 15, 0, 0, 0, 0, 0, 4, 16, 14, 0, 0, 0, 0, 0, 1, 13, 16, 1, 0 }
	}; //num 1
    (...)
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MulticlassClassification_mnist

MulticlassClassification_mnist

README.md

MNIST Classification

Problem

ML task - Multiclass classification

Solution

1. Build model

2. Train model

3. Evaluate model

4. Consume model

Files

MulticlassClassification_mnist

Directory actions

More options

Directory actions

More options

Latest commit

History

MulticlassClassification_mnist

Folders and files

parent directory

README.md

MNIST Classification

Problem

ML task - Multiclass classification

Solution

1. Build model

2. Train model

3. Evaluate model

4. Consume model