mnist-parser

Parser for MNIST database files.

More information about the MNIST database can be found at [1] and [2].

Basically these are databases of handwritten digits and their labels.

Split into 4 files, 2 (image and label) for training and 2 of testing.

train-images-idx3-ubyte.gz:  training set images (9912422 bytes) 
train-labels-idx1-ubyte.gz:  training set labels (28881 bytes) 
t10k-images-idx3-ubyte.gz:   test set images (1648877 bytes) 
t10k-labels-idx1-ubyte.gz:   test set labels (4542 bytes)

The files are in a very simple format as follows:

TRAINING SET LABEL FILE (train-labels-idx1-ubyte):

[offset] [type]          [value]          [description]
0000     32 bit integer  0x00000801(2049) magic number (MSB first)
0004     32 bit integer  60000            number of items
0008     unsigned byte    5               label
0009     unsigned byte    0               label

The labels values are 0 to 9.

TRAINING SET IMAGE FILE (train-images-idx3-ubyte):

[offset] [type]          [value]          [description]
0000     32 bit integer  0x00000803(2051) magic number
0004     32 bit integer  60000            number of images
0008     32 bit integer  28               number of rows
0012     32 bit integer  28               number of columns
0016     unsigned byte   ??               pixel

Pixels are organized row-wise. Pixel values are 0 to 255. 0 means background (white), 255 means foreground (black).

Decoded Labels and Images

The file train_labels.txt.gz contains the label of image i on the i^{th} line.

The file train_images.txt.gz contains, on each row, the grey scale value of the pixels separated by a space in row major format, i.e. the rows are appended.

--

The parser was written to get back in some shape in C programming.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
hex_lib.c		hex_lib.c
hex_lib.h		hex_lib.h
macros.h		macros.h
mnist_image_parser.c		mnist_image_parser.c
mnist_label_parser.c		mnist_label_parser.c
t10k-images-idx3-ubyte.gz		t10k-images-idx3-ubyte.gz
t10k-labels-idx1-ubyte.gz		t10k-labels-idx1-ubyte.gz
train-images-idx3-ubyte.gz		train-images-idx3-ubyte.gz
train-labels-idx1-ubyte.gz		train-labels-idx1-ubyte.gz
train_images.txt.gz		train_images.txt.gz
train_labels.txt.gz		train_labels.txt.gz

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

mnist-parser

TRAINING SET LABEL FILE (train-labels-idx1-ubyte):

TRAINING SET IMAGE FILE (train-images-idx3-ubyte):

Decoded Labels and Images

About

Uh oh!

Releases

Packages

Languages

License

afrozenator/mnist-parser

Folders and files

Latest commit

History

Repository files navigation

mnist-parser

TRAINING SET LABEL FILE (train-labels-idx1-ubyte):

TRAINING SET IMAGE FILE (train-images-idx3-ubyte):

Decoded Labels and Images

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages