VECTOR QUANTIZATION VIA K-MEANS CLUSTERING

This project demonstrates the use of k mean clustering for image compression

Image Preprocessing

We first start with an image of a car. The image is converted to grayscale and a 512 x 1024 subsection is shown below:

Compression With k-means Clustering

After preprocessing the image, we define a "patch" in the image with dimensions PxP, where P=2. This value of will work well for the dimensions of the above M X N image since they are also powers of 2.

In total, there are (MxN)/P^2 patches. We then cluster these patches into 16 clusters using k-means clustering algorithm. We use the Kmeans function in scikit-learn for this. The value of the number of clusters (C) is chosen by choosing the value of the rate (R) as 1. This means, each pixel is represented by 2PR bits. The total number of clusters C=2(RP^2).

When the k-means algorithm is applied, each patch in the image is assigned to a particular cluster. Each patch is replaced by the values of the cluster to which it belongs. We observe that as the number of patches decreases the quality of the compressed image deteriorates(at cluster=1, the image is totally destroyed).

The whole image is now represented using only those values in the cluster. Hence, we will be able to represent the whole image in fewer bits than before effectively compressing the image. However, this is a lossy image compression mehtod, decompressing the compressed image will create an image that is not identical to the original.

Below is part of the original image compared to the compressed version:

Rate vs. Distortion

In this part, we vary the rate (R) value from 0 to 1. As we vary R, the number of clusters C also changes because C=2(R*P^2). For each value of R (and thereby C) we do the clustering and the Vector Quantization again. For each of these R we calculate the distortion as the mean squared error between the original image and the quantized image.

We observe that as we increase the rate R, the distortion is monotonically decreasing. This happens because as we increase R and keep patch size constant, the number of clusters C increase. As more clusters are used to represent the same image, the reconstructed image gets closer to the original image and hence the distortion is less.

Note: We need to choose R values such that R*P2 should be an integer. Here we have taken R=[0.2, 0.4, 0.6, 0.8, 1] and P=2

Patch Size vs. Distortion

Next, we vary the patch size (P). Earlier we used P=2 to generate a 2x2 patch size. Now, we use P = [2, 4, 8] and R = 0.08 to generate patch sizes of 2x2, 4x4, and 8x8. Below is a plot of the distortion for each patch size

As the patch size is increased the distortion decreases. We would expect that with increasing patch size the distortion would increase because we are approximating a larger portion of the image. However, as we increase the patch size the number of clusters also increases at an even faster rate and having more clusters decreases the distortion because we are dividing the image into more clusters and hence decreasing the level of approximation of the image.

Till now we have considered that all the clusters have equal probability of appearing in the image. However in reality, some clusters appear more often than the others. Therefore, in practical applications we usually represent the clusters which appear frequently with lesser number of bits. Information theory tells us that we can get an average coding length of if we take into consideration that some symbols appear more often than others. In this part we compare the average coding rate for uniform probability and the actual probability.

Better Compression with Entropy Based Coding

Till now we have considered that all the clusters have equal probability of appearing in the image. However in reality, some clusters appear more often than the others. Therefore, in practical applications we usually represent the clusters which appear frequently with fewer number of bits. Information theory tells us that we can get an average coding length of if we take into consideration that some symbols appear more often than others.

By taking advantage of the fact that some clusters appear more than others, the coding rate for compression can be reduced. For the case that P=2 and R=1 a coding rate of 0.87 bits per pixel can be achieved with entropy coding, this is a reduction of 13% in coding rate.

Training vs. Test Image Compression

For this part of the project the k-means clusters were computed using the same training image of the car used for the earlier parts of the project. These clusters were used to compress a different test image of a different car. This was done using P=2 and K=1. We expect that the distortion will increase because the clusters are being computed from a completely different image than is being compressed. The test image and quantized (decompressed) version are shown below.

Visually it is obvious that using the clusters from the training image on the test image results in a much more lossy compression than training and compressing the same image. The distortion for the training image is 50 and the distortion on the test image is 112. As expected the distortion increases greatly because the k-means algorithm was trained on a completely different image resulting in clusters that do not fit the test image well.

k-means vs. Mini-batch k-means

In this part of the project we compare using k-means to mini-batch k-means for image compression. Mini-batch k-means is a variant of the k-means algorithm which uses mini-batches to reduce the computation time, while still attempting to optimise the same objective function. Mini-batches are subsets of the input data, randomly sampled in each training iteration. These mini-batches reduce the amount of computation required to converge to a local solution.

In mini-batch k-means, samples are drawn randomly from the dataset, to form a mini-batch. These are then assigned to the nearest centroid. In the second step, the centroids are updated. In contrast to k-means, this is done on a per-sample basis. For each sample in the mini-batch, the assigned centroid is updated by taking the streaming average of the sample and all previous samples assigned to that centroid.

	Distortion	Runtime (seconds)
k-means	55	21
Mini-batch k-means	59	15

As expected the mini-batch k-means algorithm was faster but at the expense of a slight increase in distortion.

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
code		code
images		images
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

VECTOR QUANTIZATION VIA K-MEANS CLUSTERING

Image Preprocessing

Compression With k-means Clustering

Rate vs. Distortion

Patch Size vs. Distortion

Better Compression with Entropy Based Coding

Training vs. Test Image Compression

k-means vs. Mini-batch k-means

About

Uh oh!

Releases

Packages

Languages

JoshuaMathew/Image-Compression-via-Clustering

Folders and files

Latest commit

History

Repository files navigation

VECTOR QUANTIZATION VIA K-MEANS CLUSTERING

Image Preprocessing

Compression With k-means Clustering

Rate vs. Distortion

Patch Size vs. Distortion

Better Compression with Entropy Based Coding

Training vs. Test Image Compression

k-means vs. Mini-batch k-means

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages