Building a Semantic Image Search Engine with OpenAI's CLIP and Faiss

The following exercise uses **CLIP** model from **OpenAI** to build the embeddings and **Facebook’s FAISS** library for Indexing. It also uses **Flicker 30k Dataset** available on **Kaggle**.

Prerequisites

You need to have the following installed:

Python 3.7+
PyTorch 1.7+
Transformers
Datasets
Faiss
Numpy
Pandas
Matplotlib
Pillow
Tqdm

You can install the required packages using pip install -r requirements.txt

Usage

Once the dataset is downloaded from Kaggle, add the notebook where the image folder is located. Make the required changes to the path and run.

The script also includes a demonstration of how to perform a search. The text query "basketball game" is encoded and compared to the image embeddings using Faiss to find the most similar images. The paths of these images are then used to load and display the images.

Example

![Screenshot](basketball result.png)

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
README.md		README.md
basketball result.png		basketball result.png
captions.csv		captions.csv
embeddings.py		embeddings.py
flicker_30k.ipynb		flicker_30k.ipynb
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Building a Semantic Image Search Engine with OpenAI's CLIP and Faiss

Prerequisites

Usage

Example

About

Releases

Packages

Languages

shashnkvats/SemanticSearch

Folders and files

Latest commit

History

Repository files navigation

Building a Semantic Image Search Engine with OpenAI's CLIP and Faiss

Prerequisites

Usage

Example

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages