The Image Extractor

We are building a tool that automatically processes images from books. Our system will:

Segment images from scanned pages
Generate captions using AI
Enhance image understanding with deep learning

Using advanced deep learning techniques, we aim to make historical and printed book images more accessible and usable.

This project will help researchers, libraries, and digital archives extract and analyze visual content effortlessly.

First Module: Image Feature Extractor

This project extracts image features using a Vision Transformer (ViT) model from Timm and provides multiple ways to process images:

✅ Extract features from a single image
✅ Find similar images in a dataset
✅ Generate tags for images

📌 Requirements

Before running the project, make sure you have installed the dependencies:

pip install torch torchvision timm pillow fiftyone requests

🛠️ How to Use

1️⃣ Extract Features from a Single Image

Run the following command to extract features from an image:

python main.py --extract path/to/image.jpg

Extracts features using the Vision Transformer (ViT) model.
Saves the extracted features for further analysis.

2️⃣ Find Similar Images in a Dataset

To find similar images within a dataset, run:

python main.py --find path/to/image.jpg path/to/dataset

Computes image embeddings.
Finds similar images based on feature similarity.

3️⃣ Generate Tags for an Image

To generate tags based on an image:

python main.py --tag path/to/image.jpg

Uses deep learning to assign relevant tags.

🔧 Changing the Model

By default, the script uses vit_base_patch16_224. To use another model, specify it like this:

python main.py --extract path/to/image.jpg --model resnet50

📌 Notes

The extracted features can be saved to a file or used for further analysis.
If the dataset or image path does not exist, the script will display an error.
No FastAPI is needed; all operations run as simple Python scripts.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
README.md		README.md
Vision.ipynb		Vision.ipynb
Welcome_to_Colab.ipynb		Welcome_to_Colab.ipynb
environment.yml		environment.yml
feature_extractor.py		feature_extractor.py
find_neighbours.py		find_neighbours.py
main.py		main.py
run.ipynb		run.ipynb
tag_generator.py		tag_generator.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

The Image Extractor

First Module: Image Feature Extractor

📌 Requirements

🛠️ How to Use

1️⃣ Extract Features from a Single Image

2️⃣ Find Similar Images in a Dataset

3️⃣ Generate Tags for an Image

🔧 Changing the Model

📌 Notes

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

gu-gridh/Image-Inspector

Folders and files

Latest commit

History

Repository files navigation

The Image Extractor

First Module: Image Feature Extractor

📌 Requirements

🛠️ How to Use

1️⃣ Extract Features from a Single Image

2️⃣ Find Similar Images in a Dataset

3️⃣ Generate Tags for an Image

🔧 Changing the Model

📌 Notes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages