SVI Percept

A Python package for modelling human perception of street view images using CLIP features and the K-nearest-neighbour algorithm.

Installation

pip install .

Quick Start

from svi_percept import SVIPerceptPipeline
from PIL import Image

# Initialize pipeline
pipeline = SVIPerceptPipeline()

# Process single image
image = Image.open("example.jpg").convert('RGB')
results = pipeline(image)

or

results = pipeline("example.jpg")

The default model is based on an Amsterdam case study.

Features

K-nearest-neighbour model of human perception
Batch processing support
GPU acceleration

Detailed Usage

Single Image Processing

from svi_percept import SVIPerceptPipeline

pipeline = SVIPerceptPipeline()
output = pipeline("path/to/image.jpg")

for cat in ['walkability', 'bikeability', 'pleasantness', 'greenness', 'safety']:
    print(f'{cat} score = {output['results'][cat]}')

Batch Processing

from torch.utils.data import Dataset

class ImageDataset(Dataset):
    def __init__(self, image_paths):
        self.image_paths = image_paths
    
    def __len__(self):
        return len(self.image_paths)
    
    def __getitem__(self, idx):
        return {"image": self.image_paths[idx]}

# Process multiple images
image_paths = ["image1.jpg", "image2.jpg", "image3.jpg"]
dataset = ImageDataset(image_paths)
pipeline = SVIPerceptPipeline(batch_size=32)
outputs = pipeline(dataset)
for image_path, output in zip(image_paths, outputs):
    for cat in ['walkability', 'bikeability', 'pleasantness', 'greenness', 'safety']:
        print(f'{image_path} {cat} score = {output['results'][cat]}')

You may also use a simpler API if you wish to forgo the full-blown Dataset-derived class. Simply use:

outputs = pipeline(["image1.jpg", "image2.jpg", "image3.jpg"])

Model Details

The package uses:

CLIP ViT-H-14-378-quickgelu for feature extraction
5 specialized scoring matrices for perception analysis
Weighted score computation using exponential scaling and softmax normalization

Requirements

Python 3.8+
PyTorch 1.9+
transformers
Pillow
numpy

Examples

See the examples directory for more detailed usage examples.

License

GPL-3.0

Citation

If you use this package in your research, please cite:

Danish, M., Labib, SM., Ricker, B., and Helbich, M. A citizen science toolkit to collect human perceptions of urban environments using open street view images. Computers, Environment and Urban Systems. Volume 116, Mar 2025, 102207.

This paper is open access.

Support

For issues and feature requests, please use the GitHub issue tracker.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
examples		examples
svi_percept		svi_percept
LICENCE		LICENCE
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SVI Percept

Installation

Quick Start

Features

Detailed Usage

Single Image Processing

Batch Processing

Model Details

Requirements

Examples

License

Citation

Support

About

Languages

License

Spatial-Data-Science-and-GEO-AI-Lab/svi_percept

Folders and files

Latest commit

History

Repository files navigation

SVI Percept

Installation

Quick Start

Features

Detailed Usage

Single Image Processing

Batch Processing

Model Details

Requirements

Examples

License

Citation

Support

About

Resources

License

Stars

Watchers

Forks

Languages