EMT Dataset

Multi-object tracking with ground truth annotations

Introduction

EMT is a comprehensive dataset for autonomous driving research, containing 57 minutes of diverse urban traffic footage from the Gulf Region. The dataset provides rich semantic annotations across two agent categories: people (pedestrians and cyclists), vehicles (seven classes). Each video segment spans 2.5-3 minutes, capturing challenging real-world scenarios:

Dense Urban Traffic: Complex multi-agent interactions in congested scenarios
Weather Variations: Clear and rainy conditions
Visual Challenges: High reflections from road surfaces and adverse weather combinations (rainy nights)

The dataset provides dense annotations for:

Detection & Tracking: Multi-object tracking with consistent IDs
Trajectory Prediction: Future motion paths and social interactions
Intention Prediction: Behavior understanding in complex scenarios

Validated through benchmarking on state-of-the-art models across tracking, trajectory prediction, and intention prediction tasks, with corresponding ground truth annotations for each benchmark.

Data Collection

Aspect	Description
Duration	57 minutes total footage
Segments	2.5-3 minutes continuous recordings
FPS	10fps for annotated frames
Agent Classes	2 Person classes and 7 Vehicle classes

Agent Categories

People
- Pedestrians
- Cyclists
Vehicles
- Motorbike
- Small motorised vehicle
- Medium vehicle
- Large vehicle
- Car
- Bus
- Emergency vehicle

Dataset Statistics

Category	Count
Annotated Frames	34,386
Bounding Boxes	626,634
Unique Agents	9,094
Vehicle Instances	7,857
Pedestrian Instances	568

Class	Description	Number of Bounding Boxes	Number of Agents
Pedestrian	An individual walking on foot.	24,574	568
Cyclist	Any bicycle or electric bike rider.	594	14
Motorbike	Includes motorcycles, bikes, and scooters with two or three wheels.	11,294	159
Car	Any standard automobile.	429,705	6,559
Small motorized vehicle	Motorized transport smaller than a car, such as mobility scooters and quad bikes.	767	13
Medium vehicle	Includes vehicles larger than a standard car, such as vans or tractors.	51,257	741
Large vehicle	Refers to vehicles larger than vans, such as lorries, typically with six or more wheels.	37,757	579
Bus	Covers all types of buses, including school buses, single-deck, double-deck.	19,244	200
Emergency vehicle	Emergency response units like ambulances, police cars, and fire trucks, distinguished by red and blue flashing lights.	1,182	9
*Overall:*		576,374	8,842

Quick Start

Download dataset:

chmod +x download.sh
./download.sh

To confirm data stats run the following command:

# Statistics
python dataset_statistics.py

Dataset Structure

To use base models, the dataset structure should be as follows:

emt-dataset/
├── data/
│   ├── annotations/
│   │   ├── intention_annotations/    # Agent intention labels
│   │   ├── tracking_annotations/     # Multi-object tracking data
│   │   ├── prediction_annotations/   # Behavior prediction labels
│   │   └── metadata.txt             # Dataset metadata
│   │── frames/                      # extracted frames(annotated frames only)
│   └── videos/                      # Raw video sequences

NB: Videos are not necessary unless you intend to use visual cues

Repository Structure

emt-dataset/
├── data/    # Containts dataset files and annotations
├── intention/    # Intention prediction scripts
├── prediction/    # Trajectory prediction scripts
├── tracking/    # Tracking scripts
├── visualize/    # All visualization scripts, for tracking, prediction and intention
├── constants.py    # Contains all constants utilized in repo, including labels for intention classes
├── dataset_statistics.py    # Script for printing statistics reported in the paper
├── utils.py    # Contains scripts for generating intention, prediction sttings including k-fold cross validation setting, as well as frames extraction related fuinctions

Visualization

To visualize detection data run the following command:

# visualize
cd visualize
python tracking_visualize.py

Benchmarking

We benchmark the dataset for the following tasks:

Multi-class MOT Tracking: Multi-object tracking with consistent IDs
Trajectory Prediction: Future motion paths and social interactions
Intention Prediction: Behavior understanding in complex scenarios

Multi-class MOT Tracking

Trajectory Prediction

Prediction package runs trajectory prediction models using LSTM, Graph Neural Networks (GNNs), and Transformer-based architectures. It supports training and evaluation modes, allowing users to load pre-trained models and process trajectory data with different settings.

├── prediction/
│   ├── dataloaders/
│   │   ├── seq_loader.py 
│   │   ├── frame_loader.py 
│   ├── evaluation/  
│   │   ├── metric_tracker.py         
│   │   ├── distance_metrics.py             
│   │   ├── utils.py                
│   ├── models/
│   │   ├── gat_temporal.py  
│   │   ├── gat.py  
│   │   ├── gcn_temporal.py  
│   │   ├── gcn.py  
│   │   ├── rnn.py  
│   │   ├── transformer_GMM.py  
│   │   └── transformer.py 
|   └── run.py

Switch to prediction folder (cd prediction) and follow the details for running trajectory prediction models here

Intention Prediction

Intention package runs LSTM-based prediction model in autoregressive and vanilla settings. It supports training and evaluation modes, allowing users to load pre-trained models.

├── intention/
│   ├── dataloaders/
│   │   ├── seq_loader.py 
│   ├── evaluation/ 
│   │   ├── classification_metrics.py           
│   │   ├── distance_metrics.py             
│   ├── models/
│   │   ├── rnn_autoregressive.py 
│   │   ├── rnn_vanilla.py  
|   └── run.py

Switch to intention folder (cd intention) and follow the details for running intention prediction models here

Installation Requirements

Install dependencies using:

pip install -r requirements.txt

Notes

Ensure GPU support (cuda) is available if running on a GPU.
When evaluating a model, a valid checkpoint file must be specified.
Normalization is recommended for better model performance.

🔗 Links

Repository: GitHub - AV-Lab/emt-dataset
Website: EMT Dataset
HuggingFace: EMT_Dataset

📝 Citation

If you use the EMT dataset in your research, please cite our paper:

@article{EMTdataset2025,
      title={EMT: A Visual Multi-Task Benchmark Dataset for Autonomous Driving in the Arab Gulf Region}, 
      author={Nadya Abdel Madjid and Murad Mebrahtu and Abdelmoamen Nasser and Bilal Hassan and Naoufel Werghi and Jorge Dias and Majid Khonji},
      year={2025},
      eprint={2502.19260},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2502.19260}, 
}

Contact

Murad: [email protected]
Nadya: [email protected]

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

EMT Dataset

Table of Contents

Introduction

Data Collection

Agent Categories

Dataset Statistics

Quick Start

Dataset Structure

Repository Structure

Visualization

Benchmarking

Multi-class MOT Tracking

Trajectory Prediction

Intention Prediction

Installation Requirements

Notes

🔗 Links

📝 Citation

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 92 Commits
assets		assets
generate_benchmarks		generate_benchmarks
intention		intention
prediction		prediction
tracking		tracking
visualize		visualize
.gitignore		.gitignore
README.md		README.md
constants.py		constants.py
dataset_statistics.py		dataset_statistics.py
download.sh		download.sh
requirements.txt		requirements.txt
utils.py		utils.py

AV-Lab/emt-dataset

Folders and files

Latest commit

History

Repository files navigation

EMT Dataset

Table of Contents

Introduction

Data Collection

Agent Categories

Dataset Statistics

Quick Start

Dataset Structure

Repository Structure

Visualization

Benchmarking

Multi-class MOT Tracking

Trajectory Prediction

Intention Prediction

Installation Requirements

Notes

🔗 Links

📝 Citation

Contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages