Data Organize

This repository contains Python scripts designed to organize and filter datasets containing image and XML annotation pairs. These tools are particularly useful for managing labeled datasets used in machine learning and computer vision projects.

Scripts Overview

This script organizes image and XML files into subdirectories based on the label (<name>) specified in the XML files.

Features:

Scans a source directory for .png or .jpg images and their corresponding .xml files.
Extracts the <name> tag from each XML file.
Creates subdirectories named after the extracted label.
Moves image and XML files into the appropriate subdirectory.
Handles duplicate file overwrites with informative messages.

Usage:

Specify the source directory containing the image-XML pairs (root_directory).
Specify the output directory where labeled subdirectories will be created (output_directory).
Run the script.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.idea		.idea
README.md		README.md
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Data Organize

Scripts Overview

Features:

Usage:

About

Releases

Packages

Languages

iremcorak/data_organize

Folders and files

Latest commit

History

Repository files navigation

Data Organize

Scripts Overview

Features:

Usage:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages