Open-source datasets and deep learning models for separating sounds.
-
Audio from YFCC100M videos for mixture-invariant training (MixIT).
-
Audio-visual YFCC100M with annotations for on-screen sound separation with AudioScope.
-
Audio-visual YFCC100M with annotations for on-screen sound separation with AudioScopeV2.
-
Synthetic AMI for speech separation in meeting room scenarios.
-
Free Universal Sound Separation (FUSS) baseline separation model.
-
Universal unsupervised separation models using mixture invariant training (MixIT).
-
Unsupervised separation models for birds using mixture invariant training (MixIT).
python3.10 -m venv venv
venv/bin/python -m pip install --upgrade pip wheel
venv/bin/python -m pip install -r requirements.in
venv/bin/python -m pip freeze > requirements.txt