WikiMuTe: A web-sourced dataset of semantic descriptions for music audio

This repository contains auxiliary code for the dataset described in the paper

Weck, B., Kirchhoff, H., Grosche, P., Serra, X. (2024). WikiMuTe: A Web-Sourced Dataset of Semantic Descriptions for Music Audio. In: Rudinac, S., et al. MultiMedia Modeling. MMM 2024. Lecture Notes in Computer Science, vol 14565. Springer, Cham. https://doi.org/10.1007/978-3-031-56435-2_4

Code

We provide a short Python script that can be used to download the audio files from the URLs provided in the dataset. Please update the user agent string in the script before running it. Moreover, the script shows how the dataset can be loaded to get the data in a convenient format.

Dataset

The dataset contains rich text description for music audio files collected from Wikipedia articles. The audio files are available for download through the URLs provided in the dataset.

We provide three variants of the dataset in the data folder. All are described in the paper.

all.csv contains all the data we collected, without any filtering.
filtered_sf.csv contains the data obtained using the self-filtering method.
filtered_mc.csv contains the data obtained using the MusicCaps dataset method.

Downloading the dataset

The dataset is available to download from Zenodo. A download script is provided in download.bash.

bash download.bash

The audio files have to be downloaded separately using the provided Python script.

python3 -m venv venv  # create a virtual environment
source venv/bin/activate  # activate the virtual environment
pip install -r requirements.txt  # install dependencies
python3 wikimute.py  # run the script

File structure

Each CSV file contains the following columns:

file: the name of the audio file
pageid: the ID of the Wikipedia article where the text was collected from
aspects: the short-form (tag) description texts collected from the Wikipedia articles
sentences: the long-form (caption) description texts collected from the Wikipedia articles
audio_url: the URL of the audio file
url: the URL of the Wikipedia article where the text was collected from

Citation

If you use this dataset in your research, please cite the following paper:

@inproceedings{wikimute,
    title = {WikiMuTe: {A} Web-Sourced Dataset of Semantic Descriptions for Music Audio},
    author = {Weck, Benno and Kirchhoff, Holger and Grosche, Peter and Serra, Xavier},
    booktitle = "MultiMedia Modeling",
    year = "2024",
    publisher = "Springer Nature Switzerland",
    address = "Cham",
    pages = "42--56",
    doi = {10.1007/978-3-031-56435-2_4},
    url = {https://doi.org/10.1007/978-3-031-56435-2_4},
}

License

This repository is released under the MIT License. Please see the LICENSE file for more details.

The data is available under the Creative Commons Attribution-ShareAlike 3.0 Unported (CC BY-SA 3.0) license. Each entry in the dataset contains a URL linking to the article, where the text data was collected from.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
download.bash		download.bash
requirements.txt		requirements.txt
wikimute.py		wikimute.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

WikiMuTe: A web-sourced dataset of semantic descriptions for music audio

Code

Dataset

Downloading the dataset

File structure

Citation

License

About

Releases

Languages

License

Bomme/wikimute

Folders and files

Latest commit

History

Repository files navigation

WikiMuTe: A web-sourced dataset of semantic descriptions for music audio

Code

Dataset

Downloading the dataset

File structure

Citation

License

About

Resources

License

Stars

Watchers

Forks

Releases

Languages