Labelled dataset with phrases retrieved from different digital sources containing diverse messages directed towards the LGBTQIA+ community.
id, phase, class, where class is {LGBTQIAphobic, No LGBTQIAphobic}
Name: LGBTQIAphobia_dataset
Description: 197 phrases classified as {LGBTQIAphobic, No LGBTQIAphobic}
Language: Spanish
Format: CSV (UTF-8)
Sources: X/Twitter, Instagram, WWW, TikTok, Digital newspapers
Size: 20Kb
Ethical considerations: This dataset was created strictly for academic and research purposes. The person who was the target of the hate speech has been anonymized, and there is no intention to harm them in any way, either to them or to the person who delivered the speech. We prioritize the protection of privacy and confidentiality of vulnerable individuals. To safeguard privacy, we carefully remove any identifying details such as user IDs, phone numbers, and addresses before sharing the data with our annotators. All the data we collect is from publicly available sources and does not contain any personal or sensitive information that may jeopardize anyone’s privacy. I request researchers to commit to abiding by ethical guidelines so as not to unnecessarily harm individuals.