A shared repository to store Andalûh EPA corpus texts. Intended to help used in machine-learning like projects.
The Andalusian varieties of [Spanish] (Spanish: andaluz; Andalusian) are spoken in Andalusia, Ceuta, Melilla, and Gibraltar. They include perhaps the most distinct of the southern variants of peninsular Spanish, differing in many respects from northern varieties, and also from Standard Spanish. Further info: https://en.wikipedia.org/wiki/Andalusian_Spanish. As there's no official or standard andaluz spelling, andaluh-py is adopting the EPA proposal (Êttandâ Pa'l Andalûh). Further info: https://andaluhepa.wordpress.com | https://andaluh.es
This repository store Andalûh EPA corpus texts. Intended to help used in machine-learning like projects. Includes:
- Europarl: A Parallel Corpus for Statistical Machine Translation http://www.statmt.org/europarl
- Adding more corpus texts.
Please open an issue for support.
Please contribute using Github Flow. Create a branch, add commits, and open a pull request.