This corpus contains 39 texts from 13 Spanish authors (2 521 066 tokens). It is the second release of the whole corpus of the PhD thesis of José Calvo Tello, who is part of the young research group CLiGS, at the University of Würzburg, Germany.
See the "metadata.csv" file for information of the publication as well as literary information about the novels like place and period of setting, information about the protagonist, narrator, etcetera.
The TEI schema for the basic and the linguistically annotated TEI files corresponds to the general CLiGS schema which is available in the CLiGS reference repository.
The metadata keywords used in the text classification section of the TEI header are controlled by an external TEI keywords file and a schematron file which are stored in the keywords folder.
- txt_id: simple plain text of the body (File name: id.txt)
- txt_author-title: simple plain text of the body (File name: Author_Title-id.txt)
- annotated: TEI files further annotated with FreeLing and WordNet (keeping teiHeader and the chapter structure of the TEI)
- pdf: Reading versions generated from the tei files
-
The author's copyright of this texts have already expired. This collection is published under Creative Common Attribution 4.0 International.
-
Please provide a reference if you use this data in your teaching or research. The following is a citation suggestion: Corpus de novelas de la Edad de Plata, edited by José Calvo Tello. Würzburg: CLiGS, 2017. https://github.com/cligs/textbox/tree/master/spanish/novela-espanola.