Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Another source for novels? #2

Open
christofs opened this issue Jan 25, 2019 · 3 comments
Open

Another source for novels? #2

christofs opened this issue Jan 25, 2019 · 3 comments

Comments

@christofs
Copy link
Collaborator

christofs commented Jan 25, 2019

There are 21 novels in XML-TEI included in this collection, with relatively little overlap with what is already online here; maybe it is worth getting some from there? https://github.com/cligs/textbox/tree/master/italian/romanzi

@lb42
Copy link
Collaborator

lb42 commented Jan 26, 2019

Happy to have a go at converting them -- I have a CLIGS-to-ELTEC convertor somewhere -- if you like.
BTW, it doesnt seem to be possible to download just this bit of the CLIGS textbox from github.

@christofs
Copy link
Collaborator Author

christofs commented Jan 28, 2019

Hi Lou, thanks. Yes, why not import and convert the non-overlapping ones. Unless someone from the Italian team would first like to check whether these novels are useful from the point of view of the sampling criteria.

Right, the individual collections are not separate repositories so Github only lets you download the whole thing. Not ideal, admittedly. I'll mention it to the team.

@lb42
Copy link
Collaborator

lb42 commented Jan 28, 2019

Shortly after posting this comment, I noticed that I had already hoovered up a copy of the entire CLIGS repo some time ago, so this is not a problem (my internet connexion from France is a bit limited) . I ran my my convertor on one of the texts with encouraging results, but need for a few more tweaks.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants