Skip to content

Commit

Permalink
add ismi sample data (see #17)
Browse files Browse the repository at this point in the history
  • Loading branch information
robcast committed Nov 10, 2022
1 parent c702678 commit 1148c90
Show file tree
Hide file tree
Showing 2 changed files with 210 additions and 0 deletions.
24 changes: 24 additions & 0 deletions examples/ismi/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,24 @@
# Sample data from the ISMI project database

The [Islamic Scientific Manuscript Initiative project](https://ismi.mpwig-berlin.mpg.de) aims to collect information about all Islamic Manuscripts in the exact sciences from the 9th to the 19th centuries CE.

The old [ISMI database](https://gitlab.gwdg.de/MPIWG/Department-II/ismi-project) database OpenMind (OM4) stores historical dates as JSON objects with the following structure:

- `state`
- "unknown": no date
- "not checked": unparsed date in `date_in_text`
- "known": date or date range entered in specified calendar
- `calendar_type`: calendar the date was entered in
- "Julian", "Islamic", "Gregorian"
- `input_form`: date type
- "Year"
- `from`: first day, `until`: last day of year (dates in Gregorian calendar, ambiguity of +-2 days when entered in Islamic calendar)
- "Range"
- `from`: first day, `until`: last day of range (dates in Gregorian calendar, ambiguity of +-2 days when entered in Islamic calendar)
- "Date"
- `date`: given day (date in Gregorian calendar, ambiguity of +-2 days when entered in Islamic calendar)
- `additional_info`: textual note with additional information

We plan to extract as much as possible of this data in the migration to the new RDF database with a CIDOC-CRM-based data model.

A sample file with dates of each type can be found in `data/ismi-om4-date-samples.json`
186 changes: 186 additions & 0 deletions examples/ismi/data/ismi-om4-date-samples.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,186 @@
[
{
"state": "unknown"
},
{
"date_in_text": "8th/14th century",
"state": "not checked"
},
{
"additional_info": "year 1233 in Julian calendar",
"calendar_type": "Julian",
"from": {
"ambiguity": 0,
"century": 13,
"dayOfMonth": 8,
"dayOfYear": 8,
"month": 1,
"year": 1232
},
"input_form": "Year",
"state": "known",
"until": {
"ambiguity": 0,
"century": 13,
"dayOfMonth": 7,
"dayOfYear": 7,
"month": 1,
"year": 1233
},
"year": 1232
},
{
"additional_info": "year 1205 in Islamic calendar",
"calendar_type": "Islamic",
"from": {
"ambiguity": 2,
"century": 18,
"dayOfMonth": 10,
"dayOfYear": 253,
"month": 9,
"year": 1790
},
"input_form": "Year",
"state": "known",
"until": {
"ambiguity": 2,
"century": 18,
"dayOfMonth": 29,
"dayOfYear": 241,
"month": 8,
"year": 1791
},
"year": 1205
},
{
"additional_info": "year 1564 in Gregorian calendar",
"calendar_type": "Gregorian",
"from": {
"ambiguity": 0,
"century": 16,
"dayOfMonth": 1,
"dayOfYear": 1,
"month": 1,
"year": 1564
},
"input_form": "Year",
"state": "known",
"until": {
"ambiguity": 0,
"century": 16,
"dayOfMonth": 31,
"dayOfYear": 366,
"month": 12,
"year": 1564
},
"year": 1564
},
{
"additional_info": "3. Martius(3) 1481 (1481-03-03) in Julian calendar (1481-03-12 Gregorian)",
"calendar_type": "Julian",
"date": {
"ambiguity": 0,
"century": 15,
"dayOfMonth": 12,
"dayOfYear": 71,
"month": 3,
"year": 1481
},
"input_form": "Date",
"state": "known"
},
{
"additional_info": "6. Muḥarram(1) 888 in Islamic calendar (1483-02-23[+-2] Gregorian)",
"calendar_type": "Islamic",
"date": {
"ambiguity": 2,
"century": 15,
"dayOfMonth": 23,
"dayOfYear": 54,
"month": 2,
"year": 1483
},
"input_form": "Date",
"state": "known"
},
{
"additional_info": "1. September(9) 1621 in Gregorian calendar",
"calendar_type": "Gregorian",
"date": {
"ambiguity": 0,
"century": 17,
"dayOfMonth": 1,
"dayOfYear": 244,
"month": 9,
"year": 1621
},
"input_form": "Date",
"state": "known"
},
{
"additional_info": "1. Ianuarius(1) 811 - 31. December(12) 811 in Julian calendar",
"calendar_type": "Julian",
"from": {
"ambiguity": 0,
"century": 9,
"dayOfMonth": 5,
"dayOfYear": 5,
"month": 1,
"year": 811
},
"input_form": "Range",
"state": "known",
"until": {
"ambiguity": 0,
"century": 9,
"dayOfMonth": 4,
"dayOfYear": 4,
"month": 1,
"year": 812
}
},
{
"additional_info": "1. Muḥarram(1) 1000 - 29. Ḏu al-Ḥijjaẗ(12) 1024 in Islamic calendar",
"calendar_type": "Islamic",
"from": {
"ambiguity": 2,
"century": 16,
"dayOfMonth": 19,
"dayOfYear": 292,
"month": 10,
"year": 1591
},
"input_form": "Range",
"state": "known",
"until": {
"ambiguity": 2,
"century": 17,
"dayOfMonth": 19,
"dayOfYear": 19,
"month": 1,
"year": 1616
}
},
{
"additional_info": "1650-01-01 - 1699-01-01 in Gregorian calendar",
"calendar_type": "Gregorian",
"from": {
"ambiguity": 0,
"century": 17,
"dayOfMonth": 1,
"dayOfYear": 1,
"month": 1,
"year": 1650
},
"input_form": "Range",
"state": "known",
"until": {
"ambiguity": 0,
"century": 17,
"dayOfMonth": 1,
"dayOfYear": 1,
"month": 1,
"year": 1699
}
}
]

0 comments on commit 1148c90

Please sign in to comment.