-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Systema Dipterorum (id 1101): test report #127
Comments
ISSUES Culex (Culcio0myia) cheni Dong, Wang & Lu, 2003 -> subgenus corrected as Culiciomyia in CoL Indetermined, 227 - names with "sp." blocked in CoL Missing Genus, 3358 - - all names blocked in CoL Names Acerocnema macrocera (Meigen, 21826) - blocked & Acerocnema macrocera (Meigen, 1826) |
Synced with 4 blocked families in the assembly tree, 2021-05-17 4 blocked families need to be confirmed after the sync: confirmed, OK. |
|
Reported by @olafbanki 2021-06-16:
In CoL: Archisargidaae "taxon blocked" in assembly tree 2021-06-17. Synced. |
Reported by @olafbanki 2021-06-16:
We agreed our action plan as (1) modify script and take available values from Epoch field of Systema Dipterorum V3.1_2021-05-12, fill start & end periods in CoL (where it necessary) for 4K accepted species, (2) apply flag “extinct” to the species with not-empty values in Epoch field. |
V3.1_2021-05-12 synced 2021-06-18 Preview 2021-06-18 https://preview.catalogueoflife.org/ looks good.
|
ITIS offers global checklist for Culicidae family: Response from SD: keep Culicidae from SD. |
Set of correctly assigned species binomials are placed under incorrect genus in the classification as a false parent: gbif/checklistbank#187 Simple re-sync did not fix a problem. Two hypotheses (@gdower): (1) Option "Union", which was used for the sector Diptera, may cause a problem. = No
Experiment 1: do not use "Union" in assembly. Bad news: above steps did not fix a problem: Experiment 2: fix "incorrect" name. Result successful: subgenus Culex is correctly placed in genus Culex Well, sync process is misinterpreting original placement of "homonymic" subgenera. |
Subgenus issues fixed in the code. SD synced 2021-09-14. 2021-09-16: looks like technical problem is resolved and species placed in correct genera. |
Version 3.6 received 2022-02-14 Imported to DEV https://data.dev.catalogueoflife.org/dataset/1101/classification |
Checks of the view 2022-02-18
In the source (SPECIES table):
CONCLUSION: set of species in these fam & gen have no parent families in the source file (blank values). CLB INTERPRETATION IS CORRECT Checked few against the source (SPECIES): Ceratopogoninae have 3 parent families, plus blank family with Serromyia errata: Chironominae have 2 parent families, plus blank family with 3 spp Nandeva pudens, Parachironomus inageheus, Polypedilum (Polypedilum) xianjuensis : Chironomiinae (is it different with above? - check with Neal) have 1 parent family Chironomidae, plus blank family with species Yaeprimus balteatus: Clitellariinae have 2 parent families, plus blank family with species Adoxomyia hasbenlii |
I don't think it is a good idea to feed names with square bracket embraced genera into CLB to indicate uncertain placement. This is a very specific convention for SD only and not known to anyone nor the system itself. We should try to change those names and rather follow the guidelines of ColDP, where we have discussed this problem and how to deal with it in a consistent way so both CLB and other users understand the data correctly. Looking at the verbatim Taxon record of that example I see various problems:
In the linked Name record I would suggest to simply remove the square brackets. |
I am comfortable with presentation of an original genus in square brackets where a new placement in a genus is not resolved yet. If CLB allows search for names with square bracket, I'll be happy to mark these accepted names as Provisionally Accepted in the CoL. See: CatalogueOfLife/backend#1112 |
Curly brackets around genera is nothing we support at this stage. It will be considered bad data and likely has impacts down the line when we assemble COL, e.g. when we make sure to have a genus record for every accepted species. Don't be surprised if you find new genera with brackets in COL. |
Investigating bare names, 8,455 ? ambigua Pankratova, 1950 = ok However, many names become "bare" for unclear (yet) reasons: |
NEW SEARCH OPTION IN Workbench @clb: RegEx Search (Regular Expression Search) |
Crawl iteration with pre-flaged "prov acc" names imported 2022-05-19 & 20.
|
Systema Dipterorum 4.2.2, May 2023, received 2023-05-27; imported to prod 2023-05-30
TASKS
Resolved 2023-06-12: Synced 2023-06-12 (without rank subgenus) |
2023-06-15: temporary names such as *FChironominae (start as *F) deleted as a node (“taxon”) in Assembly - Draft. All children attached to the next parent. Sync is not involved (i.e. such names will be back with next sync). |
Both names blocked in CoL. Reported to Neal. Systema Dipterorum re-synced 2023-11-13. |
Re-synced 2023-11-20 |
Systema Dipterorum 4.2.2, May 2023 re-synced 2023-12-05 After the check of PREVIEW 2023-12-06: Test names: Decision "Ignore" applied instead of "Block". Synced 2023-12-07. That's work: only names was blocked and children taxa synced in the CoL. |
Remains unresolved. Attempt to block subgenus as a rank: (1) vanished all subgenera from the tree & species names, (2) created "self-synonymy" (identical ACC-SYN). Blocking subgenus decision was reversed 2023-12-05. Duplicated subgenera are back (sic! PREVIEW 2023-12-07). The list was sent to Neal 2023-12-05. |
Tests of Systema Dipterorum ver. 4.5, 2023-11-16 processed via TW by DD vs data by GO: #244 |
Systema Dipterorum ver. 5.0, 2024-01-08 processed via TW by DD; imported 2024-02-07
ISSUES assessed 2024-02-12
TASKS
Hoplacephala nigriventris (Villeneuve, 1913) Hoplacephala retroseta (Villeneuve, 1913) Huttonobesseria verecunda (Hutton, 1901) Hystricia cuestae (Engel, 1920) Isomyia pseudolucilia (Malloch, 1928) plus, few cases of two identical accepted species (full list): ACC-ACC species (same authors) 0 of 342: https://www.checklistbank.org/dataset/1101/duplicates?authorshipDifferent=false&category=binomial&limit=50&minSize=2&mode=STRICT&offset=0&status=accepted Resolved 2024-02-12: Synced 2024-02-12 |
TASKS does not detect such cases. |
Systema Dipterorum ver. 5.0, 2024-01-08 processed via TW by DD; second iteration (bring back extinct spp); imported 2024-03-07
METRICS TASKS
Seems, a bug resolved. On 2024-03-11, ACC-ACC species (different authors) 512 of 512:
The same problem: interface does not show results ofdecision application. Neither in the report nor in the panel. See comments on bugs - stopper: CatalogueOfLife/backend#1300 (comment) CatalogueOfLife/backend#1300 (comment) Synced 2024-03-18, probably with sets of unresolved duplicates: |
Systema Dipterorum ver. 5.2 of 2024-05-15 (as 0.41.1 / 2024-05-23) processed via TW by GO; 1st iteration ; imported 2024-05-23
METRICS
Should be: Acartophthalmidae Czerny, 1928 = FIXED in 2nd iteration
Both Acartophthalmidae & Acartophthalmus coxatus were correct in previous version. |
Systema Dipterorum ver. 5.2 of 2024-05-15 (as 0.41.1 / 2024-06-01) processed via TW by GO; 2nd iteration ; imported 2024-06-01
METRICS ISSUES TASKS
Resolved 2024-06-03: Synced 2024-06-03 |
Systema Dipterorum ver. 5.3 of 2024-07-17 (imported as 0.43.1 / 2024-08-12) processed via TW by GO; imported 2024-08-12
METRICS ISSUES assessed 2024-08-21 TASKS
Resolved 2024-08-21,22: Synced 2024-08-22 |
|
Systema Dipterorum ver. 5.5 of 2024-10-01 (imported as Nov 2024 / 2024-11-12) processed via TW by GO; imported 2024-11-13 (first iteration - no extinct flag); second iteration imported 2024-11-21
METRICS ISSUES - do together with GO to assess the interface functionality TASKS
Resolved 2024-11-25: Synced 2024-11-25 |
Tests of PREVIEW 2024-11-26:
Re-synced 2024-11-26 |
Tests of the PREVIEW 2024-12-16: ACC-ACC same sp same auth: 171 pair, mainly Systema Dipterorum vs CCW
|
Systema Dipterorum ver. 5.5 of 2024-10-01 (imported as Nov 2024 / 2024-11-12) processed via TW by GO; imported 2024-11-13 (first iteration - no extinct flag); second iteration imported 2024-11-21;
TASKS |
Sorry Yuri, I don't remember the exact reason. The import wasn't done explicitly, but probably due to restarts or sth. It was the exact same archive as the import by Geoff before, no data has changed as you can see in the history. https://www.checklistbank.org/dataset/1101/diff?attempts=35..36 |
After PREVIEW id 306706 2024-12-19 checks:
|
As we learned, Decisions Rematch in dataset options broke decisions in SD (see the "problem of December" above) CatalogueOfLife/backend#1382 |
Version 3.1 received 2021-05-12.
Imported to prod: https://data.catalogueoflife.org/dataset/1101/about
(previous reports are in #6)
Metadata: updated
Sector: order Diptera minus 4 families
Cylindrotomidae
Limoniidae
Pediciidae
Tipulidae
As result, assembly tree looks like that:
The text was updated successfully, but these errors were encountered: