Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ITIS (id 2144): test report #8

Open
yroskov opened this issue Jan 5, 2021 · 229 comments
Open

ITIS (id 2144): test report #8

yroskov opened this issue Jan 5, 2021 · 229 comments

Comments

@yroskov
Copy link

yroskov commented Jan 5, 2021

https://www.checklistbank.org/dataset/2144/

Source of global sectors:

file ITIS_GSDs+Updates_forCoL_2020-03-03.xlsx

From: Nicolson, David
Sent: Tuesday, March 3, 2020 23:01
To: Roskov, Yury
Cc: Orrell, Thomas
Subject: Initial list of ITIS GSDs for addition (or consideration) to CoL

Yuri,
OK, here is my first pass trying to detect ITIS GSDs that should (or could) be added or updated in CoL. It includes GSDs we added or updated in ITIS since the last time CoL was updated for ITIS (mid-2017), as well as a few cases where ITIS loaded a GSD that was not noted to you previously. I left out groups where CoL already has a solid/active source, assuming the source seemed to actually be providing a reasonably complete GSD (vs. an "aspirational" GSD that is not very close to complete).

They are sorted according to their placement in ITIS now, via a hierarchy column. Those with yellow question marks may or may not be used in CoL; a few already have a source for CoL, but I suggest at least considering switching to ITIS due to various issues.

I have included a few things that we will shortly have loaded into ITIS, and a few that we are actively working on now (for inclusion in ITIS later this year, likely before the ITIS CoLdp export is ready).

If we realize we missed anything I will let you know.

Thanks, Dave

@yroskov
Copy link
Author

yroskov commented Jan 5, 2021

file ITIS_GSDs+Updates_forCoL_2020-03-03_yr_log

GSDs added or updated (or about/planned to be added/updated) in ITIS that are available, potentially for CoL use: CoL-yr comment yr2 comment yr3 comment
Hierarchy      
Deuterostomia : Chordata : Vertebrata : Gnathostomata : Tetrapoda : Amphibia 2020-08-07 old sector deleted, new attached, synced    
Deuterostomia : Chordata : Vertebrata : Gnathostomata : Tetrapoda : Aves 2020-08-07    
Deuterostomia : Chordata : Vertebrata : Gnathostomata : Tetrapoda : Mammalia 2020-08-07    
Deuterostomia : Chordata : Vertebrata : Gnathostomata : Tetrapoda : Reptilia : Squamata : Serpentes ReptileDB    
Deuterostomia : Chordata : Vertebrata : Gnathostomata : Tetrapoda : Reptilia : Squamata… ReptileDB    
Protostomia : Ecdysozoa : Arthropoda : Chelicerata : Euchelicerata : Arachnida : CURRENT/NEW HIERARCHY TO FAMILY 2020-08-11 I am going update classification in Arachnida with latest ITIS. To do that, I have Deleted Subtree class Arachnida, then, assembled subclass Arachnida from ITIS, and synced it. Third step should be Delete Sector Arachnida. Expected behavior: full classification in subclass Arachnida will stay in the tree for further assembly of GSD sectors. However, deletion is failed             2020-08-13 Markus fixed problem "manually", for one time only. As result, Arachnida has classification withour sectors as a start poinr for re-assembly. Attention: some taxa have no genera (in source IT IS data - the same). 2020-08-13 ITIS Arachnida classification updated. 10 GSDs re-attached in Arachnida: BdelloideaBase, FADA Halacaridae, OlogamasidBase, PhytoseiidBase, RhodacaridBase, SpmWeb, TenuipalpidBase, The Scorpion Files, TicksBase, WSC 2020-08-13 Doing assembly of ITIS sectors in Arachnida, I have drag&drop orders Amblypygi & Palpigradi from ITIS to assembly tree.
Protostomia : Ecdysozoa : Arthropoda : Chelicerata : Euchelicerata : Arachnida : Amblypygi +Palpigradi +Ricinulei +Schizomida +Solifugae +Uropygi 2020-08-10 old sector deleted, new attached, synced 2020-08-13; re-assembled (replace), re-synced  
Protostomia : Ecdysozoa : Arthropoda : Chelicerata : Euchelicerata : Arachnida : Pseudoscorpionida 2020-08-10 old sector deleted, new attached, synced 2020-08-13; re-assembled, re-synced Expected behavior: both orders will appear in Arachnida - superorder Not assigned - orders Amblypygi & Palpigradi.
Protostomia : Ecdysozoa : Arthropoda : Chelicerata : Euchelicerata : Arachnida : Acariformes : Sarcoptiformes : Endeostigmata : Alicorhagioidea +Alycoidea +Nematalycoidea +Oehserchestoidea +Terpnacaroidea   2020-08-13; re-assembled (replace), re-synced  
Protostomia : Ecdysozoa : Arthropoda : Chelicerata : Euchelicerata : Arachnida : Acariformes : Sarcoptiformes : Oribatida   2020-08-13; re-assembled (replace), re-synced However, result was unexpected: Arachnida - superorder Not assigned - order Araneae - order Amblypygi
Protostomia : Ecdysozoa : Arthropoda : Chelicerata : Euchelicerata : Arachnida : Acariformes : Trombidiformes : Sphaerolichida : Lordalycoidea (Lordalycidae) +Sphaerolichoidea (Sphaerolichidae)   2020-08-13; re-assembled (replace), re-synced and
Protostomia : Ecdysozoa : Arthropoda : Chelicerata : Euchelicerata : Arachnida : Opilioacariformes : Opilioacarida 2020-08-10 old sector deleted, new attached, synced 2020-08-13; re-assembled (replace), re-synced Arachnida - superorder Not assigned - order Opiliones - order Palpigradi
Protostomia : Ecdysozoa : Arthropoda : Chelicerata : Euchelicerata : Arachnida : Parasitiformes : Holothyrida 2020-08-10 new sector attached, synced 2020-08-13; re-assembled (replace), re-synced  
Protostomia : Ecdysozoa : Arthropoda : Chelicerata : Euchelicerata : Arachnida : Parasitiformes : Ixodida : Ixodides : Argasidae + Nuttalliellidae TickBase    
Protostomia : Ecdysozoa : Arthropoda : Chelicerata : Euchelicerata : Arachnida : Parasitiformes : Ixodida : Ixodides : Ixodidae TickBase    
Protostomia : Ecdysozoa : Arthropoda : Crustacea : Branchiopoda : [Phyllopoda] : Notostraca 2020-08-10 new sector attached, synced    
Protostomia : Ecdysozoa : Arthropoda : Crustacea : Branchiopoda : [Sarsostraca] : Anostraca 2020-08-10 old sector deleted, new attached, synced    
Protostomia : Ecdysozoa : Arthropoda : Hexapoda : Collembola : Collembola : Entomobryomorpha : Actaletoidea : Actaletidae All from Collembola.org    
Protostomia : Ecdysozoa : Arthropoda : Hexapoda : Collembola : Collembola : Entomobryomorpha : Isotomoidea : Isotomidae : Anurophorinae      
Protostomia : Ecdysozoa : Arthropoda : Hexapoda : Collembola : Collembola : Poduromorpha : Hypogastruroidea : Hypogastruridae      
Protostomia : Ecdysozoa : Arthropoda : Hexapoda : Insecta : Pterygota : Neoptera : Holometabola : Coleoptera : Polyphaga : Cucujiformia : Chrysomeloidea : Chrysomelidae : Cassidinae 2020-08-10 new sector (family  Chrysomelidae) attached, synced. 2020-08-18 Deleted sector Chrysomelidae, attachment of subfamily failed; 2020-08-18 subfamily assigned & synced (correct GSD 2144) Merge with overlapped spp needed. Subfam, vs fam. As shortcut decision: whole family taken  
Protostomia : Ecdysozoa : Arthropoda : Hexapoda : Insecta : Pterygota : Neoptera : Holometabola : Coleoptera : Polyphaga : Cucujiformia : Chrysomeloidea : Megalopodidae 2020-08-10 new sector attached, synced    
Protostomia : Ecdysozoa : Arthropoda : Hexapoda : Insecta : Pterygota : Neoptera : Holometabola : Coleoptera : Polyphaga : Elateriformia : Byrrhoidea : Elmidae +Protelmidae 2020-08-10 new sector attached, synced    
YR: Byrrhoidea : Limnichidae 2020-08-10old sector deleted, new attached, synced Family Limnichidae was indicated as IT IS global in ac19. I did re-assemble it.  
Protostomia : Ecdysozoa : Arthropoda : Hexapoda : Insecta : Pterygota : Neoptera : Holometabola : Hymenoptera : Apocrita : Aculeata : Apoidea (BEES) : Andrenidae +Apidae +Colletidae +Halictidae +Megachilidae +Melittidae +Stenotritidae Apoidea 2020-08-03 attached, synced    
Protostomia : Ecdysozoa : Arthropoda : Hexapoda : Insecta : Pterygota : Neoptera : Holometabola : Hymenoptera : Apocrita : Aculeata : Apoidea (sphecoid wasps) : Crabronidae +Ampulicidae +Heterogynaidae +Sphecidae Temporarily replaced HyMIS Crabronidae - see & compare results    
Protostomia : Ecdysozoa : Arthropoda : Hexapoda : Insecta : Pterygota : Neoptera : Paraneoptera : Hemiptera : Heteroptera : Cimicomorpha : Cimicoidea : Curaliidae 1 sp; 2020-08-06 attached, synced    
Protostomia : Ecdysozoa : Arthropoda : Hexapoda : Insecta : Pterygota : Neoptera : Paraneoptera : Hemiptera : Heteroptera : Cimicomorpha : Joppeicoidea 1 sp; 2020-08-06 attached, synced    
Protostomia : Ecdysozoa : Arthropoda : Hexapoda : Insecta : Pterygota : Neoptera : Paraneoptera : Hemiptera : Heteroptera : Cimicomorpha : Miroidea : Thaumastocoridae 2020-08-06 attached; Sync failed; children in Miroidea incomplete 2020-08-07 bug fixed; synced  
Protostomia : Ecdysozoa : Arthropoda : Hexapoda : Insecta : Pterygota : Neoptera : Paraneoptera : Hemiptera : Heteroptera : Cimicomorpha : Reduvoidea : Reduviidae 2020-08-06 attached; Sync failed; children in Miroidea incomplete 2020-08-07 bug fixed; synced  
Protostomia : Ecdysozoa : Arthropoda : Hexapoda : Insecta : Pterygota : Neoptera : Paraneoptera : Hemiptera : Heteroptera : Cimicomorpha : Velocipedoidea : Velocipedidae 2020-08-06 created superfamily Velocipedoidea; attached; Sync failed    
Protostomia : Ecdysozoa : Arthropoda : Hexapoda : Insecta : Pterygota : Neoptera : Paraneoptera : Hemiptera : Heteroptera : Dipsocoromorpha : Dipsocoroidea 2020-08-07 drag&drop infraoder Dipsocoromorpha; (before deleted old superfamily) attached; Sync failed    
Protostomia : Ecdysozoa : Arthropoda : Hexapoda : Insecta : Pterygota : Neoptera : Paraneoptera : Hemiptera : Heteroptera : Enicocephalomorpha : Enicocephaloidea 2020-08-07 drag&drop infraoder Enicocephalomorpha; (before deleted old superfamily) attached; Sync failed    
Protostomia : Ecdysozoa : Arthropoda : Hexapoda : Insecta : Pterygota : Neoptera : Paraneoptera : Hemiptera : Heteroptera : Gerromorpha : Gerroidea 2020-08-07 drag&drop infraoder  Gerromorpha; (before deleted old superfamilies from IT IS-Global) attached; Sync failed    
Protostomia : Ecdysozoa : Arthropoda : Hexapoda : Insecta : Pterygota : Neoptera : Paraneoptera : Hemiptera : Heteroptera : Gerromorpha : Hebroidea      
Protostomia : Ecdysozoa : Arthropoda : Hexapoda : Insecta : Pterygota : Neoptera : Paraneoptera : Hemiptera : Heteroptera : Gerromorpha : Hydrometroidea      
Protostomia : Ecdysozoa : Arthropoda : Hexapoda : Insecta : Pterygota : Neoptera : Paraneoptera : Hemiptera : Heteroptera : Gerromorpha : Mesovelioidea      
Protostomia : Ecdysozoa : Arthropoda : Hexapoda : Insecta : Pterygota : Neoptera : Paraneoptera : Hemiptera : Heteroptera : Leptopodomorpha : Leptopodoidea 2020-08-07 drag&drop infraoder  Leptopodomorpha; (before deleted old superfamilies from IT IS-Global) attached; Synced    
Protostomia : Ecdysozoa : Arthropoda : Hexapoda : Insecta : Pterygota : Neoptera : Paraneoptera : Hemiptera : Heteroptera : Leptopodomorpha : Saldoidea      
Protostomia : Ecdysozoa : Arthropoda : Hexapoda : Insecta : Pterygota : Neoptera : Paraneoptera : Hemiptera : Heteroptera : Nepomorpha : Corixoidea 2020-08-07 drag&drop infraoder  Nepomorpha; (before deleted old superfamilies from IT IS-Global) attached; Synced    
Protostomia : Ecdysozoa : Arthropoda : Hexapoda : Insecta : Pterygota : Neoptera : Paraneoptera : Hemiptera : Heteroptera : Nepomorpha : Naucoroidea      
Protostomia : Ecdysozoa : Arthropoda : Hexapoda : Insecta : Pterygota : Neoptera : Paraneoptera : Hemiptera : Heteroptera : Nepomorpha : Nepoidea      
Protostomia : Ecdysozoa : Arthropoda : Hexapoda : Insecta : Pterygota : Neoptera : Paraneoptera : Hemiptera : Heteroptera : Nepomorpha : Notonectoidea      
Protostomia : Ecdysozoa : Arthropoda : Hexapoda : Insecta : Pterygota : Neoptera : Paraneoptera : Hemiptera : Heteroptera : Nepomorpha : Ochteroidea      
Protostomia : Ecdysozoa : Arthropoda : Hexapoda : Insecta : Pterygota : Neoptera : Paraneoptera : Hemiptera : Heteroptera : Pentatomomorpha : Idiostoloidea : Henicoridae      
Protostomia : Ecdysozoa : Arthropoda : Hexapoda : Insecta : Pterygota : Neoptera : Paraneoptera : Hemiptera : Heteroptera : Pentatomomorpha : Idiostoloidea : Idiostolidae 2020-08-07 drag&drop suprfamily Idiostoloidea; (before deleted old families from IT IS-Global) ; Synced    
Protostomia : Ecdysozoa : Arthropoda : Hexapoda : Insecta : Pterygota : Neoptera : Paraneoptera : Hemiptera : Heteroptera : Pentatomomorpha : Pentatomoidea : Dinidoridae 2020-08-07 synced    
Protostomia : Ecdysozoa : Arthropoda : Hexapoda : Insecta : Pterygota : Neoptera : Paraneoptera : Hemiptera : Heteroptera : Pentatomomorpha : Pyrrhocoroidea : Largidae 2020-08-07 synced    
Protostomia : Ecdysozoa : Arthropoda : Myriapoda : Symphyla from WoRMS Millibase    

@yroskov
Copy link
Author

yroskov commented Jan 5, 2021

file ITIS_GSDs+Updates_forCoL_2020-03-03_yr_log ITIS_ac19

  Archaea          
  Bacteria          
2020-08-14 Protozoa - Apicomplexa - Conoidasida - Eucoccidiorida - Cryptosporidiidae - Cryptosporidium
2020-08-14 assembled & synced Chromista - Ochrophyta - Bacillariophyceae - Chaetocerotales - Chaetocerotaceae
2020-08-14 assembled & synced Chromista - Ochrophyta - Bacillariophyceae - Naviculales - Naviculaceae - Navicula
  Plantae - Tracheophyta - Magnoliopsida - Brassicales - Koeberliniaceae, Limnanthaceae
  Plantae - Tracheophyta - Magnoliopsida - Caryophyllales - Cactaceae, Nepenthaceae, Simmondsiaceae
  Plantae - Tracheophyta - Magnoliopsida - Crossosomatales - Crossosomataceae
  Plantae - Tracheophyta - Magnoliopsida - Cucurbitales - Datiscaceae
  Plantae - Tracheophyta - Magnoliopsida - Huerteales - Gerrardinaceae
  Plantae - Tracheophyta - Magnoliopsida - Lamiales - Gesneriaceae
  Plantae - Tracheophyta - Magnoliopsida - Malpighiales - Lophopyxidaceae
  Plantae - Tracheophyta - Magnoliopsida - Proteales - Nelumbonaceae
  Plantae - Tracheophyta - Magnoliopsida - Saxifragales - Penthoraceae
  Plantae - Tracheophyta - Magnoliopsida - Solanales - Montiniaceae, Sphenocleaceae
2020-08-14 assemled in phylum Kamptozoa (see below Jul 2023): class Entoprocta & Cycliophora; synced Animalia - Acanthocephala, Entoprocta, Hemichordata, Micrognathozoa, Cycliophora, Onychophora, Sipuncula, Tardigrada
2020-08-14 assembled & synced Animalia - Annelida - Clitellata - Branchiobdellida  
2020-08-10 Animalia - Arthropoda - Branchiopoda - Anostraca  
WoRMS Amphipoda Animalia - Arthropoda - Malacostraca - Amphipoda - Crangonyctidae - Stygobromus
2020-08-14 assembled & synced Animalia - Arthropoda - Malacostraca - Mysida  
WoRMS Copepoda Animalia - Arthropoda - Maxillopoda - Calanoida - Aetideidae
  Animalia - Arthropoda - Arachnida - Amblypygi, Opilioacarida, Palpigradi, Pseudoscorpiones, Ricinulei, Schizomida, Solifugae, Uropygi
  Animalia - Arthropoda - Arachnida - Sarcoptiformes - (suborder Oribatida)
2020-08-14 assembled & synced Animalia - Arthropoda - Entognatha - Protura  
  Animalia - Arthropoda - Insecta - Hemiptera - Hebroidea - Macroveliidae, Paraphrynoveliidae    
  Animalia - Arthropoda - Insecta - Hemiptera - Hydrometroidea - Hydrometridae    
  Animalia - Arthropoda - Insecta - Hemiptera - Gerroidea - Hermatobatidae, Veliidae    
  Animalia - Arthropoda - Insecta - Hemiptera - Leptopodoidea, Saldoidea    
  Animalia - Arthropoda - Insecta - Hemiptera - Mesovelioidea - Mesoveliidae    
  Animalia - Arthropoda - Insecta - Hemiptera - Naucoroidea - Aphelocheiridae, Potamocoridae    
2020-08-14 assembled & synced Animalia - Arthropoda - Insecta - Hymenoptera - Ceraphronoidea, Evanioidea, Platygastroidea
2020-08-14 assembled & synced Animalia - Arthropoda - Insecta - Hymenoptera - Cynipoidea - Ibaliidae
2020-08-14 assembled & synced Animalia - Arthropoda - Insecta - Hymenoptera - Vespoidea - Formicidae
2020-08-25 assembled & synced Animalia - Arthropoda - Insecta - Coleoptera - (suborder Archostemata)
2020-08-14 assembled & synced Animalia - Arthropoda - Insecta - Coleoptera - Bostrichidae, Dytiscidae, Histeridae, Nosodendridae
StaphBase Animalia - Arthropoda - Insecta - Coleoptera - Hydraenidae
  Animalia - Arthropoda - Insecta - Coleoptera - Byrrhoidea - Limnichidae
see left Animalia - Arthropoda - Insecta - Coleoptera - Chrysomeloidea - Chrysomelidae - (subfamily Cassidinae)
2020-08-14 assembled & synced Animalia - Arthropoda - Insecta - Coleoptera - Elateroidea - Lampyridae, Phengodidae
2020-08-14 assembled & synced Animalia - Arthropoda - Insecta - Coleoptera - Cucujoidea - Cucujidae - Pediacus
2020-08-14 assembled & synced Animalia - Arthropoda - Insecta - Mecoptera  
2020-08-14 assembled & synced Animalia - Arthropoda - Insecta - Trichoptera  
  Animalia - Chordata - Amphibia, Aves, Mammalia  
             
  Animalia ph cl or sfa fa
2020-08-25 assembled & synced       Coleoptera Not assigned Crowsoniellidae
            Cupedidae
            Jurodidae
            Micromalthidae
            Ommatidae
             
2020-08-25 FIXED: deleted as a sector Animalia - Arthropoda - Malacostraca - Decapoda - Thalassinoidea
             
2021-01-05 assembled Animalia - Arthropoda - Arachnida Opiliones Superfamily Samooidea • 210 living spp•ITIS Global
  Animalia - Arthropoda - Arachnida Opiliones Superfamily Travunioidea • 68 living spp•ITIS Global
  Animalia - Arthropoda - Arachnida Opiliones Superfamily Triaenonychoidea • 437 living spp•ITIS Global
  Animalia - Arthropoda - Arachnida Opiliones Superfamily Zalmoxoidea • 270 living spp•ITIS Global

@yroskov
Copy link
Author

yroskov commented Jan 5, 2021

@gdower set up a timer for automatic import.

Standard day:
time:

@mdoering
Copy link
Member

mdoering commented Jan 5, 2021

we have a built in timer we should use. After stable ids are completed we should try it with all datasets on dev for a while and activate it on prod if no issues pop up. We did use it in the early days already

@yroskov
Copy link
Author

yroskov commented Jan 8, 2021

ITIS of 2020-12-21 is synced on 2021-01-08

@yroskov
Copy link
Author

yroskov commented Feb 1, 2021

@yroskov
Copy link
Author

yroskov commented Feb 1, 2021

@gdower fixed broken sectors 2021-02-01

ITIS of 2020-12-21 is re-synced on 2021-02-01

@yroskov
Copy link
Author

yroskov commented Feb 3, 2021

ITIS of 2021-01-26 was imported (no broken sectors) and synced on 2021-02-03.

@DaveNicolson
Copy link

@yroskov , some newer ITIS GSDs that don't appear to overlap with other sources:

SMALL BUT IMPORTANT FAMILY:
Animalia : Bilateria : Protostomia : Ecdysozoa : Arthropoda : Chelicerata : Euchelicerata : Arachnida : Parasitiformes : Mesostigmata : Monogynaspida : Gamasina : Dermanyssoidea : Varroidae

Additional newer ITIS GSDs that appear to be "GAPS" in COL:
Animalia : Bilateria : Protostomia : Ecdysozoa : Arthropoda : Hexapoda : Insecta : Pterygota : Neoptera : Paraneoptera : Hemiptera : Heteroptera : Cimicomorpha : Cimicoidea : Cimicidae

Animalia : Bilateria : Protostomia : Ecdysozoa : Arthropoda : Hexapoda : Insecta : Pterygota : Neoptera : Paraneoptera : Hemiptera : Heteroptera : Cimicomorpha : Cimicoidea : Polyctenidae

Animalia : Bilateria : Protostomia : Ecdysozoa : Arthropoda : Hexapoda : Insecta : Pterygota : Neoptera : Paraneoptera : Hemiptera : Heteroptera : Pentatomomorpha : Pentatomoidea : Acanthosomatidae

@yroskov
Copy link
Author

yroskov commented Feb 23, 2021

@DaveNicolson thank you!

Families Varroidae, Cimicidae, Polyctenidae & Acanthosomatidae have been assembled in CoL and synced from ITIS of 2021-01-26.

@DaveNicolson
Copy link

@yroskov the February ITIS load was just put online. There are two new GSDs that appear to be gaps in COL now. They are:
Animalia : Bilateria : Protostomia : Ecdysozoa : Arthropoda : Crustacea : Malacostraca : Eumalacostraca : Eucarida : Decapoda : Pleocyemata : Infraorder Astacidea [Superfamilies Astacoidea, Parastacoidea, Enoplometopoidea and Nephropoidea]
Animalia : Bilateria : Protostomia : Ecdysozoa : Arthropoda : Crustacea : Malacostraca : Eumalacostraca : Eucarida : Decapoda : Pleocyemata : Infraorder Glypheidea [Superfamily Glypheoidea]

The latter was split out of the former. I see COL is not using any ranks between Decapoda and superfamilies in it. I've noted the GSD superfamilies for each of the two infraorder GSDs so you can handle them as you wish.

@gdower
Copy link
Contributor

gdower commented Mar 1, 2021

The import of the ITIS 2021-02-26 release finished: https://data.catalogueoflife.org/dataset/2144/imports

@yroskov
Copy link
Author

yroskov commented Mar 2, 2021

@DaveNicolson thank you!

Infraorders Astacidea and Glypheidea have been assembled in CoL as direct children of Decapoda and synced from ITIS of 2021-02-26.

(Hmm, it's a mess in the Tree: now all superfamilies from WoRMS Brachyura gone under infraorder Not Assigned).

@yroskov
Copy link
Author

yroskov commented Mar 2, 2021

ITIS of 2021-02-26 (id 2144) is synced on 2021-03-02.

@DaveNicolson
Copy link

March 30 load for ITIS is complete, and the full exports are available. A new smaller GSD that is a gap now in COL is Order Lophogastrida (ITIS TSN 89808), which is in the ITIS hierarchy here:
Animalia : Bilateria : Protostomia : Ecdysozoa : Arthropoda : Crustacea : Malacostraca : Eumalacostraca : Peracarida : Lophogastrida
The other updated groups are already marked as ITIS sectors in COL, so should update in your next sync of the ITIS data. Thank you!

@yroskov
Copy link
Author

yroskov commented Apr 2, 2021

Thank you Dave! We already processed a new version. Without Lophogastrida yet. I'll add it as a new sector right now in the clearinghouse for a next release.

@yroskov
Copy link
Author

yroskov commented Apr 2, 2021

Order Lophogastrida established as a sector and synced 2021-04-02 (after launch of preview release).

@yroskov
Copy link
Author

yroskov commented Apr 30, 2021

ITIS of 2021-04-27 (id 2144) is imported in CoL & synced on 2021-04-30.
7th update

@DaveNicolson
Copy link

Two (or three, if you prefer) new (for ITIS) GSDs that may or may not be wanted by COL (none of them appears in COL now, I believe):

Animalia : Bilateria : Protostomia : Ecdysozoa : Arthropoda : Crustacea : Malacostraca : Eumalacostraca : Eucarida : Decapoda : Pleocyemata : Caridea : Alpheoidea : Alpheidae
https://www.itis.gov/servlet/SingleRpt/SingleRpt?search_topic=TSN&search_value=96600#null

Animalia : Bilateria : Protostomia : Ecdysozoa : Arthropoda : Crustacea : Malacostraca : Eumalacostraca : Eucarida : Decapoda : Pleocyemata : Achelata & Polychelida [former "Palinura", now split]
https://www.itis.gov/servlet/SingleRpt/SingleRpt?search_topic=TSN&search_value=1147564#null
https://www.itis.gov/servlet/SingleRpt/SingleRpt?search_topic=TSN&search_value=1147563#null

@yroskov
Copy link
Author

yroskov commented Apr 30, 2021

@DaveNicolson thank you!

Family Alpheidae is assembled now as a direct child of superfamily Alpheoidea. Infraorders Achelata & Polychelida are assembled as children of suborder Pleocyemata. All 3 new sectors synced and will appear in CoL of May.

@DaveNicolson
Copy link

DaveNicolson commented May 28, 2021

New version of ITIS is available, dated 26 May 2021. We have some modest new GSDs that appear to be gaps in COL:

Animalia : Bilateria : Protostomia : Ecdysozoa : Arthropoda : Chelicerata : Euchelicerata : Arachnida : Acariformes : Trombidiformes : Prostigmata : Anystina : Erythraeoidea : Smarididae (family) (we are working on the other family of the superfamily, not ready yet though)
https://www.itis.gov/servlet/SingleRpt/SingleRpt?search_topic=TSN&search_value=1118129#null

Animalia : Bilateria : Protostomia : Ecdysozoa : Arthropoda : Chelicerata : Euchelicerata : Arachnida : Acariformes : Trombidiformes : Prostigmata : Anystina : Calyptostomatoidea (superfamily)
https://www.itis.gov/servlet/SingleRpt/SingleRpt?search_topic=TSN&search_value=895634#null

Animalia : Bilateria : Protostomia : Ecdysozoa : Arthropoda : Crustacea : Malacostraca : Eumalacostraca : Eucarida : Decapoda : Pleocyemata : Stenopodidea (infraorder)
https://www.itis.gov/servlet/SingleRpt/SingleRpt?search_topic=TSN&search_value=97294#null

@gdower
Copy link
Contributor

gdower commented Jun 1, 2021

Thanks @DaveNicolson! I imported the new version, although it won't be assembled into the Catalogue until Yuri returns next week: https://data.catalogueoflife.org/dataset/2144/imports

@yroskov
Copy link
Author

yroskov commented Jun 7, 2021

3 new sectors established as:

  • family Smarididae (direct child of superfamily Erythraeoidea)
  • superfamily Calyptostomatoidea (direct child of infraorder Anystina)
  • infraorder Stenopodidea (direct child of suborder Pleocyemata)

@yroskov
Copy link
Author

yroskov commented Jun 7, 2021

ITIS of 2021-05-26 (id 2144, all sectors) is synced on 2021-06-07.
9th update

@DaveNicolson
Copy link

The June ITIS load is completed, and you're welcome to import it for your use.

As already discussed (in a Spring 2021 Taxonomic Group meeting), COL should consider whether to adopt the new ITIS GSD for the mosquito family, Culicidae, under Order Diptera in COL, in place of Systema Dipterorum's data (if Thomas Pape still agrees, it is really his call, the numbers below are not so different after all).

If you want it, it is in ITIS here:
Animalia : Bilateria : Protostomia : Ecdysozoa : Arthropoda : Hexapoda : Insecta : Pterygota : Neoptera : Holometabola : Diptera : Nematocera : Culicomorpha : Culicidae

COL now shows these stats for the family, per Systema Dipterorum:
Subfamily 3
Tribe 11
Subtribe 1
Genus 57
Subgenus 216
Species 3588
Subspecies 4

ITIS now shows these stats for the family:
Subfamily 2
Tribe 11
Genus 41
Subgenus 188
Species 3585
Subspecies 219

The update was based primarily on the 2021 "Mosquitoes of the World" volumes (Wilkerson et al.), but also used the data from Harbach's "Mosquito Taxonomic Inventory" website. We tried to include the alternative combinations in use (in synonymy, where appropriate), especially for known disease vectors, and the full data set includes over 5400 scientific names (all ranks, all usages), which should help link names with various conflicting taxonomic sources from the last ~45 years.

@yroskov
Copy link
Author

yroskov commented Jul 1, 2021

ITIS of 2021-06-29 imported in the checklist bank 2021-07-01.
10th update.

  • Metadata are empty in the checklistbank.
    image

I have have set up a patch, repeating metadata from the portal: https://www.catalogueoflife.org/data/dataset/2144

@mdoering
Copy link
Member

mdoering commented Jul 2, 2021

The new diff UI even works for the larger ITIS changes: https://data.catalogueoflife.org/dataset/2144/diff?attempts=36..37

@mdoering
Copy link
Member

I have deleted the 5 ITIS merge sectors which should have removed all linked data in the project. @yroskov could you do a brief check if you spot sth unusual?

I also deployed a new backend and UI with a new BLOCK_MERGE_SYNC setting. I have activated that for COL, so from now on no merge sectors should ever by merged into the project directly. Neither by accidently hitting a sync button or by doing a sync all (of dataset). Merge sectors should just be skipped. I will test on dev now to make sure

@mdoering
Copy link
Member

@thomasstjerne also changed the UI to ask a user whether he wants a full or partial delete of a sector when you hit the delete. So you can pick if you want to retain higher taxa or delete it all

@mdoering
Copy link
Member

it is working as expected!

Merge sectors blocked in project, skip sync of sector Sector{5, datasetKey=265156, mode=MERGE, subjectDatasetKey=32593, subject=ACCEPTED FAMILY Polyporaceae Fr. ex Corda [60468 parent=60469]}

@mdoering mdoering changed the title ITIS ITIS (id 2144): test report May 17, 2024
@yroskov
Copy link
Author

yroskov commented May 23, 2024

@DaveNicolson, should we expect updates from ITIS at the end of May? (2024 Annual Checklist will be released in June)

@DaveNicolson
Copy link

@yroskov Unfortunately, the next ITIS load will be late June (and it should include an updated classification of the Coccinellidae to subgenus, which should allow you to take the GSDs for genus Rhyzobius and tribe Epilachnini, at last). After that we should return to monthly loads.

@yroskov
Copy link
Author

yroskov commented May 23, 2024

Thanks! Looking forward the end of June for the July release :)

@DaveNicolson
Copy link

ITIS is wrapping up a load that was finalized and dated 26 June 2024 (I emailed Geoff a link to get it since it's not up yet on the ITIS site).

In addition to updates to existing GSDs COL sources from ITIS (particularly in Mammalia), we also finally updated the "ladybird beetle" (Coccinellidae) classification from family to subgenus. This should allow COL to take up the GSDs for two parts of the family that we previously completed (last fall):

  1. Tribe Epilachnini (formerly treated as a subfamily, containing 1109 valid/accepted species) is placed as follows:
    Animalia : Bilateria : Protostomia : Ecdysozoa : Arthropoda : Hexapoda : Insecta : Pterygota : Neoptera : Holometabola : Coleoptera : Polyphaga : Cucujiformia : Coccinelloidea : Coccinellidae : Coccinellinae : Epilachnini

  2. Genus Rhyzobius (containing 113 species) is placed as follows:
    Animalia : Bilateria : Protostomia : Ecdysozoa : Arthropoda : Hexapoda : Insecta : Pterygota : Neoptera : Holometabola : Coleoptera : Polyphaga : Cucujiformia : Coccinelloidea : Coccinellidae : Coccinellinae : Coccidulini : Rhyzobius
    [NOTE: we accidentally changed the genus from "complete" to "partial" while updating that classification, but it remains complete, we'll get that corrected soon]

The hierarchy within Arthropoda was updated, and there are some changes that could affect at least one GSD... Based on Dr. Zhang's recommendations following the Taxonomy Group meetings, (A) Arachnida is recognized again as a full class, and Class Euchelicerata was dropped (it remains useful for Arachnida+Xuphosura vs. Pycnogonida, but forced the prior 'unpopular' rank shift that we're undoing), (B) [Class Merostomata: Order Xiphosura] replaces [Subclass Xiphosura: Order Xiphosurida], (C) Superorder Opilioacariformes ITIS GSD should become Order Opilioacarida ITIS GSD since Dr. Zhang indicates acarine workers generally consider it to be basal WITHIN Superorder Parasitiformes... Those changes are in-line with recommendations discussed in the Taxonomy Group meeting (and afterward directly with Dr. Zhang via email).

In summary of the Arachnida-related changes, here's what ITIS will now show (once the load is completed and the DB connections stabilized):

Subphylum Chelicerata
..Class Arachnida
......Order Amblypygi
......Order Araneae
......Order Opiliones
......Order Palpigradi
......Order Pseudoscorpiones
......Order Ricinulei
......Order Schizomida
......Order Scorpiones
......Order Solifugae
......Order Uropygi
....Superorder Acariformes
......Order Sarcoptiformes
......Order Trombidiformes
....Superorder Parasitiformes
......Order Holothyrida
......Order Ixodida
......Order Mesostigmata
......Order Opilioacarida
..Class Merostomata
....Order Xiphosura
..Class Pycnogonida
....Order Pantopoda

We also made some hierarchy changes within Crustacea, and it will hopefully make a lot more sense now (even though everyone knows much of it isn't "final").

Of course you'll have your own needs for handling COL's hierarchy, but I hope this is of some use.

@yroskov
Copy link
Author

yroskov commented Jul 3, 2024

@mdoering, very strange happened to ITIS sectors in CLB project 3.

All ITIS taxa appears as CoL sectors in the right window, despite they are not real sectors in the left window (and should not be sectors in the global CoL). Seems, it caused by Extended Catalogue: https://www.checklistbank.org/catalogue/3/assembly?assemblyTaxonKey=c53306f8-8660-4754-940f-b1f0cefc62ea&datasetKey=2144

As result, I am not able to establish tribe Epilachnini and genus Rhyzobius as new GSD sectors, because they are already shown as part of ITIS sectors in CoL:
image

@yroskov
Copy link
Author

yroskov commented Jul 3, 2024

@mdoering, how I can sync only global sectors from ITIS, if I have this situation:

image

@mdoering
Copy link
Member

mdoering commented Jul 8, 2024

There are 2 options.
The first is to select only the union and attach sectors using the sector mode filter. Then you can select all and trigger syncs just for those:

image

@thomasstjerne The sync and rematch all selected buttons are pretty hidden and I can access them only when I hover over the Dataset column.

The second option is just to hit sync all - merge sectors will be blocked as long as the projects Block Merge Syncs option is set to true - which is the case now.

@yroskov
Copy link
Author

yroskov commented Jul 8, 2024

ITIS of 2024-06-26; imported 2024-07-01

  • Imported: 506,052 spp (vs 505,877 spp)
  • Metadata: ok
  • Classification: ok
  • Sectors: ok; classification inside family Coccinellidae was synced, and two new sectors established 2024-07-08: tribe Epilachnini & genus Rhyzobius
  • Extinct taxa flag: OK

Metrics

image

Synced 2024-07-08

@yroskov
Copy link
Author

yroskov commented Aug 6, 2024

ITIS of 2024-07-23; imported 2024-08-05

  • Imported: 507,602 spp (vs 506,052 spp)
  • Metadata: ok
  • Classification: ok
  • Sectors: ok (attached)
  • Extinct taxa flag: OK

Metrics

image

Synced (all "attached", excluding "merged"?) 2024-08-06

@DaveNicolson
Copy link

DaveNicolson commented Aug 27, 2024

The August ITIS export is available for download & processing (load completed on 16th, but re-exported on 20th to resolve an issue). Aside from some updated GSDs and additions of groups COL gets from elsewhere, there are some small new ITIS GSDs that are empty in COL...

Cryptococcidae, found in ITIS here (but in current COL classification it would go under infraorder Coccidomorpha):
Animalia : Bilateria : Protostomia : Ecdysozoa : Arthropoda : Hexapoda : Insecta : Pterygota : Neoptera : Acercaria : Hemiptera : Sternorrhyncha : Coccoidea : Cryptococcidae

Coelostomidiidae, found ITIS here (but in current COL classification it would go under infraorder Coccidomorpha):
Animalia : Bilateria : Protostomia : Ecdysozoa : Arthropoda : Hexapoda : Insecta : Pterygota : Neoptera : Acercaria : Hemiptera : Sternorrhyncha : Coccoidea : Coelostomidiidae

Forgot another small one:
Callipappidae, found in ITIS here (but in current COL classification it would go under infraorder Coccidomorpha):
Animalia : Bilateria : Protostomia : Ecdysozoa : Arthropoda : Hexapoda : Insecta : Pterygota : Neoptera : Acercaria : Hemiptera : Sternorrhyncha : Coccoidea : Callipappidae

@yroskov
Copy link
Author

yroskov commented Aug 27, 2024

@DaveNicolson, as I can see, children taxa of Cryptococcidae, Coelostomidiidae & Callipappidae are already present in the CoL under different families as a part of global ScaleNet checklist.

Update of entire ScaleNet is in the hands of @gdower. I hope, we'll be able to update all scales in 2024. Keep fingers crossed.

At this moment, we have an awful mess in the classification of scales due to mixture of 2004 ScaleNet data and empty insertions from unnamed source: https://www.catalogueoflife.org/?taxonKey=C22KK (@mdoering, are these artifacts of Extended Catalogue?)

@DaveNicolson
Copy link

OK, great to hear ScaleNet's to be updated in COL!! We're following their current data, and Daniel has exchanged emails with editors who have made edits as a result. I'm really glad to hear it's being update in COL!! I'll omit such updates based on ScaleNet from COL notices in the future.

@yroskov
Copy link
Author

yroskov commented Sep 11, 2024

ITIS of 2024-08-16; imported 2024-09-03

  • Imported: 507,715 spp (vs 507,602 spp)
  • Metadata: version changed according to itis.gov 2024-08-20 / 2024-08-20 --> 2024-08-16 / 2024-08-20
    Geoff: The website is wrong. I would keep it as 2024-08-20. It's the date in the data archive that matters. The 08-16 version had the corrupted hierarchy. Changed back 2024-08-16 / 2024-08-20 --> 2024-08-20 / 2024-08-20
  • Classification: ok
  • Sectors: ok ("attached")
  • Extinct taxa flag: OK

Metrics

image

TASKS

image

  • Manuscript names, 5 of 17 (only synonyms left without decision = checked 2024-09-11)
  • Broken decisions, 31 = deleted

image

Synced 2024-09-11

@DaveNicolson
Copy link

A new version of ITIS, dated 20 Sep 2024, is available in the usual download page. I separately sent Geoff the new list of extinct TSNs from ITIS.

This version includes a number of updates in groups COL takes from other sources (mainly in Coccoidea/Coccomorpha), but no new/updated GSDs for COL.

@yroskov
Copy link
Author

yroskov commented Oct 3, 2024

ITIS of 2024-09-20; imported 2024-10-01

Metrics

image

TASKS

image

  • Manuscript names, 0 of 17 = 5 new accepted flagged as "prov. accepted"
  • Broken decisions, 15 = deleted
  • Outdated decisions, 0

Resolved 2024-10-03:

image

Synced 2024-10-03

Button "Sync all sectors from dataset 2144" has been used. The list showed 5 "merged" sectors despite the option "Include merged sources" not being activated:

image

@DaveNicolson
Copy link

There is a new version of ITIS available for download and use, although the updates added this month are all in groups that COL doesn't take from ITIS. Separately, I emailed Geoff the updated list of TSNs for valid/accepted taxa that are extinct.

@yroskov
Copy link
Author

yroskov commented Nov 1, 2024

  • Sectors: family Formicidae excluded from ITIS sectors (replaced by AncCat) 2024-11-01

@yroskov
Copy link
Author

yroskov commented Nov 6, 2024

ITIS of 2024-10-22; imported 2024-11-04

  • Imported: 509,359 spp (vs 508,181 spp)
  • Metadata: OK
  • Classification: OK
  • Sectors: OK ("attached")
  • Extinct taxa flag: OK (1143 spp)

Metrics

image

TASKS

image

  • Manuscript names, 17 = 5 prov acc = Request failed with status code 400
  • Broken decisions, 21 = deleted
  • Outdated decisions, 1 = no action

Resolved 2024-11-06:

image

Synced 2024-11-06 (see the comment in October)

@DaveNicolson
Copy link

The November 2024 ITIS load is complete, and in addition to updating some bird groups, we have also added a GSD covering the bacterial Class Cyanophyceae (cynaobacteria). ITIS acquired the list via contract with workers at UFlorida, and integrated the existing ITIS data into a clean list.

The parent of the group in ITIS is a new named published recently, Phylum Cyanobacteriota, and this phylum will remain incomplete until we can fold in the other tiny class with its new name Class Vampirovibrionophyceae (which isn't photosynthetic!)... so far it looks like there is just a single species (with genus, family and order), and we'll have to move that from where we placed it back a dozen years ago, but none of those additional modifications can happen at least until the December load.

Class Cyanophyceae now contains 4828 valid/accepted species, and authorship and unacceptability reasons follow the 'botanical' Code, where it has traditionally been treated.

I'm not sure how COL will want to handle this, since it is WITHIN the older GSD for Kingdom Bacteria, but hopefully it won't cause problems!

I'll email Yury and Geoff the new list of extinct TSNs momentarily.

@yroskov
Copy link
Author

yroskov commented Dec 2, 2024

ITIS of 2024-11-19; imported 2024-12-02

  • Imported: 513,610 spp (vs 509,359 spp)
  • Metadata: OK
  • Classification: OK
  • Sectors: OK ("attached").
    New global checklist for the class Cyanophyceae: I haven't heard back @dhobern yet, but entire phylum Cyanobacteriota will be automatically updated in December (incl. Cyanophyceae) without my interference, because ITIS provides whole kingdom Bacteria as a sector.
  • Extinct taxa flag: OK (1157 spp)

Metrics

image

TASKS

image

  • Manuscript names, 0 of 17 = only 4 accepted species in Plantae ("comb. ined.") = no action
  • Broken decisions, 16 = deleted

Resolved 2024-12-02:

image

Synced 2024-12-02

@DaveNicolson
Copy link

New ITIS version is available from the usual place. Updates are all in groups COL gets from ITIS (including birds, Amblypygi, a tiny class (Vampirovibrionophyceae, predatory "cyanobacteria" not covered by the cyanobacteria list ITIS loaded last month) that completes the group containing cyanobacteria, and some tweaks in Mammalia). I sent the new list of TSNs to be marked as "extinct" to Geoff and Yuri.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants