A collection of resources, overviews, links and knowlegde related to Wikibase, collected and curated by KB, national library of the Netherlands.
This page was originally extracted from the slides of the lecture Introduction Wikibase, the basics, for employeees of KB, national library of the Netherlands on 7 September 2023. This (rather long) slidedeck is available on Wikimedia Commons and Zenodo.
This overview is heavily inspired by the Wikibase knowledge graphs, A collection of open source tools and resources related to Wikibase knowledge graphs by Renat Shigapov. I've used his centralized overview many times to improve my understanding of the Wikibase universe, and hope to extend it via the overview below, and help others in a similar way.
Motivation
I created this page for several reasons:
- As a textual summary of my (rather exuberant, visual) presentation of September 7th.
- From my own experience (especially if you are new to Wikibase) I know that it can be very time-consuming to discover and understand the different corners of the Wikibase world, and to find your way in this rather confusing forrest/jungle. I hope this centralized knowledge hub can help others make that journey a little easier/faster.
- I have only been able to acquire most of my Wikibase knowledge thanks to the openness and generosity of the international Wikibase community and their willingness to make knowledge findable, visible and reusable for free. So I think it is no more than 'good reciprocal decency' to contribute to the community this overview of centralized and summarized knowledge.
- As a place to record my "public personal memory of interesting Wikibase stuff".
Contributing to this page
This page is maintained by Olaf Janssen, Wikimedia coordinator of KB. See his Wikidata user page and expert page on kb.nl for contact details.
I plan to improve & expand the overview in the future. If you would like to contribute, please let me know.
Reuse and licensing
This overview can be reused freely and openly, it is available under the CC-BY 4.0 license, so attribution is required. Use something like
Wikibase resources, Olaf Janssen & KB national library of the Netherlands, https://github.com/KBNLwikimedia/Wikibase-resources
Latest updates
Latest update: Version 0.3, 6 December 2023 (added Wikibase projects for medieval manuscripts)
- Recap Wikidata
- What is Wikibase?
- Wikibase courses and tutorials
- Examples of institutions/projects using Wikibase
- Wikibase components & architecture
- Wikibase data model
- Wikibase hosting
- Requesting data from a Wikibase
- Adding data to a Wikibase
- Wikibase community
- Staying updated
- Finding help
Table of contents generated with markdown-toc
This section ia a summary of the lecture Introduction to Wikidata (Zenodo) (in Dutch)
-
Wikidata (d.d. 7 Sept 2023) contains structured descriptions of 107 million things, since October 2012
-
Wikidata items with geo location (June 2023) - Older/other maps - Interactive map
-
What are the principles of Wikidata? (see also the Wikidata introduction)
- Structured descriptions of things, eg. Eiffel Tower
- Central storage (vs. distributed in data silos)
- Multilingual (200+ languages) - Description of the Eiffel Tower in English, Dutch, Portuguese and Japanese etc.
- Linked data
- Things, not strings, no flat text strings, but clicklable links
- Interconcected Wikidata items, eg. Eiffel Tower --> named after --> Gustave Eiffel
- Wikidata is connected to pm 8.200 external databases
- Open & free
- Free, no trackers, no ads, no usage fee.
- No copyright or database rights, all data is available under the CC0 license.
- Everyone can reuse data: query, share, copy , edit, download, sell, etc.
- Anyone can contribute, edit, add, improve, delete, merge data, etc. --> Community
- Community
- International
- Pm. 24K editors
- Under the flag of the Wikimedia Foundation. Wikidata is a sister project of Wikipedia, Wikimedia Commons etc.
- For humans and machines
- Human readable, human writable --> Data available via GUIs in HTML
- Machine readable, machine writable --> Data available via APIs in JSON , XML/RDF, CSV etc.
-
Wikidata is a secondary, general purpose, public knowledge base for the world.
- Secondary
- General purpose
- Wide scope of topics/classes
- Items contain relatively basic data (limited set of properties), Wikidata is not aimed at superspecialistic/deep data
- Public
- Public data
- Without copyright issues
- Without privacy issues (more info)
- Institutional use cases for which Wikidata is not suitable
- Publish domain-specific / specialist / 'esoteric' LOD
- Publish very large LOD sets (e.g. catalogues, thesauri)
- Use of very specific/complex/deep/layered data models
- Control over who can add/change data
- Collaboration with selected partners in a closed, controlled environment
- Recording non-public data
- Own control over hosting / IT infrastructure
- The open-source, free software that powers Wikidata - "Wikibase is essentially a blank copy of Wikidata into which you can put your own structured data" (source),
- Allowing you to build & manage your own LOD knowledge base,
- Without the disadvantages of Wikidata,
- You can create your own data models, as domain-specific/specialist/esoteric as you want/need.
- Large data sets are no problem
- Custom rights management (control over who is allowed to contribute)
- Wikibase instances can be non-public
- You can host your own instance
- With all the benefits of Wikidata
- Focused on collaboration and connection (including Wikidata community)
- For people and machines
- User-friendly GUI for structured data
- Native multilingualism support
- Version history and control, rollbacks
- Clear ontology: Items, Properties, Statements etc.
- Output in various data formats (including JSON, RDF/XML, N3)
- Search via SPARQL
- Well suported and documented MediaWiki API
- Support for tools (including OpenRefine)
- Documentation for Wikidata is in general applicable to Wikibase as well
-
Wikibase: the advantages of Wikidata, without the disadvantages (article in Dutch by Olaf Janssen)
-
Wikibase website: https://wikiba.se/
-
The bigger picture, according to the Wikimedia Foundation: Wikidata-Wikibase joint vision and the Wikibase ecosystem
-
Introduction to Wikibase and Wikibase Cloud by Georgina Burnett (WMDE), 2022 LD4 conference, 12th July 2022
-
History of Wikibase (until Febr 2022) (archived version)
See also Awesome Wikibase tutorials collected by Renat Shigapov.
- Wikibase and Semantic MediaWiki for data-driven semantics, EU Academy, online
In this course, you will learn how community-led tools and platforms like Wikibase, Semantic MediaWiki, and Wikidata can be used to create a data-driven semantic layer in a bottom-up way. - Introduction Wikibase, the basics by KB, national library of the Netherlands. Also available on Wikimedia Commons.
- Wikidata
- Rhizome Artbase
- Rhizome = art organization in NYC
- Artbase = archive of born-digital art 1983-present.
- First Wikibase instance outside of Wikimedia projects.
- Artworks in the ArtBase with more than one artist, visualized as a graph with images.
- Enslaved.org
- LOD platform containing ±1M records (people, events, places, and sources) related to the transatlantic slave trade
- Stories of the Enslaved told using Wikibase, article by Elisabeth Giesemann, 18 February 2021
- Data dumps from the Wikibase
- FactGrid
- Open collaborative international Wikibase knowledge graph for historical research, 312 participants
- FactGrid projects, by era, for example Paris to download, a freely downloadable dataset with geographic coordinates and administrative information from all the houses and streets of Paris in c. 1820. SPARQL: All the houses and streets of Paris c. 1820 (source article by Bruno Belhoste, 14-11-2021)
- The Aviation Safety Network Wikibase
- The ASN provides up-to-date, complete and reliable authoritative information on airliner accidents and safety issues.
- This Wikibase is updated regularly by a large user community and contains descriptions of more than 258,000 accidents involving light aircraft, military aircraft, helicopters, gyroplanes, gliders, hot air balloons and UAVs since 1905.
- EU Knowlegde Graph
- Contains information about 1.9M projects financed by the EU and 700K beneficiaries of European projects
- Wikibase as an Infrastructure for Knowledge Graphs: the EU Knowledge Graph, D. Diefenbach, M. De Wilde and S.Alipio, 30 September 2021 - WikidataCon 2021 video recording.
- Kohesio website is a frontend of the EU Knowledge Graph
- European Commission goes Open Source: New Project Kohesio uses Wikimedia’s Software Wikibase, 17-03-2022
- Explore EU projects in your country/region/neighbourhood, such as 1.259 EU projects in the Dutch province of South-Holland (dd. Sept 2023)
- Kunstmuseum API
- Wikibase by Kunstmuseum Den Haag to import and deliver data for the websites Delftsaardewerk.nl, Van Gogh Worldwide and Aziatischekeramiek.nl from various museums via a central API.
- For instance Object 0400098 in the collection of the Kunstmuseum Den Haag: on Delftsaardewerk.nl and in the Wikibase
- Deutsche Nationalbibliothek (DNB) and GND
- GND = Gemeinsame Normdatei, Integrated authority file for German speaking countries, 8 M authority records on persons, corporate bodies, subject headings, geographical names, works etc.
- Wikibase as a second home for the GND (p. 165) - A Wikibase to collaboratively edit and maintain authority records for the entire GLAM field and digital humanities.
- Could you wikify an authority file? Wikibase has been evaluated for the Integrated Authority File (GND) by Barbara Fischer and Jens Ohlig, 4 March 2020
- See also below, Wikibase projects in libraries
- More Wikibase instances
Wikibase is being evaluated by libraries as a tool to help them store and manage their structured data, as well as connect to the world of linked open data.
- Europe
- National Library of Germany (DNB), "GND meets Wikibase", a cooperation: Part 1 and part 2. See also A Voice in the Orchestra of Opening the GND by Barbara Fischer, DNB (p. 145 onwards)
- German National Library of Science and Technology (TIB): Examining Wikidata and Wikibase in the context of research data management applications (video) and Wikidata and Wikibase as complementary research services for cultural heritage data (March 2022)
- TIB/NFDI (Germany): Linked Open Data Management Services: A Comparison, including Wikibase (15 March 2023)
- National Library of France (BnF) (more info).
- DBN & BnF: Wikibase for Cultural Heritage and Academia, Perceived pros and cons of Wikibase as a solution (slide 7)
- National Library of the Czech Republic (more info)
- National Library of Luxembourg (more info)
- National library of Greece (more info). See also Using alternative vocabularies in Wikibase
- National library of the Netherlands (KB)
- National Library of Wales (Semantic Name Authority Repository Cymru)
- USA
- The Smithsonian Libraries (more info)
- OCLC, Wikibase pilot, a.k.a. Project Passage
- Academic libraries USA, Use cases for institutional Wikibase instances
- GLAM network Luxembourg (more info)
- Europeana EAGLE network - aims to build a multi-lingual online collection of millions of digitised items from European museums, libraries, archives and multi-media collections, which deal with the surviving inscriptions of the Greek-Roman world. The EAGLE Wikibase is designed to give a tool to anyone interested in bridging this gap and contributing translations of inscriptions.
- Fotomuseum Antwerpen (more info)
This section is extracted from the presentation Introduction to Wikibase for medieval manuscripts dd 6 December 2023. See the sections about Digital Scriptorium and Biblissima.
Digital Scriptorium
- Digital Scriptorium (DS) is a growing consortium of American institutions with collections of global premodern manuscripts dedicated to building an online national union catalog for manuscripts in US collections.
- The DS Catalog is the first member-supported national union catalog of medieval and early modern manuscripts in US collections built on LOD principles and practices. It connects researchers to pre- and early modern manuscript books in DS member institutions. Built on Wikibase, the DS Catalog aggregates supplied DS member metadata and enriches it by linking to external authorities and resources for enhanced research in a LOD environment.
- Digital Scriptorium catalog
- Search for Book of hours
- First result is Grolier Club, MS 07 (DS203)
- Metadata of DS203:
- DS ID - Shelfmark - Title - Artist - Place - Date - Language - Physical Description - Former Owner(s) - Note - IIIF Manifest - Holding Institution
- Manuscripts made in France
- Wikidata: Master of the Brotherhood of Ste. Catherine
- TGN: France
- AAT: fifteenth century (dates CE)
- The DS catalog metadata originates from the DS Catalog Wikibase
- Grolier Club, MS 07 (DS203) in the Wikibase (Q931)
- Which fields are used in the DS Wikibase?
- Overview of all properties (Ps). See also the DS datamodel on Github
- For example: described manuscript (P3) - title as recorded (P10) - production date as recorded (P23) - production place as recorded (P27)
- Usage of these properties in Excerpta ex operibus Sancti Augustini...(DS501): P3 - P10 - P23 - P27
- DS SPARQL queries
- DS manuscripts: IDs, Titles, Dates, Places, based on P3, P10, P23 and P27
- DS manuscripts: Names and Roles
- DS manuscripts: Authors and Wikidata linking - Example author: Charles VIII of France
- Portraits of DS authors from Wikidata
- More DS SPARQL queries on Github, such as this example, which can be run in the SPARQL interface.
- DS overview article: Wikibase Model for Premodern Manuscript Metadata Harmonization, Linked Data Integration, and Discovery - Mikko Koho, L. P. Coladangelo, Lynn Ransom, and Doug Emery. 2023. J. Comput. Cult. Herit. 16, 3, Article 56 (September 2023), 25 pages. https://doi.org/10.1145/3594723
- DS Github
Biblissima
- Biblissima+ is a French multi-site digital infrastructure for research and service dedicated to the history of the transmission of ancient texts, from Antiquity to the Renaissance, in the West and in the East.
- Biblissima services: Project site - Portal - IIIF - Toolkit - Documentation - Demos - Wikibase
- Biblissima Wikibase: Biblissima's authority files
- Available entitiy types in this Wikibase. Eg.
- All Biblissima Wikibase properties (295 Ps)
- Links to other databases: External IDs (213 Ps)
- Currently no Biblissima SPARQL query service available!
Diefenbach et al. (2021), Wikibase as an Infrastructure for Knowledge Graphs: the EU Knowledge Graph, see "3.1 Wikibase infrastructure"
- Simplified Wikibase data model
- EU Knowledge Graph: “Amsterdam is the capital of the Netherlands" --> Triple:
- Item = The Netherlands (Q19)
- Property = Captital city (P27)
- Value: Amsterdam (Q43)
- Version history and rollback (for Q43)
- Wikibase conceptual data model
- As explained above Wikibase has its own unique data model, which has its limitations. To what extent can other vocabularies (such as RDA and Schema.org) be included into a Wikibase?
- Literature explaining the limitations of the Wikibase model:
- Analysing and promoting ontology interoperability in Wikibase, D.Dobriy and A. Polleres (2022)
- Wikibase as an Infrastructure for Knowledge Graphs: the EU Knowledge Graph, Diefenbach et al. (2021), see "5. Comparing classical approach vs Wikibase"
- Wikibase, or The search for the unicorn, Bergamin, G. (2022), JLIS.It, 13(3), 49–62.
- Analysis of the problem by Marieke Moolenaar (KB, August 2023)
Wikibase uses a derivative of Blazegraph as its linked data storage. Let's call this the Wikibase-Blazegraph-DB. Blazegraph is a so-called graph DB or triple store, so you should be able to store RDF triples in it, thus also RDA/RDF triples and Schema.org triples. Currently, default Wikibase instances are set up in such a way that only Wikibase Q-P-Q triples can be included in the Wikibase-Blazegraph-DB via an update process from the MediaWiki-MySQL database. Schema.org or RDA/RDF triples can never enter the Wikibase-Blazegraph-DB via that update process. So if you want to get triples with other vocabulary into the KB-Wikibase-Blazegraph-DB, you will have to put them in there via another way than via the Wikibase-MediaWiki-GUI and the MediaWiki-MySQL database upate process. - Workaround by National library of Greece (NLG) - Implementing RDA in Wikibase, C. Bratsas and L. Ioannidis of Open Knowledge Greece
- Summary of the NLG approach written by Marieke Moolenaar (KB, August 2023):
The Greek Wikibase is only used as a graphical interface (front-end) for entering thesaurus data by library staff. They do not use their Wikibase to publish linked data, but only for data entry purposes. The reason they do not use Wikibase for publishing linked data is that in Wikibase you can only use the proprietary Wikibase metadata model, thus external vocabularies (such as RDA/RDF) cannot be used in Wikibase. To be able to enter new thesaurus triples via Wikibase screens/GUI, the NLG has made a translation/mapping from RDA/RDF to Wikibase entities (Ps and Qs). They periodically copy all created thesaurus triples from the Wikibase graph to another triple store (Triply). To do this, they translate their Wikibase triples back to RDA/RDF triples. In Triply they store and publish the real RDA/RDF vocabulary triples, which are not (or cannot be) present in Wikibase. - How to load up schema.org data dumps into Blazegraph by Dan Brickley, August 2016
- Which Wikibase should I choose?
- Wikibase Suite and Wikibase Docker - software that you install and run on your own hardware (typically via Docker). Good for users who want to try out Wikibase on their own hardware and who want to customize their installation. Good for institutions with large datasets.
- Wikibase.cloud - free “Wikibase as a service” platform to create Wikibases quickly and easily managed and maintained by Wikimedia Deutschland.
- KB’s sandbox WB instance on Wikibase.cloud
- KB's experiences and first impressions with unboxing, setting up, configuring and tweaking their Wikibase.cloud instance. (See also here)
- Free Wikibase hosting @ Miraheze
- Commercial hosting & services
Theun de Vries in KB sandbox Wikibase GUI, 4 equivalent URLs
- https://kbtestwikibase.wikibase.cloud/entity/Q29
- https://kbtestwikibase.wikibase.cloud/wiki/Item:Q29
- https://kbtestwikibase.wikibase.cloud/wiki/Special:EntityData/Q29
- https://kbtestwikibase.wikibase.cloud/wiki/Special:EntityData?id=Q29&format=html
- JSON: Special:EntityData/Q29.json or Special:EntityData?id=Q29&format=json
- RDF/XML: Special:EntityData/Q29.rdf or Special:EntityData?id=Q29&format=rdf
- Other formats: Q29.jsonld, Q29.ttl, Q29.n3, Q29.nt and Q29.php
- General API for all Wikimedia projects (Wikidata, Wikipedia, Wikimedia Commons etc.)
- API endpoint for KB sandbox WB: https://kbtestwikibase.wikibase.cloud/w/api.php
- wbgetentities and wbgetclaims modules for requesting data
- Q29 as JSON: https://kbtestwikibase.wikibase.cloud/w/api.php?action=wbgetentities&ids=Q29&format=json
- Q29 as XML: https://kbtestwikibase.wikibase.cloud/w/api.php?action=wbgetentities&ids=Q29&format=xml
- FactGrid: Living addresses of Parisian painters in 1834
- FactGrid: Ages of deceased persons
- EU Knowlegde Graph: Buildings of the EU
- KB Wikibase: Medieval manuscripts of the KB
- Digital Scriptorium:
Useful links:
- https://www.mediawiki.org/wiki/Wikibase/Importing
- https://github.com/shigapov/wikibase-knowledge-graphs#data-import
- https://www.wikibase.consulting/fast-bulk-import-into-wikibase/
Using KB's sandbox WB (login required) we can create a NewItem, resulting into an item about the Dutch poet H.H. ter Balkt
- OpenRefine is a well-known tool for editing, enriching and manipulating data. It is widely used to add data to Wikidata and other Wikibase instances.
- OpenRefine-Wikidata introduction workshop, KB, 4-7-2023 (also on Zenodo)
- Documentation: Connecting OpenRefine to a Wikibase instance, Reconciling with Wikibase and Uploading edits to Wikibase
- Wikibase reconcilation services: Wikidata, FactGrid, Kunstmuseum and more
- Connecting OpenRefine to a Wikibase via a manifests.json file: Wikidata, FactGrid, Kunstmuseum and more
- Files for interaction between OpenRefine and KB Wikibases, for reconciling and uploading data to Wikibases of the KB, using Openfine
- OpenRefine to Wikibase: Data Upload Pipeline
- "From formatted .txt or .csv to Wikibase"
- QuickStatements tool + help
- QuickStatements interface in KB sandbox WB (login required)
- WikibaseImport, a MediaWiki extension
- WikibaseIntegrator, a Python library (docs)
- WikidataIntegrator, a Python library (Zenodo)
- WikibaseSync, a Python library (Tutorial)
- wikibase-edit, a NodeJS library (Howto)
- wikibase-cli, a command-line interface
- VanDerBot, a Python application (Github)
- Pywikibot, a Python library (Github, Wikibase scripts)
- RaiseWikibase, a Python tool
- Wikidata-Toolkit, a Java library (Github)
Its mission is to cultivate Wikibase's development and to encourage like-minded developers and data analysts not only to improve Wikibase's existing tools but also to create new ones.
About the WBCUG - WBCUG history - WBCUG members - WBCUG monthly online meetings - The Wikibase Live sessions - WBCUG Mailing list (archives) - WBCUG Telegram (or here)
Commissions production and maintenance of open source extensions to Wikibase, and documentation for institutions that want to operate and maintain a fully-fledged instance of Wikibase. The group will focus on extensions to Wikibase instead of contributing to Wikibase core.
About the WBSG - WBSG members - WBSG meeting calender - WBSG monthly online meeting minutes - WBSG Loomio - WBSG Mastodon - WBSG Twitter
The aim of this group is to bundle and exchange knowledge & experiences about the use of Wikibase, to learn from each other, and to keep each other informed about the (international) developments and opportunities surrounding Wikibase. Membership is open to everyone in the Netherlands who already works with Wikibase, wants to work with, or is otherwise interested in this software. Mainly for - but certainly not limited to - professionals from Dutch heritage and knowledge institutions, and related organizations and companies.
About the WBGNL - WBGNL meetings (also available here) - WBGNL mailing list - WBGNL Loomio
- The first Federated-Wikibase-Workshop: Antwerp, April 2018 (blog)
- Wikibase Workshop in Berlin, June 2018 (blog)
- The Wikibase Summit: New York, September 2018
- Ghent University Wikidata and Wikibase Workshop: developing a Wikibase instance, July 2019
- Wikidata & Wikibase for National Libraries: the inaugural meeting, Stockholm, August 2019
- Wikibase workshop in Tokyo , September 2019
- Wikibase in Knowledge Graph based Research Data Management (NFDI) Projects, online, 23 February 2021 (report)
- JCDL workshop: Open Refine to Wikibase - A New Data Upload Pipeline, June 2022
- First SEMIC workshop on Wikidata and Wikibase, online, 24 January 2023
- Second SEMIC workshop on Wikidata and Wikibase, Brussel, 23 February 2023
- Third SEMIC workshop on Wikidata and Wikibase, online, 28 March 2023
- First Wikibase lexical data workshop, Wenen, Septemer 2023
- Via the meetings, minutes, presentations, mailing lists, socials etc. of the WBCUG, WBSG and WBGNL. See above.
- Wikibase.cloud: Project updates - Mailing list (archives) - Telegram
- Specifically for libraries:
- Wikibase Working Hours, community discussion of Wikidata & Wikibase with the goal of understanding how the library can contribute to and leverage these as a platform for publishing, linking, and enriching library linked data.
- IFLA Wikidata Working Group. This working group will explore and advocate for the use of and contribution to Wikidata by library and information professionals, the integration of Wikidata and Wikibase with library systems, and alignment of the Wikidata ontology with library metadata formats such as BIBFRAME, RDA, and MARC.
- More:
- Wikidata Weekly Summary, also containing WB updates!
- Wikibase yearly summaries by Envel Le Hir: 2023, 2022, 2021 and 2020
- Blog by Addshore, on Wikibase
- Help by community: use all community channels above.
- Tip: Use the Telegram channels if you want help quickly
- Help by Olaf Janssen: Wikidata user page + Expert page on kb.nl
- Wikibase documentation portal (WMDE) + here
- Low-level Wikibase technical documentation
- Wikibase.cloud documentation portal (WMDE) + WB.cloud issue tracker
- Learning Wikibase, a place to learn and share online resources (e.g. articles, videos, workflows, FAQ’s) that make it easier to install, maintain or customize your Wikibase instance.
- Tip!! Wikibase resources overview by Renat Shigapov
- UCLA Library Research Guide on Wikibase and Wikidata