LinguaLibre
Wikidata
Wikidata.org is a Wikimedia project storing structured data for Wikimedia projects and the world. These data are under CC0 license, freely accessible via numerous tool: queries, dumps and APIs. Wikidata and LinguaLibre are both based on the same Wikibase software, a data-storage and collaborative data-editing technology. Wikidata has a lexeme side which could have more collaborations with Lingualibre, but requires bots taming and associated development skills to do so.
Wikidata items
- See also Help:SPARQL 2#Notable elements.
Draft
Wikidata's properties
- See also Help:SPARQL 2#Notable elements.
Wikidata's properties most relevant to LinguaLibre are :
- list to complete
LinguaLibre's properties
- See also Special:ListProperties.
LinguaLibre is using a database system called Wikibase, the same as used by Wikidata, to crowd source the creation of a large database relevant to multilingual audio recordings. Each audio recording is associated with few properties, mostly relevant to :
- the speaker
- the language used
- some system information : word, url on Wikimedia Commons, etc.
Wikidata Lexeme
Lexemes is the place (the namespace) where the lexicographical data are stored in Wikidata. Lexemes are lexical units, words or expressions, that contains senses and forms.
These forms can store recording like the one from LinguaLibre. As of February 2021, 44363 forms use a LinguaLibre file ([1]).
- wikidata:Special:MyLanguage/Wikidata:Lexicographical_data the project for the editorial part
- mw:Extension:WikibaseLexeme/Data Model for the technical underlying model
- wikidata:Wikidata:Tools/Lexicographical_data
See also
- LinguaLibre:List_of_languages
- Special:ListProperties
- Wikidata:Pywikibot_-_Python_3_Tutorial/Lexeme on Wikidata.
- https://ordia.toolforge.org/language/ - surveil Wikidata lexical coverage
- Help:Bots
- A Brief Introduction to Wikidata
- Help:SPARQL