Querying Lingua Libre

LinguaLibre audios are structured in a database where all the recordings and their data are described. This page expose the core structure and redirect you to the relevant documentations or help pages.

Core structures

The data model includes 3 core concepts of Lingua Libre : language, speakers, recordings. Items exist for each of these 3 dimensions. Those items have properties, which themselves have various values. This, all together, creates our database.



See also Help:SPARQL#Languages, LinguaLibre:List of languages

This is the language of a speaker or of a recording. This property may point to a language defined on Wikidata (wikidata:Q34770).


See also Help:SPARQL#Speakers, DataViz:Speakers.

The speaker is the person that pronounced one or several words in an audio recording. There are various information on the speaker such as their residence at the time of recording and their native tongue.

Audio recordings

See also Help:SPARQL#Recordings, DataViz:Records.

Every recording created with the Record Wizard is added into the database along with some metadata that includes the date the recording was created, the speaker and the language.

Querying the data

SPARQL end points

SPARQL helpers

API helpers

  • Help:APIs – APIs queries relevant to LinguaLibre, including Commons.
  • Special:ApiSandbox – API queries generator for Lingualibre wikipage and wikibase contents.

Modifying the data

(This section needs an author.)

Further reading

See also

Lingua Libre technical helps
Template {{Speakers category}} • {{Recommended lists}} • {{To iso 639-2}} • {{To iso 639-3}} • {{Userbox-records}} • {{Bot steps}}
Audio files How to create a frequency list?Convert files formatsDenoise files with SoXRename and mass rename
Bots Help:BotsLinguaLibre:BotHelp:Log in to Lingua Libre with PywikibotLingua Libre Bot (gh) • OlafbotPamputtBotDragons Bot (gh)
MediaWiki MediaWiki: Help:Documentation opérationelle MediawikiHelp:Database structureHelp:CSSHelp:RenameHelp:OAuthLinguaLibre:User rights (rate limit) • Module:Lingua Libre record & {{Lingua Libre record}}JS scripts: MediaWiki:Common.jsLastAudios.jsSoundLibrary.jsItemsSugar.jsLexemeQueriesGenerator.js (pad) • Sparql2data.js (pad) • LanguagesGallery.js (pad) • Gadgets: Gadget-LinguaImporter.jsGadget-Demo.jsGadget-RecentNonAudio.js
Queries Help:APIsHelp:SPARQLSPARQL (intermediate) (stub) • SPARQL for lexemes (stub) • SPARQL for maintenanceLingualibre:Wikidata (stub) • Help:SPARQL (HAL)
Reuses Help:Download datasetsHelp:Embed audio in HTML
Unstable & tests Help:SPARQL/test
Categories Category:Technical reports