Difference between revisions of "Querying Lingua Libre"

Latest revision as of 14:51, 20 November 2022

Core structures

The data model includes 3 core concepts of Lingua Libre : language, speakers, recordings. Items exist for each of these 3 dimensions. Those items have properties, which themselves have various values. This, all together, creates our database.

Language

See also Help:SPARQL#Languages, LinguaLibre:List of languages

This is the language of a speaker or of a recording. This property may point to a language defined on Wikidata (wikidata:Q34770).

Speaker

See also Help:SPARQL#Speakers, DataViz:Speakers.

The speaker is the person that pronounced one or several words in an audio recording. There are various information on the speaker such as their residence at the time of recording and their native tongue.

Audio recordings

See also Help:SPARQL#Recordings, DataViz:Records.

Every recording created with the Record Wizard is added into the database along with some metadata that includes the date the recording was created, the speaker and the language.

Querying the data

SPARQL end points

LinguaLibre's SPARQL endpoint – to query Wikidata from Lingualibre, use SERVICE <https://wikidata.org/sparql>.
Wikidata Query Service – to query Lingualibre from Wikidata, use SERVICE <https://lingualibre.org/sparql>.

SPARQL helpers

Help:SPARQL – examples of SPARQL queries
- Special:ListProperties – list of all properties used on Lingua Libre
Help:SPARQL 2 (stub) – examples of advanced SPARQL queries

API helpers

Help:APIs – APIs queries relevant to LinguaLibre, including Commons.
Special:ApiSandbox – API queries generator for Lingualibre wikipage and wikibase contents.

Modifying the data

(This section needs an author.)

**Lingua Libre technical helps**
Template	{{Speakers category}} • {{Recommended lists}} • {{To iso 639-2}} • {{To iso 639-3}} • {{Userbox-records}} • {{Bot steps}}
Audio files	How to create a frequency list? • Convert files formats • Denoise files with SoX • Rename and mass rename
Bots	Help:Bots • LinguaLibre:Bot • Help:Log in to Lingua Libre with Pywikibot • Lingua Libre Bot (gh) • Olafbot • PamputtBot • Dragons Bot (gh)
MediaWiki	MediaWiki: Help:Documentation opérationelle Mediawiki • Help:Database structure • Help:CSS • Help:Rename • Help:OAuth • LinguaLibre:User rights (rate limit) • Module:Lingua Libre record & {{Lingua Libre record}} • JS scripts: MediaWiki:Common.js • LastAudios.js • SoundLibrary.js • ItemsSugar.js • LexemeQueriesGenerator.js (pad) • Sparql2data.js (pad) • LanguagesGallery.js (pad) • Gadgets: Gadget-LinguaImporter.js • Gadget-Demo.js • Gadget-RecentNonAudio.js
Queries	Help:APIs • Help:SPARQL • SPARQL (intermediate) (stub) • SPARQL for lexemes (stub) • SPARQL for maintenance • Lingualibre:Wikidata (stub) • Help:SPARQL (HAL)
Reuses	Help:Download datasets • Help:Embed audio in HTML
Unstable & tests	Help:SPARQL/test
Categories	Category:Technical reports

@@ Line 1: / Line 1: @@
-The sound library is structured in a [[wikipedia:Semantic Web|Semantic Web]] database where all the recordings and their data are described.
+{{#Subtitle:LinguaLibre audios are structured in a [[wikipedia:Semantic Web|Semantic Web]] database where all the recordings and their data are described. This page expose the core structure and redirect you to the relevant documentations or help pages.}}
+__TOC__
+== Core structures ==
+The data model includes 3 core concepts of Lingua Libre : language, speakers, recordings. Items exist for each of these 3 dimensions. Those items have [[Special:ListProperties|properties]], which themselves have various values. This, all together, creates our database.
-== Overview ==
+<center>[[File:LinguaLibre - Data Model Overview.svg|center|Overview of the data model of the Lingua Libre database|100%]]</center>
-The data model includes 3 core concepts of Lingua Libre.
+====Language====
+:''See also [[Help:SPARQL#Languages]], [[Special:MyLanguage/LinguaLibre:List of languages|LinguaLibre:List of languages]]''
-[[File:LinguaLibre - Data Model Overview.svg|thumb|Overview of the data model of the Lingua Libre database]]
+This is the language of a speaker or of a recording. This property may point to a language defined on Wikidata ([[wikidata:Q34770]]).
-=== Audio recordings ===
-Every recording created with the Record Wizard is added into the database along with some metadata that includes the date the recording was created, the speaker and the language.
-The list of audio recordings is available on this DataViz: [[DataViz:Records]].
-=== Language ===
+====Speaker====
-This is the language of a speaker or of a recording. This property may point to a language defined on Wikidata ([[wikidata:Q34770]]).
+:''See also [[Help:SPARQL#Speakers]], [[Special:MyLanguage/DataViz:Speakers|DataViz:Speakers]].''
-=== Speaker ===
 The speaker is the person that pronounced one or several words in an audio recording. There are various information on the speaker such as their residence at the time of recording and their native tongue.
-The list of speakers on Lingua Libre is available on this DataViz: [[DataViz:Speakers]].
+====Audio recordings====
+:''See also [[Help:SPARQL#Recordings]], [[Special:MyLanguage/DataViz:Records|DataViz:Records]].''
+Every recording created with the Record Wizard is added into the database along with some metadata that includes the date the recording was created, the speaker and the language.
 == Querying the data ==
+'''SPARQL end points'''
+* [//lingualibre.org/bigdata/#query LinguaLibre's SPARQL endpoint] – to query Wikidata from Lingualibre, use <code><nowiki>SERVICE <https://wikidata.org/sparql></nowiki></code>.
+* [[d:Special:MyLanguage/Wikidata:SPARQL_query_service|Wikidata Query Service]] – to query Lingualibre from Wikidata, use <code><nowiki>SERVICE <https://lingualibre.org/sparql></nowiki></code>.
-The data can be queried using [//lingualibre.fr/bigdata/#query LinguaLibre's SPARQL endpoint].
+'''SPARQL helpers'''
+* [[Special:MyLanguage/Help:SPARQL|Help:SPARQL]] – examples of SPARQL queries
+** [[Special:ListProperties]] – list of all properties used on Lingua Libre
+* [[Special:MyLanguage/Help:SPARQL 2|Help:SPARQL 2]] (stub) – examples of advanced SPARQL queries
-* [[Wikidata:SPARQL query service/A gentle introduction to the Wikidata Query Service]]
+'''API helpers'''
-* [[LinguaLibre:Sparql|Example SPARQL queries]]
+* [[Help:APIs]] – APIs queries relevant to LinguaLibre, including Commons.
+* [[Special:ApiSandbox]] – API queries generator for Lingualibre wikipage and wikibase contents.
 == Modifying the data ==
-In progress
+:''(This section needs an author.)''
 == Further reading ==
-* [[wikidata:Help:Navigating Wikidata]]
+* [[wikidata:Special:MyLanguage/Help:Navigating Wikidata|Help:Navigating Wikidata]] on Wikidata
+== See also ==
+{{technicals}}
+[[Category:Lingua Libre:Help]]

Help

Difference between revisions of "Querying Lingua Libre"

LinguaLibre audios are structured in a database where all the recordings and their data are described. This page expose the core structure and redirect you to the relevant documentations or help pages.