Help

Difference between revisions of "Querying Lingua Libre"

LinguaLibre audios are structured in a database where all the recordings and their data are described. This page expose the core structure and redirect you to the relevant documentations or help pages.

(→‎Querying the data: adding content)
 
(36 intermediate revisions by 8 users not shown)
Line 1: Line 1:
The sound library is structured in a [[wikipedia:Semantic Web|Semantic Web]] database where all the recordings and their data are described.
+
{{#Subtitle:LinguaLibre audios are structured in a [[wikipedia:Semantic Web|Semantic Web]] database where all the recordings and their data are described. This page expose the core structure and redirect you to the relevant documentations or help pages.}}
 +
__TOC__
 +
== Core structures ==
 +
The data model includes 3 core concepts of Lingua Libre : language, speakers, recordings. Items exist for each of these 3 dimensions. Those items have [[Special:ListProperties|properties]], which themselves have various values. This, all together, creates our database.
  
== Overview ==
+
<center>[[File:LinguaLibre - Data Model Overview.svg|center|Overview of the data model of the Lingua Libre database|100%]]</center>
The data model includes 3 core concepts of Lingua Libre.
+
====Language====
 +
:''See also [[Help:SPARQL#Languages]], [[Special:MyLanguage/LinguaLibre:List of languages|LinguaLibre:List of languages]]''
  
[[File:LinguaLibre - Data Model Overview.svg|thumb|Overview of the data model of the Lingua Libre database]]
+
This is the language of a speaker or of a recording. This property may point to a language defined on Wikidata ([[wikidata:Q34770]]).
 
 
=== Audio recordings ===
 
Every recording created with the Record Wizard is added into the database along with some metadata that includes the date the recording was created, the speaker and the language.
 
 
 
The list of audio recordings is available on this DataViz: [[DataViz:Records]].
 
  
=== Language ===
+
====Speaker====
This is the language of a speaker or of a recording. This property may point to a language defined on Wikidata ([[wikidata:Q34770]]).
+
:''See also [[Help:SPARQL#Speakers]], [[Special:MyLanguage/DataViz:Speakers|DataViz:Speakers]].''
  
=== Speaker ===
 
 
The speaker is the person that pronounced one or several words in an audio recording. There are various information on the speaker such as their residence at the time of recording and their native tongue.
 
The speaker is the person that pronounced one or several words in an audio recording. There are various information on the speaker such as their residence at the time of recording and their native tongue.
  
The list of speakers on Lingua Libre is available on this DataViz: [[DataViz:Speakers]].
+
====Audio recordings====
 +
:''See also [[Help:SPARQL#Recordings]], [[Special:MyLanguage/DataViz:Records|DataViz:Records]].''
 +
Every recording created with the Record Wizard is added into the database along with some metadata that includes the date the recording was created, the speaker and the language.
  
 
== Querying the data ==
 
== Querying the data ==
 +
'''SPARQL end points'''
 +
* [//lingualibre.org/bigdata/#query LinguaLibre's SPARQL endpoint] – to query Wikidata from Lingualibre, use <code><nowiki>SERVICE <https://wikidata.org/sparql></nowiki></code>.
 +
* [[d:Special:MyLanguage/Wikidata:SPARQL_query_service|Wikidata Query Service]] – to query Lingualibre from Wikidata, use <code><nowiki>SERVICE <https://lingualibre.org/sparql></nowiki></code>.
  
The data can be queried using [//lingualibre.fr/bigdata/#query LinguaLibre's SPARQL endpoint].
+
'''SPARQL helpers'''
 +
* [[Special:MyLanguage/Help:SPARQL|Help:SPARQL]] – examples of SPARQL queries
 +
** [[Special:ListProperties]] – list of all properties used on Lingua Libre
 +
* [[Special:MyLanguage/Help:SPARQL 2|Help:SPARQL 2]] (stub) – examples of advanced SPARQL queries
  
* [[Wikidata:SPARQL query service/A gentle introduction to the Wikidata Query Service]]
+
'''API helpers'''
* [[LinguaLibre:Sparql|Example SPARQL queries]]
+
* [[Help:APIs]] – APIs queries relevant to LinguaLibre, including Commons.
 +
* [[Special:ApiSandbox]] – API queries generator for Lingualibre wikipage and wikibase contents.
  
 
== Modifying the data ==
 
== Modifying the data ==
In progress
+
:''(This section needs an author.)''
  
 
== Further reading ==
 
== Further reading ==
* [[wikidata:Help:Navigating Wikidata]]
+
* [[wikidata:Special:MyLanguage/Help:Navigating Wikidata|Help:Navigating Wikidata]] on Wikidata
 +
== See also ==
 +
{{technicals}}
 +
[[Category:Lingua Libre:Help]]

Latest revision as of 14:51, 20 November 2022

Core structures

The data model includes 3 core concepts of Lingua Libre : language, speakers, recordings. Items exist for each of these 3 dimensions. Those items have properties, which themselves have various values. This, all together, creates our database.

100%

Language

See also Help:SPARQL#Languages, LinguaLibre:List of languages

This is the language of a speaker or of a recording. This property may point to a language defined on Wikidata (wikidata:Q34770).

Speaker

See also Help:SPARQL#Speakers, DataViz:Speakers.

The speaker is the person that pronounced one or several words in an audio recording. There are various information on the speaker such as their residence at the time of recording and their native tongue.

Audio recordings

See also Help:SPARQL#Recordings, DataViz:Records.

Every recording created with the Record Wizard is added into the database along with some metadata that includes the date the recording was created, the speaker and the language.

Querying the data

SPARQL end points

SPARQL helpers

API helpers

  • Help:APIs – APIs queries relevant to LinguaLibre, including Commons.
  • Special:ApiSandbox – API queries generator for Lingualibre wikipage and wikibase contents.

Modifying the data

(This section needs an author.)

Further reading

See also

Lingua Libre technical helps
Template {{Speakers category}} • {{Recommended lists}} • {{To iso 639-2}} • {{To iso 639-3}} • {{Userbox-records}} • {{Bot steps}}
Audio files How to create a frequency list?Convert files formatsDenoise files with SoXRename and mass rename
Bots Help:BotsLinguaLibre:BotHelp:Log in to Lingua Libre with PywikibotLingua Libre Bot (gh) • OlafbotPamputtBotDragons Bot (gh)
MediaWiki MediaWiki: Help:Documentation opérationelle MediawikiHelp:Database structureHelp:CSSHelp:RenameHelp:OAuthLinguaLibre:User rights (rate limit) • Module:Lingua Libre record & {{Lingua Libre record}}JS scripts: MediaWiki:Common.jsLastAudios.jsSoundLibrary.jsItemsSugar.jsLexemeQueriesGenerator.js (pad) • Sparql2data.js (pad) • LanguagesGallery.js (pad) • Gadgets: Gadget-LinguaImporter.jsGadget-Demo.jsGadget-RecentNonAudio.js
Queries Help:APIsHelp:SPARQLSPARQL (intermediate) (stub) • SPARQL for lexemes (stub) • SPARQL for maintenanceLingualibre:Wikidata (stub) • Help:SPARQL (HAL)
Reuses Help:Download datasetsHelp:Embed audio in HTML
Unstable & tests Help:SPARQL/test
Categories Category:Technical reports