LinguaLibre

Difference between revisions of "Stats"

Lingua Libre Alpha was announced on August 2018. By April 2019, LiLi reached 100,000 audios recordings in 46 languages thanks to 128 different speakers, 200,000 recordings (82 languages, 268 speakers) in January 2020 ; 500,000 recordings (120 languages, 538 speakers) in June 2021 and 1,000,000 recordings (200 languages, 1400 speakers) in October 2023. By June 2024, there were 1,250,000 recordings (250 languages, 2050 speakers).

(+ history query)
 
(75 intermediate revisions by 7 users not shown)
Line 1: Line 1:
<languages/>
+
__NOTOC__
 
+
<languages/><indicator name="stats"></indicator>{{#SUBTITLE:<translate> <!--T:13-->
<translate>  
+
Lingua Libre Alpha was announced on August 2018. By April 2019, ''LiLi'' reached '''100,000''' audios recordings in 46 languages thanks to 128 different speakers, '''200,000''' recordings (82 languages, 268 speakers) in January 2020 ; '''500,000''' recordings (120 languages, 538 speakers) in June 2021 and '''1,000,000''' recordings (200 languages, 1400 speakers) in October 2023. By June 2024, there were 1,250,000 recordings (250 languages, 2050 speakers).
<!--T:13-->
+
</translate>}}{{LinguaLibre:Stats/Menu}}
LinguaLibre Alpha release was announced on August 2018. As of April 1st, 2019 (8 months), nearly 100,000 audios have been recorded in 46 languages, thanks to 128 variously active speakers.  
+
<translate>
 
 
 
== Global stats == <!--T:4-->
 
== Global stats == <!--T:4-->
 
+
</translate>
<!--T:5-->
+
<query records="<translate><!--T:30--> Records</translate>" speakers="<translate><!--T:31--> Speakers</translate>" languages="<translate><!--T:32--> Languages</translate>">
<query records="Records" speakers="Speakers" languages="Languages">
 
 
SELECT
 
SELECT
 
(COUNT(DISTINCT ?record) AS ?records)
 
(COUNT(DISTINCT ?record) AS ?records)
 
(COUNT(DISTINCT ?speaker) AS ?speakers)
 
(COUNT(DISTINCT ?speaker) AS ?speakers)
 
(COUNT(DISTINCT ?language) AS ?languages)
 
(COUNT(DISTINCT ?language) AS ?languages)
 +
# see Help:SPARQL_for_maintenance#.E2.9C.85_Languages_.E2.86.92_list_of_values_used_including_redirects
 
WHERE {
 
WHERE {
 
   ?record prop:P2 entity:Q2 .
 
   ?record prop:P2 entity:Q2 .
Line 19: Line 18:
 
}
 
}
 
</query>
 
</query>
 
== History ==
 
<query yearmonth="Date" records="Records" speakers="Speakers" languages="Languages">
 
SELECT
 
?yearmonth
 
(COUNT(DISTINCT ?record) AS ?records)
 
(COUNT(DISTINCT ?speaker) AS ?speakers)
 
(COUNT(DISTINCT ?language) AS ?languages)
 
WHERE {
 
  ?record prop:P2 entity:Q2 .
 
  ?record prop:P6 ?date .
 
  ?record prop:P5 ?speaker .
 
  ?record prop:P4 ?language .
 
 
  BIND( SUBSTR(str(?date), 0, 8) as ?yearmonth )
 
}
 
GROUP BY ?yearmonth
 
ORDER BY ?yearmonth
 
</query>
 
 
== Number of records per languages == <!--T:1-->
 
 
<!--T:6-->
 
<query _pagination="10" language="Item" name="Language" nb="Number of records">
 
    select ?language (if( ?language = entity:Q4, '???', ?languageLabel ) as ?name) (COUNT(?record) as ?nb)
 
    where {
 
        ?record prop:P2 entity:Q2 .
 
        ?record prop:P4 ?lang .
 
BIND( IF( isBLANK(?lang), entity:Q4, ?lang ) as ?language ).
 
     
 
        SERVICE wikibase:label {
 
            bd:serviceParam wikibase:language "[AUTO_LANGUAGE],en" .
 
            ?language  rdfs:label ?languageLabel.
 
        }
 
    }
 
    GROUP BY ?language ?languageLabel
 
    ORDER BY DESC(?nb)
 
</query>
 
 
== The most prolific speakers == <!--T:2-->
 
 
<!--T:8-->
 
<query _pagination="10" locutor="Item" locutorLabel="Speaker" nb="Number of records" languages="Spoken languages">
 
    select ?locutor ?locutorLabel (COUNT(?record) as ?nb) (GROUP_CONCAT(DISTINCT ?langLabel;separator=", ") as ?languages)
 
    where {
 
        ?record prop:P2 entity:Q2 .
 
        ?record prop:P5 ?locutor .
 
        ?record prop:P4 ?lang .
 
        #extra:{"type": "wikibase-item", "filter":"Q4", "label": "P4", "multiple": true} ?record prop:P4 entity:[EXTRA] .
 
BIND( IF( isBLANK(?lang), entity:Q4, ?lang ) as ?language ).
 
     
 
        SERVICE wikibase:label {
 
            bd:serviceParam wikibase:language "[AUTO_LANGUAGE],en" .
 
            ?lang rdfs:label ?langLabel .
 
            ?locutor rdfs:label ?locutorLabel .
 
        }
 
    }
 
    GROUP BY ?locutor ?locutorLabel
 
    ORDER BY DESC(?nb)
 
    LIMIT 50
 
</query>
 
 
== The most recorded words == <!--T:3-->
 
 
<!--T:10-->
 
<query _pagination=10 transcription="Transcription" nb="Number of records" language="Languages">
 
    select ?transcription (COUNT(?record) as ?nb) (GROUP_CONCAT(DISTINCT (if( ?language = entity:Q4, '???', ?languageLabel )); SEPARATOR=", ") AS ?languages)
 
    where {
 
        ?record prop:P2 entity:Q2 .
 
        ?record prop:P4 ?lang .
 
BIND( IF( isBLANK(?lang), entity:Q4, ?lang ) as ?language ).
 
     
 
        ?record prop:P7 ?transcription.
 
SERVICE wikibase:label {
 
            bd:serviceParam wikibase:language "[AUTO_LANGUAGE],en" .
 
            ?language  rdfs:label ?languageLabel.
 
        }
 
    }
 
    GROUP BY ?transcription
 
    ORDER BY DESC(?nb)
 
    LIMIT 50
 
</query>
 
</translate>
 

Latest revision as of 16:32, 27 June 2024

Other languages:
Bahasa Indonesia • ‎Bahasa Melayu • ‎Bali • ‎Bikol Central • ‎Deutsch • ‎English • ‎Esperanto • ‎Igbo • ‎Toki Pona • ‎Türkçe • ‎asturianu • ‎brezhoneg • ‎català • ‎español • ‎français • ‎galego • ‎lumbaart • ‎occitan • ‎polski • ‎română • ‎sicilianu • ‎svenska • ‎Ελληνικά • ‎башҡортса • ‎македонски • ‎русский • ‎українська • ‎עברית • ‎বাংলা • ‎తెలుగు • ‎中文 • ‎日本語 • ‎ꯃꯤꯇꯩ ꯂꯣꯟ
View statistics about:

Global stats

... Loading ...