LinguaLibre

Difference between revisions of "About/en"

(Updating to match new version of source page)
(Updating to match new version of source page)
 
(17 intermediate revisions by the same user not shown)
Line 1: Line 1:
 +
<div class="section gap-l">
 +
<div class="columns v-center padded-m">
 +
<div>
 
<languages/>
 
<languages/>
'''Lingua Libre''' is an audio recording tool as well as a sound library designed by Wikimedians to improve several Wikimedia projects (Wiktionaries, Wikipedias, Wikimedia Commons, Wikidata...).
+
'''Lingua Libre''' is a project of the association '''''Wikimédia France''''' which aims to build a collaborative, multilingual, ''audiovisual corpus'' under free licence in order to:
 +
* ''Expand knowledge'' '''''about languages''''' and '''''in languages''''' in an audiovisual way on the web, on Wikimedia projects and outside ;
 +
* ''Support the development'' of '''''online language communities''''' — particularly those of poorly endowed, minority, regional, oral or signed languages — in order to help communities accessing online information and to ensure the vitality of the languages of these communities.
 +
</div>
 +
<div style="text-align: center;">
 +
[[File:Lingua libre illustration - interface.svg|frameless|440px]]
 +
</div>
 +
</div>
 +
</div>
  
'''LinguaLibre.fr''' is a massive open audio recording platform and web application to ease mass recording of wordslists or text into clean, well cut, well named and apps friendly audio files. It is designed from the start to ease the creation of consistent datasets of audio files. We believe it is the best tool available to create dataset from few dozens to several thousands audios files. Recording productivity can reach up to 1000 audio recordings / hour, given a clean words list and an experienced user. Lingua Libre has received a ''Project Grant'' funding from the [https://wikimediafoundation.org/ Wikimedia Foundation] and is hosted by [https://www.wikimedia.fr/ Wikimédia France]. Today, it is actively used by the Wikimedia community and maintained by passionate contributors as an open source project.
+
<div class="section section-blue gap-s">
__NOTOC__
+
<div class="columns v-center">
== Useful links ==
+
<span style="font-size: 35px; line-height: normal;">
 +
Already '''{{formatnum:2000}} members''' and '''{{formatnum:1250000}}+ recordings''' on Lingua Libre, join us
 +
</span>
 +
<div style="margin-top: -5px; text-align: center;">
 +
[[Special:RecordWizard|<span class="mw-ui-button mw-ui-neutral" role="button" aria-disabled="false">Record your voice</span>]]
 +
</div>
 +
</div>
 +
</div>
 +
 
 +
<div class="section gap-l">
 +
<div class="columns v-center padded-m">
 +
<div style="text-align: center;">
 +
[[File:Music-technology-guitar-microphone-studio-amplifier-846852-pxhere (cropped).jpg|frameless|440px|class=shadow]]
 +
</div>
 +
<div>
 +
== How to participate? ==
 +
 
 +
You can use '''Lingua Libre''' by exploring and reusing recordings, contribute to the corpus by recording words, or improve the website itself, in consultation with the community.
 +
 
 +
The '''''Record Wizard''''' tab allows to record short audios (1 word, 1 phrase), to categorize them and to publish them on '''Wikimedia Commons''' from a computer or smartphone. To do so, you will need to '''[https://lingualibre.org/index.php?title=Special:UserLogin&returnto=Special%253AMyLanguage%252FLinguaLibre%253AAbout&returntoquery=title%3DSpecial%253AMyLanguage%252FLinguaLibre%253AAbout log in]''' or create a user account. The user guide is available on the help page.
 +
 
 +
To modify the website pages, simply log in and click on Modify. To add more pages, the process is in two steps: enter the title of the page you wish to create in the search engine, with the prefix "LinguaLibre:". A message will appear inviting you to create the page. For any substantial modification, please consult the community beforehand.
 +
</div>
 +
</div>
 +
</div>
 +
 
 +
<div class="section section-grey gap-m">
 +
<div class="columns padded-m v-center">
 +
<div>
 +
==== Interact with the community ====
  
* IRC chan : <code>#lingualibre</code> on Freenode ([https://kiwiirc.com/client/irc.freenode.net/#lingualibre To join with Kiwiirc from a web browser])
+
Do not hesitate to inform the team of any element that could be improved. To do so, discussions take place in the Chat Room, on the mailing list or on Discord.
* Phabricator : https://phabricator.wikimedia.org/project/profile/3393/ for issues/bugs tracking
+
</div>
* Github : https://github.com/lingua-libre on Github
+
<div style="text-align: right;">
* Twitter : https://twitter.com/LingLibre_WMFr (mainly in French)
+
[https://discord.gg/Bqn3yXCp89 <span class="mw-ui-button mw-ui-neutral" role="button" aria-disabled="false" style="margin-right: 15px; margin-bottom: 11px;">Discord</span>]
 +
[https://meta.wikimedia.org/wiki/Special:MyLanguage/Lingua_Libre <span class="mw-ui-button mw-ui-neutral" role="button" aria-disabled="false" style="margin-bottom: 11px;">Project on Meta</span>]
 +
<br>
 +
[https://lingualibre.org/wiki/LinguaLibre:Chat_room <span class="mw-ui-button mw-ui-neutral" role="button" aria-disabled="false" style="margin-right: 15px;">Chat room</span>]
 +
[https://phabricator.wikimedia.org/tag/lingua_libre/ <span class="mw-ui-button mw-ui-neutral" role="button" aria-disabled="false" style="margin-right: 15px;">Phabricator</span>]
 +
[https://github.com/lingua-libre <span class="mw-ui-button mw-ui-neutral" role="button" aria-disabled="false">Github</span>]
 +
</div>
 +
</div>
 +
</div>
  
 +
<div class="section section-white gap-l">
 +
== Why participate? ==
  
== Project's history ==
+
Lingua Libre comes from the observation of several lacks on Wikimedia projects and on the web in general:
  
* '''Shtooka Recorder''' (2010) by Nicolas Vion - a notable desktop software which had a deep impact on the open audio recording ecosystems. Hundreds of applications use data produced by this software.
+
* Lack of diversity: While the web is in theory open to everyone, its content is far from representing all languages proportionally. More than 50% of websites are in English; only 301 of the world's 7000+ languages have a free encyclopedia <sup>[https://w3techs.com/technologies/overview/content_language/all <nowiki>[1]</nowiki>]</sup>, with a content that is inferior in quality and quantity to those of more endowed languages such as Wikipedia in English<sup>[https://w3techs.com/technologies/overview/content_language/all <nowiki>[1]</nowiki>],[https://athenaeum.libs.uga.edu/handle/10724/37877 <nowiki>[2]</nowiki>]</sup>. In addition, these websites host content that broadly reflects and meets Western standards and needs through the medium of the written word, which explains and helps to perpetuate their lack of linguistic diversity.
  
* '''SWAC Recorder''' (2013) by Nicolas Vion - a revamp of the earlier, lesser known but easier to install, with better user experience.
+
* Lack of orality: Although languages are essentially spoken (only 4,000 of the world's 7,000 languages have a writing system)<sup>[https://www.ethnologue.com/enterprise-faq/how-many-languages-world-are-unwritten-0 <nowiki>[4]</nowiki>]</sup>, knowledge sharing and communication via new information and communication technologies (NICTs) is mainly done in writing, particularly on the web, despite the rich multimedia format it allows. This mediation of the oral through the written word raises many barriers to contribution, such as the use of Unicode characters, the culture of the written word, the orthographic standardisation of the language or the literacy rate of the community.
  
* '''LinguaLibre.fr v1''' (2016) by Nicolas Vion - a cloud variation of the earlier versions, the project was funded by Wikimedia France (Remy Gerbet & [[user:Lyokoï]]), and create with feedbacks from local linguistic academics. The grant is associated with the project to record and preserve dying French minorities languages. In French only, this platform was demoed to the global Wikimedia community, and demonstrated the need for a v2.
+
* These lacks of diversity and orality limit the ability of Internet users to communicate and contribute online to various web platforms where they cannot find content and communities sharing their language. Among the regional minority languages that are oral or signed, they threaten in particular the poorly endowed ones, many of which are currently in danger of extinction and for whom inclusion on the web is a major challenge and opportunity.
  
* '''LinguaLibre.fr v2''' (2018) by [[user:0x010C|0x010C]] - a full rebuild, based on MediaWiki, using Wikibase and OAuth login for a better integration with the Wikimedia ecosystem. Can be used by the whole community thanks to an user interface available in many languages. The clean, sharp, well named audio files produced ease the creation or enhancing of various derivative applications. Both language learning and language preservation are common use cases. About half of the estimated 7000 human languages are endangered, many other are threatened by the raise of few state-sponsored macro languages.
+
* Indeed, of the 7000 languages in existence today, it is estimated that only 2500 will survive to the next century and only 250 (less than 5%!) will make their digital ascent — i.e. be used regularly for communication purposes in the digital space by native speakers who are comfortable on the web — a factor which is yet essential for their vitality<sup>[https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0077056 <nowiki>[5]</nowiki>]</sup>. Current initiatives by linguists and activists to document and share data, resources and content online in the languages to be preserved do not directly contribute to the development of a digitally-ascendant linguistic community of Internet users, and thus remain limited in their impact.
  
== License ==
+
* Lingua Libre aims to make up for this lack of support by placing itself at the service of linguistic communities wishing to insert and promote their language into the digital space by exploring alternative means of communication to the written word, in the hope that this will free up online communication in a growing number of languages. This objective favours by its very nature regional minority languages that are poorly endowed in terms of oral or signed language, but also benefits more endowed languages that wish to highlight their oral and visual aspects. To fulfil its mission, Lingua Libre offers an online solution for mass recording, leading to the publication of a collaborative multilingual audiovisual corpus under free licence, whose vocation is information through consultation, and revitalisation by triggering the contribution of new language communities on Lingua Libre and then outside.
 +
<center>{{2020 Coolest tool award|Lingua Libre|Diversity}}</center>
 +
</div>
  
* All contents under [https://creativecommons.org/licenses/by-sa/4.0/ Creative Commons CC-BY-SA-4.0]
+
<div class="section gap-m">
 +
<h2 style="text-align: center;">Partners</h2>
 +
<gallery mode="packed" heights=180>
 +
File:Ministere Culture soutient.png|link=https://www.culture.gouv.fr/Thematiques/Langue-francaise-et-langues-de-France
 +
</gallery>
 +
<gallery mode="packed" heights=150>
 +
File:lo congres.jpg|link=https://locongres.org/
 +
File:Mdlnc.png|link=https://www.mncparis.fr/
 +
File:olca.png|link=https://www.olcalsace.org/
 +
</gallery>
 +
</div>
 +
__NOTOC__
 +
__NOEDITSECTION__

Latest revision as of 18:43, 11 June 2024

Other languages:
Bahasa Indonesia • ‎Bahasa Melayu • ‎Deutsch • ‎English • ‎Esperanto • ‎Toki Pona • ‎Türkçe • ‎brezhoneg • ‎dansk • ‎español • ‎euskara • ‎français • ‎italiano • ‎norsk bokmål • ‎occitan • ‎polski • ‎português • ‎română • ‎sicilianu • ‎svenska • ‎íslenska • ‎башҡортса • ‎македонски • ‎русский • ‎עברית • ‎অসমীয়া • ‎తెలుగు • ‎ဘာသာ မန် • ‎ၽႃႇသႃႇတႆး  • ‎日本語

Lingua Libre is a project of the association Wikimédia France which aims to build a collaborative, multilingual, audiovisual corpus under free licence in order to:

  • Expand knowledge about languages and in languages in an audiovisual way on the web, on Wikimedia projects and outside ;
  • Support the development of online language communities — particularly those of poorly endowed, minority, regional, oral or signed languages — in order to help communities accessing online information and to ensure the vitality of the languages of these communities.

Lingua libre illustration - interface.svg

Already 2,000 members and 1,250,000+ recordings on Lingua Libre, join us

Music-technology-guitar-microphone-studio-amplifier-846852-pxhere (cropped).jpg

How to participate?

You can use Lingua Libre by exploring and reusing recordings, contribute to the corpus by recording words, or improve the website itself, in consultation with the community.

The Record Wizard tab allows to record short audios (1 word, 1 phrase), to categorize them and to publish them on Wikimedia Commons from a computer or smartphone. To do so, you will need to log in or create a user account. The user guide is available on the help page.

To modify the website pages, simply log in and click on Modify. To add more pages, the process is in two steps: enter the title of the page you wish to create in the search engine, with the prefix "LinguaLibre:". A message will appear inviting you to create the page. For any substantial modification, please consult the community beforehand.

Interact with the community

Do not hesitate to inform the team of any element that could be improved. To do so, discussions take place in the Chat Room, on the mailing list or on Discord.

Why participate?

Lingua Libre comes from the observation of several lacks on Wikimedia projects and on the web in general:

  • Lack of diversity: While the web is in theory open to everyone, its content is far from representing all languages proportionally. More than 50% of websites are in English; only 301 of the world's 7000+ languages have a free encyclopedia [1], with a content that is inferior in quality and quantity to those of more endowed languages such as Wikipedia in English[1],[2]. In addition, these websites host content that broadly reflects and meets Western standards and needs through the medium of the written word, which explains and helps to perpetuate their lack of linguistic diversity.
  • Lack of orality: Although languages are essentially spoken (only 4,000 of the world's 7,000 languages have a writing system)[4], knowledge sharing and communication via new information and communication technologies (NICTs) is mainly done in writing, particularly on the web, despite the rich multimedia format it allows. This mediation of the oral through the written word raises many barriers to contribution, such as the use of Unicode characters, the culture of the written word, the orthographic standardisation of the language or the literacy rate of the community.
  • These lacks of diversity and orality limit the ability of Internet users to communicate and contribute online to various web platforms where they cannot find content and communities sharing their language. Among the regional minority languages that are oral or signed, they threaten in particular the poorly endowed ones, many of which are currently in danger of extinction and for whom inclusion on the web is a major challenge and opportunity.
  • Indeed, of the 7000 languages in existence today, it is estimated that only 2500 will survive to the next century and only 250 (less than 5%!) will make their digital ascent — i.e. be used regularly for communication purposes in the digital space by native speakers who are comfortable on the web — a factor which is yet essential for their vitality[5]. Current initiatives by linguists and activists to document and share data, resources and content online in the languages to be preserved do not directly contribute to the development of a digitally-ascendant linguistic community of Internet users, and thus remain limited in their impact.
  • Lingua Libre aims to make up for this lack of support by placing itself at the service of linguistic communities wishing to insert and promote their language into the digital space by exploring alternative means of communication to the written word, in the hope that this will free up online communication in a growing number of languages. This objective favours by its very nature regional minority languages that are poorly endowed in terms of oral or signed language, but also benefits more endowed languages that wish to highlight their oral and visual aspects. To fulfil its mission, Lingua Libre offers an online solution for mass recording, leading to the publication of a collaborative multilingual audiovisual corpus under free licence, whose vocation is information through consultation, and revitalisation by triggering the contribution of new language communities on Lingua Libre and then outside.
Coolest Tool Award 2020 square logo.svg

Lingua Libre

2020 Coolest Tool
Award Winner

in the category
Diversity

Partners