Lingua Libre is a project of the association Wikimédia France which aims to build a collaborative, multilingual, audiovisual corpus under free licence in order to:
- Expand knowledge about languages and in languages in an audiovisual way on the web, on Wikimedia projects and outside ;
- Support the development of online language communities — particularly those of poorly endowed, minority, regional, oral or signed languages — in order to help communities accessing online information and to ensure the vitality of the languages of these communities.
How to participate?
You can use Lingua Libre by exploring and reusing recordings, contribute to the corpus by recording words, or improve the website itself, in consultation with the community.
The Record Wizard tab allows to record short audios (1 word, 1 phrase), to categorize them and to publish them on Wikimedia Commons from a computer or smartphone. To do so, you will need to log in or create a user account. The user guide is available on the help page.
To modify the website pages, simply log in and click on Modify. To add more pages, the process is in two steps: enter the title of the page you wish to create in the search engine, with the prefix "LinguaLibre:". A message will appear inviting you to create the page. For any substantial modification, please consult the community beforehand.
Lingua Libre comes from the observation of several lacks on Wikimedia projects and on the web in general:
- Lack of diversity: While the web is in theory open to everyone, its content is far from representing all languages proportionally. More than 50% of websites are in English; only 301 of the world's 7000+ languages have a free encyclopedia , with a content that is inferior in quality and quantity to those of more endowed languages such as Wikipedia in English,. In addition, these websites host content that broadly reflects and meets Western standards and needs through the medium of the written word, which explains and helps to perpetuate their lack of linguistic diversity.
- Lack of orality: Although languages are essentially spoken (only 4,000 of the world's 7,000 languages have a writing system), knowledge sharing and communication via new information and communication technologies (NICTs) is mainly done in writing, particularly on the web, despite the rich multimedia format it allows. This mediation of the oral through the written word raises many barriers to contribution, such as the use of Unicode characters, the culture of the written word, the orthographic standardisation of the language or the literacy rate of the community.
- These lacks of diversity and orality limit the ability of Internet users to communicate and contribute online to various web platforms where they cannot find content and communities sharing their language. Among the regional minority languages that are oral or signed, they threaten in particular the poorly endowed ones, many of which are currently in danger of extinction and for whom inclusion on the web is a major challenge and opportunity.
- Indeed, of the 7000 languages in existence today, it is estimated that only 2500 will survive to the next century and only 250 (less than 5%!) will make their digital ascent — i.e. be used regularly for communication purposes in the digital space by native speakers who are comfortable on the web — a factor which is yet essential for their vitality. Current initiatives by linguists and activists to document and share data, resources and content online in the languages to be preserved do not directly contribute to the development of a digitally-ascendant linguistic community of Internet users, and thus remain limited in their impact.
- Lingua Libre aims to make up for this lack of support by placing itself at the service of linguistic communities wishing to insert and promote their language into the digital space by exploring alternative means of communication to the written word, in the hope that this will free up online communication in a growing number of languages. This objective favours by its very nature regional minority languages that are poorly endowed in terms of oral or signed language, but also benefits more endowed languages that wish to highlight their oral and visual aspects. To fulfil its mission, Lingua Libre offers an online solution for mass recording, leading to the publication of a collaborative multilingual audiovisual corpus under free licence, whose vocation is information through consultation, and revitalisation by triggering the contribution of new language communities on Lingua Libre and then outside.