LinguaLibre

Difference between revisions of "Chat room"

Welcome to the Chat room! Place used to discuss any and all aspects of Lingua Libre: the project itself, discussions of the operations, policy and proposals, technical issues, etc. Other forums include for code-oriented issues, . Feel free to participate in any language you want to.

(44 intermediate revisions by 13 users not shown)
Line 5: Line 5:
 
__TOC__
 
__TOC__
 
<!-- ****      DO NOT EDIT CONTENT ABOVE    **** -->
 
<!-- ****      DO NOT EDIT CONTENT ABOVE    **** -->
 
== Datasets out of date ==
 
Hello. It seems that the datasets page, although it claims to run every 2 days, is completely out of date: all the available zips are from April 2020 or November 2019 (and the full zip from May 2019). Is this a known problem? Is there a plan to address it? [[User:Julien Baley|Julien Baley]] ([[User talk:Julien Baley|talk]]) 23:17, 27 August 2020 (UTC)
 
:Indeed, it seems to have an issue with the dataset updating. I opened a [[phab:T261519|Phabricator ticket]] about this issue. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 18:24, 28 August 2020 (UTC)
 
 
== Publish on Wikimedia Commons ==
 
 
Hello, I just tested, but my records are not published on Commons. My tests: on Firefox, then on Chrome, with 50, then with 1 expression (s), with license CC3.0-BY-SA and CC1.0. —[[User:Eihel|Eihel]] ([[User talk:Eihel|talk]]) 06:51, 2 May 2021 (UTC)[[File:LiLi April 2021 - Publish on Wikimedia Commons.png|thumb|Problème de publication sur Wikimedia Commons]]
 
:[[phab:T281636]] —[[User:Eihel|Eihel]] ([[User talk:Eihel|talk]]) 07:10, 2 May 2021 (UTC)
 
:: Usually I have the same with the first two recordings in a session. Then I can upload them again at the end. Try again with more recordings, and using "retry filed upload" button. [[User:Poemat|Poemat]] ([[User talk:Poemat|talk]]) 08:07, 2 May 2021 (UTC)
 
::: Yup, I had this bug many times. (I say "had" because I don't remember having encountered it after the fire incident.) Just don't give up and it should be published eventually. [[User:DSwissK|DSwissK]] ([[User talk:DSwissK|talk]]) 11:56, 2 May 2021 (UTC)
 
::::(As of 3 May 2021 and as I checked, I'm not aware of any code changes ([https://github.com/lingua-libre/RecordWizard/commits/master history]) which may have of affected this. Seb35 made some other code change this same day.) [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 09:47, 3 May 2021 (UTC)
 
I add a user who has the same problem: {{u|Le Commissaire}}. —[[User:Eihel-LiLi|Eihel-LiLi]] ([[User talk:Eihel-LiLi|talk]]) 15:33, 6 May 2021 (UTC)
 
:::::Bonjour {{ping|Seb35}}, Faudrait voir avec {{u|Le Commissaire}} si le problème persiste aussi (avant de clore le ticket Phab. Sincères salutations. —[[User:Eihel|Eihel]] ([[User talk:Eihel|talk]]) 10:01, 4 June 2021 (UTC)
 
::::::J’ai mis un message à Le Commissaire sur sa page de discussion.
 
::::::Le problème que vous avez eu était spécifique à votre compte, c’est peut-être arrivé à d’autres personnes mais ça semble assez rare. Aussi, à partir du moment où un utilisateur a réussi à faire un envoi vers Commons, alors c’est un problème différent du vôtre ([[:phabricator:T275957|celui-ci, qui ressemble mais l’erreur est intermittente]]). Plus globalement, il faudrait que le message d’erreur soit explicite plutôt que d’aller à chercher dans la console du navigateur, je vais ouvrir un ticket Phabricator en ce sens. [[User:Seb35|Seb35]] ([[User talk:Seb35|talk]]) 10:28, 4 June 2021 (UTC)
 
 
== Exclusion lists ==
 
If anyone uses the regularly updated [[user:Olafbot|Olafbot's]] lists of wanted words ([[List:Fra/Lemmas-without-audio-sorted-by-number-of-wiktionaries]], etc.), and spotted an item that should be removed without recording, you can use the brand new exclusion lists to remove it. For example on the list [[List:Fra/Lemmas-without-audio-sorted-by-number-of-wiktionaries]] there was the word "abandonar", which apparently doesn't belong to the contemporary French corpus. Having added it on the exclusion list (here: [[user:Olafbot/exclusion list/Fra]]) the bot knows this item should never appear in French lists it maintains, and [https://lingualibre.org/index.php?title=List:Fra/Lemmas-without-audio-sorted-by-number-of-wiktionaries&diff=619214&oldid=606068 removes it] during the next update.
 
 
Each "Lemmas without audio" list ({{Olafbot-wikt}}) has a corresponding exclusion list ({{Olafbot-exclusion}}). I hope it will help.
 
 
Normally I would add a link to the exclusion list in a description of each lemmas list, but unfortunately, Lingua Libre engine doesn't allow adding any kind of comments or descriptions to lists, so this ad is the only way to spread a word about the new functionality. [[User:Olaf|Olaf]] ([[User talk:Olaf|talk]]) 09:54, 13 September 2021 (UTC)
 
:{{ping|Olaf}} Thank you so much for this useful new function! Indeed, the Record Wizard does not yet understand comments, categories nor templates on List pages, but this will be considered for future updates. — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 18:48, 13 September 2021 (UTC)
 
 
== Ajout d'une nouvelle langue ==
 
 
Bonjour !
 
 
Je souhaite ajouter la langue Q3196953 mais en suivant la [https://lingualibre.org/wiki/Help:Add_a_new_language/fr procédure], je ne vois pas LinguaImporter. Quelqu'un peut-il me dire pourquoi?
 
 
Cdt,
 
BamLifa
 
: {{ping|BamLifa}} c'est parce que tu n'es pas administrateur. Je viens d'importer le {{Q|646152}} [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 17:16, 13 September 2021 (UTC)
 
::{{ping|Pamputt}}, merci beaucoup pour cette précision. Si cette option n'est réservée qu'aux admins, pourquoi en parler dans la doc sans cette précision ? En plus, vue la multitude des langues que nous avons qui n'existent pas encore chez Lingua libre, ne pensez-vous pas que vous devriez simplifier cette tâche ? J'ai encore une autre langue à ajouter, le Bira (bila). [[User:BamLifa|BamLifa]] ([[User talk:BamLifa|talk]]) 12:41, 20 September 2021 (UTC)
 
:::{{ping|BamLifa}} c'est indiqué sur cette page (c'est même le titre de la section (Outil pour les administrateurs)). Je ne me rappelle pas pourquoi c'est réservé aux admins mais ça limite au moins les vandales qui voudraient importer des choses qui ne sont pas des langues. Bref, j'ai importé le {{Q|656403}} et le {{Q|656404}}. Si ce ne sont pas les bonnes langues, peux-tu me donner le code ISO 639-3 correspondant (ou au moins l'identifiant Wikidata) ? [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 14:06, 20 September 2021 (UTC)
 
::::{{ping|Pamputt}}, Merci beaucoup. [[User:BamLifa|BamLifa]] ([[User talk:BamLifa|talk]]) 05:34, 22 September 2021 (UTC)
 
 
== Lists still don't work properly ==
 
 
{{Ping|WikiLucas00}} {{Ping|Poslovitch}} It's better than [[LinguaLibre:Chat_room#Lists_stopped_working|before]], but still, sometimes the Record Wizard hangs when a list is chosen.
 
Then I have to reload the page, and try again. Usually the second or the third time of trying the same list, it starts to work.
 
Probably a race condition. [[User:Olaf|Olaf]] ([[User talk:Olaf|talk]]) 09:47, 30 September 2021 (UTC)
 
:{{ping|Olaf}}It also happens to me sometimes, but I think that it could be related to the button for removing words you already recorded. When you load a list of words you never recorded (typically Olafbot's lists), ticking the button seems to kill the loading. Best — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 10:23, 30 September 2021 (UTC)
 
:: Thank you. Indeed, with this switch unchecked everything seems to work. [[User:Olaf|Olaf]] ([[User talk:Olaf|talk]]) 16:02, 1 October 2021 (UTC)
 
 
== Liste des mots à prononcer  ==
 
 
Salut ! Existe-t-il une page où des mots peuvent être ajoutés pour qu'un bon samaritain puisse parler ? [[User:Vivaelcelta|Vivaelcelta]] ([[User talk:Vivaelcelta|talk]]) 11:30, 3 October 2021 (UTC)
 
:Bonjour {{u|Vivaelcelta}}, les listes sont faites pour cela. Vous pouvez [[Special:MyLanguage/Help:Create_your_own_lists|créer votre propre liste]] qui pourra ensuite être enregistrée par n'importe qui. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 16:50, 3 October 2021 (UTC)
 
:: Merci {{u|Pamputt}}. — [[User:Vivaelcelta|Vivaelcelta]] ([[User talk:Vivaelcelta|talk]]) 22:38, 3 October 2021 (UTC)
 
 
== Projet Outils pour la patrouille ==
 
:''See [[LinguaLibre:Events/Patrol assistance tool prototyping project]].''
 
{{LangSwitch
 
|fr=Salut,
 
 
cette semaine commence un projet menés par des étudiants des formations IARF-RODECO de l’Université Toulouse 3 - Paul Sabatier concernant le prototypage d’outils de patrouille. Je suis, assisté par Adélaïde Calais, le superviseur de ce projet. Les étudiants sont en informatique avec une spécialisation en intelligence artificielle. L’idée est de leur faire prototyper (voire développer) des outils pour aider la patrouille de Lingua Libre en détectant automatiquement toutes sortes de problèmes. Nous avons déjà identifier quelques problèmes : clics, grésillements, bruits parasites et mauvaises prononciations (libellés et enregistrements pas raccord).
 
 
Et nous avons besoin de la communauté sur deux points :
 
# y a-t-il d’autres problèmes auxquels vous pensez ?
 
# nous avons besoin, pour que les étudiants puissent travailler, d’enregistrements avec défauts. Si vous les avez réenregistrés, c’est pas grave, Commons a un historique. N’hésitez pas à nous communiquer les enregistrements qui ont ou avaient des défauts !
 
 
Enfin, j’ai créé une page de projet accessible [[Special:MyLanguage/LinguaLibre:Events/Patrol_assistance_tool_prototyping_project|ici]] (page traduite).
 
 
(Si certain·es peuvent traduire ce message en anglais, c’est super cool.)
 
 
À+,
 
|en=Hi,
 
 
This week, a project lead by student of University Toulouse 3 - Paul Sabatier is starting. It will be about the prototyping of patrolling tools. I supervise this project, assisted by Adélaïde Calais. The students study computer science with a specialization in Artificial Intelligence. The aim is to have them prototyping (or even developing) tools to help Lingua Libre's patrol, by automatically detecting any kind of mistake/error related to the files. We already identified a few types of mistakes: clicks, crackles, pops and labelling issues (wrong label/wrong language).
 
 
We need the community on two points :
 
# are there other problems you could think of?
 
# we need some recordings having issues, in order for the students to be able to work. If you already recorded them again, it is not a big deal, Commons has a file history. Don't hesitate to provide us the files that have or had problems.
 
 
Lastly, I created a project page, available [[Special:MyLanguage/LinguaLibre:Events/Patrol_assistance_tool_prototyping_project|here]].
 
 
See you,}}
 
[[User:Lepticed7|Lepticed7]] ([[User talk:Lepticed7|talk]]) 09:19, 19 October 2021 (UTC)
 
:Hello [[User:Lepticed7|Lepticed7]], Translated page —[[User:Eihel|Eihel]] ([[User talk:Eihel|talk]]) 19:49, 22 October 2021 (UTC)
 
::[[User:Lepticed7|Lepticed7]], [[User:Adélaïde Calais WMFr|Adélaïde]], could you specify the dates for this project ?
 
::Also, were your point 1 and two answered by the community somewhere ? (If not I could give it a try) [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 13:19, 15 November 2021 (UTC)
 
::: {{ping|Yug}} Hi, I updated the project page with the dates. And I didn’t get any answers to my questions. [[User:Lepticed7|Lepticed7]] ([[User talk:Lepticed7|talk]]) 11:25, 28 November 2021 (UTC)
 
 
== Rashidun Caliphate ==
 
 
Hello {{ping|Zinou2go}},
 
[https://commons.wikimedia.org/wiki/File:LL-Q13955_(ara)-Zinou2go-الخلافة_الراشدة.wav LL-Q13955 (ara)-Zinou2go-الخلافة الراشدة.wav] is problematic (currently {{Q|Q204439}} on LiLi): it contains several cuts (clicks). I proposed the file for deletion in Commons. The recordings seem to be working better, could you record Rashidun Caliphate again? I didn't check the other records, but they are likely to have "clicks" as well. Also, can an admin delete this item on LiLi, please? Cordially. —[[User:Eihel|Eihel]] ([[User talk:Eihel|talk]]) 15:31, 12 November 2021 (UTC)
 
:{{ping|Eihel}} Please do not nominate files for deletion before asking for the speaker to record it again and waiting a while for their answer. Also, these recordings will come useful for the team currently working on the audio issues of Lingua Libre, so we'd better not delete them (I thought you read my messages on Discord about this). — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 15:48, 12 November 2021 (UTC)
 
::{{Ping|WikiLucas00}}, J'ai enlevé la suppression sur Commons. —[[User:Eihel|Eihel]] ([[User talk:Eihel|talk]]) 15:54, 12 November 2021 (UTC)
 
 
== Code of Conduct ==
 
Hi everyone, I just noticed again MediaWiki's [[:mw:Code of Conduct]] (2015) and Wikimedia Foundation's [[:foundation:Universal Code of Conduct]] (2021/02). Back in 2015, 0x010C included the first one as a condition to contribute to [https://github.com/lingua-libre/RecordWizard RecordWizard's codebase]. As far as I know, Lili.org and its community, so far, [https://lingualibre.org/index.php?search=Code+of+conduct has no Code of Conduct]. We may be ''implicitely'' binded by it or by some Wikimedia France's Code of Conduct, but it would be cleaner to ''explicitly'' adopt one and display it here, in written. We could therefor do the following :
 
# Short round to confirm with have nothing in place so far.
 
# Vote for 2 months to adopt the most recent [[:foundation:Universal Code of Conduct]] (2021/02)
 
# Copy the text into [[LinguaLibre:Universal Code of Conduct]].
 
[[User:Yug|Yug]] ([[User talk:Yug|talk]]) 14:48, 14 November 2021 (UTC)
 
=== Pre-discussion ===
 
Do we already have a Code of Conduct binding LinguaLibre ? [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 14:48, 14 November 2021 (UTC)
 
 
=== Vote ===
 
''Are you for or against adopting the [[:foundation:Universal Code of Conduct]] (2021) as a code of conduct for LinguaLibre's community ?''<br>
 
''Possible votes : {{tl|support}} • {{tl|weak support}}  • {{tl|weak oppose}} • {{tl|oppose}}''
 
* {{Support}} (proposer) — better to be explicit, have a framework in place, just to be clear to all on where we stand. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 14:48, 14 November 2021 (UTC)
 
 
== Lingua Libre website should be more appealing to Language Learners ==
 
:''See also [https://forvo.com Forvo.com].''
 
It would be useful if LinguaLibre follows the example of Forvo to increase the number of language learners interested in the Project.
 
 
Forvo.com has a way of displaying the information that engage users and makes it very easy to find pronunciations.
 
 
For example, if someone wants to learn how to pronounce "Honoré de Balzac" in French, it would be faster to find the audio on Forvo than on LinguaLibre. Also, Forvo displays the data in a way more appealing to language learners:
 
* https://forvo.com/search/Honoré_de_Balzac/
 
* https://lingualibre.org/index.php?search=Honoré+de+Balzac
 
'''Would it be possible to improve the way that data is displayed on LinguaLibre to make it more appealing to Language Learners ?'''
 
''In such way, the number of active users recording audios would increase significantly.'' -- [[User:Marreromarco|Marreromarco]]
 
:Some people previously reported such "issue". There is a [[phab:T252319|ticket]] on Phabricator to keep this in mind. However, the priority is currently given to develop patrol tools for Lingua Libre and we do not expect to see major improvements related to the audio brosing in the coming months (at least if we have no more external developers). I think it is like this because Lingua Libre has been though so that it helps for recording, not for listening; the second is let to the other Wikimedia projects, mainly Wiktionaries et Wikidata. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 16:00, 14 November 2021 (UTC)
 
::YES ! There are oral discussions and proposals in this direction, but LinguaLibre being a volunteers-based team, we are moving slowly. Forvo is a for-profit entity, it locks the copyright and resale of recordings made on its platform to the speaker-creator and to themselves, to then sell those recordings with a profit. They therefor have money and swift decision-making to sustain their UI/UX efforts. We are shorter on those sides.  --[[User:Yug|Yug]] ([[User talk:Yug|talk]]) 16:30, 14 November 2021 (UTC)
 
=== Sound Library's forking and hacking ===
 
'''On the [[LinguaLibre:Explore_the_sound_library|Sound Library]] side''', I was able to duplicate/fork it, which allows to start hack its CSS. Copy those codes into your own namespace :
 
* [[User:Yug/common.js]] → [[Special:MyPage/common.js]]
 
* [[User:Yug/MediaWiki:SoundLibrary.js]]‎ → [[Special:MyPage/MediaWiki:SoundLibrary.js]]
 
* [[User:Yug/LinguaLibre:Explore_the_sound_library]]‎ → [[Special:MyPage/LinguaLibre:Explore_the_sound_library]]
 
In those codes, you then have to replace all occurrences of "Yug" by your username, and it's should work. You can start hacking toward a more elegant interface. Note: the JS copy is in your *personal* JS and has a "stop" condition so the various JS instances won't fight. --[[User:Yug|Yug]] ([[User talk:Yug|talk]]) 16:30, 14 November 2021 (UTC)
 
 
== Allow recording only in the user's Native Language to avoid passing "mispronunciations" to Wiktionary ==
 
 
I started a discussion on the German Wiktionary because some words on LinguaLibre are not available on the DeWikt. The German Community told me that LinguaLibre adds words into Commons, but the Bot only accepts audios from “few” trusted users using a filter.
 
 
The English and German Wiktionaries use a bot called "DerbethBot" to add audios from Commons. However, the English Wiktionary community asked to block Lingua Libre's recordings because there were non-native speakers recording audios and the Bot had no way to differentiate them from Native speakers. After the audios were introduced in the English Wiktionary they had to forbid adding audios from LinguaLibre:
 
 
https://en.wiktionary.org/wiki/Wiktionary:Beer_parlour/2020/July#Labeling_non-native_audio
 
 
I believe that it is necessary to avoid giving “mispronunciations” to Wictionaries. That is similar to vandalism on a Wiktionary if the reader doesn't know that it is hearing a bad pronunciation and believes that it is “native speaker”:
 
 
''Some suggestions:''
 
1) Would it be possible to name the audios files to specify if the speaker is a native or not? For example, if a French speaker records the word "maison"  it could be named '''"maison-fr-native.ogg"''' . If a language learner records the same word : '''"maison-fr-learner.ogg"'''
 
 
2) A radical way to address the issue would be to only allow to record in one's native language. Of course, users could change it, but strong warnings could be added and always remind people to record only their native language. Forvo seems to take this approach.
 
 
It might be valuable for Linguists to have recordings of non-native speakers to study their accent features in an L-2 Language. However, in my humble opinion the pronunciations added to Wiktionary should be only native speakers and bots should have a way to differentiate them.
 
 
Link to the German Wiktionary discussion about LinguaLibre:
 
https://de.wiktionary.org/wiki/Wiktionary:Teestube#:~:text=von%20technischer%20seite%20gibt%20es%20keinem%20problem%2C%20zwei%20bots%20auf%20de.wiktionary%20arbeiten%20zu%20lassen.
 
:Hi, this depends on the Wikitionary policy, and it could be different from a language to another one. Anyway, it is already possible to select only recordings done by native speaker. To do that, the speaker has to fill the {{P|16}} property ith the value {{Q|15}} (see for example {{Q|466}}). Other values for {{P|16}} are given [[Special:WhatLinksHere/Q5|here]]. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 16:38, 16 November 2021 (UTC)
 
 
 
== Sursilvan ==
 
:{{done}}
 
[[Special:Contributions/Franz.Roos.1955|User:Franz.Roos.1955]] made 2 recordings in [[:en:wp:Sursilvan]] : rauna ([[Q689785]]), ‎tschitta ([[Q689786]]). Sursilvan has no iso code. Do we have a procedure for such languages ? (I forgot if the case already shown up). [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 20:37, 17 November 2021 (UTC)
 
:There is not issue. It simply uses the Wikidata identifier when there is no ISO code. Se for example {{Q|1186}}. To record in such languages, we have to create an item for this language/dialect on Lingua Libre, and this is already done for {{Q|74905}}. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 21:59, 17 November 2021 (UTC)
 
::Thank Pamputt for the clarification. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 23:12, 17 November 2021 (UTC)
 
 
== [[commons:commons:structured data]] ==
 
 
I've been very pleased with LL's tooling, that does so much of the process of uploading to Commons, sensible naming, description-writing, and categorisation for me; however, I have an idea for an additional step LL could automate. This is in Commons' no-longer-so-new structured data section, which manifests (among other ways) as a tab on the file page.
 
 
As an example of what could be automatically added to a file's datastore, there is a property called 'audio transcription' which serves a similar role to Commons' TimedText subtitle functionality (silly example: [[commons:TimedText:051226-kakapo-billbooming.ogg.en.srt]]) but for shorter clips -- in other words, seemingly designed with applications like LinguaLibre in mind.
 
 
Since these are of the so-called 'monolingual text' datatype, the source language can be specified (or where not part of the main set of languages Wikimedia uses, the special code 'mis' is used and 'language of work or name' used as a qualifier) at the same time as the actual text that is being spoken, which LL has access to since the audio file started out as a text prompt!
 
 
What think y'all? [[User:Arlo Barnes|Arlo Barnes]] ([[User talk:Arlo Barnes|talk]]) 04:25, 19 November 2021 (UTC)
 
:Hi {{u|Arlo Barnes}} there is [[phab:T239272|Phabricator ticket]] about this topic. Currently there are not yet all properties on Wikidata to fit all Lingua Libre properties. For example, I [[d:Wikidata:Property proposal/language level|proposed to create]] a property for the language level of a speaker but it did not get enough support. SO I guess, we should first list all properties we would like to add on SDC. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 07:18, 19 November 2021 (UTC)
 
 
==  [Feature Request] Play next sound automatically while checking recordings ==
 
 
After recording sounds it is important to check them to verify their quality. However, it is very tiring to record 380 words and afterwards have to click 380 times on the ''“Next button”'' while checking them.
 
 
'''After recording, would it be possible to add a button to "Play next sound automatically" ?''' [https://i.imgur.com/XwC34pj.png Screenshot Here]  [[User:Marreromarco|Marreromarco]] ([[User talk:Marreromarco|talk]]) 04:09, 20 November 2021 (UTC)
 
:Agreed, it is already [[phab:T218372|tracked on Phabricator]]. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 09:45, 20 November 2021 (UTC)
 
 
== "How to use Lingua Libre for your language learning" ==
 
 
I recently found a "new" way to benefit from the sounds on Lingua Libre. I would suggest that it could be advertised on the Lingua Libre main website and on the Wikipedia in French/English:
 
* [[:en:wp:GoldenDict|GoldenDict]] is a FOSS Dictionary application very valuable for language learners.
 
 
A  way to benefit from Lingua Libre recordings is to download the datasets, unzip them and "load" the sounds on GoldenDict (as Sound Directories. [https://i.imgur.com/9avJDgS.png Screenshot here]). In such a way, users have easily an offline "Pronunciation Dictionary". It is very easy to do. Here is an [https://i.imgur.com/axRHruk.png screenshot] of how it looks to GoldenDict the French word "fuir". Another example [https://i.imgur.com/Rq0nQCt.png here].
 
 
Lingua Libre sounds can be used with GoldenDict OFFLINE. That is a huge advantage in developing countries, where language learners often do not have reliable internet connection.
 
 
''It would be valuable to create a description on the Lingua Libre website about'' '''"How to use Lingua Libre sounds for your language learning"''' .
 
 
There it would be possible to describe how to use the audios offline with GoldenDict, etc.  If  more methods are developed (Anki add-on), better GUI, Android App, etc. they could be explained there.--[[User:Marreromarco|Marreromarco]] ([[User talk:Marreromarco|talk]]) 04:41, 20 November 2021 (UTC)
 
:1) '''Reuse of datasets :''' Yes ! Dataset download and reuse must be showcasted and strengthened. I think a "Reuses gallery" page could be created, with screenshot and minimal how-to for GoldenDict, Anki and others.
 
:2) '''Anki:''' You are the 4th or 5th contributor to rise the need for an Anki add-on. We need to do something on this side, yes. It's more than 1~2 days work and too big for a volunteer work, so we need to apply for a grant. I'am looking in and mapping our options at the moment ({{tl|Grants table}}). At one point we have to jump in and design a project, yes.
 
:3) For '''e-learning app''', a 5k€ project was designed by myself a year ago. The funding by local regional government was declined, but it could easily be refreshed.
 
:We have to redesign some projects and apply in early 2022. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 09:28, 23 November 2021 (UTC)
 
::The core question is the Human Resources.
 
::'''*Daily routines*''' keeps WikiLucas, Pamputt, Poslovitch and myself –aka the community-side contributors— busy maintaining the place, welcoming and guiding new users, cleaning pages, etc. We are now quite smooth, successful and stable on this side.
 
::To '''*push forward*''' on developments, UI, tools, e-learning, communication, grants, we each have one or two side projects in mind, pushing those <u>''slowly''</u>. But as always in FOSS projects the task ahead is much larger and we could achieve much more with more human resources.
 
::'''Overall''', it's possible we are at a new turning right now. As things are stable, with road maps available, '''we just need 1 to 3 new coordinators and communicants contributors to tip the dynamic into forward-offensive mode''', with communication therefor new arrivals, new speakers, new devs, new coordinators and really push forward with new events/workshop, funds and SMART features.
 
::@[[User:Marreromarco|Marreromarco]], I'am currently writing down structuring "community how to" to ease new contributor's jumping in (see [[LinguaLibre:Roles]], [[LinguaLibre:Workshops]], {{tl|Grants table}}). You are doing a nice push on communication (It's FOSS) and with your questions you are mapping out Lili's needs. Pamputt and WikiLucas are following our progresses. All this is pretty interesting. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 10:48, 23 November 2021 (UTC)
 
 
:I would like to work on the "Public Relations" Department of LinguaLibre! - EDIT (28th Nov. 2021) : '''Any PR campaign would fail miserably if there is no search function.''' I explain the reasons at the end of this section:  [[LinguaLibre:Events/Winter 2021-2022 Public Relations Campaign]]
 
 
[[User:Marreromarco|Marreromarco]] ([[User talk:Marreromarco|talk]]) 23:49, 23 November 2021 (UTC)
 
::Sound good :) Your outreach to YouTubers and popular FOSS blogs is spot on.
 
::I am back from a wikibreak, I am cleaning up some last pages, then since the maintenance side is stable I would like to focus my energy on projects design –recording rare languages, technology, PR campaign– and associated grant requests to secure funding and the actual realization of those visions. We can collaborate. You lead on the PR : design your campaign. I can review and help it to fit some Grants formats. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 18:00, 24 November 2021 (UTC)
 
 
I created a new wiki page in the "events" section of a "PR Campaign for 2022". Please visit [[LinguaLibre:Events/Winter 2021-2022 Public Relations Campaign]] and participate in the discussion with new ideas. EDIT (28th Nov. 2021) I will NOT contribute anymore to a PR campaign. the reasons are explained as comment on the relevant section [[User:Marreromarco|Marreromarco]] ([[User talk:Marreromarco|talk]]) 21:20, 25 November 2021 (UTC)
 
 
== Creating a LL catgory for a dialect ==
 
 
Would be grateful if someone could tell me if it's possible to create a LL category for a dialect?
 
 
We're working in Konkani, which has its own (but small) Wikipedia at http://gom.wikipedia.org Under Konkani, there are some dialects spoken, the pronunciation of one can be different from the other.
 
 
Would like to create a category for Saxtti (the Salcete dialect of Konkani). This will ensure that readings don't get overwritten by other dialects. Also, it would allow the recordings of many others which might have already been done in Konkani as a how.
 
 
Question: How do we create space for the dialects of a language?
 
 
Thanks very much, in advance! --[[User:Fredericknoronha|Fredericknoronha]] ([[User talk:Fredericknoronha|talk]]) 13:34, 27 November 2021 (UTC)
 
:Hello {{ping|Fredericknoronha}} and welcome to Lingua Libre. I imported {{Q|700683}} (gom) as it was not on Lingua Libre yet. On Lingua Libre, dialects are treated the same way as languages. You can create an element for your dialect on Wikidata (example for [https://www.wikidata.org/wiki/Q35359 auvergnat dialect]) and tell us once it is ready, so that we can import it on Lingua Libre with an admin tool. You can also directly create an element for your dialect on Lingua Libre, following the steps described at [[Special:MyLanguage/Help:Add_a_new_language|Help:Add a new language]] and taking example of {{Q|1186}}. Don't hesitate to ping an admin if you have any questions.
 
:All the best — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 15:35, 27 November 2021 (UTC)
 
::''« there are some dialects spoken, the pronunciation of one can be different from the other. […] This will ensure that readings don't get overwritten by other dialects. »''
 
::If the writing are similar but only the pronunciation differs depending on where the speaker comes from, it looks like different accents.
 
::Recordings are specific to a word, a language and a speaker. Which means me recording in French the word "bonjour" will be one audio file on Lili. WikiLucas can record in French the same word "bonjour", it will create an other audio file on Lili. My recording(s), since i come from the South West, will carry the southern accent. Recordings by WikiLucas, who lives 700km East of me, will cary the Lyon area accent. Lingualibre will store 2 recordings, one per user. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 21:59, 27 November 2021 (UTC)
 
: Hello {{u|Fredericknoronha}}, I have imported {{Q|701734}} so that you can now record words in that dialect. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 17:21, 28 November 2021 (UTC)
 
 
== Feedback about Lingua Libre by Professor Carol Genetti, PhD ==
 
 
'''Dear Members of Lingua Libre,
 
'''
 
I am pleased to share a message from Professor [https://en.wikipedia.org/wiki/Carol_Genetti Carol Genetti], a linguist and leading expert in endangered languages.  Professor Genetti is author of one of the best books in the field of Linguistics called "How Languages Work". Her vast knowledge and experience are extremely valuable and after reviewing Lingua Libre she said:
 
 
''Thank you for contacting me and letting me know about this initiative. It is an interesting idea. I especially like the multilingual menus -- very helpful.''
 
 
''Are you aware of [https://www.endangeredlanguages.com/ this website], hosted by the University of Hawaii (and, I believe, funded by Google). So one thing that occurs to me is the proliferation of such sites. How will people in an endangered-language community find out about their options, and then make an informed choice about which of these online resources will be best over time for their communities? Should such efforts cross-reference each other?''
 
 
''My second thought has to do with longevity. It takes a significant commitment to support a site like this over time. The challenge is having someone who can keep such sites funded, working, organized, relevant, and engaging users over time. How will you make sure that the data will be available in 10, 50, 150 years? Maybe you get that automatically by being associated with Wikipedia. If so, state that. Also, there should be a clear statement of how such data might be used, and by whom, so speakers know that if they record a wordlist, someone might use if for some purpose without their permission (is that right?).
 
''
 
''I'm sorry to have to bring a down-to-earth message to the inspiration and passion for endangered languages that has clearly fueled this work, but having seen other initiatives stumble in this way, I wanted to be sure that you are thinking about this. Speakers will be entrusting you with such valuable pieces of their lives and their cultures. How will you safeguard this over time? Let people know.
 
''
 
''Those issues aside, here are a couple of other comments:''
 
 
* There should be a statement targeted for speakers of endangered languages - why would they want to do this? What is the value for them and their communities? What will happen to the recordings? etc.''
 
* Will you provide speakers with suggestions for what vocabulary to record, e.g. greetings, colors, verb forms?''
 
* It would be helpful if it was clear from the large list of languages which ones have recordings. Maybe put those in a different color font?''
 
* It would be helpful to include translations of the words into one of the world's major languages or the national language. Otherwise, someone's grandkids coming to this in 30 years will not know what the words mean.''
 
* Do you want to move beyond single words to a piece of connected discourse, such as a short poem or story, a song, or the reading of some common text (such as a sentence from the UN Declaration for Linguistic Rights)?''
 
* Should there be a means to flag inappropriate content?''
 
 
''I hope that you find this helpful. And I'm so glad you liked my book! It is lovely to hear that people have found it helpful.''
 
 
''Carol Genetti''
 
''Vice Provost for Graduate and Postdoctoral Programs''
 
''NYU Abu Dhabi''
 
''(she/her/hers)''
 
 
[[User:Marreromarco|Marreromarco]] ([[User talk:Marreromarco|talk]]) 09:23, 4 December 2021 (UTC)
 
:Hey, this is some interesting feedback.
 
:* "What will happen to the recordings?": Our homepage lacks such important information. We should plan a redesign for 2022 (inspired by the homepage of [https://commonvoice.mozilla.org/ Common Voice]?) so that we finally have a homepage that properly explains what Lingua Libre is and can do.
 
:* "Suggestions of things to record?": This already exists. They're called Lists. We have some pending improvements on that matter (easier to find and contribute to, etc.)
 
:* "Show which languages have recordings": The datasets page could help, but I guess it would be interesting to put that on an easy-to-find page (again, like [https://commonvoice.mozilla.org/fr/languages Common Voice's languages page]?)
 
:* "Include translations of the words into one of the world's major languages or the national language": we only support "transcription" for now.
 
:** How could we even "link" the recordings to translations? (Lexemes? Plain text?)
 
:** Who would have to do that? (the locutor? a dedicated team of contributors?)
 
:** Where would it be done? (in the RecordWizard?)
 
:** -> That's an interesting thing to think about, but might be slightly out of scope right now
 
:* "Sentences, stories, songs...?": Yes, indeed. The Record Wizard is already able to do that (with some config tweaks that have to be done by the locutor), but it would be great to streamline this further. Dedicated UI, ability to record an audiobook (or Wikipedia, Wikisource, Wikinews article) as a mixture of sentences that can be stored locally before being all merged together into one audio file sent to Commons, ability for multiple contributors to work on the same book/article... That's something we should also discuss with the [https://librivox.org/ Librivox] folks: they use Audacity so far, but they might be interested in a tool that's better suited to their needs.
 
:* "flag inappropriate content?": My insight is focused on technical stuff. This sounds more like some editorial guidelines that would have to be debated by the community.
 
:* "'''longevity'''?": Should Lingua Libre vanish tomorrow, the audio recordings are not lost. They're all stored on Wikimedia Commons, and that makes them as "immortal" as files stored on hard disks, SSDs, CDs or magnetic bands and mirrored half a dozen times around the world can be. However, I can't say much about our Wikibase, which, at the current time, '''is the only place where all the recordings and locutor-related metadata is stored'''. That's a serious single point of failure. There are no dumps and therefore no mirrorring. We'll definitely have to discuss it with Wikimedia France and the Tech Team.
 
:Hopefully my answers are clear and comprehensible. I'm pleased to have received feedback from Pr. Genetti. Now it's our turn to take matters in our hands! --[[User:Poslovitch|Poslovitch]] ([[User talk:Poslovitch|talk]]) 13:13, 5 December 2021 (UTC)
 
 
== How to delete lists? ==
 
:{{Done}}
 
Hello, recently I completed some lists. Now everything is done and those lists are needless. Is there any possibility to delete lists? Greetings --[[User:Onkel Tomm|Onkel Tomm]] ([[User talk:Onkel Tomm|talk]]) 10:02, 10 December 2021 (UTC)
 
:{{Ping|Onkel Tomm}} hello, admins can delete those lists. The lists you created are [https://lingualibre.org/index.php?target=Onkel+Tomm&namespace=142&tagfilter=&newOnly=1&start=&end=&limit=50&title=Special%3AContributions here]. Which ones should I delete ? [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 10:25, 10 December 2021 (UTC)
 
::Hello Yug, please delete all 8 lists, because they are all finally finished. Thanks. --[[User:Onkel Tomm|Onkel Tomm]] ([[User talk:Onkel Tomm|talk]]) 13:44, 10 December 2021 (UTC)
 
:{{Ping|Onkel Tomm}} We are clean ! thank for asking, it keeps the place clean :) [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 15:10, 10 December 2021 (UTC)
 
 
== Case study ==
 
Hello all, I noticed a file upload which gather interesting use cases.
 
 
{| class="wikitable"
 
! Item || Label || Speaker || Account || Filename || Category
 
|-
 
| [[Q709231]] ([https://lingualibre.org/index.php?title=Q709231&oldid=689510 arch.]) || "Ingenieur" || [[Q674858]] 'fleur' || User:Beat_Ruest || [[:File:LL-Q150_(fra)-fleur_(Beat_Ruest)-Ingenieur.wav]] || [[:commons:Category:Lingua Libre pronunciation by Beat Ruest]]
 
|-
 
| — || Mispelling of "Ingénieur" || – || – || Carries the misspelling || Category page was not created, therefor virtually "lost" to Wikimedia Commons and [[:commons:Category:Lingua_Libre_pronunciation_by_user]].
 
|}
 
 
Questions:
 
* Question 1: How do we handle mispelling ? I assume renaming ALL THREE of the [[Q709231]]'s label AND Property:P3 'recording' AND Wikimedia file [[:File:LL-Q150_(fra)-fleur_(Beat_Ruest)-Ingenieur.wav]] rename. Is that ok or will it break something ?
 
* Question 2: Category should be automatically created. How do we go for this ? I assume a request on [[LinguaLibre:Bot]]
 
* Question 3: What about the category by *speaker/voice* ([[Q709231]] 'fleur'), which curently doesn't exist, and which can have multiple speakers with the same name 'fleur' ?
 
[[User:Yug|Yug]] ([[User talk:Yug|talk]]) 10:39, 10 December 2021 (UTC)
 
: Question 1: it is a good start. I guess, we need to fix it both on Lingua Libre and on Wikimedia Commons
 
: Question 2: you speak about categories on Wikimedia Commons? If so, I guess a bot can do it (Lingua Libre Bot or another one).
 
: Question 3: actually the speaker is identified as "fleur (Beat Ruest)". Only one locutor of Beat Ruest can use the nickname "fleur".
 
: [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 11:23, 20 December 2021 (UTC)
 
::Q1, Q2 agree.
 
::Q3 : {{ping|Pamputt}} check the categories on [[:commons:File:LL-Q150_(fra)-fleur_(Beat_Ruest)-Ingenieur.wav]]. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 14:56, 20 December 2021 (UTC)
 
:::{{ping|Yug}} you mean the problem is [[:c:File:LL-Q150_(fra)-fleur_(Beat_Ruest)-Ingenieur.wav]] is categorized in "Category:Lingua Libre pronunciation by Beat Ruest" and not in "Category:Lingua Libre pronunciation by fleur (Beat Ruest)" or similar name? [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 07:57, 5 January 2022 (UTC)
 
::::Yes, we dont have categorization by '''speaker''' "Fleur (Beat Ruest)". Low importance, but could be a feature request. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 18:01, 5 January 2022 (UTC)
 
 
== Gestion de doublons ==
 
:''See also [[Help:Homographs]]'' (new, needs review!)
 
 
Bonsoir !
 
 
Il y a-t-il une gestion de doublons dans LL pour les mots d'une même langue ? [[User:BamLifa|BamLifa]] ([[User talk:BamLifa|talk]]) 13:45, 18 December 2021 (UTC)
 
:Bonjour [[User:BamLifa|BamLifa]], si un même locuteur enregistre le même mot alors l'enregistrement précédent sera écrasé (un même locuteur ne peut enregistrer qu'une seule fois le même mot). En revanche, rien n'empêche l'enregistrement d'un même mot par plusieurs locuteurs et locutrices différentes, c'est même un des objectifs de Lingua Libre : mettre en lumière la diversité des prononciations. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 11:19, 20 December 2021 (UTC)
 
::@[[User:Pamputt|Pamputt]] : Comment sont alors gérés les homographes non homophones ? ^^ [[User:Totodu74|Totodu74]] ([[User talk:Totodu74|talk]]) 00:03, 5 January 2022 (UTC)
 
 
:::Bonjour [[User:Totodu74|Totodu74]], il est possible d'ajouter des indications entre parenthèses (cette information est stockée à l'aide de {{P|18}}). Voir par exemple {{Q|1685}} et {{Q|1686}}. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 07:55, 5 January 2022 (UTC)<br>
 
 
:::@[[User:Totodu74|Totodu74]], salut, la question des homographes est en partie résolue dans nos langues africaines qui sont essentiellement des langues à tons. --[[User:Rçag|Rçag]] ([[User talk:Rçag|talk]]) 11:18, 9 January 2022 (UTC)
 
:Rçag, could you explain your solution a bit so we learn from it.
 
:{{Ping|BamLifa|Rçag|Pamputt|Totodu74}} the page [[Help:Homographs]] is there to gather best practices. It's new, review and edits welcome. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 15:05, 12 January 2022 (UTC)
 
 
== Comment changer de pseudonyme ==
 
 
Bonjour, sur les projets de Wikimedia, mon pseudonyme est Manjiro91 (et anciennement GamissimoYT), comment change-t-on de pseudonyme ?
 
[[User:GamissimoYT|GamissimoYT]] ([[User talk:GamissimoYT|talk]]) 17:13, 11 January 2022 (UTC)
 
:Bonjour {{u|GamissimoYT}}. Lingua Libre utilise le même pseudo que celui qui est en utilisation sur Wikimedia Commons. Donc si vous voulez utiliser le pesudonyme Manjiro91, déconnectez-vous de Lingua Libre, puis de Wikimedia Commons. Ensuite, connectez vous à Commons avec le pseudo Manjiro91 et enfin reconnectez vous à Lingua Libre. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 21:05, 11 January 2022 (UTC)
 
{{Notif|Pamputt}} Mon pseudonyme Wikimedia Commons est Manjiro91 (anciennement GamissimoYT mais le changement de pseudonyme ne s'effectue pas sur LiLi. [[User:GamissimoYT|GamissimoYT]] ([[User talk:GamissimoYT|talk]]) 13:38, 12 January 2022 (UTC)
 
:{{ping|GamissimoYT}}, tu as bien fait les connexions/déconnexions dans l'ordre que j'ai indiqué ? Si tu es sûr que tu es connecté avec Manjiro91 sur Wikimedia Commons, alors tu peux essayer de te déconnecter de Lingua Libre et te reconnecter dans la foulée. Essayer de vider le cache du navigateur peut peut-être aidé aussi. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 07:37, 13 January 2022 (UTC)
 
 
== Merging of items about languages ==
 
:''See also [[Help:SPARQL]] and [[Help:SPARQL for maintenance]].''
 
Hi y'all,
 
 
For the record, I just merge a couple of items about the same language:
 
* {{Q|52071}} in {{Q|73}}
 
* {{Q|139228}} in {{Q|183}}
 
* {{Q|170137}} in {{Q|359}}
 
* {{Q|683869}} in {{Q|418}}
 
* {{Q|646169}} in {{Q|6714}}
 
* {{Q|570518}} in {{Q|52069}}
 
* {{Q|538624}} in {{Q|84030}}
 
* {{Q|646173}} in {{Q|390314}}
 
* {{Q|646161}} in {{Q|502754}}
 
* {{Q|570510}} in {{Q|489393}}
 
 
I detected them with this SPARQL query:
 
 
<syntaxhighlight lang="sparql">
 
SELECT ?idWD (COUNT(?item) AS ?compte) (GROUP_CONCAT(?item) AS ?items) WHERE {
 
  ?item prop:P2 entity:Q4 ; prop:P12 ?idWD .
 
}
 
GROUP BY ?idWD
 
HAVING ( ?compte > 1 )
 
</syntaxhighlight>
 
 
Ping {{ping|WikiLucas00}} it seems you are responsible for some of them...
 
 
Cheers, [[User:VIGNERON|VIGNERON]] ([[User talk:VIGNERON|talk]]) 09:29, 19 February 2022 (UTC)
 
:Thanks VIGNERON for finding them and cleaning it. Now what to do with recording items that use the doublon language item (for example with [[Special:WhatLinksHere/Q52071|Duala]]). I think we must modify {{P|4}} for all recording items so that languages are not counted twice and also to clean up the database (there are also transcription problems for items listed in the Duala example). [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 16:16, 19 February 2022 (UTC)
 
::Thank you {{ping|VIGNERON}} for pointing these out. As you can see, most of them were not created manually but using the tool (the pages wheighted circa 4kB, with labels in many languages). It seems that the Lingua Importer tool has (or had?) a problem, but I could not reproduce it (trying to import languages that are already in LL wikibase).<br/> During last summer's hackathon we talked a bit about languages in our wikibase, but I can't remember why we need to have language elements in our Wikibase, and not just use the existing base of WikiData 🤔 — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 23:23, 19 February 2022 (UTC)
 
 
== MediaWiki customizations of LinguaLibre ==
 
 
Love the MediaWiki skin of LinguaLibre and I am curious of skin and customizations made. Who are the authors? (can not see credits) --[[User:Zblace|Zblace]] ([[User talk:Zblace|talk]]) 10:15, 19 February 2022 (UTC)
 
:The skin is known as BlueLL. The source code is available on [https://github.com/lingua-libre/BlueLL github]. It has been developed by Wikimedia France  in 2020. That's said, it is true there is no licence and credits on Github. I will ask to {{u|Adélaïde Calais WMFr}} if she remember anything so that I can the missing informations. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 16:58, 19 February 2022 (UTC)
 
::Hi {{ping|Zblace}}, this skin's author is [[User:0x010C]], and its opensource. Can be reused freely. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 22:45, 22 May 2022 (UTC)
 
 
== New property: translation ==
 
Hello, I've created {{P|38}} to be used in case there is no writing in the recording language but instead a translation in the vehicular language. See for example what I did [https://lingualibre.org/index.php?title=Q212431&type=revision&diff=743039&oldid=191330 here] and [https://lingualibre.org/index.php?title=Q58994&type=revision&diff=743044&oldid=580313 there]. Do you agree with that? Any comment? [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 16:33, 19 February 2022 (UTC)
 
:It's a good idea! Many users tend to add a translation as they find it important for other people to have. It will also be handy for cases like your second example, where we only have the translation but not the transcription of the source language: we will be able to query the base to see all audios of a language that have a translation. — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 23:28, 19 February 2022 (UTC)
 
::I am thinking about a way to populate automatically this property via the Record Wizard. Currently, it seems that the Record Wizard populates {{P|18}} when something is written between brackets (see {{Q|1685}} for example but I have not checked recently). So, if we modify the Record Wizard code, it is possible to recognize this is a translation in another language and so to populate {{P|38}}. But I would like to be sure to propose the best way to do it before asking for such development. The idea is to be managed automatically (or at least not completely manually). [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 00:18, 20 February 2022 (UTC)
 
 
== Lingua Libre Wishlist for 2022-2023 ==
 
 
Hi everyone !
 
<br/>This week, Wikimedia France is preparing its budget for the fiscal year to come : July 2022 to June 2023. If there are things you would like to see done or to do with our help on Lingua Libre, please share it on this page : https://lingualibre.org/wiki/LinguaLibre:2022-2023_projection
 
<br/>Have a great week-end ! --[[User:Adélaïde Calais WMFr|Adélaïde Calais WMFr]] ([[User talk:Adélaïde Calais WMFr|talk]]) 17:23, 11 March 2022 (UTC)
 
: {{u|marreromarco}} Thank you for your suggestions. However, I have some reservations about "Add function to "Request" a Pronunciation to Native Speakers" at this current stage for two reasons. First, this will require quite a bit of moderation to correct requests for grammar and spelling (e.g. HASBAND) as well as remove terrible requests. This will place a large burden on a few users and can easily lead to questionable decisions by moderators. Second, Forvo is flooded with requests that are either overly specific (e.g. "He came back from abyss and won the tie.") and, therefore, likely benefit only one user. IMHO, Rdrg109 proposal to focus on providing pronunciations for entries on the various wiktionaries is a better approach to building up the LL at this point. It will provide a solid foundation for users to find any word in LL. It might be a better time to open up LL to general requests once this project is completed and the community has grown. [[User:Languageseeker|Languageseeker]] ([[User talk:Languageseeker|talk]]) 15:49, 21 May 2022 (UTC)
 
 
== How to get the city  country label in SPARQL ==
 
:''See also [[Help:SPARQL]].''
 
I'm working on an Anki extension for LL, but I'm having a little trouble writing the sparql query. In short, I want to be able to get the city and country for a recording in LL. However, when I query P14, I get the link to the item instead of 'residence': {'type': 'literal', 'value': 'Q142'} or 'residence': {'type': 'literal', 'value': 'Q142'}. Instead I hope to get city:"" and country "France" for the first query city:"Paris" and country:"France" for the second one. Any ideas? [[User:Languageseeker|Languageseeker]] ([[User talk:Languageseeker|talk]]) 20:23, 19 May 2022 (UTC)
 
:Hi {{u|Languageseeker}} thanks for your work on a Anki extension. Could you post here the query you have now? [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 16:58, 20 May 2022 (UTC)
 
::Hi {{u|Pamputt}} . The query that I'm using is a very lightly modified version of the bot query.
 
 
:: <syntaxhighlight lang="sparql">ENDPOINT = "https://lingualibre.org/bigdata/namespace/wdq/sparql"
 
API = "https://lingualibre.org/api.php"
 
BASEQUERY = """
 
SELECT DISTINCT
 
    ?record ?file ?transcription ?recorded
 
    ?languageIso ?languageQid ?languageWMCode
 
    ?residence ?learningPlace ?languageLevel
 
    ?speaker ?linkeduser
 
WHERE {
 
  ?record prop:P2 entity:Q2 .
 
  ?record prop:P3 ?file .
 
  ?record prop:P4 ?language .
 
  ?record prop:P5 ?speaker .
 
  ?record prop:P6 ?recorded .
 
  ?record prop:P7 ?transcription .
 
  ?language prop:P13 ?languageIso.
 
  ?speakerLanguagesStatement llq:P16 ?languageLevel .
 
  ?speaker prop:P11 ?linkeduser .
 
  ?speaker prop:P14 ?residence .
 
  ?speaker llp:P4 ?speakerLanguagesStatement .
 
  ?speakerLanguagesStatement llv:P4 ?speakerLanguages .
 
  OPTIONAL { ?speakerLanguagesStatement llq:P16 ?languageLevel . }
 
  FILTER( ?speakerLanguages = ?language) .
 
  SERVICE wikibase:label {
 
    bd:serviceParam wikibase:language "en" .
 
  }
 
  #filters
 
}"""</syntaxhighlight>
 
 
:: Currently, I'm running it with filters = "" because it seems that a query for a single term takes around 70s, while fetching a single transcription takes about 145 seconds. My plan is to group the results by transcription and then write that into a json file to avoid the costly query. Basically, I need the speaker name, the term, their country, their city, the ISO code of the language, date created, and the filename, languageLevel.
 
 
:: For example, for the term un chien, the json would look like:
 
:: { "term": {"un chien": {"speaker": "Julien Baley", "language": "fra", "city": "", "country": "France", "recorded": "2020-11-27", "filename": "LL-Q150_(fra)-Julien_Baley-un_chien.wav", "languageLevel": "Q15"}}} [[User:Languageseeker|Languageseeker]] ([[User talk:Languageseeker|talk]]) 23:17, 20 May 2022 (UTC)
 
 
== Contribution: Python program to download all files created by a specific user ==
 
:''See also [[Help:Download datasets]].''
 
I wrote a [https://github.com/rkosov/Lingua-Libre-User-Audio-Downloader python program] that downloads all the files created by one user. For video files, it downloads the full webm. For audio files, the default is to download the wave file. However, for audio files, you can optionally choose either mp3 or ogg files. Currently, the configuration requires a minor modification of lluad.py. If there is strong demand, I will write a command line parser for it. Please report any bugs or errors on the github page. Feature requests are welcome. [[User:Languageseeker|Languageseeker]] ([[User talk:Languageseeker|talk]]) 02:28, 20 May 2022 (UTC)
 
:{{Ping|Languageseeker}} please add your tool to [[Help:Download datasets]]. It lists several tools with different specifics, your tool is welcome and may help some Python users as well. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 22:41, 22 May 2022 (UTC)
 
 
== Garbage Values in prop:P14  ==
 
:''See also [[Help:SPARQL for maintenance]] and [[Help:SPARQL_for_maintenance#.E2.9C.85_Speakers_.E2.86.92_Undefined_place_of_residence]].''
 
As part of my Anki project, I queried the entire LL database and I'm trying to parse the output of ?speaker prop:P14 ?residence. I've noticed that there are a number of garbage values in provided for P14, such as Q1, Q2, Q103962887, Q6099648, Strasbourg. There seem to be three cases.
 
# Users wishing to enter an extremely vague place such as Earth or the Universe. These should be set to None.
 
# Users accidentally linking to a disambiguation page. These require correction.
 
# Users not even entering a Wikidata item which require manual correction.
 
 
To solve the root of the problem, I propose that P14 should be restricted to only Wikidata items that exist and have P17. [[User:Languageseeker|Languageseeker]] ([[User talk:Languageseeker|talk]]) 21:22, 25 May 2022 (UTC)
 
:{{Ping|Languageseeker}} it's a good find. If you still have that SPARQL query under hand please add it into [[Help:SPARQL for maintenance]]. Yes, it's something we should clean up i think. There may be some few case where the speaker dont want to share its location but in 95% of cases i think we can go ahead, correct or ask them to correct it. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 12:39, 26 May 2022 (UTC)
 
:I noticed that when creating a new speaker, place of learning is optional. Not cool. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 21:32, 27 May 2022 (UTC)
 
:: {{ping|YUG}} For the life of me, I can't get the federated query to work, but I have a separate query to get the location and country labels from wikidata. These are the problematic ones. Note, that Q20 is on the list because Q20 "Norway" is missing P17
 
 
* ['MichaelSchoenitzer', None]
 
* ['D.Muralidharan', None]
 
* ['Kaderousse', None]
 
* ['Krokus', None]
 
* ['विदुला टोकेकर', 'Q103962887']
 
* ['DoctorandusManhattan', 'Q2']
 
* ['Justforoc', 'Q2']
 
* ['Student16 de', None]
 
* ['Didierwiki', 'Q6099648']
 
* ['Sarah2149', None]
 
* ['DomesticFrog', 'Q1']
 
* ['Drkanchi', None]
 
* ['Satdeep Gill', None]
 
* ['Iwan.Aucamp', 'Q20']
 
* ['Skimel', 'Q2']
 
* ['Abeɣzan', None]
 
* ['Gibraltar Rocks', None]
 
* ['Bomdapatrick', None]
 
* ['Ibtissam RAHMOUNI', None]
 
* ['Trabelsiismail', None]
 
* ['Ziko', 'Q2']
 
* ['Youcefelallali', None]
 
* ['Foxxipeter7', None]
 
* ['Celevra089', None]
 
* ['Bodhisattwa', None]
 
* ['Atudu', None]
 
* ['KageyamaxNishinoya', 'Q30915818']
 
* ['Darkdadaah', None]
 
* ['JayashreeVI', None]
 
* ['रश्मीमहेश', 'Q103962887']
 
* ['गीता गोविंद नेने', 'Q103893785']
 
* ['Awangba Mangang', None]
 
* ['Abigaljo', None]
 
* ['FaelDaug', 'Q29423162']
 
[[User:Languageseeker|Languageseeker]] ([[User talk:Languageseeker|talk]]) 02:16, 30 May 2022 (UTC)
 
 
== Anki Extension Release ==
 
 
I just released [https://ankiweb.net/shared/info/124265771 Lingua Libre and Forvo Addon]. It has a number of advanced options to improve search results and can run either as a batch operation or on an individual note.
 
 
By default, it first checks Lingua Libre and, if there are no results on Lingua Libre, it then checks Forvo.  To run as a pure Lingua Libre extension, you will need to set "disable_Forvo" to <code>True</code> in your configuration section.
 
 
Please reports bugs, issues, ideas on [https://github.com/rkosov/Lingua-Libre-and-Forvo-Audio-Downloader github]. I would love any feedback. [[User:Languageseeker|Languageseeker]] ([[User talk:Languageseeker|talk]]) 02:23, 31 May 2022 (UTC)
 
  
 
== Results of Coverage Test of French Lemma and Non-Lemma forms is English Wiktionary ==
 
== Results of Coverage Test of French Lemma and Non-Lemma forms is English Wiktionary ==
Line 510: Line 40:
 
:PS: Should i answer to you in French i get a feeling you are French or learning it. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 15:07, 1 June 2022 (UTC)
 
:PS: Should i answer to you in French i get a feeling you are French or learning it. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 15:07, 1 June 2022 (UTC)
 
:: {{Ping|YUG}} Salut, Yug. Oui, je suis en train d'apprendre le français. Comme nous avons discutez pendant notre reunion, c'est difficile de definer les limits d'une language. Comme je le vois, les formes lemma ne suffit pas. Maintenant, je suis en train de crée un Olafbot sur steroid pour francais. Mon plan est de réaliser un program python qui peux analyser les modèle utilizer sur Wiktionary. [[User:Languageseeker|Languageseeker]] ([[User talk:Languageseeker|talk]]) 15:48, 7 June 2022 (UTC)
 
:: {{Ping|YUG}} Salut, Yug. Oui, je suis en train d'apprendre le français. Comme nous avons discutez pendant notre reunion, c'est difficile de definer les limits d'une language. Comme je le vois, les formes lemma ne suffit pas. Maintenant, je suis en train de crée un Olafbot sur steroid pour francais. Mon plan est de réaliser un program python qui peux analyser les modèle utilizer sur Wiktionary. [[User:Languageseeker|Languageseeker]] ([[User talk:Languageseeker|talk]]) 15:48, 7 June 2022 (UTC)
 +
:Hi {{ping|Languageseeker}}. I'm sorry I did not visit the Chat Room in a long time, and missed your report. Very interesting, good job! I remember a request I made to [[User:Olaf|Olaf]] some time ago: it would be interesting to have a list similar to the one Olafbot is updating, but containing only lemmas of the target language (to quickly have nearly all lemmas of a dictionary illustrated with an audio pron). Also, I suggest you to use the categories of the French version of Wiktionary when you plan to work on French (and some other languages, that are more extensively described there). As you can see [[:fr:wikt:Catégorie:Lemmes_en_français|here]], the category gathering French lemmas is more than 3 times more complete on the fr. version than on the en. version of Wiktionary. As you mentioned, these numbers are exciting, let's keep up the good work! All the best — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 15:47, 26 November 2022 (UTC)
 +
::: {{Ping|WikiLucas00}} Sorry, I totally forgot about your request. The list is now ready for French: [[List:Fra/Filtered-lemmas-without-audio-sorted-by-number-of-wiktionaries]]. It's produced like the other lists, but it's limited to words from Catégorie:Lemmes_en_français. The list will be refreshed together with the rest. [[User:Olaf|Olaf]] ([[User talk:Olaf|talk]]) 16:54, 14 May 2023 (UTC)
 +
:::: Hello {{ping|Olaf}}! Thank you so much for this list, it's going to be very useful for sure! Let's cover 100% of Lemmas 😎 I'll tell the French contributors on Discord about it 😉 All the best — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 22:18, 20 May 2023 (UTC)
  
 
== How to create user page ==
 
== How to create user page ==
Line 525: Line 58:
 
== Manually-coded languages ==
 
== Manually-coded languages ==
  
I came across [[:meta:LinguaLibre/SignIt]] recently (via betawiki) and was wondering if manually-coded languages would be appropriate for this as well? These are languages in sign modality, but strongly tied to a spoken/written language; they usually adopt the grammar of the nonmanual language, choosing instead to simply transpose the vocabulary. This means they are most often used in application-specific and pidgin contexts (Pidgin Sign for English and diver's signs are examples). In particular, I am interested in ''toki pona luka'', a manual form of {{q|338540}}. Since the vocab is the same as spoken/written toki pona, there are a minimal number of lexemes overall, so having a complete set of signs is easily achievable. Manually-coded languages including ''toki pona luka'' are generally not given a separate ISO 639 code since they are in effect equivalent to scripts. Would this cause a problem for the infrastructure as currently designed? [[User:Arlo Barnes|Arlo Barnes]] ([[User talk:Arlo Barnes|talk]]) 05:56, 17 August 2022 (UTC)
+
I came across [[:meta:Lingua Libre/SignIt]] recently (via betawiki) and was wondering if manually-coded languages would be appropriate for this as well? These are languages in sign modality, but strongly tied to a spoken/written language; they usually adopt the grammar of the nonmanual language, choosing instead to simply transpose the vocabulary. This means they are most often used in application-specific and pidgin contexts (Pidgin Sign for English and diver's signs are examples). In particular, I am interested in ''toki pona luka'', a manual form of {{q|338540}}. Since the vocab is the same as spoken/written toki pona, there are a minimal number of lexemes overall, so having a complete set of signs is easily achievable. Manually-coded languages including ''toki pona luka'' are generally not given a separate ISO 639 code since they are in effect equivalent to scripts. Would this cause a problem for the infrastructure as currently designed? [[User:Arlo Barnes|Arlo Barnes]] ([[User talk:Arlo Barnes|talk]]) 05:56, 17 August 2022 (UTC)
  
 
----
 
----
Line 580: Line 113:
  
 
For the electoral commission, Mathis B, 22:00, 12 septembre 2022 (CEST)
 
For the electoral commission, Mathis B, 22:00, 12 septembre 2022 (CEST)
 +
 +
== Is there a way to exclude username from Wikimedia Commons upload file name? ==
 +
:''See also [[Help:Renaming]].''
 +
This seems redundant and takes up a lot of space --[[User:Middle river exports|Middle river exports]] ([[User talk:Middle river exports|talk]]) 20:22, 9 October 2022 (UTC)
 +
:{{ping|Middle river exports}} Welcome MRE,
 +
:You could name your speaker with a single character I guess.
 +
:But keeping the name is voluntary. Each speaker has his/her own voice, which we want to document. If, outside of Wikimedia, you want to remove part of the filename, we have a technical tutorial to do so. See [[Help:Download datasets]] and [[Help:Renaming]]. Ping us back if your dataset is not up to date. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 13:16, 10 October 2022 (UTC)
 +
::I have solved this now by just changing my username to something shorter. This way I can upload English as Usmaan (عثمان) for example where instead of just repeating the username it shows two scripts which is more useful. (Apparently few enough people have Arabic script usernames that short common words are mostly available.) --[[User:Middle river exports|عثمان]] ([[User talk:Middle river exports|talk]]) 20:23, 10 October 2022 (UTC)
 +
:::All Unicode characters should be ok, in words and usernames ;) [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 19:46, 11 October 2022 (UTC)
 +
 +
== Username update request ==
 +
 +
I realised my username on Mediawiki didn't carry over here when I changed it. On thus site could I please have it changed to: عُثمان
 +
--[[User:Middle river exports|عثمان]] ([[User talk:Middle river exports|talk]]) 08:45, 10 November 2022 (UTC)
 +
 +
== Data on LinguaLibre:Stats isn't consistant with Wikipedia Commons's Category ==
 +
 +
On the Stats page, the French have 254,387 records
 +
 +
https://lingualibre.org/wiki/LinguaLibre:Stats/Languages
 +
 +
Meanwhile, the Category on commons.wikimedia.org has 253,464 records
 +
 +
https://commons.wikimedia.org/wiki/Category:Lingua_Libre_pronunciation-fra
 +
 +
The stats display more records. This data inconsistency is strange. -- [[User:Shenlebantongying]], 10:36, 23 december 2022.
 +
:This means some item page exist here, but no audio are on Commons.
 +
:Item creation here and upload are done at step 5 of the recording, nearly simultaneously.
 +
:So I don't know what is going on. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 17:41, 26 December 2022 (UTC)
 +
 +
== [[c:Category:Lingua Libre pronunciation-bxg]] ==
 +
 +
All files in this category are tagged with wrong language. I have requested moves for files in the category, but what's more to be done?--[[User:GZWDer|GZWDer]] ([[User talk:GZWDer|talk]]) 13:05, 12 January 2023 (UTC)
 +
: Thanks for reporting. Actually all these items are erroneous (see [[Special:WhatLinksHere/Q590228]]):
 +
:* {{Q|798236}} (wrong language code)
 +
:* {{Q|802994}} (wrong language code)
 +
:* {{Q|802995}} (useless)
 +
:* {{Q|802996}} (useless)
 +
:* {{Q|802998}} (useless)
 +
:* {{Q|802999}} (useless)
 +
:* {{Q|803000}} (useless)
 +
:* {{Q|803001}} (useless)
 +
:* {{Q|803002}} (useless)
 +
:I have not checked yet if corresponding recordings are still on Commons. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 16:11, 13 January 2023 (UTC)
 +
 +
== I can not publish my records recorded via Lingua Libre. ==
 +
 +
Dear Colleagues,
 +
 +
It records, but when I press the button to publish it on Wikimedia Commons. It does not work. It returns as "Retry failed upload" Any idea? Thank you. [[User:Key Mîrza|Key Mîrza]] ([[User talk:Key Mîrza|talk]]) 05:09, 28 January 2023 (UTC)
 +
:Is it happening for all your recordings or only some of them? [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 08:49, 28 January 2023 (UTC)
 +
:: It was all good until a month ago. Nowadays I am on a vacation in another city and trying to enter to my accout and make some more records. I can enter into my account and I can create records, but I can not publish them. I stuck at publishing stage. Nothing publishing. None of my records publishing. I even tried to record via my cell phone, even there nothig publishing. By the way, I just saw your previous message wecoming me. Thank you, for your kind wish. Best wishes... [[User:Key Mîrza|Key Mîrza]] ([[User talk:Key Mîrza|talk]]) 09:57, 28 January 2023 (UTC)
 +
:::Hmmm, I do not know what to say. Sometimes some recordings do not upload but they other do. When none recording uploads, I do not know what could be the origin. Could you try with another webbrowser (firefox or Chrome)? To go further, I think we would need a Javascript expert that could have some hints. {{ping|Poslovitch|Lepticed7}} maybe ? Another question, how many words do you try to record? If this is a lot, could you try with only a few (less than 10 for example). [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 15:42, 28 January 2023 (UTC)
 +
:::: I tried 11 words together, then even 1 word only for testing purpose. Nothing worked. You said Java. Do I need java to be able to work with the application? If so, that I need to install Java. Because I formatted my PC. May be it is not installed. Thank you. [[User:Key Mîrza|Key Mîrza]] ([[User talk:Key Mîrza|talk]]) 17:06, 28 January 2023 (UTC)
 +
:::::Java is different than Javascript. Javascript is language supported by the webbrowser so you do not need to install anything else than a webbrowser to record pronunciations on Lingua Libre. Unfortunately, I cannot dig further in this direction because I almost know nothing about Javascript. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 21:18, 28 January 2023 (UTC)
 +
:::::: Thank you, anyway. [[User:Key Mîrza|Key Mîrza]] ([[User talk:Key Mîrza|talk]]) 22:38, 28 January 2023 (UTC)
 +
:[[User:Key Mîrza|Key Mîrza]], thank you a lot for your voice, it make us discover new languages. Please be aware Lili works best on solid desktop computers. Also, you likely have a limit of 380 records uploads per 72 minutes. So you may need to leave your tab open, and click "retry" after that. You can expand those right by making a demand on Commons. See [[LinguaLibre:User rights]]. Contact us if you think it may be that. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 15:07, 5 February 2023 (UTC)
 +
::It's [https://commons.m.wikimedia.org/w/index.php?title=Special%3AUserRights&user=Key+M%C3%AErza confirmed], as all new contributor you are limited to 380 uploads per 72h. You can get more userrights by requesting those rights on Commons. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 15:15, 5 February 2023 (UTC)
 +
 +
== Late 2022-2023 Winter report ==
 +
Hello all, allow me to share few overall news from the various recent, ongoing, or near-future efforts.
 +
* 🤖 User:Pamputt has taken over Lingualibre Bot and added support for the Kurdish wiktionary. See github.
 +
* 🌏 Melody (WMFr intern) and myself made a mini-editathon on writing template emails for outreach. See Lingualibre:Events.
 +
* ⚡ User:Elfix and myself will attend are collaborating for sparql requests (me) optimization (Elfix). We aim to create and languages gallery this spring.
 +
* 🔴 Wikimedia France's freelance on the record wizard is back on track, delivery of fixes should occur around May-June.
 +
* 🙋‍♀️ Adelaide (WMFr) mentioned the wish of a second intern on Lingualibre outreach this summer, to reuse Melody's assets, expand actions and geographic diversity.
 +
* 🫱🏼‍🫲🏽 Wikimedia France yearly strategic meetup is this week, and is expected to strengthen its (linguistic) diversity and metrics axes, for which Lingualibre is one of their champions.
 +
* 🧓 Eve and myself (likely) will be present at Toulouse's ''Forom des Langues'', in May, where ~60+ languages associations are present.
 +
 +
For specific deadlines and events coming soon, please also check [[Lingualibre:Events/Program]]. We always welcome contributors. When necessary, WMFr may refund transportation costs. Worth a try ! [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 15:07, 5 February 2023 (UTC)
 +
 +
== Edit your nickname ==
 +
 +
Good evening, I would like to change my nickname because it did not update when I was renamed Manjiro91 then Manjiro5 instead of GamissimoYT on Wikimedia projects. Thanks in advance Regards '''[[User:GamissimoYT|<span style="color:#fc3">manȷıro</span>]]<sup><small>[[User talk:GamissimoYT|<span style="font-variant:small-caps; color:#000">💬</span>]]</small></sup>''' 22:53, 23 February 2023 (UTC)
 +
 +
== Tool to prepare words for Lingua Libre ==
 +
 +
Preparing words to be used in Lingua Libre has always been challenging. But I think this is a shared challenge. Crawling text from different sources and creating a clean list of words is very important. I've used [[User:Titodutta/Bengali_words_from_pages|Tito's]] instructions in the past, but using multiple tabs and multiple tools is not the best user experience. So, I thought I'd create something that is functional for me and simple enough to be tweaked. Introducing [[User:Psubhashish/tools/Prepare words for Lingua Libre|"Prepare words for Lingua Libre"]]. The tool is currently set for Odia but can be easily tweaked for other languages using non-Latin scripts. I'd request Lingua Libre core team to incorporate the tool into Lingua Libre so that users can use the platform to create a wordlist. Extracting words from any random text is always hard, especially new contributors. --[[User:Psubhashish|Subhashish]] ([[User talk:Psubhashish|talk]]) 03:44, 14 March 2023 (UTC)
 +
:Hi [[User:Psubhashish|Psubhashish]]. This is really nice. Do you think it would be easy to adapt it to create a [[Help:Create_a_new_generator|new generator]]? Generators can be used by anyone after they import them in their common.js. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 06:44, 14 March 2023 (UTC)
 +
:: Thanks [[User:Pamputt]]. That would be fantastic, but I probably don't have the right knowhow for doing that. I did take ChatGPT's help to create a [[User:Psubhashish/common.js|.js version]] from the [[User:Psubhashish/tools/Prepare words for Lingua Libre|HTML code]] I had shared earlier but would appreciate any help. I think having a tool inside Lingua Libre would be great so really liked the idea of new generators. Common users would like things well packaged rather than jumping from one platform to another. --[[User:Psubhashish|Subhashish]] ([[User talk:Psubhashish|talk]]) 13:09, 14 March 2023 (UTC)
 +
 +
== Problème de publication des enregistrements  ==
 +
 +
Bonjour, il y a quelques années, j'ai renommé mon compte GamissimoYT en Manjiro91. Plus tard, je l'ai renommé Manjiro5. Le problème est que le renommage de mon compte global Wikimedia ne s'est pas fait sur Lingua Libre. Je ne peux donc pas publier les audios que j'enregistre sur LinguaLibre et n'apparaissent pas non plus sur Commons. Pourriez-vous m'aider ? '''[[User:GamissimoYT|<span style="color:#fc3">manȷıro</span>]]<sup><small>[[User talk:GamissimoYT|<span style="font-variant:small-caps; color:#000">💬</span>]]</small></sup>''' 08:41, 26 April 2023 (UTC)
 +
 +
== Renommer un dialecte en langue ==
 +
 +
Bonjour,
 +
 +
J'avais fait la demande pour l'ajout de "Teochew dialect" il y a quelques années lors de mes premiers essais. Cependant, il paraît plus pertinent de juste laisser "teochew" tout court sans le mot dialecte. Serait-il possible de faire ce changement.
 +
 +
[[User:Assassas77|Assassas77]] ([[User talk:Assassas77|talk]]) 19:41, 7 May 2023 (UTC)
 +
:{{Done}} Solved [https://lingualibre.org/index.php?title=Q4465&type=revision&diff=912499&oldid=477865 here] by [[User:Assassas77]] ! It's a wiki :) [[User:Yug|Yug]] ([[User talk:Yug|talk]])
 +
 +
== MediaWiki:Lang/* ==
 +
 +
What are the MediaWiki:Lang/* messages for? For example, [[MediaWiki:Lang/awa]]? It looks like they mostly just repeat the language code in the content. --[[User:Amire80|Amir E. Aharoni]] ([[User talk:Amire80|talk]]) 07:21, 24 May 2023 (UTC)

Revision as of 07:21, 24 May 2023

Chat rooms in various languages:
English · 🌐

Chatroom FAQ

How to download all audios of one language? By speaker?

Datasets are availale here. A script is updating the datasets every 2 days, using CommonsDownloadTool. For more, see Help:Download datasets.

How to add missing languages?

Administrators can add new languages on demand, they do so within few days. Please provide your language's ISO 639-3 code and/or its Wikidata ID. For more, see Help:Add a new language.

How to keep my wikimedia project up to date?

Contact Poslovitch, the master of Lingua Libre Bot. For more info, check out Help:Bots and LinguaLibre:Bot.

What IRL events are coming? When? Where?

Please see LinguaLibre:Events.

How to translate LinguaLibre User Interface into a new language?

Go to translatewiki.net. For more, see Help:Translate.

How to archive sections which have been answered?

After reviewing the section, add {{done}} ~~~~ to the top of the section. After few days to 2 weeks, move the section's code to [[LinguaLibre:Chat_room/Archives/year]].

Archives
20222021202020192018

Results of Coverage Test of French Lemma and Non-Lemma forms is English Wiktionary

While playing around with generating lists for pronunciation from Wiktionary, I decided to run a few tests on the current coverage of French lemma and non-lemma forms in English Wiktionary. I choose French because it is the largest datasets in LL.

Current Coverage of French in Lingua Libre

  • Total French Entries in Lingua Libre by a native speaker: 233 982
  • Unique French Entries in Lingua Libre by a native speaker: 154 358
  • Percentage of overlap: 34%
  • Term with the greatest number of pronunciations: "blanc" with 40

Current Coverage of Category:French lemmas

  • Total entries in Category:French lemmas: 84 482
  • Pronounced entries: 50 917
  • Entries with pronunciation: 33 565
  • Coverage Percentage: 60.27%

Current Coverage of Category:French non-lemma forms

  • Total entries in Category:French non-lemma forms: 29 1225
  • pronounced entries: 26 791
  • Entries with pronunciation: 264 434
  • Coverage Percentage: : 9.20%

For me, there are several lessons to be drawn.

  1. First, there has been amazing growth on LL. Covering 60.27% percent is a real achievement.
  2. The overlap percentage is quite small overall.
  3. There needs to be a clearer sense of when LL should stop requesting pronunciations for a certain term because 40 pronunciations of "blanc" seems a bit excessive.
  4. A need exists to continue pro-actively targeting entries in Wiktionary that are not in Lingua Libre. Currently, 297 999 French lemma and non-lemma forms require pronunciations.
  5. Generating lists from Wiktionary and checking coverage is not as hard as I thought.
  6. Lingua Libre has almost caught up with Forvo in the number of French pronunciations (233 982 vs 254, 703). Overall, Lingua Libre has shown amazing and healthy progress in a very short period of time. I'm excited about these results. Languageseeker (talk) 03:07, 1 June 2022 (UTC)
@Languageseeker This investigation is pretty cool. (I'm not sure i understand all your numbers yet, but i will read again when back on my PC). Its quite nice to see we are reaching Forvo level for our lead language. It's possible we have more unique words than forvo since we have user:Olafbot actively guiding and pushing us on that path.
On Lili we have chosen to be a learning AND linguistic diversity audio database. When you account for gender, regional accents, age, voice type, having 40 french audios for a word is still 400+ voices short.
Also, all contributors are not able to contribute audio perfect files due to various shortcomings (hardware, no recording room, no noose cancelling system, etc). We lack proper rating and review system. It's on our [slow] roadmap tho. 😉
PS: Should i answer to you in French i get a feeling you are French or learning it. Yug (talk) 15:07, 1 June 2022 (UTC)
@YUG Salut, Yug. Oui, je suis en train d'apprendre le français. Comme nous avons discutez pendant notre reunion, c'est difficile de definer les limits d'une language. Comme je le vois, les formes lemma ne suffit pas. Maintenant, je suis en train de crée un Olafbot sur steroid pour francais. Mon plan est de réaliser un program python qui peux analyser les modèle utilizer sur Wiktionary. Languageseeker (talk) 15:48, 7 June 2022 (UTC)
Hi @Languageseeker . I'm sorry I did not visit the Chat Room in a long time, and missed your report. Very interesting, good job! I remember a request I made to Olaf some time ago: it would be interesting to have a list similar to the one Olafbot is updating, but containing only lemmas of the target language (to quickly have nearly all lemmas of a dictionary illustrated with an audio pron). Also, I suggest you to use the categories of the French version of Wiktionary when you plan to work on French (and some other languages, that are more extensively described there). As you can see here, the category gathering French lemmas is more than 3 times more complete on the fr. version than on the en. version of Wiktionary. As you mentioned, these numbers are exciting, let's keep up the good work! All the best — WikiLucas (🖋️) 15:47, 26 November 2022 (UTC)
@WikiLucas00 Sorry, I totally forgot about your request. The list is now ready for French: List:Fra/Filtered-lemmas-without-audio-sorted-by-number-of-wiktionaries. It's produced like the other lists, but it's limited to words from Catégorie:Lemmes_en_français. The list will be refreshed together with the rest. Olaf (talk) 16:54, 14 May 2023 (UTC)
Hello @Olaf ! Thank you so much for this list, it's going to be very useful for sure! Let's cover 100% of Lemmas 😎 I'll tell the French contributors on Discord about it 😉 All the best — WikiLucas (🖋️) 22:18, 20 May 2023 (UTC)

How to create user page

Hello, my user name is Ngangaesther from Kenya. I am still stuck on how am supposed to create my user page kindly help regards Esther

Odia language missing from Stats/Languages

Hi there, for some reason, the Odia-language stats are missing from the Stats/Languages page. Also, "The most prolific speakers for the current month " section in the Stats/Speakers page is not loading at all since the time I checked last (about 10 days). I have tried on Chromium and Firefox and the result is the same even after clearing cache. --Subhashish (talk) 19:40, 28 July 2022 (UTC)

Hello Subhashish, it should be back online. We had a hackathon to put it back. We are calling for devs to push forwards. Yug (talk) 11:07, 10 August 2022 (UTC)
Thank you for the update, Yug. --Subhashish (talk) 14:00, 10 August 2022 (UTC)

Manually-coded languages

I came across meta:Lingua Libre/SignIt recently (via betawiki) and was wondering if manually-coded languages would be appropriate for this as well? These are languages in sign modality, but strongly tied to a spoken/written language; they usually adopt the grammar of the nonmanual language, choosing instead to simply transpose the vocabulary. This means they are most often used in application-specific and pidgin contexts (Pidgin Sign for English and diver's signs are examples). In particular, I am interested in toki pona luka, a manual form of toki pona (Q338540). Since the vocab is the same as spoken/written toki pona, there are a minimal number of lexemes overall, so having a complete set of signs is easily achievable. Manually-coded languages including toki pona luka are generally not given a separate ISO 639 code since they are in effect equivalent to scripts. Would this cause a problem for the infrastructure as currently designed? Arlo Barnes (talk) 05:56, 17 August 2022 (UTC)


Hello Arlo Barnes,

I understand "manually coded languages" as synonymous to "signed languages", am I correct?
If there is no distinct ISO for the signed language, we could still:

  • Create a new wikidata item without ISO, which will be used as identifier by LinguaLibre infrastructure
  • Use the spoken/write language ISO, and create lists of words all suffixed by (signed).

Either of those solutions could work.

If you have some knowledge of signed toki pona luka please let me know. We are adding features on Lingualibre and SignIt in order to be able to record video of signed words by late 2022. We are almost there. If you would like to record some basic signed words to share with the world, then let me know. Yug (talk) 20:58, 17 August 2022 (UTC)

Signed languages and manually-coded languages share similarities (the manual modality) and differences (since sign languages are 'native' to the signed modality, they use it more fully, having complete deixis and time-reference systems, use of handshape classifiers, etc.) -- 'luka' means 'hand'/'five', so that's the part of the name that indicates the manual modality, but otherwise it's just garden-variety toki pona. I am interested in using SignIt to record this vocab, yes. The '(signed)' suffix seems like a good way to do it. Arlo Barnes (talk) 13:16, 19 August 2022 (UTC)
Arlo Barnes: We increasingly have tools to update and correct sign language recordings, so the suffix (signed) or the solution we choose appears incorrect, we still can correct it later using that bot.
I would encourage you to first train yourself and learn that manually-coded language over the coming months. Indeed, we still have a very last bug within our video recording chain, which makes rightful videos appears as audio on Commons. We expect to solve this last issue this fall (September or October ?). So for now, I encourage you to rest well, reload energy, to get ready to record later this year. Maybe identify near you some suitable place with elegant monochrome wall to film over or consider building yourself a low-cost recording studio,. Etc. We can discuss it to keep it low cost and effective if you are interested, as I'm also looking for such walls and/or considering building one for myself.
See also : Minimal Sign Language Studio guideline. Yug (talk) 22:30, 19 August 2022 (UTC)

Update my username

I have changed my Wikimedia username but the previous name still appears in Lingua Libre. I know it's not included in unified logins. Anyway, please update my username to Aishik Rehman. Hirok Raja (talk) 15:14, 1 September 2022 (UTC)

Hi Hirok Raja¸would you have an example of what you would like to see to be changed? I think you are talking about the filename but I am not sure, so with one example, it would be clearer. Pamputt (talk)
@Pamputt
1. Top menubar of lingualibre.org showing 'Hirok Raja' as my profile name.
2. After uploading when I try to check my uploads in Commons, it takes me to https://commons.m.wikimedia.org/wiki/Special:ListFiles/Hirok_Raja page.
3. 'Hirok Raja' being used as Default recorder in the file names and description
4. Change speaker name to 'Aishik Rehman' every time while recording is quite annoying to me.
5. Even here 'Hirok Raja' is showing as my signature by default ): Hirok Raja (talk) 19:16, 2 September 2022 (UTC)
I suspect this is due to long term cookies. Would be interesting to push a clean up for your connection cookies for Lingualibre, it will log you out, then come back here. On firefox.
Open about:preferences#privacy > Go to "Cookies and Site Data"> Click "Manage Data" > Search "Lingualibre" > Remove selected. Yug (talk) 21:10, 2 September 2022 (UTC)

Siège communautaire de Wikimédia France – ouverture du vote / Community representative to Wikimédia France’s board - votes are opened

(English version below. Do not hesitate to correct my English translation.)

(Message copié depuis le bistro du jour par Lepticed7 (talk))

Bonjour,

En tant que président de la commission électorale pour l'élection du siège communautaire au conseil d'administration de Wikimédia France, je vous annonce que le vote ouvre aujourd'hui (13 septembre) à 0h CEST. Il se terminera le 26 septembre à 23h59 CEST.

Comme il y a trois ans, le scrutin est public sur Meta. Les pages de votes sont disponibles dans la catégorie correspondante ou en lien sur la page principale. C'est un scrutin par approbation, le candidat qui aura le plus grand nombre de voix sera donc déclaré élu. Vous pouvez voter pour autant de candidats que vous le souhaitez.

Si vous avez des questions, vous pouvez les poser sur la page de discussion ou par courriel à election@wikimedia.fr.

Pour la commission électorale, Mathis B, le 12 septembre 2022 à 22:00 (CEST)


(Message copied from the French Wikipedia Bistro by Lepticed7 (talk))

Hello,

as the chairman of the electoral commission for the election of the community representative to Wikimédia France’s board, I announce that votes open today (13th september) at 0:00 CEST. They will be closed on 26th september at 23:59 CEST.

Like it was the case three years ago, voting is on Meta. Voting pages are available in the corresponding category or as links in the main page. The elected candidate will be the one with the most approbation votes. You can vote for as many candidates as you wish.

If you have any questions, you can ask them on the Talk page on Meta, or by email at election@wikimedia.fr.

For the electoral commission, Mathis B, 22:00, 12 septembre 2022 (CEST)

Is there a way to exclude username from Wikimedia Commons upload file name?

See also Help:Renaming.

This seems redundant and takes up a lot of space --Middle river exports (talk) 20:22, 9 October 2022 (UTC)

@Middle river exports Welcome MRE,
You could name your speaker with a single character I guess.
But keeping the name is voluntary. Each speaker has his/her own voice, which we want to document. If, outside of Wikimedia, you want to remove part of the filename, we have a technical tutorial to do so. See Help:Download datasets and Help:Renaming. Ping us back if your dataset is not up to date. Yug (talk) 13:16, 10 October 2022 (UTC)
I have solved this now by just changing my username to something shorter. This way I can upload English as Usmaan (عثمان) for example where instead of just repeating the username it shows two scripts which is more useful. (Apparently few enough people have Arabic script usernames that short common words are mostly available.) --عثمان (talk) 20:23, 10 October 2022 (UTC)
All Unicode characters should be ok, in words and usernames ;) Yug (talk) 19:46, 11 October 2022 (UTC)

Username update request

I realised my username on Mediawiki didn't carry over here when I changed it. On thus site could I please have it changed to: عُثمان --عثمان (talk) 08:45, 10 November 2022 (UTC)

Data on LinguaLibre:Stats isn't consistant with Wikipedia Commons's Category

On the Stats page, the French have 254,387 records

https://lingualibre.org/wiki/LinguaLibre:Stats/Languages

Meanwhile, the Category on commons.wikimedia.org has 253,464 records

https://commons.wikimedia.org/wiki/Category:Lingua_Libre_pronunciation-fra

The stats display more records. This data inconsistency is strange. -- User:Shenlebantongying, 10:36, 23 december 2022.

This means some item page exist here, but no audio are on Commons.
Item creation here and upload are done at step 5 of the recording, nearly simultaneously.
So I don't know what is going on. Yug (talk) 17:41, 26 December 2022 (UTC)

c:Category:Lingua Libre pronunciation-bxg

All files in this category are tagged with wrong language. I have requested moves for files in the category, but what's more to be done?--GZWDer (talk) 13:05, 12 January 2023 (UTC)

Thanks for reporting. Actually all these items are erroneous (see Special:WhatLinksHere/Q590228):
I have not checked yet if corresponding recordings are still on Commons. Pamputt (talk) 16:11, 13 January 2023 (UTC)

I can not publish my records recorded via Lingua Libre.

Dear Colleagues,

It records, but when I press the button to publish it on Wikimedia Commons. It does not work. It returns as "Retry failed upload" Any idea? Thank you. Key Mîrza (talk) 05:09, 28 January 2023 (UTC)

Is it happening for all your recordings or only some of them? Pamputt (talk) 08:49, 28 January 2023 (UTC)
It was all good until a month ago. Nowadays I am on a vacation in another city and trying to enter to my accout and make some more records. I can enter into my account and I can create records, but I can not publish them. I stuck at publishing stage. Nothing publishing. None of my records publishing. I even tried to record via my cell phone, even there nothig publishing. By the way, I just saw your previous message wecoming me. Thank you, for your kind wish. Best wishes... Key Mîrza (talk) 09:57, 28 January 2023 (UTC)
Hmmm, I do not know what to say. Sometimes some recordings do not upload but they other do. When none recording uploads, I do not know what could be the origin. Could you try with another webbrowser (firefox or Chrome)? To go further, I think we would need a Javascript expert that could have some hints. @Poslovitch & Lepticed7 maybe ? Another question, how many words do you try to record? If this is a lot, could you try with only a few (less than 10 for example). Pamputt (talk) 15:42, 28 January 2023 (UTC)
I tried 11 words together, then even 1 word only for testing purpose. Nothing worked. You said Java. Do I need java to be able to work with the application? If so, that I need to install Java. Because I formatted my PC. May be it is not installed. Thank you. Key Mîrza (talk) 17:06, 28 January 2023 (UTC)
Java is different than Javascript. Javascript is language supported by the webbrowser so you do not need to install anything else than a webbrowser to record pronunciations on Lingua Libre. Unfortunately, I cannot dig further in this direction because I almost know nothing about Javascript. Pamputt (talk) 21:18, 28 January 2023 (UTC)
Thank you, anyway. Key Mîrza (talk) 22:38, 28 January 2023 (UTC)
Key Mîrza, thank you a lot for your voice, it make us discover new languages. Please be aware Lili works best on solid desktop computers. Also, you likely have a limit of 380 records uploads per 72 minutes. So you may need to leave your tab open, and click "retry" after that. You can expand those right by making a demand on Commons. See LinguaLibre:User rights. Contact us if you think it may be that. Yug (talk) 15:07, 5 February 2023 (UTC)
It's confirmed, as all new contributor you are limited to 380 uploads per 72h. You can get more userrights by requesting those rights on Commons. Yug (talk) 15:15, 5 February 2023 (UTC)

Late 2022-2023 Winter report

Hello all, allow me to share few overall news from the various recent, ongoing, or near-future efforts.

  • 🤖 User:Pamputt has taken over Lingualibre Bot and added support for the Kurdish wiktionary. See github.
  • 🌏 Melody (WMFr intern) and myself made a mini-editathon on writing template emails for outreach. See Lingualibre:Events.
  • ⚡ User:Elfix and myself will attend are collaborating for sparql requests (me) optimization (Elfix). We aim to create and languages gallery this spring.
  • 🔴 Wikimedia France's freelance on the record wizard is back on track, delivery of fixes should occur around May-June.
  • 🙋‍♀️ Adelaide (WMFr) mentioned the wish of a second intern on Lingualibre outreach this summer, to reuse Melody's assets, expand actions and geographic diversity.
  • 🫱🏼‍🫲🏽 Wikimedia France yearly strategic meetup is this week, and is expected to strengthen its (linguistic) diversity and metrics axes, for which Lingualibre is one of their champions.
  • 🧓 Eve and myself (likely) will be present at Toulouse's Forom des Langues, in May, where ~60+ languages associations are present.

For specific deadlines and events coming soon, please also check Lingualibre:Events/Program. We always welcome contributors. When necessary, WMFr may refund transportation costs. Worth a try ! Yug (talk) 15:07, 5 February 2023 (UTC)

Edit your nickname

Good evening, I would like to change my nickname because it did not update when I was renamed Manjiro91 then Manjiro5 instead of GamissimoYT on Wikimedia projects. Thanks in advance Regards manȷıro💬 22:53, 23 February 2023 (UTC)

Tool to prepare words for Lingua Libre

Preparing words to be used in Lingua Libre has always been challenging. But I think this is a shared challenge. Crawling text from different sources and creating a clean list of words is very important. I've used Tito's instructions in the past, but using multiple tabs and multiple tools is not the best user experience. So, I thought I'd create something that is functional for me and simple enough to be tweaked. Introducing "Prepare words for Lingua Libre". The tool is currently set for Odia but can be easily tweaked for other languages using non-Latin scripts. I'd request Lingua Libre core team to incorporate the tool into Lingua Libre so that users can use the platform to create a wordlist. Extracting words from any random text is always hard, especially new contributors. --Subhashish (talk) 03:44, 14 March 2023 (UTC)

Hi Psubhashish. This is really nice. Do you think it would be easy to adapt it to create a new generator? Generators can be used by anyone after they import them in their common.js. Pamputt (talk) 06:44, 14 March 2023 (UTC)
Thanks User:Pamputt. That would be fantastic, but I probably don't have the right knowhow for doing that. I did take ChatGPT's help to create a .js version from the HTML code I had shared earlier but would appreciate any help. I think having a tool inside Lingua Libre would be great so really liked the idea of new generators. Common users would like things well packaged rather than jumping from one platform to another. --Subhashish (talk) 13:09, 14 March 2023 (UTC)

Problème de publication des enregistrements

Bonjour, il y a quelques années, j'ai renommé mon compte GamissimoYT en Manjiro91. Plus tard, je l'ai renommé Manjiro5. Le problème est que le renommage de mon compte global Wikimedia ne s'est pas fait sur Lingua Libre. Je ne peux donc pas publier les audios que j'enregistre sur LinguaLibre et n'apparaissent pas non plus sur Commons. Pourriez-vous m'aider ? manȷıro💬 08:41, 26 April 2023 (UTC)

Renommer un dialecte en langue

Bonjour,

J'avais fait la demande pour l'ajout de "Teochew dialect" il y a quelques années lors de mes premiers essais. Cependant, il paraît plus pertinent de juste laisser "teochew" tout court sans le mot dialecte. Serait-il possible de faire ce changement.

Assassas77 (talk) 19:41, 7 May 2023 (UTC)

Check-green.svg Done Solved here by User:Assassas77 ! It's a wiki :) Yug (talk)

MediaWiki:Lang/*

What are the MediaWiki:Lang/* messages for? For example, MediaWiki:Lang/awa? It looks like they mostly just repeat the language code in the content. --Amir E. Aharoni (talk) 07:21, 24 May 2023 (UTC)