LinguaLibre

Difference between revisions of "Chat room"

Welcome to the Chat room! Place used to discuss any and all aspects of Lingua Libre: the project itself, discussions of the operations, policy and proposals, technical issues, etc. Other forums include for code-oriented issues, . Feel free to participate in any language you want to.

 
(341 intermediate revisions by 49 users not shown)
Line 2: Line 2:
 
{{Lang-CR}}
 
{{Lang-CR}}
 
<indicator name="talk"></indicator>
 
<indicator name="talk"></indicator>
 +
{{LL:Chat room/FAQ}}
 
__TOC__
 
__TOC__
 +
<!-- ****      DO NOT EDIT CONTENT ABOVE    **** -->
  
== Chatroom FAQ ==
+
== Results of Coverage Test of French Lemma and Non-Lemma forms is English Wiktionary ==
* '''How to download all audios of one language ? By speaker ?'''
 
** Languages are there [https://lingualibre.fr/datasets/ https://lingualibre.fr/datasets/]. A short server-side script is auto-ran every 2 days, itself using [https://github.com/lingua-libre/CommonsDownloadTool lingua-libre/CommonsDownloadTool]. For more, see [[Help:Download from LinguaLibre]].
 
  
* '''How to add missing languages ?'''
+
While playing around with generating lists for pronunciation from Wiktionary, I decided to run a few tests on the current coverage of French lemma and non-lemma forms in English Wiktionary. I choose French because it is the largest datasets in LL.
** Administrators can add new languages, they do so within few days. For users, please provide your language's [[:wikipedia:iso-639-3|iso-639-3]] code + link to the en.wikipedia.org's article. Optional infos are the common English name and wikidata IQ. For more, see [[Help:Add a new language]].
 
  
* '''How to keep my wikimedia project up to date ?'''
+
Current Coverage of French in Lingua Libre
** Contact [[User talk:Poslovitch|User:Poslovitch]], the botmaster of Lingua Libre Bot. For more info, check out [[Help:Bots]] and [[LinguaLibre:Bot]].
+
* Total French Entries in Lingua Libre by a native speaker: 233 982
 +
* Unique French Entries in Lingua Libre by a native speaker: 154 358
 +
* Percentage of overlap: 34%
 +
* Term with the greatest number of pronunciations: "blanc" with 40
  
* '''What IRL event.s are coming ? When ? Where ?'''
+
Current Coverage of [https://en.wiktionary.org/wiki/Category:French_lemmas Category:French lemmas]
** Nothing coming. For more, see [[LinguaLibre:Events]].
+
* Total entries in Category:French lemmas: 84 482
 +
* Pronounced entries: 50 917
 +
* Entries with pronunciation: 33 565
 +
* Coverage Percentage: 60.27%
  
* '''How to translate LinguaLibre User Interface into a new language ?'''
+
Current Coverage of [https://en.wiktionary.org/wiki/Category:French_non-lemma_forms Category:French non-lemma forms]
** Go to [https://translatewiki.net/w/i.php?title=Special:Translate&group=mwgithub-recordwizard&language=fr&filter=%21translated&action=translate translatewiki.net], change the url part <code>fr</code> into your language's [[:en:List_of_ISO_639-2_codes|ISO 639-2 code]]. For more, see [[Help:Translate]].
+
* Total entries in Category:French non-lemma forms: 29 1225
 +
* pronounced entries: 26 791
 +
* Entries with pronunciation: 264 434
 +
* Coverage Percentage: : 9.20%
  
* '''How to archive sections which have been answered ?'''
+
For me, there are several lessons to be drawn.
** After reviewing the section, add '<code><nowiki>{{done}} -- can be closed ~~~~</nowiki></code>' to the top of the section. After few days to 2 weeks, move the section's code to <code><nowiki>[[LinguaLibre:Chat_room/Archives/year]]</nowiki></code>.
+
# First, there has been amazing growth on LL. Covering 60.27% percent is a real achievement.
=== Archives ===
+
# The overlap percentage is quite small overall.
<!-- {{Colapse|1=Archives|2= Archives by year:}}
+
# There needs to be a clearer sense of when LL should stop requesting pronunciations for a certain term because 40 pronunciations of "blanc" seems a bit excessive.
<br/> -->
+
#  A need exists to continue pro-actively targeting entries in Wiktionary that are not in Lingua Libre. Currently, 297 999 French lemma and non-lemma forms  require pronunciations.
* [[/Archives/2021|2021]]
+
# Generating lists from Wiktionary and checking coverage is not as hard as I thought.
* [[/Archives/2020|2020]]
+
# Lingua Libre has almost caught up with Forvo in the number of French pronunciations (233 982 vs 254, 703). Overall, Lingua Libre has shown amazing and healthy progress in a very short period of time. I'm excited about these results. [[User:Languageseeker|Languageseeker]] ([[User talk:Languageseeker|talk]]) 03:07, 1 June 2022 (UTC)
* [[/Archives/2019|2019]]
+
:{{Ping|Languageseeker}} This investigation is pretty cool. (I'm not sure i understand all your numbers yet, but i will read again when back on my PC). Its quite nice to see we are reaching Forvo level for our lead language. It's possible we have more unique words than forvo since we have [[user:Olafbot]] actively guiding and pushing us on that path.
* [[/Archives/2018|2018]]
+
:On Lili we have chosen to be a learning AND linguistic diversity audio database. When you account for gender, regional accents, age, voice type, having 40 french audios for a word is still 400+ voices short.
<!-- ****      DO NOT ARCHIVE CONTENT ABOVE    **** -->
+
:Also, all contributors are not able to contribute audio perfect files due to various shortcomings (hardware, no recording room, no noose cancelling system, etc). We lack proper rating and review system. It's on our [slow] roadmap tho. 😉
 +
:PS: Should i answer to you in French i get a feeling you are French or learning it. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 15:07, 1 June 2022 (UTC)
 +
:: {{Ping|YUG}} Salut, Yug. Oui, je suis en train d'apprendre le français. Comme nous avons discutez pendant notre reunion, c'est difficile de definer les limits d'une language. Comme je le vois, les formes lemma ne suffit pas. Maintenant, je suis en train de crée un Olafbot sur steroid pour francais. Mon plan est de réaliser un program python qui peux analyser les modèle utilizer sur Wiktionary. [[User:Languageseeker|Languageseeker]] ([[User talk:Languageseeker|talk]]) 15:48, 7 June 2022 (UTC)
 +
:Hi {{ping|Languageseeker}}. I'm sorry I did not visit the Chat Room in a long time, and missed your report. Very interesting, good job! I remember a request I made to [[User:Olaf|Olaf]] some time ago: it would be interesting to have a list similar to the one Olafbot is updating, but containing only lemmas of the target language (to quickly have nearly all lemmas of a dictionary illustrated with an audio pron). Also, I suggest you to use the categories of the French version of Wiktionary when you plan to work on French (and some other languages, that are more extensively described there). As you can see [[:fr:wikt:Catégorie:Lemmes_en_français|here]], the category gathering French lemmas is more than 3 times more complete on the fr. version than on the en. version of Wiktionary. As you mentioned, these numbers are exciting, let's keep up the good work! All the best — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 15:47, 26 November 2022 (UTC)
 +
::: {{Ping|WikiLucas00}} Sorry, I totally forgot about your request. The list is now ready for French: [[List:Fra/Filtered-lemmas-without-audio-sorted-by-number-of-wiktionaries]]. It's produced like the other lists, but it's limited to words from Catégorie:Lemmes_en_français. The list will be refreshed together with the rest. [[User:Olaf|Olaf]] ([[User talk:Olaf|talk]]) 16:54, 14 May 2023 (UTC)
 +
:::: Hello {{ping|Olaf}}! Thank you so much for this list, it's going to be very useful for sure! Let's cover 100% of Lemmas 😎 I'll tell the French contributors on Discord about it 😉 All the best — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 22:18, 20 May 2023 (UTC)
  
== Datasets out of date ==
+
== How to create user page ==
Hello. It seems that the datasets page, although it claims to run every 2 days, is completely out of date: all the available zips are from April 2020 or November 2019 (and the full zip from May 2019). Is this a known problem? Is there a plan to address it? [[User:Julien Baley|Julien Baley]] ([[User talk:Julien Baley|talk]]) 23:17, 27 August 2020 (UTC)
 
:Indeed, it seems to have an issue with the dataset updating. I opened a [[phab:T261519|Phabricator ticket]] about this issue. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 18:24, 28 August 2020 (UTC)
 
  
== About the exclusion of already recorded words ==
+
Hello, my user name is Ngangaesther from Kenya. I am still  stuck on how am supposed to create my user page kindly help
Hi, I think the option to exclude words that I have already recorded is broken. This morning, I start a recording session and LL proposes me words that I registered two days ago. For example, I already registered [https://commons.wikimedia.org/wiki/File:LL-Q143_(epo)-Lepticed7-Belorusino.wav Belorusino] two days ago, but it does not disappear when I click exclude words already recorded. And notice the two versions of the file, which I already re-recorded it. Can someone fix this? [[User:Lepticed7|Lepticed7]] ([[User talk:Lepticed7|talk]]) 10:07, 15 November 2020 (UTC)
+
regards Esther
:I have opened a [[phab:T267876|Phabricator ticket]]. It may be fixed in the coming months but not sure. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 20:05, 15 November 2020 (UTC)
 
  
== Reminder : Grants ==
+
== Odia language missing from [[LinguaLibre:Stats/Languages|Stats/Languages]] ==
Hello all, I'am monitoring grants these days and there is a summary table available here [[LinguaLibre:Grants]]
 
  
I think both rapid grants mechanisms could be of help to us now, to reach out to local community via small scale events, training, hardware, food, transportation costs, flyers' designs, etc. By example, [[:meta:Wikimedia_France/Micro-financement/Demande/µFi-2020-10-421441|This WM-France micro-fi's request]] organizes 4 evenings of contribution, getting 100€ for each evening. The same user has been welcome to do several Grant requests.<br> Heavier, the R&D Grant could surely be used for something. I have an idea on this, but we can trust Indian contributors to come up with relevant technical ideas and teams as well. {{ping|Titodutta}} [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 01:20, 8 February 2021 (UTC)
+
Hi there, for some reason, the Odia-language stats are missing from the [[LinguaLibre:Stats/Languages|Stats/Languages]] page. Also, "The most prolific speakers for the current month
 +
" section in the [[LinguaLibre:Stats/Speakers|Stats/Speakers]]  page is not loading at all since the time I checked last (about 10 days). I have tried on Chromium and Firefox and the result is the same even after clearing cache. --[[User:Psubhashish|Subhashish]] ([[User talk:Psubhashish|talk]]) 19:40, 28 July 2022 (UTC)
 +
:Hello [[User:Psubhashish|Subhashish]], it should be back online. We had a hackathon to put it back. We are calling for devs to push forwards. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 11:07, 10 August 2022 (UTC)
 +
:: Thank you for the update, [[User:Yug|Yug]]. --[[User:Psubhashish|Subhashish]] ([[User talk:Psubhashish|talk]]) 14:00, 10 August 2022 (UTC)
  
== LinguaLibre Bot and Wikidata ==
+
== Manually-coded languages ==
{{Move|LinguaLibre:Technical board|type=section}}
 
I have not checked the bot's contrib on Wikidata for quite some time. Yesterday I uploaded ~100 Bangal film names from Bangla Wikipedia. It looks like [https://www.wikidata.org/w/index.php?target=Lingua+Libre+Bot&namespace=all&tagfilter=&start=&end=&limit=50&title=Special%3AContributions the bot is not] active, unless I am missing something. --[[User:Titodutta|টিটো দত্ত (Titodutta)]] ([[User talk:Titodutta|কথা]]) 18:10, 13 February 2021 (UTC)
 
  
== Update and technical improvements ==
+
I came across [[:meta:Lingua Libre/SignIt]] recently (via betawiki) and was wondering if manually-coded languages would be appropriate for this as well? These are languages in sign modality, but strongly tied to a spoken/written language; they usually adopt the grammar of the nonmanual language, choosing instead to simply transpose the vocabulary. This means they are most often used in application-specific and pidgin contexts (Pidgin Sign for English and diver's signs are examples). In particular, I am interested in ''toki pona luka'', a manual form of {{q|338540}}. Since the vocab is the same as spoken/written toki pona, there are a minimal number of lexemes overall, so having a complete set of signs is easily achievable. Manually-coded languages including ''toki pona luka'' are generally not given a separate ISO 639 code since they are in effect equivalent to scripts. Would this cause a problem for the infrastructure as currently designed? [[User:Arlo Barnes|Arlo Barnes]] ([[User talk:Arlo Barnes|talk]]) 05:56, 17 August 2022 (UTC)
  
Hi all,
+
----
  
Full information and full disclosure, I'm working now with WikiValley and Wikimédia France in a paid capacity to help improve Lingua Libre technical structure (see [https://www.wikimedia.fr/emploi-wikimedia-france/nos-appels-doffres/appel-doffres-developpement-et-amelioration-de-loutil-web-lingua-libre/ this] - in French - for the scope of our intervention).
+
Hello [[User:Arlo Barnes|Arlo Barnes]],
  
One of our first action last Thursday was to restart the Blazegraph updater. A lot of tools are depending on this "fundamental brick" (including but not limited to): the SPARQL endpoint (and pages using it) and bots. Now, you can see that pages like [[Special:MyLanguage/LinguaLibre:Stats]] are up-to-date again and the bots should also restart soon (you can see more technical info on this on [[LinguaLibre:Technical board]])).
+
I understand "manually coded languages" as synonymous to "signed languages", am I correct?<br />If there is no distinct ISO for the signed language, we could still:
 +
* Create a new wikidata item without ISO, which will be used as identifier by LinguaLibre infrastructure
 +
* Use the spoken/write language ISO, and create lists of words all suffixed by <kbd>(signed)</kbd>.
 +
Either of those solutions could work.
  
The next big step will be to update this Mediawiki from 1.31 to 1.35 and moving it to a new server.
+
If you have some knowledge of signed ''toki pona luka'' please let me know. We are adding features on Lingualibre and SignIt in order to be able to record video of signed words by late 2022. We are almost there. If you would like to record some basic signed words to share with the world, then let me know. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 20:58, 17 August 2022 (UTC)
  
If you see something or anything wrong or strange, don't hesitate to let me know. I'm also available for any question.
+
: Signed languages and manually-coded languages share similarities (the manual modality) and differences (since sign languages are 'native' to the signed modality, they use it more fully, having complete deixis and time-reference systems, use of handshape classifiers, etc.) -- 'luka' means 'hand'/'five', so that's the part of the name that indicates the manual modality, but otherwise it's just garden-variety toki pona. I am interested in using SignIt to record this vocab, yes. The '(signed)' suffix seems like a good way to do it. [[User:Arlo Barnes|Arlo Barnes]] ([[User talk:Arlo Barnes|talk]]) 13:16, 19 August 2022 (UTC)
 +
::[[User:Arlo Barnes|Arlo Barnes]]: We increasingly have [[:commons:Commons:Bots/Requests/Dragons_Bot_(2)|tools]] to update and correct sign language recordings, so the suffix <code>(signed)</code> or the solution we choose appears incorrect, we still can correct it later using that bot.
 +
::I would encourage you to first train yourself and learn that manually-coded language over the coming months. Indeed, we still have a very last bug within our video recording chain, which makes rightful videos appears as audio on Commons. We expect to solve this last issue this fall (September or October ?). So for now, I encourage you to rest well, reload energy, to get ready to record later this year. Maybe identify near you some suitable place with elegant monochrome wall to film over or consider building yourself a low-cost recording studio,. Etc.  We can discuss it to keep it low cost and effective if you are interested, as I'm also looking for such walls and/or considering building one for myself.
 +
::See also : [https://github.com/lingua-libre/SignIt/issues/18 Minimal Sign Language Studio guideline]. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 22:30, 19 August 2022 (UTC)
  
Cheers, [[User:VIGNERON|VIGNERON]] ([[User talk:VIGNERON|talk]]) 08:56, 15 February 2021 (UTC)
+
== Update my username ==
:Nice ! Happy to see you folks jumping in. Thank you for the Stats ! We can witness our passage over 400,000 audios shortly. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 16:27, 15 February 2021 (UTC)
 
  
== 400,000 ==
+
I have changed my Wikimedia username but the previous name still appears in Lingua Libre. I know it's not included in unified logins. Anyway, please update my username to Aishik Rehman. [[User:Hirok Raja|Hirok Raja]] ([[User talk:Hirok Raja|talk]]) 15:14, 1 September 2022 (UTC)
 +
: Hi Hirok Raja¸would you have an example of what you would like to see to be changed? I think you are talking about the filename but I am not sure, so with one example, it would be clearer. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]])
 +
::{{ping|Pamputt}} <br/> 1. Top menubar of lingualibre.org showing 'Hirok Raja' as my profile name. <br/> 2. After uploading when I try to check my uploads in Commons, it takes me to https://commons.m.wikimedia.org/wiki/Special:ListFiles/Hirok_Raja page. <br/> 3. 'Hirok Raja' being used as Default recorder in the file names and description <br/> 4. Change speaker name to 'Aishik Rehman' every time while recording is quite annoying to me. <br/> 5. Even here 'Hirok Raja' is showing as my signature by default ): [[User:Hirok Raja|Hirok Raja]] ([[User talk:Hirok Raja|talk]]) 19:16, 2 September 2022 (UTC)
 +
:::I suspect this is due to long term cookies. Would be interesting to push a clean up for your connection cookies for Lingualibre, it will log you out, then come back here. [https://support.mozilla.org/en-US/kb/storage?as=u&utm_source=inproduct&redirectslug=permission-store-data&redirectlocale=en-US On firefox].
 +
:::Open <code>about:preferences#privacy</code> > Go to "Cookies and Site Data"> Click "Manage Data" > Search "Lingualibre" > Remove selected. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 21:10, 2 September 2022 (UTC)
  
The total amount of recordings on Lingua Libre reached '''400,000''' a few hours ago. February is already the second most fruitful month since the beginning of the project, even though we are only halfway through. LiLi is growing faster and faster, and this is only the beginning!<br/>Congratulations and thanks to everyone who gives some time to record voices and to spread the project around the world.<br/>
+
== Siège communautaire de Wikimédia France – ouverture du vote / Community representative to Wikimédia France’s board - votes are opened ==
All the best — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 18:10, 16 February 2021 (UTC)
 
:And another milestone broken ! Big thanks to the [[user:Titodutta|Titodutta]] and Marathi effects, too ! [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 21:24, 16 February 2021 (UTC)
 
::[[User:Yug|Yug]], [[User:WikiLucas00|WikiLucas]] and [[user:Titodutta|Titodutta]]- thanks for the support! Marathi community had decided to gift minimum 5000 records on the occasion of [[:en:Marathi Language Day|Marathi Language Day]] to be celebrated on 27 February. We have crossed 6000 records as of now. All credit goes to community members. [[User:सुबोध कुलकर्णी|सुबोध कुलकर्णी]] ([[User talk:सुबोध कुलकर्णी|talk]]) 05:22, 26 February 2021 (UTC)
 
::See also [[:Commons:Category:Lingua_Libre_pronunciation-mar]]
 
:::Congratulation to the Marathi community ! It's nice to see you contributes this way :) [[User:Yug|Yug]] ([[User talk:Yug|talk]])
 
  
== Chat room in your language ==
+
(English version below. Do not hesitate to correct my English translation.)
  
Hi all. I've created [[Template:Lang-CR]] in order to list all the chat rooms. I think it would be interesting for people to discuss in their native language. The main discussion should remain on this chat room in English in order to be understood by most of the contributors. So feel free to create a village pump/chat room in your mother tongue. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 20:21, 16 February 2021 (UTC)
+
(Message copié depuis le bistro du jour par [[User:Lepticed7|Lepticed7]] ([[User talk:Lepticed7|talk]]))
:It is welcome move. We need to discuss many local issues, policies, approaches, ideas etc. in own language. I have created Mar page [[LinguaLibre:संवाद-चर्चा दालन|संवाद-चर्चा दालन]]. Let me know whether the process is right. I will start engaging speakers here. [[User:सुबोध कुलकर्णी|सुबोध कुलकर्णी]] ([[User talk:सुबोध कुलकर्णी|talk]]) 05:36, 26 February 2021 (UTC)
 
::{{ping|सुबोध कुलकर्णी}} that's perfect. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 06:40, 26 February 2021 (UTC)
 
  
== New batch of lists available ! (1,000 languages) ==
+
Bonjour,
:''Please, remember to tag the list_talk's page with {{tl|UNILEX license}}.''
 
Greetings!<br>Thanks to [[:commons:user:Tshrinivasan|Tshrinivasan]] with who we discussed recent Indic (Marathi!) activity and lack of lists, I bumped again into UNILEX (GNU-like license), which is a Google-led Unicode Consortium project listing vocabulary for 999 languages. Data seems clean as far as I can tell. The two main maintainers are Google folks. So I suspect UNILEX uses Google's best scrappers and NLP cleaners. Within this data are tab-separated frequency lists as <code>{item}  {number_of_occurences}</code>. I forked their github, and made a script to convert their format into Lili's <code>List:*</code> format such as <code># {item}</code>. See:
 
* [https://github.com/lingua-libre/unilex/ github.com/lingua-libre/unilex]/[https://github.com/lingua-libre/unilex/tree/master/data/frequency-sorted-hash data/frequency-sorted-hash]/[https://github.com/lingua-libre/unilex/tree/master/data/frequency/ig.txt ig.txt] – frequency
 
* [https://github.com/lingua-libre/unilex/ github.com/lingua-libre/unilex]/[https://github.com/lingua-libre/unilex/tree/master/data/frequency-sorted-hash data/frequency-sorted-count]/[https://github.com/lingua-libre/unilex/tree/master/data/frequency-sorted-count/ig.txt ig.txt] – sorted
 
* [https://github.com/lingua-libre/unilex/ github.com/lingua-libre/unilex]/[https://github.com/lingua-libre/unilex/tree/master/data/frequency-sorted-hash data/frequency-sorted-hash]/[https://github.com/lingua-libre/unilex/tree/master/data/frequency-sorted-hash/ig.txt ig.txt] – Lili's List format
 
You can check if there is your own language among the 999 available. For Marathi, replace <code>ig</code> by <code>mr</code>. I therefor created 2 local lists to test this approach :
 
* [[List:Mar/words-by-frequency-00001-to-01000]] – starts soft
 
* [[List:Mar/words-by-frequency-01001-to-05000]] – then I jumps to multiples of 5,000 : 01001-05000, 05001-10000, 10001-15000, etc.
 
'''<span style="color:green">Right now, 1000 lists are already formated in Lili's syntax within the [https://github.com/lingua-libre/unilex/tree/master/data/frequency-sorted-hash /data/frequency-sorted-hash] directory.'''</span> If any community lacks wordlists on Lili's there you have them : copy, paste, done, situation unlocked ! [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 16:40, 24 February 2021 (UTC)
 
:{{ping|Titodutta}} hi! This may interest your community. There are dozen(s) Indic languages :) It could also help you. You already recorded most of those words for your language (ben), together with the "ignore already recorded words" functions, these lists can fill some gaps :) [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 16:48, 24 February 2021 (UTC)
 
::* I love this. I'll inform the Marathi folks. --[[User:Titodutta|টিটো দত্ত (Titodutta)]] ([[User talk:Titodutta|কথা]]) 17:16, 24 February 2021 (UTC)
 
::* This is just amazing. You don't know how much delighted I am feeling at this moment. I checked the Bengali list, a very few random words have typos, but that should not be more than 1% I guess. Over-all this will an extremely helpful resource for the communities. --[[User:Titodutta|টিটো দত্ত (Titodutta)]] ([[User talk:Titodutta|কথা]]) 17:24, 24 February 2021 (UTC)
 
:::* I share your enthusiasm ! It's bot created I'am pretty sure, the clean up is likely just statistical. Now that those lists are technically available, ideal next step would be human review by local communities. Maybe groups of 2~3 users for copyedit sprints ? :D But this is optional IMHO. Also, the corpora coming from online documents, IRL objects like `chair`, `car`, `walk`, may be further down on these lists. But they must be there in the first 20,000 items. The best is the linguistic diversity of this set. Amazing. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 18:10, 24 February 2021 (UTC)
 
::::*It's a good resource indeed. Thanks! The Marathi words in the list are grammatically correct also, with nearly no typos. We have started discussion about this in our community. Currently, we have started working on Lexemes first, the recordings of the lists thus created will be done simultaneously. The community thinks this approach is more useful in long run. The separate group of speakers may adopt these lists. But then we have to devise way to avoid repetitions. We will definitely discuss more on this resource utilisation and let you know.[[User:सुबोध कुलकर्णी|सुबोध कुलकर्णी]] ([[User talk:सुबोध कुलकर्णी|talk]]) 05:14, 26 February 2021 (UTC)
 
[[:commons:user:Tshrinivasan|Tshrinivasan]], [[User:Yug|Yug]] - Marathi community plans to work on these lists. But [https://github.com/lingua-libre/unilex/tree/master/data/frequency-sorted-hash] giving 404 error. Please help. [[User:सुबोध कुलकर्णी|सुबोध कुलकर्णी]] ([[User talk:सुबोध कुलकर्णी|talk]]) 05:54, 5 March 2021 (UTC)
 
:[[:commons:user:Tshrinivasan|Tshrinivasan]], [[User:सुबोध कुलकर्णी|सुबोध कुलकर्णी]] : It's in active developements these days so I made few changes.
 
:* Currently at: [https://github.com/hugolpz/unilex-extended/tree/master/frequency-sorted-hash /hugolpz/unilex-extended/frequency-sorted-hash] which uses UNILEX as a git submodule to respect each project's scope.
 
:* I just ran the script for Marathi, so the lists are now local. When picking a list, type <code>List:Mar/M</code>:
 
:See also section below. My apologize for the changes. Hope it didn't affected you too much. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 07:47, 5 March 2021 (UTC)
 
=== Pause before running ===
 
[[File:Long_tail.svg|thumb|Long tail curves likely applies to languages ranked by number of speakers. Since macro-languages such Mandarin, English, Spanish, Hindi, etc are certain to be soon audio documented by the sheer force of demography, our effort-strategy should progressively shift toward the right, and increasingly rare languages. The rarer the languages and speakers, the more listening we should become and the more custom assistances we will have to provide.]]
 
[[User:Dragons Bot|Dragons Bot]] has been created, coded, tested, and is ready to import UNILEX's lists to LinguaLibre's <code>List:{iso}/{title}</code> namespaces. Given 1,000 pages and associated talk page will be create, I would like to pause few days to consider about this large list import / creation and why.
 
* Lili > Languages > existing breath: We reached 110 languages on LinguaLibre so far.
 
* Lili > Lists > non-sorted by usefulness : Sparql queries provides lists for all languages, but without prioritization on words' usefulness.
 
* Lili > Lists > sorted by usefulness :
 
** Hand picked frequency lists are present for about 7 languages : eng, mar, por, pol, tam, ron, kur. With optimal relevance for teaching/learning.
 
** {{u|Olafbot}}'s <code>List:*/Lemmas-without-audio-sorted-by-number-of-wiktionaries</code> for 72 languages, updated daily, with optimal relevance for wiktionaries.
 
** UNILEX can provide frequency lists for 1,000 languages. About 10 times our current language coverage. UNILEX plugs itself upon [https://github.com/Google/Corpuscrawler Github.com/Google/Corpuscrawler], and open source project which plan to support more languages. I dived into these chain and it's an 'easy' NLP pipeline to contribute too. The wikimedia comunity can use it and expand it.
 
'''Core issue:''' the core issue from online arrival of users is to increase retention of minority and semi-rare languages by smoothing their speakers work.  By example an user of [[:en:Wayuu language|Wayuu language]] arrived today. We local (frequency) list was available today. But UNILEX + Dragons Bot can provide a local Wayuu frequency list of 8000 items, ready to record.<br>
 
Since we don't know which semi-rare languages will come next, having 1,000 languages ready is a safe yet not so excessive bet. Assuming a [[:en:Zipf's law]]/[[:en:Long tail]] curve for languages and their speakers we can still predict that at least one out of 10~20 new language's speaker will miss a local wordlist. But together with OlafBot's lists, we move from 6% toward 90% of our languages habing a solid, '''usefulness-based roadmap''' to walk forward. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 14:21, 3 March 2021 (UTC)
 
: Well, I believe the idea to import Unilex lists is very good. One of the things a new user needs most is an idea of what to record. The Unilex lists suit this function, especially in the case of new languages, where there is no other list available, and no words have been already recorded. The only question I see is how to import the Unilex lists. Perhaps the best idea is to import 1000 most frequent words from each list. It would be even better if the recorded words were automatically removed from the lists and replaced by new ones (like in the case of Olafbot-managed lists), but even a static list is good as bait if the goal is just to attract more speakers of rare languages.
 
: One remark: you should translate the file names from Unilex to match LiLi's language codes (or perhaps you did it, I don't know, I didn't examine the code). It's not always the same, for example, Polish is "pl" in Unilex, and "Pol" in Lili. If you leave the old codes, the list won't be automatically found when a new user presses the "Local List" button. Anyway, the newbies are likely not to notice the lists at all regardless of all our efforts. [[User:Olaf|Olaf]] ([[User talk:Olaf|talk]]) 00:55, 4 March 2021 (UTC)
 
  
== jQuery.Deferred exception: this.pastRecords is undefined ==
+
En tant que président de la commission électorale pour [[:m:Wikimédia France/Gouvernance/Siège communautaire|l'élection du siège communautaire au conseil d'administration de Wikimédia France]], je vous annonce que le vote ouvre aujourd'hui (13 septembre) à 0h CEST. Il se terminera le 26 septembre à 23h59 CEST.
:''This discussion may be moved to [[LinguaLibre:Technical board]].''
 
Hello, there.
 
  
When I try to load a list of words to record from the FR wiktionary, the modal does not disappear when I click "Done" and seems blocked trying to load the words. During this time, the JS console complains that "jQuery.Deferred exception: this.pastRecords is undefined", and the last resource loaded is, in cURL format:
+
Comme il y a trois ans, le scrutin est public sur Meta. Les pages de votes sont disponibles dans [[:m:Category:Wikimédia France/Gouvernance/Siège communautaire/2022/Votes|la catégorie correspondante]] ou en lien sur la page principale. C'est un scrutin par approbation, le candidat qui aura le plus grand nombre de voix sera donc déclaré élu. Vous pouvez voter pour autant de candidats que vous le souhaitez.
curl 'https://fr.wiktionary.org/w/api.php?action=query&format=json&origin=*&formatversion=2&prop=pageterms&wbptterms=label&generator=categorymembers&gcmnamespace=0&gcmtitle=%3ACat%C3%A9gorie%3ALocutions%20verbales%20en%20fran%C3%A7ais&gcmtype=page&gcmlimit=max' -H 'User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.15; rv:85.0) Gecko/20100101 Firefox/85.0' -H 'Accept: application/json, text/javascript, */*; q=0.01' -H 'Accept-Language: de,en-US;q=0.7,en;q=0.3' --compressed -H 'Origin: https://lingualibre.org' -H 'DNT: 1' -H 'Connection: keep-alive' -H 'Referer: https://lingualibre.org/' -H 'TE: Trailers'
 
  
Looks like there is a bug…
+
Si vous avez des questions, vous pouvez les poser sur la page de discussion ou par courriel à election@wikimedia.fr.
  
Regards.
+
Pour la commission électorale, Mathis B, le 12 septembre 2022 à 22:00 (CEST)
[[User:LoquaxFR|LoquaxFR]] ([[User talk:LoquaxFR|talk]]) 17:21, 24 February 2021 (UTC)
 
:Salut {{u|LoquaxFR}}, peux-tu décrire précisément ce que tu fais lorsque tu écris "when I try to load a list of words to record from the FR wiktionary" ? Comment charges-tu la liste de mots, le fais tu en utilisant en utalisant l'option « Catégorie Wikimedia » sur la droite ou bien en créant toi-même la liste de mots un par un ? Si tu utilises « Catégorie Wikimedia », peux-tu nous donner la catégorie que tu veux utiliser ? Est ce que tu arrives à reproduire le problème quelle que soit la catégorie avec laquelle tu veux travailler ? Merci d'avance pour ces renseignements qui je l'espère pourront permettre de cerner le problème le plus précisément possible. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 17:58, 24 February 2021 (UTC)
 
:: En français, ce sera plus simple, en effet. Le problème se reproduit systématiquement lorsque j’essaye d’utiliser une catégorie Wikimédia (celle du wiktionnaire français en l’occurrence); je n’utilise que cette possibilité pour charger des mots, et le problème apparaît pour toutes les catégories que j’essaye d’utiliser, que j’aie déjà enregistré presque tous les mots ou celles pour lesquelles je n’ai fait qu’une petite partie des milliers de termes. Le problème se produit en navigation privée également, donc ça ne semble pas être le cache ou les cookies. Si besoin de plus d’infos, n’hésite pas. [[User:LoquaxFR|LoquaxFR]] ([[User talk:LoquaxFR|talk]]) 18:08, 24 February 2021 (UTC)
 
:::Merci pour les infos supplémentaireS. Je viens de tester avec Firefox 78.7 et je ne rencontre pas ce problème. Peux-tu essayer avec un autre navigateur (Chromium ou autre) pour voir si le problème est inhérent à ton firefox (y compris en navigation privée). Ca peut par exemple venir d'un gadget que tu aurais installé. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 18:40, 24 February 2021 (UTC)
 
::::Addons Firefox qui casse le JS ? [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 18:57, 24 February 2021 (UTC)
 
::::: Chrome et Safari me donnent le même résultat ; j’ai également essayé depuis une autre bécane et un autre OS, sans mieux : l’erreur JS se montre toujours et rien ne se passe au moment de la validation de la modale. Est-ce que j’aurai enregistré trop de mots, faisant bugger le JS lorsqu’il essaye de retirer ceux déjà enregistrés ? Vu qu’on n’est que quelques-uns à en avoir enregistré autant, ça se pourrait. J’avais déjà remarqué que le chargement de listes depuis le Wiktionnaire mettait de plus en plus de temps pour moi (relativement, hein : quelques secondes d’attente au plus).  Est-ce un autre problème lié à mon compte ? [[User:LoquaxFR|LoquaxFR]] ([[User talk:LoquaxFR|talk]]) 06:30, 25 February 2021 (UTC)
 
::::::Merci pour les compléments d'info. J'ai ouvert [[phab:T275734|T275734]]. Faudrait voir avec {{u|Lepticed7}} et {{u|WikiLucas00}}, qui ont sensiblement le même nombre d'enregistrements que toi, pour tester si ils rencontrent aussi le même problème. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 06:54, 25 February 2021 (UTC)
 
::::::: Salut, perso, je sais pas si c’est lié, mais il y a certains enregistrements que le Record Wizard ne retire pas quand je veux retirer les mots déjà enregistrés. En atteste [https://commons.wikimedia.org/w/index.php?title=File%3ALL-Q143_(epo)-Lepticed7-aprilo.wav ce fichier], que j’ai enregistré trois fois. [[User:Lepticed7|Lepticed7]] ([[User talk:Lepticed7|talk]]) 10:45, 28 February 2021 (UTC)
 
  
== 50,000 ==
+
----
February 2021. This month. We have seen 50,000 pronunciation in a month (see [[LinguaLibre:Statistics]]). This is for the first time we saw 50,000 entries in a month. This is great. --[[User:Titodutta|টিটো দত্ত (Titodutta)]] ([[User talk:Titodutta|কথা]]) 08:51, 28 February 2021 (UTC)
 
:That's really amazing. The same month we passed 400k recordings! AND the shortest month in the year! I'm going to prepare a small News to be published every month (inspired by what you did in September if I remember correctly), I think February is a very good month to start with! I'll publish it on your talk page if you'd like 🙂 All the best ! — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 16:11, 28 February 2021 (UTC)
 
:* We can actually officially start a bi-monthly [[LinguaLibre:Newsletter]] to published on 1 March, 1 May, 1 July and so on. What do you think? I am also requesting [[User:Pamputt]], [[User:Yug]], [[User:Lyokoï]], [[User:Lepticed7]] to comment. --[[User:Titodutta|টিটো দত্ত (Titodutta)]] ([[User talk:Titodutta|কথা]]) 17:40, 28 February 2021 (UTC)
 
:::I would say, why not but I cannot lead for such project so if you are motivated to write and lead such newsletter, go ahead. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 18:39, 28 February 2021 (UTC)
 
::::On the [[LinguaLibre:Technical board/intro]] Poslovitch has started a [[LinguaLibre:Technical board/News|/News]] section which keeps log of important milestones. It's an interesting idea because it's minimalist, therefor low maintenance.
 
::::I'am also interested by a Newsletter for both external and internal purpose. I would help around yes. Editorial line would gain to be clarified: who are the expected readers, writing stuly, overall length, major sections, sections lenghts, etc. But this can "appears" with the first few issues :) Please keep a balance so the writing workload stays modest. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 18:57, 28 February 2021 (UTC)
 
:::::The /News of the technical board is mostly about technical news. '''I fully agree to the idea of a Newsletter, yet quarterly'''. We could grab some ideas from the French Wiktionary's ''[https://fr.wiktionary.org/wiki/Wiktionnaire:Actualit%C3%A9s Actualités]''. --[[User:Poslovitch|Poslovitch]] ([[User talk:Poslovitch|talk]]) 20:33, 28 February 2021 (UTC)
 
:::::* Salut, let's start with the newsletter of March. I'll add the stories I know such as 400,000 audios, 50,000 this month, the Wikimedia Wikimeet India, upcoming France-India call, French Wiktionary missed recording work etc. I'll start the draft tomorrow and ping you here.<br>In future we will need [[:mw:Extension:MassMessage]] to send newsletter to subscribers' talk page. A system admin is needed with access to the server and localsettings.php etc pages. I understand this will take time, so it can wait. Kind regards. --[[User:Titodutta|টিটো দত্ত (Titodutta)]] ([[User talk:Titodutta|কথা]]) 21:24, 28 February 2021 (UTC)
 
:{{Ping|Titodutta}} hi, We are having on the mailing list another discussion about networking, cooperations and outward communications. I think the [[LinguaLibre:Newsletter]] page can be modeled upon Technical board and [[LinguaLibre:Bot]], a kind of hub for a subgroup of active users dedicated to a common goal. In this case <u>Communication</u>. The bimonthly Newsletter could be a core, founding element. But other discussion about outreach could take place there. We have so much to push in this direction : academic outreach, rare languages and under-represented countries, partner institutions, calling for new wikimedians, reminding far-away Wikimedian chapter of Lingualibre, etc. Having a hub dedicated to writing elegant co-edited texts, defining targets and leading the call for communication campaign would be a strong plus. I'am still focused on codes but I could help in few weeks. You seems to love it as well. Do we have other users interested to join such efforts ? Would be good to have few more folks. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 20:39, 2 March 2021 (UTC)
 
  
=== Newsletter : March 2021 review ? ===
+
(Message copied from the French Wikipedia Bistro by [[User:Lepticed7|Lepticed7]] ([[User talk:Lepticed7|talk]]))
:''You can co-edit this text. PS Titodutta: a rough summary of past months and emerging directions based on a message to an ex-contributor.''
 
In January and February, the « Lili » community has taken back control of the technical stack (access to servers, GitHub codes, bots, etc.) and made a call for more diverse speakers. The Indian community started to show up, with key Indic languages being Bengali (50,000) and Marathi (~10,000). Romanian, Polish, Ukrainian are also on the rise around 20,000 audios each. We continue to have some dozen smaller languages showing up but no powerful push yet.
 
  
Right now, an external software company is upgrading our MediaWiki and its modules thanks to Wikimedia France's funding. The volunteer dev team is also strong and internal organization is increasing. We now have [[LinguaLibre:Technical board]] as a tech hub, [[LinguaLibre:Bot]] as a bot hub, [[LinguaLibre:Events]] as an IRL/Online event hub.
+
Hello,
When the main software upgrade settles down in a month we plan a [yet to create] [[LinguaLibre:Newsletter|LinguaLibre:Newsletter/room]] as an inward and outward communication hub.
 
  
In that last dimension, we could reach out to « relay users » on other wikis, who can share our news about LinguaLibre with communities of wiktionaries, wiksources, wikipedias, wikidata. We equally consider formally reaching out to non-Wikimedia groups such as Common Voice, Unicode, governmental and NGO agencies, research centers. Possibly in the form of group work and/or an online editathon when we gather to spread the news. This hub, summarizing the community's discussions, will therefore also clarify goals and strategies. We are looking for help with this matter.
+
as the chairman of the electoral commission for [[:m:Wikimédia France/Gouvernance/Siège communautaire|the election of the community representative to Wikimédia France’s board]], I announce that votes open today (13th september) at 0:00 CEST. They will be closed on 26th september at 23:59 CEST.
  
This current forward dynamic is thanks to the early Autumn 2020's efforts. We weren't able to immediately convert those into actions but it still injected energy and vision into LinguaLibre which helped snowball the current dynamic. Also, many thanks to all those who got involved in this journey! [[User:Yug|Yug]] [[User talk:Yug|<small><font style="color:green;">(talk)</font></small>]] 07:20, 3 March 2021 (UTC)
+
Like it was the case three years ago, voting is on Meta. Voting pages are available in [[:m:Category:Wikimédia France/Gouvernance/Siège communautaire/2022/Votes|the corresponding category]] or as links in the main page. The elected candidate will be the one with the most approbation votes. You can vote for as many candidates as you wish.
:Also, I just found out Commons grows at a speed of [https://stats.wikimedia.org/#/commons.wikimedia.org/content/pages-to-date/normal|line|2-year|page_type~content*non-content|monthly about 1 millions files per month]. So with 50,000 audios last month, Lili makes up to 5% of Commons' new files. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 14:57, 3 March 2021 (UTC)
 
* Made a minor change, I'll get back to this. Sorry for the delay, something kept me really busy for the last two days. Regards. --[[User:Titodutta|টিটো দত্ত (Titodutta)]] ([[User talk:Titodutta|কথা]]) 20:20, 3 March 2021 (UTC)
 
  
== Marathi women speakers celebrate 'Women's Day' & 'Women History Month' on Lingua Libre ==
+
If you have any questions, you can ask them on the Talk page on Meta, or by email at election@wikimedia.fr.
  
Greetings of coming World Women's day!<br>
+
For the electoral commission, Mathis B, 22:00, 12 septembre 2022 (CEST)
Glad to share this news. Marathi language community in Maharashtra State of India has taken initiative to record their language from the last 2 months. Out of total 26 speakers, @24 are women from 4 different places in the state. The group has decided to reach 10,000 recording mark to celebrate 'Women's Day' and 15,000 mark in March. As of now 8600+ recordings are uploaded. A small group of women have also started working on Lexicographical data, the recordings of which would be done simultaneously. The activity is being coordinated by institutional partner Jnana Prabodhini, Pune and facilitated by CIS-A2K, affiliate of WMF in India. The community needs support from all of you. Thanks, [[User:सुबोध कुलकर्णी|सुबोध कुलकर्णी]] ([[User talk:सुबोध कुलकर्णी|talk]]) 06:28, 5 March 2021 (UTC)
 
  
:Greeting  [[User:सुबोध कुलकर्णी|सुबोध कुलकर्णी]], nice to witness this enthusiasm.
+
== Is there a way to exclude username from Wikimedia Commons upload file name? ==
:I imported UNILEX lists for Marathi. When in [[Special:RecordWizard|RecordWizard]]'s Step 3 as you pick a list, go for <code>Local list</code>, then <code>mar/M</code> and you will see lists of the most used words. I proposed a '''gentle ramp approach''' : first list has just 200 words, see [[List:Mar/Most_used_words,_UNILEX_1:_words_00001_to_00200]]. Given my experience it will allows better on-the-ground session with new users. 200 is gently ambitious, allows to pass the ''uncanny valley of the first 20 words'', and move to the ''joyful Lingualibre flow of rapid recording''. Perfect for demo and on-boarding. :)
+
:''See also [[Help:Renaming]].''
:Following lists are for motivated users who chose to return. To consolidate skills, list 2 has 800 words while list 3 has 1000. At this state a nice 2,000 audio have been recorded by the speaker, while this words likely make up for 90% of daily conversations.
+
This seems redundant and takes up a lot of space --[[User:Middle river exports|Middle river exports]] ([[User talk:Middle river exports|talk]]) 20:22, 9 October 2022 (UTC)
:It then moves into committed users. List 4 has 3000, the following ones 5,000 words each. These lists are not expected to be done in one strike but over several session of one hour or less, during a dedicated day or along a week or so.  
+
:{{ping|Middle river exports}} Welcome MRE,
:I hope these may help your language community to better on-board interested contributors :)
+
:You could name your speaker with a single character I guess.
:We also encourage development of women speakers networks, so thanks a lot for your lead. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 08:57, 5 March 2021 (UTC)
+
:But keeping the name is voluntary. Each speaker has his/her own voice, which we want to document. If, outside of Wikimedia, you want to remove part of the filename, we have a technical tutorial to do so. See [[Help:Download datasets]] and [[Help:Renaming]]. Ping us back if your dataset is not up to date. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 13:16, 10 October 2022 (UTC)
:Added Marathi lists :
+
::I have solved this now by just changing my username to something shorter. This way I can upload English as Usmaan (عثمان) for example where instead of just repeating the username it shows two scripts which is more useful. (Apparently few enough people have Arabic script usernames that short common words are mostly available.) --[[User:Middle river exports|عثمان]] ([[User talk:Middle river exports|talk]]) 20:23, 10 October 2022 (UTC)
:* [[List:Mar/Most_used_words,_UNILEX_1:_words_00001_to_00200]]
+
:::All Unicode characters should be ok, in words and usernames ;) [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 19:46, 11 October 2022 (UTC)
:* [[List:Mar/Most_used_words,_UNILEX_2:_words_00201_to_01000]]
 
:* [[List:Mar/Most_used_words,_UNILEX_3:_words_01001_to_02000]]
 
:* [[List:Mar/Most_used_words,_UNILEX_4:_words_02001_to_05000]]
 
:* [[List:Mar/Most_used_words,_UNILEX_5:_words_05001_to_10000]]
 
:* [[List:Mar/Most_used_words,_UNILEX_6:_words_10001_to_15000]]
 
:* [[List:Mar/Most_used_words,_UNILEX_7:_words_15001_to_20000]]
 
:* [[List:Mar/Most_used_words,_UNILEX_8:_words_20001_to_25000]]
 
:* [[List:Mar/Most_used_words,_UNILEX_9:_words_25001_to_30000]]
 
:[[User:Yug|Yug]] ([[User talk:Yug|talk]]) 09:01, 5 March 2021 (UTC)
 
  
::Many thanks [[User:Yug|Yug]] for detailed explanation. These are useful to start with. Our group has taken lexicographical approach now to develop lists. So we need alphabetical lists to get forms of words. For example we create list like this - शरीर, शरीरभर, शरीराकडून, शरीराकडे, शरीराचं, शरीराचा, शरीराची, शरीराचे, शरीराच्या, शरीरात...etc. The members distribute work according to letters. Therefore it will be good if we can get modified lists. - [[User:सुबोध कुलकर्णी|सुबोध कुलकर्णी]] ([[User talk:सुबोध कुलकर्णी|talk]]) 11:22, 5 March 2021 (UTC)
+
== Username update request ==
:::I see. [[User:सुबोध कुलकर्णी|सुबोध कुलकर्णी]], you could use [https://github.com/hugolpz/unilex-extended/tree/main/frequency-sorted-count/mr.txt frequency-sorted-count/mr.txt], keep the 30,000 most frequent, then sort alphabetically and split by hand on each letter. See [[Help:How_to_create_a_frequency_list%3F#UNILEX.27s_lists]]. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 11:53, 5 March 2021 (UTC)
 
:::I tried to pushed it forward but it's a bit more complex than I anticipated. Ideally, you would 1) add a prefix so <code>औ.txt</code> becomes <code>/Marathi_words_starting_with_औ.txt</code>, 2) merge the rarest letters together. I must refocus on non-wiki projects, can you call for help from local wiki-developers ?
 
<pre>
 
# Define language
 
iso=mr
 
# get file, cut out meta, sort by 2nd column (frequency), keep 50000, keep only word, sort by 1st column, alphabetically, save to .txt file
 
curl https://raw.githubusercontent.com/unicode-org/unilex/master/data/frequency/${iso}.txt | tail -n +6 | sort -k 2,2 -n -r | head -n 50000 | cut -d$'\t' -f1 | sort -k 1,1 > ${iso}.txt
 
# get mr.txt content, for all line starting with alpha-num, convert first letter to lowercase, then print in files depending on first symbol
 
cat mr.txt | awk '{file = (/^[[:alnum:]]/ ? tolower(substr($0,1,1)) : "symbol") ".txt"; print >> file; close(file)}'
 
# Remove a to z files
 
find . -regex './[a-z].txt' -delete
 
# Convert to wiki lists format `# {item}
 
sed -i -E 's/^/# /g' `find . -type f -name "?.txt"`
 
# See line counts, sorted numerically descendant
 
wc -l * | sort -n -r
 
# See lines count, if n<200 then print filename, add file to merged.txt
 
wc -l * | awk '$1 < 200 {print $2}' | xargs cat >> merged.txt
 
</pre>
 
:::This already provides the lists by letters. It should put you solidly on the way. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 12:52, 5 March 2021 (UTC)
 
{| class="wikitable"
 
! Without merge (50 files) || With merging (32 files)
 
|- valign="top"
 
|
 
<pre>
 
  99860 total
 
  50000 mr.txt
 
  4976 स.txt
 
  4462 प.txt
 
  3745 म.txt
 
  3545 क.txt
 
  3195 व.txt
 
  2201 न.txt
 
  2183 ब.txt
 
  2134 अ.txt
 
  1789 र.txt
 
  1666 द.txt
 
  1623 आ.txt
 
  1568 ग.txt
 
  1524 ज.txt
 
  1507 त.txt
 
  1376 श.txt
 
  1132 ल.txt
 
  1102 ह.txt
 
  1089 च.txt
 
  1076 उ.txt
 
  1025 भ.txt
 
    809 य.txt
 
    791 फ.txt
 
    766 ख.txt
 
    652 ट.txt
 
    645 घ.txt
 
    480 ए.txt
 
    456 इ.txt
 
    446 ध.txt
 
    420 ड.txt
 
    318 ठ.txt
 
    273 झ.txt
 
    182 थ.txt
 
    163 ओ.txt
 
    118 छ.txt
 
    115 ऑ.txt
 
    64 ऐ.txt
 
    55 ढ.txt
 
    44 औ.txt
 
    29 २.txt
 
    26 ई.txt
 
    20 ष.txt
 
    20 ऊ.txt
 
    20 १.txt
 
    14 ऋ.txt
 
      6 ऱ.txt
 
      4 ३.txt
 
      2 ९.txt
 
      2 ८.txt
 
      1 ॐ.txt
 
      1 ४.txt
 
</pre>
 
|
 
<pre>
 
  4976 स.txt
 
  4462 प.txt
 
  3745 म.txt
 
  3545 क.txt
 
  3195 व.txt
 
  2201 न.txt
 
  2183 ब.txt
 
  2134 अ.txt
 
  1789 र.txt
 
  1666 द.txt
 
  1623 आ.txt
 
  1568 ग.txt
 
  1524 ज.txt
 
  1507 त.txt
 
  1376 श.txt
 
  1132 ल.txt
 
  1102 ह.txt
 
  1089 च.txt
 
  1076 उ.txt
 
  1025 भ.txt
 
    886 merged.txt
 
    809 य.txt
 
    791 फ.txt
 
    766 ख.txt
 
    652 ट.txt
 
    645 घ.txt
 
    480 ए.txt
 
    456 इ.txt
 
    446 ध.txt
 
    420 ड.txt
 
    318 ठ.txt
 
    273 झ.txt
 
</pre>
 
|}
 
: There is also a list [[List:Mar/Lemmas-without-audio-sorted-by-number-of-wiktionaries]] which is updated every day by a bot, so it should be always fresh. The list consists of words that are present in one or more Wiktionaries, but have no recording in Commons. At the top of the list, there are words with the largest number of Wiktionaries. You could probably give it a try too, [[User:सुबोध कुलकर्णी|सुबोध कुलकर्णी]]. [[User:Olaf|Olaf]] ([[User talk:Olaf|talk]]) 16:34, 5 March 2021 (UTC)
 
  
== Automatically updated lists of unrecorded audio ==
+
I realised my username on Mediawiki didn't carry over here when I changed it. On thus site could I please have it changed to: عُثمان
 +
--[[User:Middle river exports|عثمان]] ([[User talk:Middle river exports|talk]]) 08:45, 10 November 2022 (UTC)
  
Not everybody here is probably aware that there are lists of unrecorded words available for 72 languages. The lists are sorted by the number of the language versions of Wiktionary where a corresponding word is described, with the most popular words at the top, so the lists should maximize in a way the usefulness of the recording. Words with audio recordings present in Commons are removed automatically from the lists every night. In this way, the lists should be always fresh. The lists have always a title in the form of <code><language code>/Lemmas-without-audio-sorted-by-number-of-wiktionaries</code>: {{olafbot-wikt}}. [[User:Olaf|Olaf]] ([[User talk:Olaf|talk]]) 16:51, 5 March 2021 (UTC)
+
== Data on LinguaLibre:Stats isn't consistant with Wikipedia Commons's Category ==
:This is game changer. Welcoming new contributors of 72 languages will no more be a tricking question of providing relevant lists. More lists coming. We can refocus on outreach and calling for new contributors to audio document their voices, their languages, their cultures. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 18:15, 5 March 2021 (UTC)
 
  
== Outreach ==
+
On the Stats page, the French have 254,387 records
[[File:Catalan_dialects-en.png|thumb|Dialects of Catalan.]]
 
I used the opportunity of bumping into a currently inactive user to go to his wikipedia (Catalan), ask him where I could announce we now have a {{wikt-list|cat}} list, and went to make [https://ca.wikipedia.org/wiki/Tema:W4oh9xw4ndkefh91 a gentle announcement]. I don't expect it to pay off soon, but by several pings, we should have some folks landing back here on Lingualibre. I didn't contact the ca:wikt community but you see the idea : leaving small many announcements here and there so people know our name. Smaller pings are ok. ''"Sorry all, i've been busy on LinguaLibre project those days"'', this would be helpful too. I tried to emphasis what service Lili provides to them (not sure I was good on that, but it's just a ping :) ). Please when you have the opportunity, reach out to local communities. Especially those not currently active. We have nice lists in 72+ languagea now. Let the wiki folks know and record more. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 08:24, 7 March 2021 (UTC)
 
:{{Ping|Pamputt|}} hi, they started a light conversation-description of Catalan about cat valencia, cat central, cat balearic and cat Western (? not sure it was 3 or 4 different) pronunciations. Do you have any understanding on this Catalan issue ? Is this like Marseille French VS Paris French accents or something else ? [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 18:25, 7 March 2021 (UTC)
 
::I do not precisely know how different are these Catalan varieties but they are more different than French from Paris and French from Marseille because theses varieties are considered as different dialects. So it is something like {{Q|930}} and {{Q|1186}} for the Occitan language. So we could start to import this dialect in Lingua Libre to be able to record in these dialects. At least, we should import the main dialects here, namely Northwestern Catalan, Valencian, Central Catalan, Balearic, Rossellonese and Alguerese. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 18:58, 7 March 2021 (UTC)
 
:::It seems to be the wish expressed by [[:ca:User:Vriullop|User:Vriullop]] too, and on another discussion I got. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 19:22, 7 March 2021 (UTC)
 
::::{{Q|518078}}, {{Q|518079}}, {{Q|518087}}, {{Q|518106}}, {{Q|518118}}, {{Q|518128}} are now available, so we can record right now words in these dialects. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 20:09, 7 March 2021 (UTC)
 
  
== License ? ==
+
https://lingualibre.org/wiki/LinguaLibre:Stats/Languages
{{done}}<br/>
 
I bumped again into [[User:Evist0/RecordWizard.json|cc-by-sa]] license for contributions. Aren't we supposed to contribute it all under CC-0 so it's Wikidata compatible ? [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 21:39, 8 March 2021 (UTC)
 
:The licence is up to the user's choice. --[[User:Poslovitch|Poslovitch]] ([[User talk:Poslovitch|talk]]) 21:54, 8 March 2021 (UTC)
 
::Then what do we do on wikidata ? Ooohhh... It's just a link toward Commons, no a copy of the audio file.... [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 22:53, 8 March 2021 (UTC)
 
  
== Metrics > [https://lingualibre.org/wiki/Special:ListUsers?username=&group=&creationSort=1&desc=1&wpsubmit=&wpFormIdentifier=mw-listusers-form&limit=500 Accounts creations] ==
+
Meanwhile, the Category on commons.wikimedia.org has 253,464 records
Hi everyone !<br>
 
We got about 5 times more account creations this January 2021 (~60) compare to January 2020 (~12).<br>
 
Welcoming is largely done by hand these days. Having a bot for that may help.<br>
 
And, given that we are all overloaded, maybe would be wise to outreach for help. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 23:19, 8 March 2021 (UTC)
 
  
== Help - to delete word  ==
+
https://commons.wikimedia.org/wiki/Category:Lingua_Libre_pronunciation-fra
  
Hi, please guide me how i can delete recorded word from lili. already uploaded on wikimedia commons by mistake. Recorded Marathi word is 'कालका', which i want to delete. Thanks in advance.
+
The stats display more records. This data inconsistency is strange. -- [[User:Shenlebantongying]], 10:36, 23 december 2022.
 +
:This means some item page exist here, but no audio are on Commons.
 +
:Item creation here and upload are done at step 5 of the recording, nearly simultaneously.
 +
:So I don't know what is going on. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 17:41, 26 December 2022 (UTC)
  
:Hi {{u|Aparna Gondhalekar}}, there are two options depending whether "कालका" exists. If "कालका" exists but you record badly, then you just need to record it again and the new recording will replace the previous recording. Or if "कालका" does not exist, we need to delete the file directly on [[c:File:LL-Q1571 (mar)-Aparna Gondhalekar-कालका.wav|Wikimedia Commons]]. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 21:18, 9 March 2021 (UTC)
+
== [[c:Category:Lingua Libre pronunciation-bxg]] ==
  
== Wikimania 2021 ==
+
All files in this category are tagged with wrong language. I have requested moves for files in the category, but what's more to be done?--[[User:GZWDer|GZWDer]] ([[User talk:GZWDer|talk]]) 13:05, 12 January 2023 (UTC)
It's not a big surprise, but it have been confirmed : [[:meta:Wikimania 2021|Wikimania_2021]] will be online only. It will limit our outreach. We used to go there and record 10~20 languages, 5-mins demoing to 30 people, and doing workshop to 40+ others. Also got plenty of small chats (100+) raising awareness about Lili and connecting with devs for fast discussions. Will need to find other way this year too. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 21:34, 9 March 2021 (UTC)
+
: Thanks for reporting. Actually all these items are erroneous (see [[Special:WhatLinksHere/Q590228]]):
 +
:* {{Q|798236}} (wrong language code)
 +
:* {{Q|802994}} (wrong language code)
 +
:* {{Q|802995}} (useless)
 +
:* {{Q|802996}} (useless)
 +
:* {{Q|802998}} (useless)
 +
:* {{Q|802999}} (useless)
 +
:* {{Q|803000}} (useless)
 +
:* {{Q|803001}} (useless)
 +
:* {{Q|803002}} (useless)
 +
:I have not checked yet if corresponding recordings are still on Commons. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 16:11, 13 January 2023 (UTC)
  
== Return with Return ==
+
== I can not publish my records recorded via Lingua Libre. ==
  
So, we are back. Almost after 50 days, we are back to work. Thanks to [[User:VIGNERON]], [[User:Yug]], [[User:Pamputt]] etc who were around. Let's make some noise.
+
Dear Colleagues,  
  
'''Idea:''' I have an idea, can you record the word "Return" or "Come back" (or something similar) in your language and put it in the gallery below? Please mention the language name, and meaning in the caption. --[[User:Titodutta|টিটো দত্ত (Titodutta)]] ([[User talk:Titodutta|কথা]]) 02:09, 23 April 2021 (UTC)
+
It records, but when I press the button to publish it on Wikimedia Commons. It does not work. It returns as "Retry failed upload" Any idea? Thank you. [[User:Key Mîrza|Key Mîrza]] ([[User talk:Key Mîrza|talk]]) 05:09, 28 January 2023 (UTC)
:"Return/Come back" as in "LinguaLibre is back", [https://en.m.wikipedia.org/wiki/The_Lord_of_the_Rings:_The_Return_of_the_King#/languages :en:The Lord of the Rings: The Return of the King]] (70 languages) or [https://en.m.wikipedia.org/wiki/Return_of_the_Jedi#/languages en:Return of the Jedi] (63), right ? [[User:Titodutta|Titodutta]], please provide some examples / context. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 04:58, 23 April 2021 (UTC)
+
:Is it happening for all your recordings or only some of them? [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 08:49, 28 January 2023 (UTC)
* Yes you are right. --[[User:Titodutta|টিটো দত্ত (Titodutta)]] ([[User talk:Titodutta|কথা]]) 19:30, 23 April 2021 (UTC)
+
:: It was all good until a month ago. Nowadays I am on a vacation in another city and trying to enter to my accout and make some more records. I can enter into my account and I can create records, but I can not publish them. I stuck at publishing stage. Nothing publishing. None of my records publishing. I even tried to record via my cell phone, even there nothig publishing. By the way, I just saw your previous message wecoming me. Thank you, for your kind wish. Best wishes... [[User:Key Mîrza|Key Mîrza]] ([[User talk:Key Mîrza|talk]]) 09:57, 28 January 2023 (UTC)
 +
:::Hmmm, I do not know what to say. Sometimes some recordings do not upload but they other do. When none recording uploads, I do not know what could be the origin. Could you try with another webbrowser (firefox or Chrome)? To go further, I think we would need a Javascript expert that could have some hints. {{ping|Poslovitch|Lepticed7}} maybe ? Another question, how many words do you try to record? If this is a lot, could you try with only a few (less than 10 for example). [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 15:42, 28 January 2023 (UTC)
 +
:::: I tried 11 words together, then even 1 word only for testing purpose. Nothing worked. You said Java. Do I need java to be able to work with the application? If so, that I need to install Java. Because I formatted my PC. May be it is not installed. Thank you. [[User:Key Mîrza|Key Mîrza]] ([[User talk:Key Mîrza|talk]]) 17:06, 28 January 2023 (UTC)
 +
:::::Java is different than Javascript. Javascript is language supported by the webbrowser so you do not need to install anything else than a webbrowser to record pronunciations on Lingua Libre. Unfortunately, I cannot dig further in this direction because I almost know nothing about Javascript. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 21:18, 28 January 2023 (UTC)
 +
:::::: Thank you, anyway. [[User:Key Mîrza|Key Mîrza]] ([[User talk:Key Mîrza|talk]]) 22:38, 28 January 2023 (UTC)
 +
:[[User:Key Mîrza|Key Mîrza]], thank you a lot for your voice, it make us discover new languages. Please be aware Lili works best on solid desktop computers. Also, you likely have a limit of 380 records uploads per 72 minutes. So you may need to leave your tab open, and click "retry" after that. You can expand those right by making a demand on Commons. See [[LinguaLibre:User rights]]. Contact us if you think it may be that. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 15:07, 5 February 2023 (UTC)
 +
::It's [https://commons.m.wikimedia.org/w/index.php?title=Special%3AUserRights&user=Key+M%C3%AErza confirmed], as all new contributor you are limited to 380 uploads per 72h. You can get more userrights by requesting those rights on Commons. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 15:15, 5 February 2023 (UTC)
  
=== Return Gallery ===
+
== Late 2022-2023 Winter report ==  
<gallery>
+
Hello all, allow me to share few overall news from the various recent, ongoing, or near-future efforts.
File:LL-Q9610 (ben)-Titodutta-প্রত্যাবর্তন.wav|প্রত্যাবর্তন (''Protyaborton'' in Bangla, means "Return")
+
* 🤖 User:Pamputt has taken over Lingualibre Bot and added support for the Kurdish wiktionary. See github.
File:LL-Q150_(fra)-Yug-retour.wav|Retour (French)
+
* 🌏 Melody (WMFr intern) and myself made a mini-editathon on writing template emails for outreach. See Lingualibre:Events.
</gallery>
+
* ⚡ User:Elfix and myself will attend are collaborating for sparql requests (me) optimization (Elfix). We aim to create and languages gallery this spring.
 +
* 🔴 Wikimedia France's freelance on the record wizard is back on track, delivery of fixes should occur around May-June.
 +
* 🙋‍♀️ Adelaide (WMFr) mentioned the wish of a second intern on Lingualibre outreach this summer, to reuse Melody's assets, expand actions and geographic diversity.
 +
* 🫱🏼‍🫲🏽 Wikimedia France yearly strategic meetup is this week, and is expected to strengthen its (linguistic) diversity and metrics axes, for which Lingualibre is one of their champions.
 +
* 🧓 Eve and myself (likely) will be present at Toulouse's ''Forom des Langues'', in May, where ~60+ languages associations are present.
  
== Translate doesn't seem to work ==
+
For specific deadlines and events coming soon, please also check [[Lingualibre:Events/Program]]. We always welcome contributors. When necessary, WMFr may refund transportation costs. Worth a try ! [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 15:07, 5 February 2023 (UTC)
  
I can't seem to be able to translate pages, is this an error on my behalf or are there something wrong with the servers? --[[User:Sabelöga|Sabelöga]] ([[User talk:Sabelöga|talk]]) 17:01, 23 April 2021 (UTC)
+
== Edit your nickname ==
:Indeed, something is broken. There is a [[phab:T280972|Phabricator ticket]] to track this issue. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 18:30, 23 April 2021 (UTC)
 
::Okay, thank you. --[[User:Sabelöga|Sabelöga]] ([[User talk:Sabelöga|talk]]) 22:01, 23 April 2021 (UTC)
 
:::Hello {{u|Pamputt}}, I tried to translate several pages from the Wiki directly, to test, taking inspiration from the T:xx translation markers (example: https://lingualibre.org/wiki/Translations:Help:Main/14/fr). An error occurs, always the same. I added a line in your [[phab:T280972|task]], notifying Tgr who may be interested. He may add the tag of the "OAuthAuthentication" project. Cordially. —[[User:Eihel|Eihel]] ([[User talk:Eihel|talk]]) 14:31, 25 April 2021 (UTC)
 
[[File:Translation error in Lingua Libre.png|600px|erreur de traduction]]
 
:Translations are back. Thanks. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 18:54, 27 April 2021 (UTC)
 
::I still can't seem to be able to translate :( {{ping|Pamputt|Eihel}} --[[User:Sabelöga|Sabelöga]] ([[User talk:Sabelöga|talk]]) 22:12, 28 April 2021 (UTC)
 
::{{u|Sabelöga}} can you describe precisely (or post a screenshot) when you want to [https://lingualibre.org/index.php?title=Special:Translate&group=page-LinguaLibre%3AMain+Page%2Ftext translate the main page]? [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 08:29, 29 April 2021 (UTC)
 
:::{{u|Pamputt}} When I click translate it looks like this, and nothing else happens. https://imgur.com/a/fgY1sSl --[[User:Sabelöga|Sabelöga]] ([[User talk:Sabelöga|talk]]) 15:42, 29 April 2021 (UTC)
 
::::{{u|Sabelöga}} Indeed, it is the same behaviour as before. Could it be a problem of cache? Could you try to clear it (see [[w:Wikipedia:Bypass_your_cache|Wikipedia:Bypass_your_cache]] to know how to bypass it if needed). {{u|Seb35}} and {{u|VIGNERON}}, do you have any idea? [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 17:26, 29 April 2021 (UTC)
 
:::::{{u|Pamputt}} I've tried to clear cache, to log in on different devices, edit on computer and mobile and translate uninlogged in incognito mode and when I tried to manualy create [[Translations:Help:Configure_your_microphone/1/sv]] this error appeared:
 
<pre>Internt fel
 
[1738fa8dc0b56f3d0f41bed6] /index.php?title=Translations:Help:Configure_your_microphone/1/sv&action=submit Error from line 294 of /opt/mediawiki/1.35/extensions/OAuthAuthentication/auth/OAuthPrimaryAuthenticationProvider.php: Class 'MediaWiki\Extensions\OAuthAuthentication\AuthBlacklist' not found
 
  
Backtrace:
+
Good evening, I would like to change my nickname because it did not update when I was renamed Manjiro91 then Manjiro5 instead of GamissimoYT on Wikimedia projects. Thanks in advance Regards '''[[User:GamissimoYT|<span style="color:#fc3">manȷıro</span>]]<sup><small>[[User talk:GamissimoYT|<span style="font-variant:small-caps; color:#000">💬</span>]]</small></sup>''' 22:53, 23 February 2023 (UTC)
  
#0 /opt/mediawiki/1.35/includes/auth/AuthManager.php(2470): MediaWiki\Extensions\OAuthAuthentication\OAuthPrimaryAuthenticationProvider->providerRevokeAccessForUser()
+
== Tool to prepare words for Lingua Libre ==
#1 /opt/mediawiki/1.35/includes/auth/AuthManager.php(864): MediaWiki\Auth\AuthManager->callMethodOnProviders()
 
#2 /opt/mediawiki/1.35/includes/user/User.php(848): MediaWiki\Auth\AuthManager->revokeAccessForUser()
 
#3 /opt/mediawiki/1.35/extensions/Translate/src/SystemUsers/FuzzyBot.php(17): User::newSystemUser()
 
#4 /opt/mediawiki/1.35/extensions/Translate/TranslateHooks.php(1095): MediaWiki\Extensions\Translate\SystemUsers\FuzzyBot::getUser()
 
#5 /opt/mediawiki/1.35/includes/HookContainer/HookContainer.php(321): TranslateHooks::validateMessage()
 
#6 /opt/mediawiki/1.35/includes/HookContainer/HookContainer.php(132): MediaWiki\HookContainer\HookContainer->callLegacyHook()
 
#7 /opt/mediawiki/1.35/includes/HookContainer/HookRunner.php(1529): MediaWiki\HookContainer\HookContainer->run()
 
#8 /opt/mediawiki/1.35/includes/EditPage.php(1904): MediaWiki\HookContainer\HookRunner->onEditFilterMergedContent()
 
#9 /opt/mediawiki/1.35/includes/EditPage.php(2232): EditPage->runPostMergeFilters()
 
#10 /opt/mediawiki/1.35/includes/EditPage.php(1724): EditPage->internalAttemptSave()
 
#11 /opt/mediawiki/1.35/includes/EditPage.php(680): EditPage->attemptSave()
 
#12 /opt/mediawiki/1.35/includes/actions/EditAction.php(71): EditPage->edit()
 
#13 /opt/mediawiki/1.35/includes/actions/SubmitAction.php(38): EditAction->show()
 
#14 /opt/mediawiki/1.35/includes/MediaWiki.php(527): SubmitAction->show()
 
#15 /opt/mediawiki/1.35/includes/MediaWiki.php(313): MediaWiki->performAction()
 
#16 /opt/mediawiki/1.35/includes/MediaWiki.php(940): MediaWiki->performRequest()
 
#17 /opt/mediawiki/1.35/includes/MediaWiki.php(543): MediaWiki->main()
 
#18 /opt/mediawiki/1.35/index.php(53): MediaWiki->run()
 
#19 /opt/mediawiki/1.35/index.php(46): wfIndexMain()
 
#20 {main}</pre>
 
--[[User:Sabelöga|Sabelöga]] ([[User talk:Sabelöga|talk]]) 21:46, 29 April 2021 (UTC)
 
:Hello [[User:Pamputt|Pamputt]] and [[User:Sabelöga|Sabelöga]], I admit that I didn't search deeply, but I don't understand the change from ''status'' to resolved from [[phab:T280972|T280972 (''Translating does not work anymore'')]]. I still cannot access the Translate pages. Also, the translation wiki pages (page/xxx/''code_language'') are accessible via Translate, so I am willing to believe that the problem is unrelated, but I am confused. A translation page on the wiki is created and read for translation from Translate, is there no cause link? If these pages are blocked, can FuzzyBot update them? Removing the caches does not solve anything. See also [[phab:T281289]]. Why add an old extension version that does not work on MW 1.35 by adding a patch instead of adding what is recommended? Cordially. —[[User:Eihel|Eihel]] ([[User talk:Eihel|talk]]) 11:10, 30 April 2021 (UTC)
 
::Resolved —[[User:Eihel|Eihel]] ([[User talk:Eihel|talk]]) 17:31, 30 April 2021 (UTC)
 
:::It works now, thanks! --[[User:Sabelöga|Sabelöga]] ([[User talk:Sabelöga|talk]]) 20:06, 30 April 2021 (UTC)
 
  
== HIGH PRIORITY: Audio recordings have dust and clicks ==
+
Preparing words to be used in Lingua Libre has always been challenging. But I think this is a shared challenge. Crawling text from different sources and creating a clean list of words is very important. I've used [[User:Titodutta/Bengali_words_from_pages|Tito's]] instructions in the past, but using multiple tabs and multiple tools is not the best user experience. So, I thought I'd create something that is functional for me and simple enough to be tweaked. Introducing [[User:Psubhashish/tools/Prepare words for Lingua Libre|"Prepare words for Lingua Libre"]]. The tool is currently set for Odia but can be easily tweaked for other languages using non-Latin scripts. I'd request Lingua Libre core team to incorporate the tool into Lingua Libre so that users can use the platform to create a wordlist. Extracting words from any random text is always hard, especially new contributors. --[[User:Psubhashish|Subhashish]] ([[User talk:Psubhashish|talk]]) 03:44, 14 March 2023 (UTC)
:''Under investigation: Some users experience parasitic saturation (“Pock!”) or dust while other don't. This irregular occurrence reminds of earlier, non-solved “speed up bug”.''
+
:Hi [[User:Psubhashish|Psubhashish]]. This is really nice. Do you think it would be easy to adapt it to create a [[Help:Create_a_new_generator|new generator]]? Generators can be used by anyone after they import them in their common.js. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 06:44, 14 March 2023 (UTC)
 +
:: Thanks [[User:Pamputt]]. That would be fantastic, but I probably don't have the right knowhow for doing that. I did take ChatGPT's help to create a [[User:Psubhashish/common.js|.js version]] from the [[User:Psubhashish/tools/Prepare words for Lingua Libre|HTML code]] I had shared earlier but would appreciate any help. I think having a tool inside Lingua Libre would be great so really liked the idea of new generators. Common users would like things well packaged rather than jumping from one platform to another. --[[User:Psubhashish|Subhashish]] ([[User talk:Psubhashish|talk]]) 13:09, 14 March 2023 (UTC)
  
I've had friends record German and Romanian lists. They're using separate hardware, and have recorded thousands of words before, so I know their hardware is fine. The recordings they've done today suffer from loud clicks on half the recordings, so there seems to be a problem with the recording studio. I clearly have no idea what the problem is or how to fix it, but I hope someone else will!
+
== Problème de publication des enregistrements  ==
  
Here are examples:
+
Bonjour, il y a quelques années, j'ai renommé mon compte GamissimoYT en Manjiro91. Plus tard, je l'ai renommé Manjiro5. Le problème est que le renommage de mon compte global Wikimedia ne s'est pas fait sur Lingua Libre. Je ne peux donc pas publier les audios que j'enregistre sur LinguaLibre et n'apparaissent pas non plus sur Commons. Pourriez-vous m'aider ? '''[[User:GamissimoYT|<span style="color:#fc3">manȷıro</span>]]<sup><small>[[User talk:GamissimoYT|<span style="font-variant:small-caps; color:#000">💬</span>]]</small></sup>''' 08:41, 26 April 2023 (UTC)
* [[File:LL-Q188_(deu)-Natschoba-der_Wunsch.wav]] — LL-Q188_(deu)-Natschoba-der_Wunsch.wav
 
* [[File:LL-Q7913_(ron)-Andreea_Teodoraa-muscă.wav]] — LL-Q7913_(ron)-Andreea_Teodoraa-muscă.wav
 
* [[File:LL-Q150 (fra)-Hélène (Hsarrazin)-corné.wav]] — LL-Q150 (fra)-Hélène (Hsarrazin)-corné.wav
 
[[User:Julien Baley|Julien Baley]] ([[User talk:Julien Baleytalk]]) 16:24, 24 April 2021 (UTC)
 
  
: J'ai le même souci. [[User:DSwissK|DSwissK]] ([[User talk:DSwissK|talk]]) 17:49, 24 April 2021 (UTC)
+
== Renommer un dialecte en langue ==
::Hmm, very annoying.I 've opened a [[phab:T281041|Phabricator ticket]]. I hope the issue will be fixed soon. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 18:38, 24 April 2021 (UTC)
 
::HIGH priority. No idea who can fix it. Can someone refine the diagnosis ? Can more people test with their configuration and report here ? [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 15:33, 25 April 2021 (UTC)
 
:::I notified Mr. Vion, the original coder of the JS recorder. He may have some insights. I suspect it's a bug with either :
 
:::* [https://github.com/lingua-libre/RecordWizard RecordWizard (studio)], the mw extension interfacing the user speaking and the audio processing layers. It got recent changes due to migration to mw 1.35.
 
:::* [https://github.com/lingua-libre/LinguaRecorder LinguaRecorder JS], the core JS library processing audio signal. No changes in past week.
 
:::Recent changes may have affected how the audio cuts are done. Either mw extension or the JS could need a fix.
 
:::This is a core bug preventing LinguaLibre core mission. Any insight is welcome. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 15:43, 25 April 2021 (UTC)
 
::::So {{Q|522922}} (deu:der_Wunsch), {{Q|522753}} (ron:muscă) and {{Q|523386}} (fra:corné). —[[User:Eihel|Eihel]] ([[User talk:Eihel|talk]]) 17:26, 25 April 2021 (UTC)
 
:::::{{ping|Eihel}} the 1st and 3rd ones sounds good to me. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 20:38, 25 April 2021 (UTC)
 
::::::{{ping|Yug}} the 1st and 3rd ones do not sound good to me, there's a clear click on the "der" and "cor". If you have populated the table below, perhaps your numbers are too optimistic (if we have a different judgement on these three). [[User:Julien Baley|Julien Baley]] ([[User talk:Julien Baley|talk]]) 12:56, 26 April 2021 (UTC)
 
:::::{{ping|Julien Baley|DSwissK|Eihel}}
 
:::::I reviewed recent recordings of 4 users.
 
:::::* Two contributors have perfect audios (100% good on 8 audios checked for each user).
 
:::::* Two new users have the bug (30% of audios with saturation).
 
:::::I first though it could be new users not using their hardware properly : microphone must not be overly sensitive, we should not let them vibrate, etc. It's a know-how we are transmitting when doing IRL workshops and that tech-friendly people fix quickly. Autodidact users have not been warned of this.
 
:::::But it does not explain why experienced users such as DSwissK and Julien's friend have such noise. So I'am confused.
 
:::::DSwissK, did you tried alternative microphone settings, with lower volume ? That you are not recently speaking louder or a changes you did not notice previously ? [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 22:02, 25 April 2021 (UTC)
 
:Hello {{u|Yug}}, I concede that the difference may be minimal on some records. You have to listen carefully, it's like "a diamond on a vinyl which jumps on a dust". Some files are more affected than others (depending on the vocal intonation), but all of the ones I have cited are problematic. To fully understand, you can try recording with Schtooka (former LiLi), then immediately redo the same recording on LiLi. As I said to Hélène, you can also compare with an existing recording {{Q|499309}}. Cordially. —[[User:Eihel|Eihel]] ([[User talk:Eihel|talk]]) 15:12, 26 April 2021 (UTC)
 
::{{ping|Eihel|Julien Baley}} I'am officially deaf from one ear so I'am not the best judge on audios. I pushed the review as far as I can do bu could other users help to review more audios so Mr. Vion can attack this investigation with clean clues and ratios. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 16:15, 26 April 2021 (UTC)
 
:::{{ping|Yug}} I'm very happy to help review some recordings, if you want; could you suggest a list of users? (I don't know how to find users that have recently recorded). [[User:Julien Baley|Julien Baley]] ([[User talk:Julien Baley|talk]]) 17:41, 26 April 2021 (UTC)
 
::::{{Ping|Julien Bale}} process added below. Thank you ! Note: the user I review (all those below) may have higher noise ratio since don't have a musical ear. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 16:56, 26 April 2021 (UTC)
 
:::::{{Ping|Yug}}; I've checked the entire table and added a few people (Hsarazin has only 1 recent recording, so I've amended the "14" that was shown). Some people have 0% problem, some close to 100%... the problems are very characteristic. [[User:Julien Baley|Julien Baley]] ([[User talk:Julien Baley|talk]]) 19:25, 26 April 2021 (UTC)
 
::{{Ping|Pamputt|DSwissK}} & others, I really need help on this one. We need to review and report 10+ recording for each user uploading audios to Commons and likely to send a custom message to each affected user, on their talk page and on their Commons' talk page (ex [[User_talk:Andreea_Teodoraa|msg]], ex [https://commons.wikimedia.org/w/index.php?title=User_talk%3AAndreea_Teodoraa&type=revision&diff=555617601&oldid=468099121 ping]). [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 16:36, 26 April 2021 (UTC)
 
:::{{ping|Yug}} not fully helpful but I added a section on [[LinguaLibre:Stats#The most prolific speakers for the current month]], it may help to narrow down to who did recent recordings. Cheers, [[User:VIGNERON|VIGNERON]] ([[User talk:VIGNERON|talk]]) 07:20, 27 April 2021 (UTC)
 
'''/!\''' The dust bug issue is confirmed as core and relatively widespread. I sent an email this morning to Wikimedia France (Adelaide, Remy, Michael) with suggested solutions : immediate, restoring a sitenotice ribon to inform our users ; short term, hiring Vion for analysis and possibly a fix. We should not be claiming to be back online and on our feet when we arent. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 14:09, 27 April 2021 (UTC)
 
:Good. The CSS fixes have been deployed. → Sitenotice is back. → Indentation is back. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 14:11, 27 April 2021 (UTC)
 
:{{Ping|WikiLucas00|DSwissK}} hi,
 
:Given you are the two active users having this issue we need you most.
 
:Could you record 15~30 other audios with another Web browser, such as Firefox or else. Then report the result with this ?
 
:If you have any other hypothesis to test I'am interested. (Changing microphones, etc.) [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 18:23, 27 April 2021 (UTC)
 
::I had the impression (and DSwissK confirmed on Discord) that using Firefox slightly reduces the amount of problems encountered. — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 19:53, 27 April 2021 (UTC)
 
:::Yup, I installed Firefox and could finally send some more audios (me and my daughter), with internal microphone on my laptop. Please review. [[User:DSwissK|DSwissK]] ([[User talk:DSwissK|talk]]) 00:45, 28 April 2021 (UTC)
 
:::{{ping|Yug}} I checked with Andreea_Teodoraa and Natschoba what browser they're using: Chrome and Safari. I asked Andreea_Teodoraa to try Firefox, she did 22 recordings (https://commons.wikimedia.org/wiki/Special:ListFiles?limit=20&user=Andreea+Teodoraa) and 20 are clearly perfect, and 2 (însene and "pe scurt" I feel I hear a problem, but cannot see anything in Audacity). Considering we were on 75% bug on Chrome, this seems to be a move in the right direction. [[User:Julien Baley|Julien Baley]] ([[User talk:Julien Baley|talk]]) 02:33, 30 April 2021 (UTC)
 
:::{{ping|Yug}} Have tried with another friend (https://commons.wikimedia.org/w/index.php?title=Special:ListFiles&limit=100&user=LangPao) and everything sounds bug-free, both on Chrome and Firefox; Firefox is the most recent 10). [[User:Julien Baley|Julien Baley]] ([[User talk:Julien Baley|talk]]) 13:11, 30 April 2021 (UTC)
 
::::(Answered below on 15:16, 4 May 2021 [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 15:48, 4 May 2021 (UTC))
 
I think that could raise your interest : same smartphone, same internal microphone, same list (1 word). The only difference is [https://upload.wikimedia.org/wikipedia/commons/archive/f/fb/20210501160233%21LL-Q150_%28fra%29-DSwissK-g%C3%A9n%C3%A9ralement.wav using Chrome] and [https://upload.wikimedia.org/wikipedia/commons/f/fb/LL-Q150_%28fra%29-DSwissK-g%C3%A9n%C3%A9ralement.wav Firefox version]. [[User:DSwissK|DSwissK]] ([[User talk:DSwissK|talk]]) 19:20, 1 May 2021 (UTC)
 
:{{Ping|Julien Baley|DSwissK}} thank to you both. The recent [[A/B testing]] where only one parameter is changed is what we look for. Testing same users with different browser seems fruitful. Thanks also to Julien for your audacity inspections, our dev will eventually have to dig into that.
 
:@DSwissK, from your 2 example i see mainly a difference in volume (dB). It may be nothing, but when reviewing audios I also noticed that many seemed to be low dB. Could it be that Chrome changed it's default audio recording levels, which increase the presence of noise ? In that cases other projects like [https://forvo.com/ Forvo] (fake open license) and others should also be affected.
 
:Anyway, if a recent Chrome version was corrupted, maybe we could recommend to use Firefox for a while. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 15:16, 4 May 2021 (UTC)
 
::@Yug there is indeed a difference in volume but the problem is not the noise but the clicks. There is more noise in the Firefox version, but it isn't disturbing. At least, not as much as these clicks... [[User:DSwissK|DSwissK]] ([[User talk:DSwissK|talk]]) 18:29, 4 May 2021 (UTC)
 
:::Is there any chance it is related to the versions of Firefox or Chrome? I guess people upgraded their browser versions in the recent months – if I understand correctly there were a few issues before the OVH fire; perhaps more people upgraded since. (Personnally I hardly hear the issue except when there is a loud click, I don’t have an ear as developed as others here.) [[User:Seb35|Seb35]] ([[User talk:Seb35|talk]]) 21:05, 4 May 2021 (UTC)
 
  
I reinstalled the LinguaRecorder demo on '''https://lingualibre.org/demo/sandbox.html''' with the settings identical to the RecordWizard extension (on the gear on the 'Studio' (4th) step and [https://github.com/lingua-libre/RecordWizard/blob/master/modules/vue/rw.vue.studio.js#L23-L33 here in the PHP+JS code]). You can play with the settings, perhaps there is something to move around the saturation? (You have to click on "Apply new options" then "start" when you change one, and the "ready" counter should be incremented.) [[User:Seb35|Seb35]] ([[User talk:Seb35|talk]]) 20:54, 4 May 2021 (UTC)
+
Bonjour,
  
=== Limiting the number of words to record ===
+
J'avais fait la demande pour l'ajout de "Teochew dialect" il y a quelques années lors de mes premiers essais. Cependant, il paraît plus pertinent de juste laisser "teochew" tout court sans le mot dialecte. Serait-il possible de faire ce changement.
{{Ping|Yug|DSwissK|VIGNERON|Seb35|Pamputt|Titodutta}} I think that one important cause of the bugs is related to the RAM. Thus, loading a long list into the Record Wizard results in a maximum amount of bugs in the recordings (the length of this list -- its weight -- may vary, depending on the user's hardware and software).
 
  
I think we should try limiting (to 100 or 200 maximum) the possible number of words to be put into the Record Wizard, at least temporarily. There is no point in loading into the RW lists that are 1000-words long; taking a little break during the recording is never wrong, and it could help reducing the amount of bugs for the moment, while we try to find the source of the issue.<br/>Best — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 19:53, 27 April 2021 (UTC)
+
[[User:Assassas77|Assassas77]] ([[User talk:Assassas77|talk]]) 19:41, 7 May 2023 (UTC)
:We have to test this hypothesis. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 21:35, 27 April 2021 (UTC)
+
:{{Done}} Solved [https://lingualibre.org/index.php?title=Q4465&type=revision&diff=912499&oldid=477865 here] by [[User:Assassas77]] ! It's a wiki :) [[User:Yug|Yug]] ([[User talk:Yug|talk]])
::Tested and reporting : I used very small lists (less than 10 words) and still have the same issue. I encounter that bug on my smartphone, both my computers (desktop and laptop) under Chrome (latest version). Using internal or external microphone doesn't change anything. [[User:DSwissK|DSwissK]] ([[User talk:DSwissK|talk]]) 00:42, 28 April 2021 (UTC)
 
:::{{ping|DSwissK}} thank you. This is helpful. Seems clearly software issue. I contacted Wikimedia France and Vion requesting them to jump in.
 
:::We need people with audio software skills to inspect those audios and people with JS+audio skills to review the audio input chains. Mr. Vion has both skills. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 10:52, 28 April 2021 (UTC)
 
:::I do not think it's RAM related.
 
:::Even with 1000 words we are dealing with 1000 words x 7KB per file = 7 MB.
 
:::Let's admit the browser stores the words in a very, very details-rich way, so the files are 1000 times heavier. We still are 7GB.
 
:::Most computers have 8~16GB of RAM by now.
 
:::I also recorded small list and apparently add the issue.
 
:::Most (all?) users affected had recorded few dozens words. Worst affected users: Natschoba → 149, Andreea Teodoraa → 247, WikiLucas00 → 64.
 
:::All but 3 users [[LinguaLibre:Stats#The most prolific speakers for the current month|this month]] have recorded less than 300 words. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 11:02, 28 April 2021 (UTC)
 
:Folks, I inspected our Github codes:
 
:* [https://github.com/lingua-libre/RecordWizard/commits/master RecordWizard MediaWiki extension (php)] – some recent non-audio-stream changes.
 
:* [https://github.com/lingua-libre/LinguaRecorder/commits/master LinguaRecorderJS] – no changes this past year.
 
:I can't find a clear recent change which could have affected our audios recording stream.
 
:{{Ping|VIGNERON|Seb35}} are you aware of any (environmental) change which could have had affected the audio stream of RecordWizard recently ? [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 07:57, 29 April 2021 (UTC)
 
::I am still in the process of properly publishing code from the server to Github and Gerrit for the various extensions, but there is indeed '''no change related to audio'''.
 
::Specifically the [https://github.com/lingua-libre/LinguaRecorder/commits/master LinguaRecorderJS] is very exactly what was installed in 1.31 and in 1.35, no change here (on the server there is only a micro-instruction to register the LinguaRecorderJS in MediaWiki environment)
 
::For the RecordWizard, main changes are maintenance, a technical thing about serialization of Wikibase items, and related to interface (vue.js, which changed from 2.6.11 to 2.6.12, which is mainly a [https://github.com/vuejs/vue/compare/v2.6.11...v2.6.12 security release]).
 
::[[User:Seb35|Seb35]] ([[User talk:Seb35|talk]]) 19:46, 4 May 2021 (UTC)
 
  
{{ping|VIGNERON|Seb35|Pamputt|Yug|Poslovitch}}<br/>
+
== MediaWiki:Lang/* ==
'''Update''': Another user ({{u|Le Commissaire}}) reported an audio bug (on WMFr Discord server). This was not the "click"/"pop" bug, but the speeding-up bug, ''but'' the user told that the bug occurred when loading a list of 1000 words into the RW. I suggested him to try loading a shorter list, he tried with 250 words and it worked fine, no issue. This constitutes another clue that RAM is important/long lists are a problem for several users in the RW.<br/>
 
In addition to a ''potential limitation of the RW to 350 words'' (for example), see this related ticket:
 
*[[phab:T276014|T276014]], Feature request to be able to load parts of lists in RW (only possible for Categories at the moment)
 
<br/>Best — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 15:09, 6 May 2021 (UTC)
 
:Worth investigating. I made assumption of 7kB per word, but the audio strean could be completly different from my assumption. Natural path would requires to call back Mr. Vion or User:0x010C to investigate (none currently active), or to dive into [https://github.com/lingua-libre/LinguaRecorder/commits/master LinguaRecorderJS], the navigator's memory, and Ram. Maybe more. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 18:41, 6 May 2021 (UTC)
 
  
=== Review process ===
+
What are the MediaWiki:Lang/* messages for? For example, [[MediaWiki:Lang/awa]]? It looks like they mostly just repeat the language code in the content. --[[User:Amire80|Amir E. Aharoni]] ([[User talk:Amire80|talk]]) 07:21, 24 May 2023 (UTC)
{{collapse|1=
 
'''To review recordings by another user :'''
 
# Go to [[Special:RecentChanges]] > Find recent recordings > Pick an user which is not already in the table below
 
# Open 10~20 of this user's recent recordings > Listen each > Count how many have unusual audio artifacts
 
# Add this user to the table below with its associated results and your comment
 
# If you feel necessary, please notify the user on Lili (ex [[User_talk:Andreea_Teodoraa|msg]]) and ping the user on Commons (ex [https://commons.wikimedia.org/w/index.php?title=User_talk%3AAndreea_Teodoraa&type=revision&diff=555617601&oldid=468099121 ping])
 
  
'''To be reviewed :'''
+
== Where are the Greek recordings? ==
# With your usual web browser, go to [[Special:RecordWizard|Record Wizard (studio)]] > Step 3, enter your web browser name then 15 words in your language > Record, publish.
+
According to the statistics page there are 130 recordings of the Greek language (Q205, ISO: gre). However there is no category [[commons:category:Lingua Libre pronunciation-gre]] defined or any recordings added to this category. There is a category [[commons:category:Lingua Libre pronunciation-ell]], but it is empty. What happened to the 130 Greek recordings? [[User:Olaf|Olaf]] ([[User talk:Olaf|talk]]) 20:16, 9 June 2023 (UTC)
# Come on [[LinguaLibre:Chat room#Reviews-ready]] > Post a message with your web browser, its version [optional], and your OS.
+
:Hi {{u|Olaf}}, for unclear reason (probably historical reason), it seems that all Greek recordings are categorized in [[c:Category:Lingua Libre pronunciation-other (Q9129)|Category:Lingua Libre pronunciation-other]]. We have to move all these recordings in the [[c:category:Lingua Libre pronunciation-gre|good catagory]] (I do not know if Commons has a some automatic tool for such job). And also redirect [[commons:category:Lingua Libre pronunciation-ell]] to [[c:category:Lingua Libre pronunciation-gre]]. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 07:24, 10 June 2023 (UTC)
 +
::Hi {{u|Pamputt}}. This happened because in [[wikidata:Q9129#P220]] both ISO 639-3 codes are deprecated, and [https://doc.wikimedia.org/Wikibase/master/php/docs_topics_lua.html#mw_wikibase_entity_getBestStatements entity:getBestStatements] function, used in [[commons:Module:Lingua Libre record#L-46]], doesn't accept deprecated entries, so the module can't get the language code and falls back to "other" category. We could change the Wikidata entry and the files would be moved automatically. However code "gre" must stay deprecated, because it is unclear if it refers to ancient or modern Greek. It would be better to promote "ell" to normal entry. Then changes in [[Q205]] would be also needed. It looks like bulk moving Lingua Libre recordings around doesn't require admin rights, so I can fix this issue if you agree to change the Greek language code to "ell" instead of "gre". [[User:Olaf|Olaf]] ([[User talk:Olaf|talk]]) 08:46, 10 June 2023 (UTC)
 +
:::Hi {{u|Olaf}} thank you for your investigation. So, I have modified {{Q|205}} to fix the issue on the Lingua Libre side. For Wikimedia Commons, you can go ahead. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 08:11, 18 June 2023 (UTC)
 +
::::Thanks, {{u|Pamputt}}. It's not as easy, as I thought. Setting Greek ISO 639-3 code to normal from obsolete creates constraint validation with Modern Greek with the same code. In fact, LinguaLibre shouldn't record Greek words as Greek ([[Wikidata:Q9129|Q9129]]) but rather as Modern Greek ([[Wikidata:Q36510|Q36510]]). In fact Modern Greek is also defined in LinguaLibre: [[Q279]]. [[User:Olaf|Olaf]] ([[User talk:Olaf|talk]]) 13:26, 18 June 2023 (UTC)
 +
:::::If I understand correctly, the easiest way to manage this case would be to delete {{Q|205}}, so that no one can record in "this language" and thus select only {{Q|279}}. If so, I would require to replace all Lingua Libre statements that use {{Q|205}} by {{Q|279}}. There is currently [https://lingualibre.org/index.php?title=Special:WhatLinksHere/Q205&namespace=0&limit=500 137 items] that use {{Q|205}}, so I think it is manageable by hand. {{u|Olaf}}, what do you think about this "workaround"? [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 16:48, 18 June 2023 (UTC)
 +
::::::This would be perfect, it also requires renaming the 137 recordings in Commons, but it can be done. What about the [https://lingualibre.org/datasets/ datasets] to be downloaded from LinguaLibre, will they change automatically? [[User:Olaf|Olaf]] ([[User talk:Olaf|talk]]) 21:08, 18 June 2023 (UTC)
 +
:[[User:Olaf|Olaf]], [[User:Pamputt|Pamputt]], I had nearly similar case with Chinese ISOs zho vs cmn. I have about 186 zho items (see [[Help:SPARQL_for_maintenance#.E2.9C.85_Recordings_.E2.86.92_With_ISO-639-3_.60zho.60_to_change_to_.60cmn.60|Help:SPARQL for maintenance]])]] which have the wrong iso. My plan is :
 +
:* to delete those audios, very simply, on both Lingualibre and Commons. The alternative would be to edit them all on both sites.
 +
:* to [https://lingualibre.org/index.php?title=Q130&type=revision&diff=691521&oldid=444378 discourage recording] or delete that Lili Qid.
 +
:so I may work on those audio, some day... [[User:Hugo en résidence|Hugo en résidence]] ([[User talk:Hugo en résidence|talk]]) 17:36, 18 June 2023 (UTC)
 +
::I don't like deleting good recordings as a way of dealing with wrong categorization. Moreover some of them are probably in use, because Olafbot might have added them to Polish Wiktionary. If there is no other option, just leave them where they are in Commons, and remove Greek from Lingua Libre alone in favor of Modern Greek. But I think Pamputt's solution is better. [[User:Olaf|Olaf]] ([[User talk:Olaf|talk]]) 21:08, 18 June 2023 (UTC)
 +
:::[[USer:Olaf]], I don't like either. But 186 recording is about 8 minutes work, and it have been confusing us for 3 years. Do point to that. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 19:35, 20 June 2023 (UTC)
 +
::::Deleting 186 recordings is about the same amount of time as modifying the language statement. This is manageable by hand and I would prefer not to delete them. I do not have time for now but I will try to do it before the end of the month. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 11:47, 21 June 2023 (UTC)
  
'''To be reviewed, recording with another browser or device :'''
+
== Any Recording limitation in Lingua Libre ==
# With your usual web browser, go to [[Special:RecordWizard|Record Wizard (studio)]] > Step 3, enter your web browser name then 15 words in your language > Record, publish.
 
# Come on [[LinguaLibre:Chat room#Reviews-ready]] > Post a message with your web browser, its version [optional], and your OS.
 
# Add some information so we know which of your recording are associated with this alternative browser or device.
 
| title = Click to see the review process
 
}}
 
  
=== Review-ready ===
+
Hello,I want to know any recording limitation in Lingua Libre. Because I'm planning a screen-cast in Tamil language. If anyone know please reply. Thank you [[User:Sriveenkat|Sriveenkat (🎤) ]] ([[User talk:Sriveenkat|talk]]) 11:11, 1 August 2023 (UTC)
* I recorded 10+ audios with Chrome 89.0.4389.114 (Official Build) (64-bit) : <s>all good for me, no review needed</s>. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 14:35, 27 April 2021 (UTC)
+
:I you are not an [[c:Commons:Patrol#Autopatrol|autopatrolled user]] on Wikimedia Commons, then you cannot upload more than 380 audios per 72 minutes. If you want to record more words within this timeslot, then you should request for [[c:Commons:Requests_for_rights#Autopatrol|this right]]. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 14:15, 1 August 2023 (UTC)
::{{ping|Yug}} Could you try 20 more with an up-to-date version of Chrome? — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 18:38, 27 April 2021 (UTC)
+
::Hi, {{ping|Pamputt}}, I don't record 380 audios within 72 minutes. I'm planning to create screen-cast tutorial video in Tamil language. So I ask this question. Thank you for your reply [[User:Sriveenkat|Sriveenkat (🎤) ]] ([[User talk:Sriveenkat|talk]]) 14:35, 1 August 2023 (UTC)
:::{{ping|WikiLucas00}} Done. I'am not sure, but I may have the bug as well. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 19:42, 27 April 2021 (UTC)
 
::::{{ping|Yug}} The majority of [https://commons.wikimedia.org/w/index.php?title=Special:ListFiles/Yug&ilshowall=1 your last recordings] contain at least a click. — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 19:56, 27 April 2021 (UTC)
 
  
=== Samples ===
+
== Exclusion list for generators? ==
:''Under investigation: Some contributors experience parasitic saturation (“Pock!”) or dust while other don't.''
 
:''Please review your recent recordings and help expand table below so we can identify a recurring pattern among affected contributors vs non-affected ones.''
 
{| class="wikitable"
 
! ||Username || # reviewed || % affected || Example file || Web Browser + version || Comment
 
|-
 
| [[Special:Contributions/DSwissK|c]] || [[User:DSwissK]] || 15 || 33% (5) || [[File:LL-Q150 (fra)-DSwissK-gratter.wav]] <br> [[File:LL-Q150 (fra)-DSwissK-béquille.wav]] <br> [[File:LL-Q150 (fra)-DSwissK-avec.wav]] ||  || New echo bug?
 
|-
 
| [[Special:Contributions/Natschoba|c]] || [[User:Natschoba]] || 20 || 95% (19) || [[File:LL-Q188_(deu)-Natschoba-der_Wunsch.wav]]<br>[[File:LL-Q188 (deu)-Natschoba-Anspruch erheben.wav]]<br>[[File:LL-Q188 (deu)-Natschoba-der Unfall.wav]]<br>[[File:LL-Q188 (deu)-Natschoba-der Teil.wav]] || ||  Several thousands of recordings before. No hardware change.
 
|-
 
| [[Special:Contributions/Andreea Teodoraa|c]] || [[User:Andreea Teodoraa]] || 11 || 75% (8) || [[File:LL-Q7913 (ron)-Andreea Teodoraa-muscă.wav]]<br>[[File:LL-Q7913 (ron)-Andreea Teodoraa-otravă.wav]]<br>[[File:LL-Q7913 (ron)-Andreea Teodoraa-ofițer.wav]] || || Several thousands of recordings before. Tried different mics and platforms, same behaviour.
 
|-
 
| [[Special:Contributions/GeoMechain|c]] || [[User:GeoMechain]] || 15 || 0% (0) ||  || || 
 
|-
 
| [[Special:Contributions/ClasseNoes|c]] || [[User:ClasseNoes]] || 15 || 0% (0) ||  || || 
 
|-
 
| [[Special:Contributions/Hsarrazin|c]] || [[User:Hsarrazin]] || 14 || 30% (4) || [[File:LL-Q150 (fra)-Hélène (Hsarrazin)-corné.wav]]<br>[[File:LL-Q150 (fra)-Hélène (Hsarrazin)-Bellevigne-les-Châteaux.wav]]<br>[[File:LL-Q150 (fra)-Hélène (Hsarrazin)-Saint-Sylvain-d’Anjou.wav]] ||  ||
 
|-
 
| [[Special:Contributions/ᱥᱟᱹᱜᱩᱱ ᱗|c]] || [[User:ᱥᱟᱹᱜᱩᱱ ᱗]] || 2 || 100% (2)|| [[File:LL-Q33965 (sat)-ᱵᱳᱫᱤ ᱵᱟᱥᱠᱤ (ᱥᱟᱹᱜᱩᱱ ᱗)-ᱢᱟᱨᱥᱟᱞ.wav]]<br>[[File:LL-Q33965_(sat)-ᱵᱳᱫᱤ_ᱵᱟᱥᱠᱤ_(ᱥᱟᱹᱜᱩᱱ_᱗)-ᱠᱟᱹᱢᱤ.wav]] ||  || Only 2 audios.
 
|-
 
| [[Special:Contributions/Zoyahssn|c]] || [[User:Zoyahssn]] || 2 || 100% (2) || [[File:LL-Q1860 (eng)-Md Anan Islam (Zoyahssn)-Md Anan Islam.wav]] || ||  Suspects: Hardware & sound setting issue
 
|-
 
| [[Special:Contributions/Olaf|c]] || [[User:Olaf]] || 15 || 0% (0) || — ||  || All recent recordings ok. <small>(I have these clicks in every recording session, but I remove all such occurrences during the review phase. Only because of this it's 0%.[[User:Olaf|Olaf]] ([[User talk:Olaf|talk]]) 23:44, 1 May 2021 (UTC))</small>)
 
|-
 
| [[Special:Contributions/WikiLucas00|c]] || [[User:WikiLucas00]] || 60 || 75% (45) || [[File:LL-Q150 (fra)-WikiLucas00-chapeaux.wav]]<br/>[[File:LL-Q150 (fra)-WikiLucas00-apologétique.wav]]<br/>[[File:LL-Q150 (fra)-WikiLucas00-sacerdotal.wav]]<br/>[[File:LL-Q150 (fra)-WikiLucas00-érythrocyte.wav]] || Brave 1.23.73 (Chromium: 90.0.4430.85)  || See [https://commons.wikimedia.org/wiki/Special:ListFiles/WikiLucas00 my 2021-04-26 10pm CEST series]
 
|-
 
| [[Special:Contributions/WikiLucas00|c]] || [[User:WikiLucas00]] || 300 || 0% (0) || All files are OK || Firefox 88.0.1, External microphone  || Perfectly fine. See [https://commons.wikimedia.org/wiki/Special:ListFiles/WikiLucas00 my 2021-05-06 9am CEST series]
 
|-
 
| [[Special:Contributions/Le Commissaire|c]] || [[User:Le Commissaire]] || ?? || ?% (?) || || Opera, Desktop Computer, External microphone  || Speed-up bug occurred when loading a 1000-words-long list into RW. Tried with loading only 250 words and recording again, went fine.
 
|}
 
  
== Publish on Wikimedia Commons ==
+
Hello, if there isn't a feature like this somewhere already, I propose a per-user blacklist of sorts, which would allow users to select words which would be excluded when you choose one of the generator options to generate words. I'm currently going through a list of words in a Wiktionary category, and I'm confronted with a growing list of words that I can't deal with because they aren't suitable for pronunciation (e.g. particles that surround other arbitrary words), or they're just homophones of something I've already recorded, etc. What would be necessary, techniaclly, in order to make this happen? [[User:Kiril kovachev|Kiril kovachev]] ([[User talk:Kiril kovachev|talk]]) 12:39, 10 August 2023 (UTC)
 +
:Hi {{u|Kiril kovachev}}, I have opened a [[phab:T344221|Phabricator ticket]] for this request. If you know Javascript, you may have a look to the [https://github.com/lingua-libre/RecordWizard code] to propose a patch. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 05:52, 15 August 2023 (UTC)
  
Hello, I just tested, but my records are not published on Commons. My tests: on Firefox, then on Chrome, with 50, then with 1 expression (s), with license CC3.0-BY-SA and CC1.0. —[[User:Eihel|Eihel]] ([[User talk:Eihel|talk]]) 06:51, 2 May 2021 (UTC)[[File:LiLi April 2021 - Publish on Wikimedia Commons.png|thumb|Problème de publication sur Wikimedia Commons]]
+
== Barnstar Award Template ==
:[[phab:T281636]] —[[User:Eihel|Eihel]] ([[User talk:Eihel|talk]]) 07:10, 2 May 2021 (UTC)
 
:: Usually I have the same with the first two recordings in a session. Then I can upload them again at the end. Try again with more recordings, and using "retry filed upload" button. [[User:Poemat|Poemat]] ([[User talk:Poemat|talk]]) 08:07, 2 May 2021 (UTC)
 
::: Yup, I had this bug many times. (I say "had" because I don't remember having encountered it after the fire incident.) Just don't give up and it should be published eventually. [[User:DSwissK|DSwissK]] ([[User talk:DSwissK|talk]]) 11:56, 2 May 2021 (UTC)
 
::::(As of 3 May 2021 and as I checked, I'm not aware of any code changes ([https://github.com/lingua-libre/RecordWizard/commits/master history]) which may have of affected this. Seb35 made some other code change this same day.) [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 09:47, 3 May 2021 (UTC)
 
I add a user who has the same problem: {{u|Le Commissaire}}. —[[User:Eihel-LiLi|Eihel-LiLi]] ([[User talk:Eihel-LiLi|talk]]) 15:33, 6 May 2021 (UTC)
 
:::::Bonjour {{ping|Seb35}}, Faudrait voir avec {{u|Le Commissaire}} si le problème persiste aussi (avant de clore le ticket Phab. Sincères salutations. —[[User:Eihel|Eihel]] ([[User talk:Eihel|talk]]) 10:01, 4 June 2021 (UTC)
 
::::::J’ai mis un message à Le Commissaire sur sa page de discussion.
 
::::::Le problème que vous avez eu était spécifique à votre compte, c’est peut-être arrivé à d’autres personnes mais ça semble assez rare. Aussi, à partir du moment où un utilisateur a réussi à faire un envoi vers Commons, alors c’est un problème différent du vôtre ([[:phabricator:T275957|celui-ci, qui ressemble mais l’erreur est intermittente]]). Plus globalement, il faudrait que le message d’erreur soit explicite plutôt que d’aller à chercher dans la console du navigateur, je vais ouvrir un ticket Phabricator en ce sens. [[User:Seb35|Seb35]] ([[User talk:Seb35|talk]]) 10:28, 4 June 2021 (UTC)
 
  
== Translation admins ==
+
There is any Barnstar Award Template for Lingua Libre? [[User:Sriveenkat|Sriveenkat (🎤) ]] ([[User talk:Sriveenkat|talk]]) 07:06, 13 September 2023 (UTC)
 +
:There are [[Template:50k barnstar]] and [[Template:Speaker of the month]] and maybe other. [[User:WikiLucas00|WikiLucas00]] may know other barnstars. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 21:11, 13 September 2023 (UTC)
 +
::{{ping|Pamputt|WikiLucas00}} Ok Pamputt, I want give barnstar award for Some Beginner Speakers. It will be a motivating for them. Am I right?[[User:Sriveenkat|Sriveenkat (🎤) ]] ([[User talk:Sriveenkat|talk]]) 11:46, 14 September 2023 (UTC)
 +
:::Hello {{ping|Pamputt|Sriveenkat}}! Indeed, it would be a nice idea to offer awards for beginners, such as a barnstar for passing 1000 recordings for example. All the best — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 16:08, 16 September 2023 (UTC)
  
I updated [[phab:T262855|this ticket]], explaining our need of translation admins. I'm espacially thinking of {{u|Sabelöga}} and {{u|Eihel}}, who have the skills and the needs to get this rights (e.g. [[Special:Diff/484898|here]]).<br/>
+
==1,000,000th ==
If the community agrees, we can ask the developper team currently working on the project to implement this new status into Lingua Libre, and we will then be able to elect new translation admins on LiLi.
+
* N  ! 08:38 కంటగిల్లు (Q1094614)‎ diffhist +3,648‎ V Bhavya talk contribs block ‎Created a new Item
You can vote by using <code><nowiki>{{Support}} or {{Oppose}}</nowiki></code>.
+
* N  ! 08:38 కంటగించు (Q1094613)‎ diffhist +3,636‎ V Bhavya talk contribs block ‎Created a new Item
<br/>All the best, — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 12:21, 4 May 2021 (UTC)
+
* '''N  ! 08:38 కంటకితము (Q1094612)‎ diffhist +3,636‎ V Bhavya talk contribs block ‎Created a new Item'''
:Hello [[User:WikiLucas00|WikiLucas]], Especially since the tvar translation variables have just changed. —[[User:Eihel-LiLi|Eihel-LiLi]] ([[User talk:Eihel-LiLi|talk]]) 16:32, 5 May 2021 (UTC)
+
* N  ! 08:38 కంటకుడు (Q1094611)‎ diffhist +3,624‎ V Bhavya talk contribs block ‎Created a new Item
=== Vote ===
+
* N  ! 08:38 కంటక (Q1094610)‎ diffhist +3,588‎ V Bhavya talk contribs block ‎Created a new Item
* {{Support}} (proposer) — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]]
+
* N  ! 08:38 కంటబడు (Q1094609)‎ diffhist +3,612‎ V Bhavya talk contribs block ‎Created a new Item
* {{Support}} We are are early stage for the communnity, having ''3 active referents for any given administrative task is required'' (see also [[:en:Bus factor]]). It is also necessary to document process as we see them appears, in a concise therefore maintainable way. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 15:09, 4 May 2021 (UTC)
+
[[User:Yug|Yug]] ([[User talk:Yug|talk]])
*:In this project, the rights associated (example: ''pagetranslation'') with translation administrators are already contained in the administrators. In addition, an administrator can self-grant the right without going through a formal request (on any WM). I therefore think that we are far from the indispensable (wo)man (especially after Strasbourg IMHO). Also, if I want to continue on this project and following the previous section… —[[User:Eihel-LiLi|Eihel-LiLi]] ([[User talk:Eihel-LiLi|talk]]) 16:29, 5 May 2021 (UTC)
 
*::{{ping|Eihel-LiLi}} "Active" [and skilled] is an important word. I'm admin but not active on translations pages. We have about 4 admins truly active this past 6 months, AFAIK only WikiLucas was admin while truly active [and skilled] on pagetranslation. Adding 2+ more is required. Seems on the way. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 09:59, 6 May 2021 (UTC)
 
*:::And Pamputt too (already TA on WD for example). Cordially. —[[User:Eihel-LiLi|Eihel-LiLi]] ([[User talk:Eihel-LiLi|talk]]) 15:14, 6 May 2021 (UTC)
 
* {{Support}} Agree to ask for this new status. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 15:46, 4 May 2021 (UTC)
 
* {{Support}} Agreed. [[User:DSwissK|DSwissK]] ([[User talk:DSwissK|talk]]) 18:31, 4 May 2021 (UTC)
 
* {{Weak support}} —[[User:Eihel-LiLi|Eihel-LiLi]] ([[User talk:Eihel-LiLi|talk]]) 15:49, 6 May 2021 (UTC)
 
* {{Support}} J’ai confiance. [[User:Lyokoï|Lyokoï]] ([[User talk:Lyokoï|talk]]) 17:57, 10 May 2021 (UTC)
 
* {{Support}} I'm up for it! --[[User:Sabelöga|Sabelöga]] ([[User talk:Sabelöga|talk]]) 18:53, 19 May 2021 (UTC)
 
  
=== Discussion ===
+
== Why Lingua Libre Bot isn't running Wikidata? ==
* I'd rather see [[User:Titodutta|Titodutta]]. —[[User:Eihel-LiLi|Eihel-LiLi]] ([[User talk:Eihel-LiLi|talk]]) 01:20, 6 May 2021 (UTC)
 
:{{ping|Eihel-LiLi}} {{u|Titodutta}} is already an admin on LiLi, which means he has the <code>pagetranslation</code> right. Implementing this ''translation admin'' status would allow us to grant some users the <code>pagetranslation</code> right without granting them all admin rights (like the right to <code>delete</code> pages or <code>block</code> users for instance). — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 07:31, 6 May 2021 (UTC)
 
::Ah OK. I took the most prolific users, but I remembered that [[User:WikiLucas00|you]] and Pamputt are TAs… —[[User:Eihel-LiLi|Eihel-LiLi]] ([[User talk:Eihel-LiLi|talk]]) 15:04, 6 May 2021 (UTC)
 
  
== Browsing the sound library ==
+
{{ping|Poslovitch|Pamputt|WikiLucas00}}Why Lingua Libre Bot isn't running in Wikidata? {{u|Darafsh}} asked about in Wikidata Lexicographical data Telegram Group. What's the problem? Please kindly tell the issue. Thanks-[[User:Sriveenkat|Sriveenkat]] () ([[User talk:Sriveenkat|talk]])  16:12, 6 October 2023 (UTC)
 +
:{{ping|Sriveenkat}} could you point to an Lingua Libre item and a Wikidata item or lexeme that has not received the pronunciation? This will help to test and find what is wrong. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 19:22, 6 October 2023 (UTC)
 +
::Hi {{ping|Pamputt}} Recorded Audios doesn't received in the Wikidata Items and Wikidata Lexemes!. The User {{u|Darafsh}} have recorded some many words for Wikidata Lexeme Project. but never audios added to the Wikidata Lexemes. You can see the [[wikidata:Special:Contributions/Lingua Libre Bot]] The last contribution on 23:49, 9 September 2023. So, Iam just asking run the Lingua Libre Bot on Wikidata. I'm also recorded some words for Wikidata Lexeme Project I waited for some days, But never my audios added to wikidata lexemes. So, I run QuickStatements for Adding My audios.. Now User Darafsh also run QuickStatements for adding he's audios.. I think so many users using Lingua Libre for Automatically adding audios on Wikidata and some wikitionaries. I hope you understand Thankyou Regards [[User:Sriveenkat|Sriveenkat]] () ([[User talk:Sriveenkat|talk]])  05:38, 7 October 2023 (UTC)
 +
:Thanks to {{ping|Sriveenkat}} to start the discussion. If you need some examples, you may see Mazanin's contributions on [https://commons.wikimedia.org/wiki/Special:Contributions/Mazanin Commons]. This is the recorded audio: [https://commons.wikimedia.org/wiki/File:LL-Q9168_(fas)-Mazanin_(%D9%85%D8%A7%D8%B2%D9%86%DB%8C%D9%86)-%D9%87%D9%85%D8%A8%D8%A7%D8%B4%DB%8C.wav] and this is the lexeme entry on Wikidata: [https://www.wikidata.org/wiki/Lexeme:L1010467] but they are not connected yet. [[User:Darafsh|Darafsh]] ([[User talk:Darafsh|talk]]) 12:07, 7 October 2023 (UTC)
  
{{u|Nicolas NALLET}} is currently working on the page that will display the recordings of Lingua Libre, and would like to know the list of filters that we would like to use on this page (e.g. by language, by speaker, by date...)
+
== SiteNotice ==
 +
Hi,<br />Translations are not working for Sitenotice. Install CentralNotice? ―[[User:Eihel|Eihel]] ([[User talk:Eihel|talk]]) 14:31, 7 October 2023 (UTC)
  
Feel free to suggest other filters or give your opinion on suggested filters 🙂 — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 12:58, 20 May 2021 (UTC)<br/><small>(pinging {{ping|Yug|Pamputt|Titodutta}} — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 15:48, 20 May 2021 (UTC))</small>
+
== Global bot status ==
:Great news!
+
Lingualibre Bot has been [https://meta.wikimedia.org/w/index.php?title=Steward_requests/Bot_status&diff=prev&oldid=25702991 approved]. cc {{ping|Pamputt|Poslovitch|WikiLucas00}}. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 12:31, 10 October 2023 (UTC)
:The most obvious ones are, I guess, the following:
+
:Thank you for the request and congrats on the approval! '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 12:40, 16 October 2023 (UTC)
:* by language
 
:* by speaker
 
:* by speaker's language proficiency (beginner, etc.)
 
:* by genre (male, female, etc.)
 
:--[[User:Poslovitch|Poslovitch]] ([[User talk:Poslovitch|talk]]) 13:38, 20 May 2021 (UTC)
 
*Hello {{u|WikiLucas00}} and {{u|Poslovitch}}
 
**by cat (<code>deepcat</code>, <code>incategory</code>)
 
**by coord (<code>nearcoord</code>, <code>boost-nearcoord</code>)
 
**by link (<code>linksto</code>)
 
:The codes in parentheses are those of '''''CirrusSearch''''', an extension that can be added to LiLi. Poslovitch's proposals also have filters contained in '''''WikibaseCirrusSearch''''' (<code>haswbstatement</code>). Tell me what you think of this. Cordially. —[[User:Eihel|Eihel]] ([[User talk:Eihel|talk]]) 20:36, 20 May 2021 (UTC)
 
::{{ping|Eihel}} could you describe a bit how do you imagine this would work? (since the recordings on Lingua Libre don't have ''cat'' or ''coord'' at all, and could have ''link'' but I couldn't find any examples, I'm a bit confused and would like to know more). Same question for CirrusSearch, we could look into it to see if it can be installed, but what use do you see for it? (the only use I know is for ''WikibaseCirrusSearch''). Cheers, [[User:VIGNERON|VIGNERON]] ([[User talk:VIGNERON|talk]]) 14:42, 26 May 2021 (UTC)
 
:::Code on github please. You may check Forvo and Codepen to find elegant html5 audio element and css. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 22:00, 26 May 2021 (UTC)
 
::::Hello {{ping|VIGNERON}}, The WikibaseCirrusSearch extension requires the installation of the CirrusSearch extension. This means that it does not change much. It is true that my proposals are not very Catholic, but this project will evolve over time. To begin with, this page contains a cat (not all LiLi TPs contain a cat, this should be corrected). However, since you want an example, [https://www.wikidata.org/w/index.php?sort=last_edit_desc&search=all%3A+insource%3A%2F%5C%5B%5C%5BUser%5C%3AVIGNERON%5C%7CVIGNERON%5C%5D%5C%5D%5C+%5C%28%5C%5B%5C%5BUser%5C+talk%5C%3AVIGNERON%2F++insource%3A%2F%5C%5B%5C%5BUser%5C%3AEihel%5C%7CEihel%5C%5D%5C%5D%5C+%5C%28%5C%5B%5C%5BUser%5C+talk%5C%3AEihel%2F&title=Special%3ASearch&profile=advanced&fulltext=1&ns0=1&ns120=1&ns146=1 here is one] (the TPs where we both participated with ''insource''). Best regards. [[User:Eihel|Eihel]] ([[User talk:Eihel|talk]]) 09:54, 4 June 2021 (UTC)
 
::::For example, the lists - which are the way to correctly make a significant number of records - were already numerous before Strasbourg. Now only one language letter appears (a). A search on its history for its own lists is possible knowing how they were recorded. But for example, if I want the lists in French in a search, "List:Fra" is not sufficient, because we only get a part. In the future, categories should be created for lists: by user, by language, by set (from the same record session) and by subject (fruit, animals, etc.). Otherwise it will quickly be insurmountable from a moment. Cordially. —[[User:Eihel|Eihel]] ([[User talk:Eihel|talk]]) 14:04, 4 June 2021 (UTC)
 
  
== Plans for the next armageddon? ==
+
== ExternalTools - Wikidata Query Service - Recording Indian Actor and Actress Names in Tamil ==
  
Are there any contingency plans implemented after the Big Fire? A regular backup for example? [[User:Poemat|Poemat]] ([[User talk:Poemat|talk]]) 22:49, 24 May 2021 (UTC)
+
{{ping|Yug|Pamputt|WikiLucas00}} I am now interested in Recording Indian Actor and Actress Names in Tamil. So I make a [https://w.wiki/8G6T query], I Input that query url in ExternalTools. A error comes "Result must contain both "id" and "label" field." I think something need to modify on this query. Please anyone help for this. Thanks [[User:Sriveenkat|Sriveenkat]] ([[User talk:Sriveenkat|talk]]) 19:58, 24 November 2023 (UTC)
:{{ping|Poemat}} good question, thanks for asking. There is obviously some plans. I'll let {{ping|Seb35|Nicolas NALLET|Michael Barbereau WMFr}} complete and/or correct me but right now, there is daily backups on a server in an other datacenter. Cheers, [[User:VIGNERON|VIGNERON]] ([[User talk:VIGNERON|talk]]) 12:47, 26 May 2021 (UTC)
+
:{{ping|Sriveenkat}}, [https://w.wiki/8Gev this] works. Please note there is 6982 items if we remove the LIMIT, and I don't how the systems works with such larger list. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 23:13, 25 November 2023 (UTC)
 +
::{{ping|Yug}} Thanks for your reply. The query doesn't works for me :( Error in ExternalTools "undefine" [[User:Sriveenkat|Sriveenkat]] ([[User talk:Sriveenkat|talk]])  06:03, 26 November 2023 (UTC)
 +
:::{{ping|Sriveenkat}}, in Wikifata QS you have to run the query to check if it is working and providing data, if so go to the URL bar, copy that long url. Come back to Lingualibre Step 3, external tool, paste that long url. It worked for me. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 06:00, 27 November 2023 (UTC)
 +
::::{{ping|Sriveenkat}} Sorry, I missed something. On the Query Service bottom right, click "Link" > then on "SPARQL endpoint" : copy this url. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 08:25, 27 November 2023 (UTC)
 +
:::::{{ping|Yug}} Works with copying SPARQL endpoint link. Thank you much. I'm planning to record more proverbs, usage examples, places, persons, Lingualibre is really more comfortable to record it. Thanks Again [[User:Sriveenkat|Sriveenkat]] ([[User talk:Sriveenkat|talk]]) 22:54, 27 November 2023 (UTC)
  
== Request for Mon language Code= mnw ==
+
== Logo redesign propositions ==
{{done}}<br/>
 
Do not have Mon language for this [[File:LL-Q9217 (tha)-咽頭べさ-Mon people.wav]] so I added Thai language I would like to have this problem resolved thanks. <small>message posted by</small> [[User:咽頭べさ]] ([[User talk:咽頭べさ|talk]])
 
  
:Hello again {{ping|咽頭べさ}} thank you for pointing out that Mon language was missing on Lingua Libre! I added it, you should from now on be able to record words in this language 🙂 Please read the message I posted on your talk page before recording new words.
+
I had a bit of fun yesterday contributing to one of my favourite projects in a slightly different way. I've kept the ideas (microphone, wings) and colours of the current logo but made it a bit more polished. I've already taken a few opinions on Discord but I wanted to get a more general opinion. What do you think?
:All the best, — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 16:40, 27 May 2021 (UTC)
 
  
== Celebrating the coming 500k milestone ==
+
Just so you know, I won't be at all offended if the community prefers to keep the current logo, because there are some very good reasons for keeping it (I'm thinking in particular of all the printed materials, the fact that it's simple (easy to draw by hand if we don't have a printer and maybe more "readable" if very small), its declination for sign languages, etc.).
  
Hello {{ping|DenisdeShawi|DSwissK|Eihel-LiLi|Julien Baley|KlaudiuMihaila|Lepticed7|Lyokoï|Olaf|Pamputt|Poemat|Poslovitch|Sabelöga|Theklan|Titodutta|Yug|सुबोध कुलकर्णी}}
+
<gallery style="text-align:center;"  heights="200px"  widths="200px">
 
+
File:Proposition refonte logo Lingua Libre (1).svg|Proposition 1
As you may have seen, we recorded 30,000 pronunciations during the current month (2nd most active month ever), the very first full calendar month since the rebirth of the website, after the datacenter fire that stalled the project for 6 weeks. If we keep a similar pace, we should reach in June the important milestone of 500,000 recordings made on Lingua Libre. That is incredible.
+
File:Proposition refonte logo Lingua Libre (2).svg|Proposition 2
 
+
</gallery>
I wanted to ask you all, '''how do you want to celebrate this milestone?''' Feel free to suggest anything below, and let's try to celebrate it properly 🙂
+
[[User:DSwissK|DSwissK]] ([[User talk:DSwissK|talk]]) 08:59, 3 December 2023 (UTC)
 
+
:{{Ping|DSwissK}} hello,
All the best<br>— '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 14:33, 27 May 2021 (UTC)
+
:We can add your proposition in the set of logos ideas within a Wikimedia Commons [[:commons:Category:Proposed Lingua Libre logo|Category:Proposed Lingua Libre logo]], for reference later on. But to be honest, good logo design requires design experience, artistic intuition, brand and public awareness, which are harder to gather than it seems. It also must fit a project's phase and branding strategy, when the project needs a new logo and project members willing to shift from the current high visibility logo to a new one. All together changing a logo is not something easy to push for. I made a similar answer [https://github.com/lingua-libre/SignIt/pull/41 here] few month ago about Lingua Libre SignIt. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 12:23, 4 December 2023 (UTC)
: Hi there, I remember registering numbers up to 1399 in French ([[c:File:LL-Q150 (fra)-Poslovitch-1399.wav]]). I abide to get that number up to 4242 once we reach that milestone ! --[[User:Poslovitch|Poslovitch]] ([[User talk:Poslovitch|talk]]) 18:18, 27 May 2021 (UTC)
+
:: {{Ping|Yug}} hi,
::Some kind of reward would be nice, like a star for the home-pedia user page. Or a physical sticker sent by post, similar to what Wikimedia does from time to time. Or an online event of sorts. [[User:KlaudiuMihaila|KlaudiuMihaila]] ([[User talk:KlaudiuMihaila|talk]]) 16:45, 29 May 2021 (UTC)
+
:: Thank you for your input. I appreciate you explaining the complexities - you raise great context I had not fully considered. [[User:DSwissK|DSwissK]] ([[User talk:DSwissK|talk]]) 09:05, 6 December 2023 (UTC)
::: We gather and make an apéro. [[User:Lepticed7|Lepticed7]] ([[User talk:Lepticed7|talk]]) 16:54, 29 May 2021 (UTC)
 
:::: Maybe an online event is the simple to do actualy. What did you think about a Live on Twitch with some guests about Lingua Libre, its history, how people made some very big recording session, how its help describe language, etc… ? [[User:Lyokoï|Lyokoï]] ([[User talk:Lyokoï|talk]]) 10:22, 1 June 2021 (UTC)
 
:::: It's possible to have some budget for celebrating :)[[User:Xenophôn|Xenophôn]]([[User talk:Xenophôn|talk]]) 08:54, 8 June 2021 (UTC)
 
 
 
== Failed to upload on Wiki Commons ==
 
 
 
Hi, I am an editor from Central Bikol Wiktionary. I have tried to record words and it went through. But it has failed to be uploaded on commons. I think it's the second time to happen. This was only after the Lingua Libre has came back. My internet connection is stable so I guess there might be some internal problems. I hope not. [[User:Kunokuno|Kunokuno]] ([[User talk:Kunokuno|talk]]) 14:58, 28 May 2021 (UTC)
 
:Hello {{ping|Kunokuno}} I'm truly sorry that this problem occurred, thanks for warning us about it.
 
:Could you please tell us your current setup (device, browser, microphone)? How many words did you record? Could you try to reproduce the bug with 10 words, and then look at your browser's console (instructions [https://kb.mailster.co/how-can-i-open-the-browsers-console/ here]) to tell us the error message if there is one?
 
:Thank you in advance.
 
:All the best. — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 16:21, 28 May 2021 (UTC)
 
::Hello {{ping|Kunokuno}},
 
::Did you retried and do you stil have the same problem? (there has been some fixes recently, it shouldn't happen anymore but I want to make sure everything is correct right now).
 
::Cheers, [[User:VIGNERON|VIGNERON]] ([[User talk:VIGNERON|talk]]) 08:41, 4 June 2021 (UTC)
 
[[File:Lingua libre error.png|thumb|Lingua libre error]]
 
:::Hello everyone, sorry for the late response. My records are still not getting through to commons. The record was successful, but it cannot be upload on commons. My device is a intel core i5 laptop, browser is google chrome, and I'm using a headset with a built in microphone. I have also tried recording on my phone but it has the same error. I have tried doing the screenshot for the error message, if there's any. Please check here. Sorry, I am not quite knowledgeable on the codes and programming languages. [[User:Kunokuno|Kunokuno]] ([[User talk:Kunokuno|talk]]) 13:53, 18 June 2021 (UTC)
 
 
 
== 500000! ==
 
Lili reached 500 000 recordings. Congratulations to everybody! [[User:Olaf|Olaf]] ([[User talk:Olaf|talk]]) 12:56, 15 June 2021 (UTC)
 
: Congrat dear all speakers! It’s unbelievable! \o/ [[User:Lyokoï|Lyokoï]] ([[User talk:Lyokoï|talk]]) 23:30, 15 June 2021 (UTC)
 
::Indeed, congratulations to all of you, let us go to the million o_O. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 16:57, 16 June 2021 (UTC)
 
  
== Lingua Libre video tutorial ==
+
== Hebrew diacritics (Niqqud) ==
  
Hi everyone! I made a short video tutorial for Lingua Libre, in French. If you like it, I could create one in English and we could include it in the {{templ|Welcome}} template, to help newcomers.<br/>Here is the video, please tell me your thoughts about it! <small>also available [https://www.youtube.com/watch?v=DYqiJa5QOuM here on YouTube]</small>
+
In Hebrew we use diacritics (Niqqud) to determine how to pronounce the words.
[[File:Tutoriel Lingua Libre.webm|thumb|left|Lingua Libre tutorial in French]]{{clr}}
 
All the best — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 10:04, 23 June 2021 (UTC)
 
  
:I really like it. It is not too long, very clear, etc. So I think it would be a good idea to create one in English. Few remarks:
+
Niqqud is usually common in the following cases:
:* if you create one video in English, is it possible just to make the movie with the interface in English and then to create the text as subtitle (Wikimedia Commons supports subtitles), so that it would be easy to translate the subtitles in several languages (remain the problem of the interface itself in English).
+
# Young kids or people learning the language.
:* on Wikimedia Commons, I think you should write what music is used in the video and where does it come from in order to be sure it is a free-licence music
+
# Formal use.
:Very nice job. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 18:56, 23 June 2021 (UTC)
+
# To distinguish between meanings when the base form is ambiguous.
::Thank you {{ping|Pamputt}}) ! I think I will indeed make a video with the interface in English, with no built-in subtitles as you suggested, and we will then be able to add TimedText subtitles on Commons. I think I'll also make a version with built-in substitles (so basically the same video as here but with everything in English), in order to have a cleaner English version to be post and share on YouTube.
 
::EDIT: I added English subtitles on the French video, to test the functionality, it seems to work well!
 
::Thank you for your remark about the music, I added the information on the file's description.
 
::See you! 🙂 — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 10:08, 24 June 2021 (UTC)
 
  
== Auto-inserting recorded words to Wiktionary ==
+
This is a short example:
 +
* Base form: גזר (GZR)
 +
* Carrot: גֶּזֶר (Gezer)
 +
* Masculine cut: גָּזַר (Gazar)
 +
* Piece: גֶּזֶר (Gezer)
 +
This is the corresponding Wiktionary article: https://he.wiktionary.org/wiki/גזר
  
Hi, I am back after a long hiatus! :) I wanted to ask about auto-inserting recorded words to Wiktionary. Is it possible to automatically insert recorded files into the respective Wiktionary entries if I had imported those words from a specific Wiktionary category? For instance, I did a test batch today from [[:wikt:or:ଶ୍ରେଣୀ:ବାଲେଶ୍ୱରୀ_ଶବ୍ଦ|"ଶ୍ରେଣୀ:ବାଲେଶ୍ୱରୀ_ଶବ୍ଦ"]] from the Odia Wiktionary. The uploaded words do appear on Commons but I need to manually add each recording. Is there a way to automate that?
+
When fetching words from Wiktionary it's better to use the first headers instead of the item names because in many cases the term is ambiguous and the items name is the base form without any pronunciation guidance.
  
My second question is something that I had asked long back - is there a way to change (or choose from two options) the filename. For instance, I would like to use the Commmons convention of "TWO_LETTER_ISO_CODE-WORDNAME.EXTENSION" format (e.g. "or-କଳା.wav"). If there is already a file that exists, then the new file can be "or-କଳା-01.wav". In that way, viewing the words in the Commons category would be easier meaning "or-କଳା.wav" and "or-କଳା-01.wav" will appear close to each other. One can even check which of the recordings is better to use on Wikimedia projects. In the backend you can of course connect the files to your Wikibase by providing unique IDs to each recording.
+
As for Wikipedia etc. sometimes there's a word with the Niqqud inside the article but it will be a bit complicated to parse so we can skip that for now.
  
Hugs of solidarity for your grave loss because of the fire! With everything going on with COVID last and this year, this was horrible! <3 --[[User:Psubhashish|Subhashish]] ([[User talk:Psubhashish|talk]]) 14:45, 24 June 2021 (UTC)
+
== Lights on userrights ==
:Hi {{Ping|Psubhashish}} I'm Lingua Libre Bot's operator. It cannot operate on Wiktionaries on which it has not received the bot flag. Feel free to file a request on [[LinguaLibre:Bot]]. I'm falling behind with the various currently pending requests since I've been the handyman of Lingua Libre on and off, but at some point I'll be able to tackle these ;) --[[User:Poslovitch|Poslovitch]] ([[User talk:Poslovitch|talk]]) 15:23, 24 June 2021 (UTC)
+
Hello all,<br>
:: Hello all! {{ping|Poslovitch}} [[LinguaLibre:Bot#Bot_request_for_.7BOdia.7D_witkionary|done]], please let me know if there is anything that I could do.
+
I bumped again into [[LinguaLibre:User_rights]] and {{tl|Autopatrolled}}. To the extend of my knowledge we have no solution to this and no active user is munitoring this bottleneck. Is this assessment correct ? [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 21:03, 28 December 2023 (UTC)
  
=== File name ===
+
== A mobile app ==
:Hi {{u|Psubhashish}} regarding the second question about the filename, it has been decided to have only one record by word and by locutor. This means that if you record again the same word, the previous record will be replaced by the new one. Thus, it is possible to correct a bad/wrong pronunciation. Why would you like to record two times the same word? [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 17:50, 24 June 2021 (UTC)
 
::{{ping|Psubhashish|Pamputt}} I think that Psubhashish is refering to the historical naming convention (NB there is no actual naming convention on Commons, it is merely some advice for naming files) of pronunciation files on Commons ([[c:Commons:Pronunciation_files_requests#Naming_the_files|see here]]), that was unchanged since 2005 and clearly insufficient. This page was suggesting just to put a 2-letter language code and the pronunced word in the filename, which was problematic as soon as another speaker pronounced the same word (that's why they suggested to add a number if it was the case). I changed this page recently, to advice users to display in the filename at least the language spoken (iso 639-3 if possible), the word written in the language's writing system, an identifier for the user, and a place related to the speaker (the place where they learned the language and/or where they live). Lingua Libre's automatic naming already does that, except for the place of learning/residence (which is for the moment only available on the speaker's element, on Lingua Libre). {{ping|Psubhashish}} I don't understand why you would want to change your filenames for some more reductive ones. The more precise the filename is, the better it is to know information about the speaker! And it is still very easy to search for a precise word, you just have to type the word+.wav in Commons, or the word itself directly in Lingua Libre's searchbar.
 
::All the best — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 18:48, 24 June 2021 (UTC)
 
  
::: {{ping|Pamputt}} and {{ping|WikiLucas00}} I didn't actually mean to create ambiguous filenames based on the older convention. I was worried for the multiple kinds of naming inside the category [[:c:commons:Odia pronunciation]]. The way the files are organized there are or-NAME.extension (e.g. File:Or-ଅନ୍ୟ.wav). What I am proposing is slightly different than how you want to capture the information in the file. I am all for metadata being captured inside the description. In fact, I'd support to add a field to describe the ISO 639-2/639-3 three-letter-codes (e.g. Ori-nor-ଏଇଚି.wav). There is currently no link to the Lingua Libre QID and I'd propose to add that too.
+
I personally think that contributing using a browser is quite dangerous, Firefox on mobile, for example, has a very strict page unloading policy which leads to closing the tab while uploading thus losing the remaining data which wasn't uploaded yet (I found a workaround but it's not perfect), are there any thought about this? (Maybe even expanding the current [https://www.saveriomorelli.com/commonvoice/ CV Project] app by Saverio Morelli?)
  
::: What I was proposing was not to reduce information collected but simplifying the filename. We're struggling at the moment to use a bot, find and search and insert a file from Commons into a Wiktionary entry. I'd love to hear from you all what the issue would be if the file descriptions template ({{t|Lingua Libre record}} contains information such as language name, language ISO (including variation), language Glottocode (which linguists prefer because ISO is faulty. ref. requirements by language archives such as Living Tongues, ELR and Language Archive Cologne), and information about speaker's age range, gender and region (as dialects also vary from region to region, optional field as this is personal data).
+
== Is the Record Wizard not working for anyone else? ==
  
::: The filename, however, can be simpler as using a bot to search for duplicates is hard now for the community because the QID and username are included in the filename. What if all that information, as I explained above, are included in the information below in the template and the file name can be the ISO 639-1 (for standard spoken forms or macrolanguages) or ISO 639-2 or 639-3 (for dialects/variations)? As I had explained in my previous comment, nor-NAME.wav and nor-NAME-01.wav will appear close to eachother because of alphabetical sorting. An average user without the knowledge of bots can even manually test the quality of recordings if they are using files on different Wikimedia projects. Can at least this be piloted for one language? --[[User:Psubhashish|Subhashish]] ([[User talk:Psubhashish|talk]]) 02:12, 25 June 2021 (UTC)
+
My mic works with [https://mictests.com/ mictests.com], but [https://lingualibre.org/wiki/Special:RecordWizard the RecordWizard] doesn't pick anything up at the "check your microphone" stage. I've tried on both my phone and my laptop, and I can record sound in both cases, and I have the appropriate permissions enabled, but this particular website isn't detecting sounds. Is anyone else having this kind of problem? [[User:Grendelkhan|Grendelkhan]] ([[User talk:Grendelkhan|talk]]) 23:43, 24 February 2024 (UTC)
 +
:Hello [[User:Grendelkhan]],
 +
:I just received a second such report. User also checked [https://mictests.com/ mictests.com] sucessfully.
 +
:On Firefox, Lingua Libre recording studio step 4, the microphone is allowed (we see the red microphone image on the left of the URL address). But after clicking the record button, no recording occurs.
 +
:* Mictests on other site : successful.
 +
:*Device: Notebook
 +
:*OS: ?  
 +
:*Browser: Firefox, Chrome.
 +
:*User: [[User:Akamycoco]].
 +
:*Languages affected: all.
 +
:*Dates : Worked on February 28. Stopped working on February 29.
 +
:Let's starts an investigation. Could you let me know your OS and precise web browser version ? (Help > About Chrome or similar)
 +
:Let me know as well if you have basic developer skills to Right-click on the staled page > Inspect > Console : are there any error message ? [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 07:55, 1 March 2024 (UTC)
  
I have created a sub-section just to make clearer the discussion. I am completely lost. Currently, the files created on Lingua LIbre are all named such as [[c:File:LL-Q33810_(ori)-Psubhashish-ଫସ୍କା.wav|File:LL-Q33810_(ori)-Psubhashish-ଫସ୍କା.wav]], which mean File:LL-QID (LANGUAGE_CODE)-(LOCUTOR NAME)-WORD.wav, with QID the identifier of the language on Wikidata<s>identifier of the recording on Lingua Libre</s>, LANGUAGE CODE can be either two or three letters (ISO 639-3) if there is no 2 letters code for the language, LOCUTOR NAME, the name of the person who record on Lingua Libre and WORD the word that has been recorded. So could you give us an example pointing to a file that has not a suitable name from your point of view? I think it will help me to get your point. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 06:13, 25 June 2021 (UTC)
+
::My laptop is using Google Chrome <tt>122.0.6261.94 (Official Build) (64-bit)</tt> on Linux (Debian Testing). No error messages in the console when I attempt the recording. My phone is using Chrome <tt>122.0.6261.90</tt> on Android 14 on a Pixel 5a. It ''does'' seem to work on Firefox <tt>115.7.0esr (64-bit)</tt> on my laptop. (I really should have checked that before.) So maybe this is solely a Chrome problem? [[User:Grendelkhan|Grendelkhan]] ([[User talk:Grendelkhan|talk]]) 16:30, 2 March 2024 (UTC)
:{{ping|Pamputt}} Watch out. The QID is not the recording's, it's the language's Wikidata QID ;) --[[User:Poslovitch|Poslovitch]] ([[User talk:Poslovitch|talk]]) 08:13, 25 June 2021 (UTC)
 
::Indeed :) Thank you, I correct in my previous message. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 08:34, 25 June 2021 (UTC)
 
::: Hello all, what meant to say is I understand that you have a convention for LL. But I personally do not want my username of the QID of the language or too many signs or even blank spaces. All of that are a problem when it comes to a few thousand recordings by multiple authors where the same word recorded by different people do not even appear close to each other in a sorted list. As I had written earlier, metadata can be better captured in a more formatted way inside Commons and you're capturing it even better inside the Wikibase of LL. The question is whether the file name should have all the metdata or can it have even the most essential metadata. The username is irrelevant in a filename. If I click a picture of the Eiffel Tower or the Taj Mahal, my username appearing in the filename can only indicate a copyright owner pride. :D QID is a Wikimedian's paradise. It makes no sense to a common user. Entries on Commons are not just for use by Wikimedians but for the larger public. An ISO code (or a Glottolog ID) does this job (though one can argue that not all the people understand ISO codes). The three letter ISO code would address the language-dialect-variation in most cases. The word itself in the preferred script is self explanatory. All the metadata can be included inside the page using the LL template. I do not understand the insistence on adding additional info (QID and username). Also, just curious what really is the issue with ISO-FILENAME.EXTN (ori-କ.wav) for the first occurrence and ISO-FILENAME.EXTN (ori-କ-1.wav) for the second occurrence and so on? --[[User:Psubhashish|Subhashish]] ([[User talk:Psubhashish|talk]]) 09:19, 25 June 2021 (UTC)
 
::::Not all the languages we allow to record on LinguaLibre have an ISO code. That's why the QID is useful. --[[User:Poslovitch|Poslovitch]] ([[User talk:Poslovitch|talk]]) 09:33, 25 June 2021 (UTC)
 
::::: {{ping|Psubhashish}}, {{u|Poslovitch}} replied about the QID. About the username, the goal is to ensure that there is only one record per speaker. With such name, if you record twice the same words, only the lastest record will remain. It is very useful if you want to correct a wrong/bad pronunciation because the preivous recording is automatically replaced by the new one. Thus, no need for the user to ask for a deletion of the previous file on Wikimedia Commons.
 
::::: That's said, I do not see the benefits to shorten the filename name. If you are looking for a given word, using the search engine on Wikimedia Commons should find the recordings. If you are interested by mass import, so Lingua Libre Bot is probably the tool you are looking for. If you want to do it by yourself, there are already some Python codes (other that LLBot) that do this job. See for example [https://github.com/JackPotte/JackBot/blob/master/src/wiktionary/fr_wiktionary_import_from_commons.py this code] that is used on the French Wiktionary. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 11:09, 25 June 2021 (UTC)
 
{{ping|Pamputt}} thanks for sharing this. I share the same concern with you when it comes to ISO and had shared about Glottolog ID. Glottolog ID is something field researchers as Gregory Anderson (Living Tongues) or organizations such as LAC and ELP use. But apart from Glottolog being used by field researchers, the classification is indeed really detailed. Does using QID solve any particular issue? I am yet to have explore the LL Bot but have made a [[LinguaLibre:Bot#Bot_request_for_.7BOdia.7D_witkionary|request]]. BTW can the LL Bot be used for inserting files that are already there on a Commons folder? You still didn't share why the ISO-FILENAME.EXTN and ISO-FILENAME-01.EXTN option is a bad one and why "LL-QID (ISO)-USERNAME-NAME.wav" is preferred over the former for the languages with ISO standards. Also, have you considered the need for the same word being recorded multiple times by someone who speaks in different accents or there is a need for different intonations/moods? A word might be written the same way in a particular writing system but there are often aforementioned needs. If a new recording overwrites an existing one, many might accidentally overwrite audio files that are needed. --[[User:Psubhashish|Subhashish]] ([[User talk:Psubhashish|talk]]) 14:34, 25 June 2021 (UTC)
 
:{{ping|Psubhashish}} I let Poslovitch answer concerning LLBot.
 
:: ''Does using QID solve any particular issue?''
 
:Using QID allow us to be able to record any language/dialect even those that would not be yet available in Glottolog. In addition, we are sure that the QID is stable and will not change in the future.
 
::''You still didn't share why the ISO-FILENAME.EXTN and ISO-FILENAME-01.EXTN option is a bad one and why "LL-QID (ISO)-USERNAME-NAME.wav" is preferred over the former for the languages with ISO standards.''
 
:This is what I tried to explain in the previous message. This is used to manage double recording and to correct bad pronuncitation files easily. If we use "ISO-FILENAME.EXTN", it is not linked to a locutor and so it means several files can be created by the same locutor, and the "bad" files will be kept. A name such as "LL-QID (ISO)-USERNAME-NAME.wav" solves this problem (maybe "LL" is not needed but it is only two letters). In addition, how you would record word from dialects or languages that do not have ISO codes if we use something like "ISO-FILENAME.EXTN"?
 
::''Also, have you considered the need for the same word being recorded multiple times by someone who speaks in different accents or there is a need for different intonations/moods? A word might be written the same way in a particular writing system but there are often aforementioned needs. If a new recording overwrites an existing one, many might accidentally overwrite audio files that are needed.''
 
: This are really rare cases. If a user wants to record himself/herself with several accents, probably most of the recordings will not be "natural", which mean the audio files will be poor quality for reusing. That's said, there is a way to manage words that spell the same but have differents pronunciations. In such cases, it is possible to add in bracket a precision about the word we want to record. For example in French, we have [[c:File:LL-Q150 (fra)-0x010C-fils (pluriel de fil).wav|File:LL-Q150 (fra)-0x010C-fils (pluriel de fil).wav]] (fils (plural of fil)) and [[c:File:LL-Q150 (fra)-0x010C-fils (enfant).wav|File:LL-Q150 (fra)-0x010C-fils (enfant).wav]] (fils (child)). So that, using the bracket, we are sure about the user intent [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 16:53, 25 June 2021 (UTC)
 
  
== LinguaLibreBot pour le Wiktionnaire en Chaoui ==
+
== Automatic categorization isn't documented. ==
  
Bonjour, je veux relancer la discussion pour permettre le bot LinguaLibre d'ajouter des audio sur shy.wiktionary.org . je suis le seul admin de ce projet. je peux vous aider pour l'algorithme des pages, si vous n'êtes pas contre. merci d'avance.--[[User:Reda Kerbouche|Reda Kerbouche]] ([[User talk:Reda Kerbouche|talk]]) 12:48, 2 July 2021 (UTC)
+
So far as I can tell, this isn't documented: if, for user Foo, category <tt>Lingua Libre pronunciation by Foo</tt> exists on Commons, then all uploads will be categorized into that category. This is helpful! It's also easy to backfill after the fact using [[:commons:Help:Gadget-Cat-a-lot]]. I'm not sure where to document this, but it seems reasonable to do so ''somewhere''. [[User:Grendelkhan|Grendelkhan]] ([[User talk:Grendelkhan|talk]]) 16:26, 3 March 2024 (UTC)
: Salut {{u|Reda Kerbouche}} : on est train de réfléchir comment prendre en charge au mieux les différents Wiktionnaires. Si vais essayer de me motiver pour proposer quelque chose durant l'été. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 12:54, 2 July 2021 (UTC)
 

Latest revision as of 16:26, 3 March 2024

Chat rooms in various languages:
English · 🌐

Chatroom FAQ

How to download all audios of one language? By speaker?

Datasets are availale here. A script is updating the datasets every 2 days, using CommonsDownloadTool. For more, see Help:Download datasets.

How to add missing languages?

Administrators can add new languages on demand, they do so within few days. Please provide your language's ISO 639-3 code and/or its Wikidata ID. For more, see Help:Add a new language.

How to keep my wikimedia project up to date?

Contact Poslovitch, the master of Lingua Libre Bot. For more info, check out Help:Bots and LinguaLibre:Bot.

What IRL events are coming? When? Where?

Please see LinguaLibre:Events.

How to translate LinguaLibre User Interface into a new language?

Go to translatewiki.net. For more, see Help:Translate.

How to archive sections which have been answered?

After reviewing the section, add {{done}} ~~~~ to the top of the section. After few days to 2 weeks, move the section's code to [[LinguaLibre:Chat_room/Archives/year]].

Archives
20222021202020192018

Results of Coverage Test of French Lemma and Non-Lemma forms is English Wiktionary

While playing around with generating lists for pronunciation from Wiktionary, I decided to run a few tests on the current coverage of French lemma and non-lemma forms in English Wiktionary. I choose French because it is the largest datasets in LL.

Current Coverage of French in Lingua Libre

  • Total French Entries in Lingua Libre by a native speaker: 233 982
  • Unique French Entries in Lingua Libre by a native speaker: 154 358
  • Percentage of overlap: 34%
  • Term with the greatest number of pronunciations: "blanc" with 40

Current Coverage of Category:French lemmas

  • Total entries in Category:French lemmas: 84 482
  • Pronounced entries: 50 917
  • Entries with pronunciation: 33 565
  • Coverage Percentage: 60.27%

Current Coverage of Category:French non-lemma forms

  • Total entries in Category:French non-lemma forms: 29 1225
  • pronounced entries: 26 791
  • Entries with pronunciation: 264 434
  • Coverage Percentage: : 9.20%

For me, there are several lessons to be drawn.

  1. First, there has been amazing growth on LL. Covering 60.27% percent is a real achievement.
  2. The overlap percentage is quite small overall.
  3. There needs to be a clearer sense of when LL should stop requesting pronunciations for a certain term because 40 pronunciations of "blanc" seems a bit excessive.
  4. A need exists to continue pro-actively targeting entries in Wiktionary that are not in Lingua Libre. Currently, 297 999 French lemma and non-lemma forms require pronunciations.
  5. Generating lists from Wiktionary and checking coverage is not as hard as I thought.
  6. Lingua Libre has almost caught up with Forvo in the number of French pronunciations (233 982 vs 254, 703). Overall, Lingua Libre has shown amazing and healthy progress in a very short period of time. I'm excited about these results. Languageseeker (talk) 03:07, 1 June 2022 (UTC)
@Languageseeker This investigation is pretty cool. (I'm not sure i understand all your numbers yet, but i will read again when back on my PC). Its quite nice to see we are reaching Forvo level for our lead language. It's possible we have more unique words than forvo since we have user:Olafbot actively guiding and pushing us on that path.
On Lili we have chosen to be a learning AND linguistic diversity audio database. When you account for gender, regional accents, age, voice type, having 40 french audios for a word is still 400+ voices short.
Also, all contributors are not able to contribute audio perfect files due to various shortcomings (hardware, no recording room, no noose cancelling system, etc). We lack proper rating and review system. It's on our [slow] roadmap tho. 😉
PS: Should i answer to you in French i get a feeling you are French or learning it. Yug (talk) 15:07, 1 June 2022 (UTC)
@YUG Salut, Yug. Oui, je suis en train d'apprendre le français. Comme nous avons discutez pendant notre reunion, c'est difficile de definer les limits d'une language. Comme je le vois, les formes lemma ne suffit pas. Maintenant, je suis en train de crée un Olafbot sur steroid pour francais. Mon plan est de réaliser un program python qui peux analyser les modèle utilizer sur Wiktionary. Languageseeker (talk) 15:48, 7 June 2022 (UTC)
Hi @Languageseeker . I'm sorry I did not visit the Chat Room in a long time, and missed your report. Very interesting, good job! I remember a request I made to Olaf some time ago: it would be interesting to have a list similar to the one Olafbot is updating, but containing only lemmas of the target language (to quickly have nearly all lemmas of a dictionary illustrated with an audio pron). Also, I suggest you to use the categories of the French version of Wiktionary when you plan to work on French (and some other languages, that are more extensively described there). As you can see here, the category gathering French lemmas is more than 3 times more complete on the fr. version than on the en. version of Wiktionary. As you mentioned, these numbers are exciting, let's keep up the good work! All the best — WikiLucas (🖋️) 15:47, 26 November 2022 (UTC)
@WikiLucas00 Sorry, I totally forgot about your request. The list is now ready for French: List:Fra/Filtered-lemmas-without-audio-sorted-by-number-of-wiktionaries. It's produced like the other lists, but it's limited to words from Catégorie:Lemmes_en_français. The list will be refreshed together with the rest. Olaf (talk) 16:54, 14 May 2023 (UTC)
Hello @Olaf ! Thank you so much for this list, it's going to be very useful for sure! Let's cover 100% of Lemmas 😎 I'll tell the French contributors on Discord about it 😉 All the best — WikiLucas (🖋️) 22:18, 20 May 2023 (UTC)

How to create user page

Hello, my user name is Ngangaesther from Kenya. I am still stuck on how am supposed to create my user page kindly help regards Esther

Odia language missing from Stats/Languages

Hi there, for some reason, the Odia-language stats are missing from the Stats/Languages page. Also, "The most prolific speakers for the current month " section in the Stats/Speakers page is not loading at all since the time I checked last (about 10 days). I have tried on Chromium and Firefox and the result is the same even after clearing cache. --Subhashish (talk) 19:40, 28 July 2022 (UTC)

Hello Subhashish, it should be back online. We had a hackathon to put it back. We are calling for devs to push forwards. Yug (talk) 11:07, 10 August 2022 (UTC)
Thank you for the update, Yug. --Subhashish (talk) 14:00, 10 August 2022 (UTC)

Manually-coded languages

I came across meta:Lingua Libre/SignIt recently (via betawiki) and was wondering if manually-coded languages would be appropriate for this as well? These are languages in sign modality, but strongly tied to a spoken/written language; they usually adopt the grammar of the nonmanual language, choosing instead to simply transpose the vocabulary. This means they are most often used in application-specific and pidgin contexts (Pidgin Sign for English and diver's signs are examples). In particular, I am interested in toki pona luka, a manual form of toki pona (Q338540). Since the vocab is the same as spoken/written toki pona, there are a minimal number of lexemes overall, so having a complete set of signs is easily achievable. Manually-coded languages including toki pona luka are generally not given a separate ISO 639 code since they are in effect equivalent to scripts. Would this cause a problem for the infrastructure as currently designed? Arlo Barnes (talk) 05:56, 17 August 2022 (UTC)


Hello Arlo Barnes,

I understand "manually coded languages" as synonymous to "signed languages", am I correct?
If there is no distinct ISO for the signed language, we could still:

  • Create a new wikidata item without ISO, which will be used as identifier by LinguaLibre infrastructure
  • Use the spoken/write language ISO, and create lists of words all suffixed by (signed).

Either of those solutions could work.

If you have some knowledge of signed toki pona luka please let me know. We are adding features on Lingualibre and SignIt in order to be able to record video of signed words by late 2022. We are almost there. If you would like to record some basic signed words to share with the world, then let me know. Yug (talk) 20:58, 17 August 2022 (UTC)

Signed languages and manually-coded languages share similarities (the manual modality) and differences (since sign languages are 'native' to the signed modality, they use it more fully, having complete deixis and time-reference systems, use of handshape classifiers, etc.) -- 'luka' means 'hand'/'five', so that's the part of the name that indicates the manual modality, but otherwise it's just garden-variety toki pona. I am interested in using SignIt to record this vocab, yes. The '(signed)' suffix seems like a good way to do it. Arlo Barnes (talk) 13:16, 19 August 2022 (UTC)
Arlo Barnes: We increasingly have tools to update and correct sign language recordings, so the suffix (signed) or the solution we choose appears incorrect, we still can correct it later using that bot.
I would encourage you to first train yourself and learn that manually-coded language over the coming months. Indeed, we still have a very last bug within our video recording chain, which makes rightful videos appears as audio on Commons. We expect to solve this last issue this fall (September or October ?). So for now, I encourage you to rest well, reload energy, to get ready to record later this year. Maybe identify near you some suitable place with elegant monochrome wall to film over or consider building yourself a low-cost recording studio,. Etc. We can discuss it to keep it low cost and effective if you are interested, as I'm also looking for such walls and/or considering building one for myself.
See also : Minimal Sign Language Studio guideline. Yug (talk) 22:30, 19 August 2022 (UTC)

Update my username

I have changed my Wikimedia username but the previous name still appears in Lingua Libre. I know it's not included in unified logins. Anyway, please update my username to Aishik Rehman. Hirok Raja (talk) 15:14, 1 September 2022 (UTC)

Hi Hirok Raja¸would you have an example of what you would like to see to be changed? I think you are talking about the filename but I am not sure, so with one example, it would be clearer. Pamputt (talk)
@Pamputt
1. Top menubar of lingualibre.org showing 'Hirok Raja' as my profile name.
2. After uploading when I try to check my uploads in Commons, it takes me to https://commons.m.wikimedia.org/wiki/Special:ListFiles/Hirok_Raja page.
3. 'Hirok Raja' being used as Default recorder in the file names and description
4. Change speaker name to 'Aishik Rehman' every time while recording is quite annoying to me.
5. Even here 'Hirok Raja' is showing as my signature by default ): Hirok Raja (talk) 19:16, 2 September 2022 (UTC)
I suspect this is due to long term cookies. Would be interesting to push a clean up for your connection cookies for Lingualibre, it will log you out, then come back here. On firefox.
Open about:preferences#privacy > Go to "Cookies and Site Data"> Click "Manage Data" > Search "Lingualibre" > Remove selected. Yug (talk) 21:10, 2 September 2022 (UTC)

Siège communautaire de Wikimédia France – ouverture du vote / Community representative to Wikimédia France’s board - votes are opened

(English version below. Do not hesitate to correct my English translation.)

(Message copié depuis le bistro du jour par Lepticed7 (talk))

Bonjour,

En tant que président de la commission électorale pour l'élection du siège communautaire au conseil d'administration de Wikimédia France, je vous annonce que le vote ouvre aujourd'hui (13 septembre) à 0h CEST. Il se terminera le 26 septembre à 23h59 CEST.

Comme il y a trois ans, le scrutin est public sur Meta. Les pages de votes sont disponibles dans la catégorie correspondante ou en lien sur la page principale. C'est un scrutin par approbation, le candidat qui aura le plus grand nombre de voix sera donc déclaré élu. Vous pouvez voter pour autant de candidats que vous le souhaitez.

Si vous avez des questions, vous pouvez les poser sur la page de discussion ou par courriel à election@wikimedia.fr.

Pour la commission électorale, Mathis B, le 12 septembre 2022 à 22:00 (CEST)


(Message copied from the French Wikipedia Bistro by Lepticed7 (talk))

Hello,

as the chairman of the electoral commission for the election of the community representative to Wikimédia France’s board, I announce that votes open today (13th september) at 0:00 CEST. They will be closed on 26th september at 23:59 CEST.

Like it was the case three years ago, voting is on Meta. Voting pages are available in the corresponding category or as links in the main page. The elected candidate will be the one with the most approbation votes. You can vote for as many candidates as you wish.

If you have any questions, you can ask them on the Talk page on Meta, or by email at election@wikimedia.fr.

For the electoral commission, Mathis B, 22:00, 12 septembre 2022 (CEST)

Is there a way to exclude username from Wikimedia Commons upload file name?

See also Help:Renaming.

This seems redundant and takes up a lot of space --Middle river exports (talk) 20:22, 9 October 2022 (UTC)

@Middle river exports Welcome MRE,
You could name your speaker with a single character I guess.
But keeping the name is voluntary. Each speaker has his/her own voice, which we want to document. If, outside of Wikimedia, you want to remove part of the filename, we have a technical tutorial to do so. See Help:Download datasets and Help:Renaming. Ping us back if your dataset is not up to date. Yug (talk) 13:16, 10 October 2022 (UTC)
I have solved this now by just changing my username to something shorter. This way I can upload English as Usmaan (عثمان) for example where instead of just repeating the username it shows two scripts which is more useful. (Apparently few enough people have Arabic script usernames that short common words are mostly available.) --عثمان (talk) 20:23, 10 October 2022 (UTC)
All Unicode characters should be ok, in words and usernames ;) Yug (talk) 19:46, 11 October 2022 (UTC)

Username update request

I realised my username on Mediawiki didn't carry over here when I changed it. On thus site could I please have it changed to: عُثمان --عثمان (talk) 08:45, 10 November 2022 (UTC)

Data on LinguaLibre:Stats isn't consistant with Wikipedia Commons's Category

On the Stats page, the French have 254,387 records

https://lingualibre.org/wiki/LinguaLibre:Stats/Languages

Meanwhile, the Category on commons.wikimedia.org has 253,464 records

https://commons.wikimedia.org/wiki/Category:Lingua_Libre_pronunciation-fra

The stats display more records. This data inconsistency is strange. -- User:Shenlebantongying, 10:36, 23 december 2022.

This means some item page exist here, but no audio are on Commons.
Item creation here and upload are done at step 5 of the recording, nearly simultaneously.
So I don't know what is going on. Yug (talk) 17:41, 26 December 2022 (UTC)

c:Category:Lingua Libre pronunciation-bxg

All files in this category are tagged with wrong language. I have requested moves for files in the category, but what's more to be done?--GZWDer (talk) 13:05, 12 January 2023 (UTC)

Thanks for reporting. Actually all these items are erroneous (see Special:WhatLinksHere/Q590228):
I have not checked yet if corresponding recordings are still on Commons. Pamputt (talk) 16:11, 13 January 2023 (UTC)

I can not publish my records recorded via Lingua Libre.

Dear Colleagues,

It records, but when I press the button to publish it on Wikimedia Commons. It does not work. It returns as "Retry failed upload" Any idea? Thank you. Key Mîrza (talk) 05:09, 28 January 2023 (UTC)

Is it happening for all your recordings or only some of them? Pamputt (talk) 08:49, 28 January 2023 (UTC)
It was all good until a month ago. Nowadays I am on a vacation in another city and trying to enter to my accout and make some more records. I can enter into my account and I can create records, but I can not publish them. I stuck at publishing stage. Nothing publishing. None of my records publishing. I even tried to record via my cell phone, even there nothig publishing. By the way, I just saw your previous message wecoming me. Thank you, for your kind wish. Best wishes... Key Mîrza (talk) 09:57, 28 January 2023 (UTC)
Hmmm, I do not know what to say. Sometimes some recordings do not upload but they other do. When none recording uploads, I do not know what could be the origin. Could you try with another webbrowser (firefox or Chrome)? To go further, I think we would need a Javascript expert that could have some hints. @Poslovitch & Lepticed7 maybe ? Another question, how many words do you try to record? If this is a lot, could you try with only a few (less than 10 for example). Pamputt (talk) 15:42, 28 January 2023 (UTC)
I tried 11 words together, then even 1 word only for testing purpose. Nothing worked. You said Java. Do I need java to be able to work with the application? If so, that I need to install Java. Because I formatted my PC. May be it is not installed. Thank you. Key Mîrza (talk) 17:06, 28 January 2023 (UTC)
Java is different than Javascript. Javascript is language supported by the webbrowser so you do not need to install anything else than a webbrowser to record pronunciations on Lingua Libre. Unfortunately, I cannot dig further in this direction because I almost know nothing about Javascript. Pamputt (talk) 21:18, 28 January 2023 (UTC)
Thank you, anyway. Key Mîrza (talk) 22:38, 28 January 2023 (UTC)
Key Mîrza, thank you a lot for your voice, it make us discover new languages. Please be aware Lili works best on solid desktop computers. Also, you likely have a limit of 380 records uploads per 72 minutes. So you may need to leave your tab open, and click "retry" after that. You can expand those right by making a demand on Commons. See LinguaLibre:User rights. Contact us if you think it may be that. Yug (talk) 15:07, 5 February 2023 (UTC)
It's confirmed, as all new contributor you are limited to 380 uploads per 72h. You can get more userrights by requesting those rights on Commons. Yug (talk) 15:15, 5 February 2023 (UTC)

Late 2022-2023 Winter report

Hello all, allow me to share few overall news from the various recent, ongoing, or near-future efforts.

  • 🤖 User:Pamputt has taken over Lingualibre Bot and added support for the Kurdish wiktionary. See github.
  • 🌏 Melody (WMFr intern) and myself made a mini-editathon on writing template emails for outreach. See Lingualibre:Events.
  • ⚡ User:Elfix and myself will attend are collaborating for sparql requests (me) optimization (Elfix). We aim to create and languages gallery this spring.
  • 🔴 Wikimedia France's freelance on the record wizard is back on track, delivery of fixes should occur around May-June.
  • 🙋‍♀️ Adelaide (WMFr) mentioned the wish of a second intern on Lingualibre outreach this summer, to reuse Melody's assets, expand actions and geographic diversity.
  • 🫱🏼‍🫲🏽 Wikimedia France yearly strategic meetup is this week, and is expected to strengthen its (linguistic) diversity and metrics axes, for which Lingualibre is one of their champions.
  • 🧓 Eve and myself (likely) will be present at Toulouse's Forom des Langues, in May, where ~60+ languages associations are present.

For specific deadlines and events coming soon, please also check Lingualibre:Events/Program. We always welcome contributors. When necessary, WMFr may refund transportation costs. Worth a try ! Yug (talk) 15:07, 5 February 2023 (UTC)

Edit your nickname

Good evening, I would like to change my nickname because it did not update when I was renamed Manjiro91 then Manjiro5 instead of GamissimoYT on Wikimedia projects. Thanks in advance Regards manȷıro💬 22:53, 23 February 2023 (UTC)

Tool to prepare words for Lingua Libre

Preparing words to be used in Lingua Libre has always been challenging. But I think this is a shared challenge. Crawling text from different sources and creating a clean list of words is very important. I've used Tito's instructions in the past, but using multiple tabs and multiple tools is not the best user experience. So, I thought I'd create something that is functional for me and simple enough to be tweaked. Introducing "Prepare words for Lingua Libre". The tool is currently set for Odia but can be easily tweaked for other languages using non-Latin scripts. I'd request Lingua Libre core team to incorporate the tool into Lingua Libre so that users can use the platform to create a wordlist. Extracting words from any random text is always hard, especially new contributors. --Subhashish (talk) 03:44, 14 March 2023 (UTC)

Hi Psubhashish. This is really nice. Do you think it would be easy to adapt it to create a new generator? Generators can be used by anyone after they import them in their common.js. Pamputt (talk) 06:44, 14 March 2023 (UTC)
Thanks User:Pamputt. That would be fantastic, but I probably don't have the right knowhow for doing that. I did take ChatGPT's help to create a .js version from the HTML code I had shared earlier but would appreciate any help. I think having a tool inside Lingua Libre would be great so really liked the idea of new generators. Common users would like things well packaged rather than jumping from one platform to another. --Subhashish (talk) 13:09, 14 March 2023 (UTC)

Problème de publication des enregistrements

Bonjour, il y a quelques années, j'ai renommé mon compte GamissimoYT en Manjiro91. Plus tard, je l'ai renommé Manjiro5. Le problème est que le renommage de mon compte global Wikimedia ne s'est pas fait sur Lingua Libre. Je ne peux donc pas publier les audios que j'enregistre sur LinguaLibre et n'apparaissent pas non plus sur Commons. Pourriez-vous m'aider ? manȷıro💬 08:41, 26 April 2023 (UTC)

Renommer un dialecte en langue

Bonjour,

J'avais fait la demande pour l'ajout de "Teochew dialect" il y a quelques années lors de mes premiers essais. Cependant, il paraît plus pertinent de juste laisser "teochew" tout court sans le mot dialecte. Serait-il possible de faire ce changement.

Assassas77 (talk) 19:41, 7 May 2023 (UTC)

Check-green.svg Done Solved here by User:Assassas77 ! It's a wiki :) Yug (talk)

MediaWiki:Lang/*

What are the MediaWiki:Lang/* messages for? For example, MediaWiki:Lang/awa? It looks like they mostly just repeat the language code in the content. --Amir E. Aharoni (talk) 07:21, 24 May 2023 (UTC)

Where are the Greek recordings?

According to the statistics page there are 130 recordings of the Greek language (Q205, ISO: gre). However there is no category commons:category:Lingua Libre pronunciation-gre defined or any recordings added to this category. There is a category commons:category:Lingua Libre pronunciation-ell, but it is empty. What happened to the 130 Greek recordings? Olaf (talk) 20:16, 9 June 2023 (UTC)

Hi Olaf, for unclear reason (probably historical reason), it seems that all Greek recordings are categorized in Category:Lingua Libre pronunciation-other. We have to move all these recordings in the good catagory (I do not know if Commons has a some automatic tool for such job). And also redirect commons:category:Lingua Libre pronunciation-ell to c:category:Lingua Libre pronunciation-gre. Pamputt (talk) 07:24, 10 June 2023 (UTC)
Hi Pamputt. This happened because in wikidata:Q9129#P220 both ISO 639-3 codes are deprecated, and entity:getBestStatements function, used in commons:Module:Lingua Libre record#L-46, doesn't accept deprecated entries, so the module can't get the language code and falls back to "other" category. We could change the Wikidata entry and the files would be moved automatically. However code "gre" must stay deprecated, because it is unclear if it refers to ancient or modern Greek. It would be better to promote "ell" to normal entry. Then changes in Greek (Q205) would be also needed. It looks like bulk moving Lingua Libre recordings around doesn't require admin rights, so I can fix this issue if you agree to change the Greek language code to "ell" instead of "gre". Olaf (talk) 08:46, 10 June 2023 (UTC)
Hi Olaf thank you for your investigation. So, I have modified Greek (Q205) to fix the issue on the Lingua Libre side. For Wikimedia Commons, you can go ahead. Pamputt (talk) 08:11, 18 June 2023 (UTC)
Thanks, Pamputt. It's not as easy, as I thought. Setting Greek ISO 639-3 code to normal from obsolete creates constraint validation with Modern Greek with the same code. In fact, LinguaLibre shouldn't record Greek words as Greek (Q9129) but rather as Modern Greek (Q36510). In fact Modern Greek is also defined in LinguaLibre: Modern Greek (Q279). Olaf (talk) 13:26, 18 June 2023 (UTC)
If I understand correctly, the easiest way to manage this case would be to delete Greek (Q205), so that no one can record in "this language" and thus select only Modern Greek (Q279). If so, I would require to replace all Lingua Libre statements that use Greek (Q205) by Modern Greek (Q279). There is currently 137 items that use Greek (Q205), so I think it is manageable by hand. Olaf, what do you think about this "workaround"? Pamputt (talk) 16:48, 18 June 2023 (UTC)
This would be perfect, it also requires renaming the 137 recordings in Commons, but it can be done. What about the datasets to be downloaded from LinguaLibre, will they change automatically? Olaf (talk) 21:08, 18 June 2023 (UTC)
Olaf, Pamputt, I had nearly similar case with Chinese ISOs zho vs cmn. I have about 186 zho items (see Help:SPARQL for maintenance)]] which have the wrong iso. My plan is :
  • to delete those audios, very simply, on both Lingualibre and Commons. The alternative would be to edit them all on both sites.
  • to discourage recording or delete that Lili Qid.
so I may work on those audio, some day... Hugo en résidence (talk) 17:36, 18 June 2023 (UTC)
I don't like deleting good recordings as a way of dealing with wrong categorization. Moreover some of them are probably in use, because Olafbot might have added them to Polish Wiktionary. If there is no other option, just leave them where they are in Commons, and remove Greek from Lingua Libre alone in favor of Modern Greek. But I think Pamputt's solution is better. Olaf (talk) 21:08, 18 June 2023 (UTC)
USer:Olaf, I don't like either. But 186 recording is about 8 minutes work, and it have been confusing us for 3 years. Do point to that. Yug (talk) 19:35, 20 June 2023 (UTC)
Deleting 186 recordings is about the same amount of time as modifying the language statement. This is manageable by hand and I would prefer not to delete them. I do not have time for now but I will try to do it before the end of the month. Pamputt (talk) 11:47, 21 June 2023 (UTC)

Any Recording limitation in Lingua Libre

Hello,I want to know any recording limitation in Lingua Libre. Because I'm planning a screen-cast in Tamil language. If anyone know please reply. Thank you Sriveenkat (🎤) (talk) 11:11, 1 August 2023 (UTC)

I you are not an autopatrolled user on Wikimedia Commons, then you cannot upload more than 380 audios per 72 minutes. If you want to record more words within this timeslot, then you should request for this right. Pamputt (talk) 14:15, 1 August 2023 (UTC)
Hi, @Pamputt , I don't record 380 audios within 72 minutes. I'm planning to create screen-cast tutorial video in Tamil language. So I ask this question. Thank you for your reply Sriveenkat (🎤) (talk) 14:35, 1 August 2023 (UTC)

Exclusion list for generators?

Hello, if there isn't a feature like this somewhere already, I propose a per-user blacklist of sorts, which would allow users to select words which would be excluded when you choose one of the generator options to generate words. I'm currently going through a list of words in a Wiktionary category, and I'm confronted with a growing list of words that I can't deal with because they aren't suitable for pronunciation (e.g. particles that surround other arbitrary words), or they're just homophones of something I've already recorded, etc. What would be necessary, techniaclly, in order to make this happen? Kiril kovachev (talk) 12:39, 10 August 2023 (UTC)

Hi Kiril kovachev, I have opened a Phabricator ticket for this request. If you know Javascript, you may have a look to the code to propose a patch. Pamputt (talk) 05:52, 15 August 2023 (UTC)

Barnstar Award Template

There is any Barnstar Award Template for Lingua Libre? Sriveenkat (🎤) (talk) 07:06, 13 September 2023 (UTC)

There are Template:50k barnstar and Template:Speaker of the month and maybe other. WikiLucas00 may know other barnstars. Pamputt (talk) 21:11, 13 September 2023 (UTC)
@Pamputt & WikiLucas00 Ok Pamputt, I want give barnstar award for Some Beginner Speakers. It will be a motivating for them. Am I right?Sriveenkat (🎤) (talk) 11:46, 14 September 2023 (UTC)
Hello @Pamputt & Sriveenkat ! Indeed, it would be a nice idea to offer awards for beginners, such as a barnstar for passing 1000 recordings for example. All the best — WikiLucas (🖋️) 16:08, 16 September 2023 (UTC)

1,000,000th

  • N ! 08:38 కంటగిల్లు (Q1094614)‎ diffhist +3,648‎ V Bhavya talk contribs block ‎Created a new Item
  • N ! 08:38 కంటగించు (Q1094613)‎ diffhist +3,636‎ V Bhavya talk contribs block ‎Created a new Item
  • N ! 08:38 కంటకితము (Q1094612)‎ diffhist +3,636‎ V Bhavya talk contribs block ‎Created a new Item
  • N ! 08:38 కంటకుడు (Q1094611)‎ diffhist +3,624‎ V Bhavya talk contribs block ‎Created a new Item
  • N ! 08:38 కంటక (Q1094610)‎ diffhist +3,588‎ V Bhavya talk contribs block ‎Created a new Item
  • N ! 08:38 కంటబడు (Q1094609)‎ diffhist +3,612‎ V Bhavya talk contribs block ‎Created a new Item

Yug (talk)

Why Lingua Libre Bot isn't running Wikidata?

@Poslovitch, Pamputt, & WikiLucas00 Why Lingua Libre Bot isn't running in Wikidata? Darafsh asked about in Wikidata Lexicographical data Telegram Group. What's the problem? Please kindly tell the issue. Thanks-Sriveenkat () (talk) 16:12, 6 October 2023 (UTC)

@Sriveenkat could you point to an Lingua Libre item and a Wikidata item or lexeme that has not received the pronunciation? This will help to test and find what is wrong. Pamputt (talk) 19:22, 6 October 2023 (UTC)
Hi @Pamputt Recorded Audios doesn't received in the Wikidata Items and Wikidata Lexemes!. The User Darafsh have recorded some many words for Wikidata Lexeme Project. but never audios added to the Wikidata Lexemes. You can see the wikidata:Special:Contributions/Lingua Libre Bot The last contribution on 23:49, 9 September 2023. So, Iam just asking run the Lingua Libre Bot on Wikidata. I'm also recorded some words for Wikidata Lexeme Project I waited for some days, But never my audios added to wikidata lexemes. So, I run QuickStatements for Adding My audios.. Now User Darafsh also run QuickStatements for adding he's audios.. I think so many users using Lingua Libre for Automatically adding audios on Wikidata and some wikitionaries. I hope you understand Thankyou Regards Sriveenkat () (talk) 05:38, 7 October 2023 (UTC)
Thanks to @Sriveenkat to start the discussion. If you need some examples, you may see Mazanin's contributions on Commons. This is the recorded audio: [1] and this is the lexeme entry on Wikidata: [2] but they are not connected yet. Darafsh (talk) 12:07, 7 October 2023 (UTC)

SiteNotice

Hi,
Translations are not working for Sitenotice. Install CentralNotice? ―Eihel (talk) 14:31, 7 October 2023 (UTC)

Global bot status

Lingualibre Bot has been approved. cc @Pamputt, Poslovitch, & WikiLucas00 . Yug (talk) 12:31, 10 October 2023 (UTC)

Thank you for the request and congrats on the approval! — WikiLucas (🖋️) 12:40, 16 October 2023 (UTC)

ExternalTools - Wikidata Query Service - Recording Indian Actor and Actress Names in Tamil

@Yug, Pamputt, & WikiLucas00 I am now interested in Recording Indian Actor and Actress Names in Tamil. So I make a query, I Input that query url in ExternalTools. A error comes "Result must contain both "id" and "label" field." I think something need to modify on this query. Please anyone help for this. Thanks Sriveenkat (talk) 19:58, 24 November 2023 (UTC)

@Sriveenkat , this works. Please note there is 6982 items if we remove the LIMIT, and I don't how the systems works with such larger list. Yug (talk) 23:13, 25 November 2023 (UTC)
@Yug Thanks for your reply. The query doesn't works for me :( Error in ExternalTools "undefine" Sriveenkat (talk) 06:03, 26 November 2023 (UTC)
@Sriveenkat , in Wikifata QS you have to run the query to check if it is working and providing data, if so go to the URL bar, copy that long url. Come back to Lingualibre Step 3, external tool, paste that long url. It worked for me. Yug (talk) 06:00, 27 November 2023 (UTC)
@Sriveenkat Sorry, I missed something. On the Query Service bottom right, click "Link" > then on "SPARQL endpoint" : copy this url. Yug (talk) 08:25, 27 November 2023 (UTC)
@Yug Works with copying SPARQL endpoint link. Thank you much. I'm planning to record more proverbs, usage examples, places, persons, Lingualibre is really more comfortable to record it. Thanks Again Sriveenkat (talk) 22:54, 27 November 2023 (UTC)

Logo redesign propositions

I had a bit of fun yesterday contributing to one of my favourite projects in a slightly different way. I've kept the ideas (microphone, wings) and colours of the current logo but made it a bit more polished. I've already taken a few opinions on Discord but I wanted to get a more general opinion. What do you think?

Just so you know, I won't be at all offended if the community prefers to keep the current logo, because there are some very good reasons for keeping it (I'm thinking in particular of all the printed materials, the fact that it's simple (easy to draw by hand if we don't have a printer and maybe more "readable" if very small), its declination for sign languages, etc.).

DSwissK (talk) 08:59, 3 December 2023 (UTC)

@DSwissK hello,
We can add your proposition in the set of logos ideas within a Wikimedia Commons Category:Proposed Lingua Libre logo, for reference later on. But to be honest, good logo design requires design experience, artistic intuition, brand and public awareness, which are harder to gather than it seems. It also must fit a project's phase and branding strategy, when the project needs a new logo and project members willing to shift from the current high visibility logo to a new one. All together changing a logo is not something easy to push for. I made a similar answer here few month ago about Lingua Libre SignIt. Yug (talk) 12:23, 4 December 2023 (UTC)
@Yug hi,
Thank you for your input. I appreciate you explaining the complexities - you raise great context I had not fully considered. DSwissK (talk) 09:05, 6 December 2023 (UTC)

Hebrew diacritics (Niqqud)

In Hebrew we use diacritics (Niqqud) to determine how to pronounce the words.

Niqqud is usually common in the following cases:

  1. Young kids or people learning the language.
  2. Formal use.
  3. To distinguish between meanings when the base form is ambiguous.

This is a short example:

  • Base form: גזר (GZR)
  • Carrot: גֶּזֶר (Gezer)
  • Masculine cut: גָּזַר (Gazar)
  • Piece: גֶּזֶר (Gezer)

This is the corresponding Wiktionary article: https://he.wiktionary.org/wiki/גזר

When fetching words from Wiktionary it's better to use the first headers instead of the item names because in many cases the term is ambiguous and the items name is the base form without any pronunciation guidance.

As for Wikipedia etc. sometimes there's a word with the Niqqud inside the article but it will be a bit complicated to parse so we can skip that for now.

Lights on userrights

Hello all,
I bumped again into LinguaLibre:User_rights and {{Autopatrolled}}. To the extend of my knowledge we have no solution to this and no active user is munitoring this bottleneck. Is this assessment correct ? Yug (talk) 21:03, 28 December 2023 (UTC)

A mobile app

I personally think that contributing using a browser is quite dangerous, Firefox on mobile, for example, has a very strict page unloading policy which leads to closing the tab while uploading thus losing the remaining data which wasn't uploaded yet (I found a workaround but it's not perfect), are there any thought about this? (Maybe even expanding the current CV Project app by Saverio Morelli?)

Is the Record Wizard not working for anyone else?

My mic works with mictests.com, but the RecordWizard doesn't pick anything up at the "check your microphone" stage. I've tried on both my phone and my laptop, and I can record sound in both cases, and I have the appropriate permissions enabled, but this particular website isn't detecting sounds. Is anyone else having this kind of problem? Grendelkhan (talk) 23:43, 24 February 2024 (UTC)

Hello User:Grendelkhan,
I just received a second such report. User also checked mictests.com sucessfully.
On Firefox, Lingua Libre recording studio step 4, the microphone is allowed (we see the red microphone image on the left of the URL address). But after clicking the record button, no recording occurs.
  • Mictests on other site : successful.
  • Device: Notebook
  • OS: ?
  • Browser: Firefox, Chrome.
  • User: User:Akamycoco.
  • Languages affected: all.
  • Dates : Worked on February 28. Stopped working on February 29.
Let's starts an investigation. Could you let me know your OS and precise web browser version ? (Help > About Chrome or similar)
Let me know as well if you have basic developer skills to Right-click on the staled page > Inspect > Console : are there any error message ? Yug (talk) 07:55, 1 March 2024 (UTC)
My laptop is using Google Chrome 122.0.6261.94 (Official Build) (64-bit) on Linux (Debian Testing). No error messages in the console when I attempt the recording. My phone is using Chrome 122.0.6261.90 on Android 14 on a Pixel 5a. It does seem to work on Firefox 115.7.0esr (64-bit) on my laptop. (I really should have checked that before.) So maybe this is solely a Chrome problem? Grendelkhan (talk) 16:30, 2 March 2024 (UTC)

Automatic categorization isn't documented.

So far as I can tell, this isn't documented: if, for user Foo, category Lingua Libre pronunciation by Foo exists on Commons, then all uploads will be categorized into that category. This is helpful! It's also easy to backfill after the fact using commons:Help:Gadget-Cat-a-lot. I'm not sure where to document this, but it seems reasonable to do so somewhere. Grendelkhan (talk) 16:26, 3 March 2024 (UTC)