LinguaLibre

Chat room/Archives/2021

< LinguaLibre:Chat room

Issue with the Main page

This section should be moved to LinguaLibre:Technical board.

-- Done

Hi, the main page here uses a sollution that requires the MediaWiki:Lang and its subpages to be populated, which they aren't which makes the main page not switch languages even if there is a translation available in that language and the language has been set. Could someone look into this if it's possibly to rework the structure or maybe somehow import the MediaWiki:Lang subpages? --Sabelöga (talk) 17:40, 16 January 2021 (UTC)

Hello @Sabelöga: thank you very much for this remark! I just imported the MediaWiki:Lang subpages from Meta, and it seems to be working as of now 🙂 All the best. — WikiLucas (🖋️) 00:09, 17 January 2021 (UTC)
That's excellent, that you so much for amending this, regards --Sabelöga (talk) 21:59, 18 January 2021 (UTC)
Thanks a lot! The main page looks fine --Higa4 (talk) 04:34, 19 January 2021 (UTC)

Images missing

-- Done

This section should be moved to LinguaLibre:Technical board.

Images on Help:Add_a_new_language show up as missing on my end. Have they moved or is this some error? --Sabelöga (talk) 21:08, 19 January 2021 (UTC)

Hello @Sabelöga: they are not actually missing (for example, I can see them on the page you are talking about), but I also have experienced similar issues on the website. Some images seem to randomly disappear for no reason, and to come back after a while, without any modification on the page. We will talk about it during the next team meeting. — WikiLucas (🖋️) 21:19, 19 January 2021 (UTC)
Actually, some images are really missing in the section "I know what I'm doing". This comes from the fact all images have been lost when the website have migrated to the new design. I had opened a ticket about that but I think we will never find them back. So we should create new screenshot when we discover such missing image. I will try to do for the page you've mentioned. Pamputt (talk) 23:06, 19 January 2021 (UTC)
Yes, those were the one I were talking about. --Sabelöga (talk) 16:56, 23 January 2021 (UTC)

Translation error?

The translation units on Help:Configure_your_microphone does not align properly with each other. What I mean is that the translation units include several section when the software should just pick the one for each unit. I removed the __TOC__ from the page since the TOC will appear anyway. So could a translation adminstrator mark the page for translation again and let's see if that solves the issue. --Sabelöga (talk) 16:56, 23 January 2021 (UTC)

-- DoneWikiLucas (🖋️) 22:21, 23 January 2021 (UTC)

RecordWizard drops syllable

-- Done -- can be archived. Yug (talk) 16:03, 27 January 2021 (UTC)

Prior to several days ago I was recording word pronunciations without problems, but now I can't. Several days ago RecordWizard started removing certain sounds from my voice, most frequently syllables "s" and "f". If I spell a word, for example "syllable", it records it as "yllable", just like if I never spelled "s". If I spell "sophicated", it records "soicated".

This sort of behaviour is not present at the test sound stage (the first thing RecordWizard asks user to do), it captures my speech perfecty. However the very same issue is present on different devices with different operation systems, different browsers and different microphones.

My guess is that maybe some changes to noise recognition were deployed several days ago, and it now misinterprets those syllables as background noice. Anyway, I will be grateful for suggestions on how to fix this issue. --Tohaomg (talk) 07:43, 26 January 2021 (UTC)

Hi Tohaomg, not easy to say what happens here. I am pretty sure nothing change at the backend since several months. With your examples "syllable" and "sophicated", does it happen every time you try to pronounce these words or does it happen randomly? In the first case, can other contributors try to record these words and see if the problem occurs for them as well? Myself, I just tried and I did not see this problem. Pamputt (talk) 21:39, 26 January 2021 (UTC)
I am not trying to record exactly those words, they are just examples to show you what I mean. I am actually trying to record words in Ukrainian language. When I try to record words, in some 3 cases out of 4 syllables are dropped, and in 1 out of 4 they are not, so I need to do in average 4 attempts to record a word. And this problem appeared abruptly several days ago, everything worked fine before. --Tohaomg (talk) 08:09, 27 January 2021 (UTC)
Could it come from your microphone? Did you try to record with other hardware for a test? Pamputt (talk) 10:36, 27 January 2021 (UTC)
Yes, it happens on different devices. --Tohaomg (talk) 11:59, 27 January 2021 (UTC)

Solved it. Turns out, this effect is present only when loading lists longer than several hundred words. My next theory is that it was due to some sort of RAM shortage. Thank you for your time. --Tohaomg (talk) 13:06, 27 January 2021 (UTC)

This bug stays weird...
Anyway, thanks Tohaomg for your audios <3 Yug (talk) 16:03, 27 January 2021 (UTC)
Happy to see that you've found a workaround. Indeed, what you guess could be the reason of the problem because the server is currently not very robust. It should evolve in the coming weeks/months. Pamputt (talk) 20:04, 27 January 2021 (UTC)

Technical > Github Winter 20-21 review

Following October 2020's 0x010C's departure we've reviewed the human needs for maintenance of various technical subprojects. Thanks to 4 months community effort things are in better position now :

  • Definitions: All repositories are now well defined via a clean, one sentence descriptor. It maps sub-projects, so new volunteers know quickly what repository does what. See github.com/lingua-libre.
  • Mentors: 2/3 of repositories now have a volunteer referee-mentor with "correct" understanding, able to discuss the repository, guide new comers.
  • Documentations: Most repositories are "correctly" documented via an existing readme.md. Improvement always welcome.
  • Web servers: Wikimedia France hired a new Sysops, which guide and team up with volunteers on the server issues. Welcome to WMFR's MickeyBarber/Michael.
  • Maintenance: Wikimedia France is reviewing freelance candidates for deeper mediawiki and recordwizar coding support. Thanks to Adelaide & WMFR's team.
  • Globalization: Wikimedia France has plan to expand volunteership toward India. Thanks to Adelaide & WMFR's team.

All pretty positive. Pamputt, Jitrixis, Poslovitch, Adelaide, Mikey and myself pushed forward on these fronts.

Still ! The following repositories are currently leaderless and contributorless:

  • LinguaRecorder: Powerfull JS library to manage audio recording : intelligent cutting with regular padding, saturation control, various export options,...
  • RecordWizard: MediaWiki extension allowing mass recording of clean, well cut, well named pronunciation files.
  • QueryViz: MediaWiki extension adding a <query> tag to display sparql queries results inside wiki pages

LinguaLibre Bot is under review by Poslovitch but may gain from some more love. LinguaLibre Bot it's the most impactful yet underused piece of our sub-projects since it needs to be authorized per target language (ex: add audios to tamil wikipedia articles) and is only authorized for few languages & wiki :

  • Lingua-Libre-Bot: Mediawiki bot facilitating the resuse of Lingua Libre's audio records on many wikis, including wikipedias and wiktionaries.

Satelite linguistic project :

  • SignIt: LinguaLibre SignIt is a web-browser extension which translates a word in Sign Language, in order to learn sign language while reading online.

The end ! Thanks to all those who helped and are joining in :) Yug (talk) 13:11, 1 February 2021 (UTC)

LinguaLibre International call (France-India-others)

-- Done -- Please refer to LinguaLibre:Events#2021_International_call. Yug (talk) 11:46, 12 February 2021 (UTC)

Namaskara/Hello,
Earlier we noted that we started getting more participation from India (I am from India as well). In October last year when I had around 15,000 uploads, I then contacted Wikimedia France with the following idea, what I wrote in the email then:

I believe in India, as we have many languages and dialects, the tool is specially relevant. I wanted to have a discussion with the people working on the project, or can help with this idea.

This email was followed by a call with Adelaide and most possibly Lyokoi joined. Adelaide kindly invited me to attend another call, where I could briefly meet many of you.
Now, I know, there is more interest from India (different languages). You might have seen some work in Marathi very recently. Just two days ago I attended a brief India (Maharashtra) LinguaLibre meeting. I did not expect to see so many participants, but it looks like around 10 or so people are interested to record Marathi pronunciation.

So, is it possible to have a France-India call? It is absolutely OK for me to make it an "international call", so that everyone can join. Here we can have some of the people from India (any country) who can tell their plans, ask questions, get to know from you, or share experience. There might be ideas and questions related to setting up the project page etc. Around 5 or more people from India will be interested to join, I think.

PS: Most possibly there are LinguaLibre calls arranged. Adelaide kindly invited me to two such calls. Otherwise, I do not get to know about these calls. If these calls are open and anyone can join, possibly can we announce the call dates and time on this Project Chat, so that anyone interested/eligible can join?

Regards. --টিটো দত্ত (Titodutta) (কথা) 20:03, 1 February 2021 (UTC)

Hello Titodutta,
Nice to hear that news of a 10 people Lingualibre workshop in India's Marathi community. This is wonderful.
Adelaide is definitively the person to contact for institutional relationship and workshops. She is coordinating-piloting-animating this project for Wikimedia France and knows who is who, where the human resources are, what is our wish-list and next moves.
I'am interested by this call as well. Santosh, an Indian wikimedian contacted us as well (via email) for a similar need.
I will send you an email to group us all. Yug (talk) 09:41, 2 February 2021 (UTC)
Yes, it would be good to have documentation or project page creation process on this site. Other than Marathi, briefly, a few Kannada students from a south Indian college started working on Lingua Libre (you may see an event page m:Alva's Wikipedia Student's Association/Events/Lingua Libre training session). Similarly, you might have seen some involvement from the Punjabi community where User:Nitesh Gill and a couple of other Punjabi community members are working.
Other than Indian languages, there is a good response from Japanese, Ukrainian, and a few other languages. From all these the thought of the "International call" came to my mind.
Other than small projects we can also think of "small events" in the future, such as a LinguaLibre-a-thon or Libre-a-thon (similar to edit-a-thon, for example on World Environment Day we can get together and records pronunciation which is related to the environment, and not on Commons. This can be a small event where we record our own respective languages). Regards. --টিটো দত্ত (Titodutta) (কথা) 23:51, 2 February 2021 (UTC)
@Titodutta and सुबोध कुलकर्णी: From what I see now with France and India, It seems the best seeds are with already very active wikipedians with interest in languages.
We also have a group of successful seeding due to already active wikimedian who have some institutional roles (Lyokoi, WikiLucas, Titudutta, सुबोध कुलकर्णी_Subodh). Basically, according to data, one out of 10 speaker who tried Lingualibre really stick in. So you need someone really active and outreaching, training 20, 30 people to initiate a local community.
Note: I create LinguaLibre:Events#2021_International_call, please fill in informations as needed. Yug (talk) 16:07, 8 February 2021 (UTC)

Marathi language stats

This section should be moved to LinguaLibre:Technical board.
-- Done Yug (talk) 16:09, 8 February 2021 (UTC)

Mar records @2600 on Commons:Category:Lingua_Libre_pronunciation-mar Wikimedia Commons, but it is not reflected in LL stats - records per lang. It is just 163. Could anyone please look into and resolve? सुबोध कुलकर्णी (talk) 05:08, 3 February 2021 (UTC)

Hi सुबोध कुलकर्णी. Lingua Libre suffers a bug since the end of 2020. New developers are looking to this issue. Let us hope it will be fixed in the coming weeks. Pamputt (talk) 07:05, 3 February 2021 (UTC)
@सुबोध कुलकर्णी and सुबोध कुलकर्णी: it's fixed ! Data are back online thanks to the devs hired by Wikimedia France and Adelaide. You can also use the {{User records-mar}} template on userpages to tag speakers/uploaders. Yug (talk) 14:17, 11 February 2021 (UTC)

Stats : toward records and beyond...

Folks, given the stats page is broken [paid devs will fix it in coming weeks thanks to Wikimedia France !], I jumped with some regex to do the maths:

We will likely reach 400,000 this very months. This feast is wildly due to the recent rise of Indic languages. We must also notice that most languages only have from 3 to 50 words, people trying out. Best results are achieved if we get users commit a bit, then things truly take off. Other thing, our 7 most active users provided 200,000 of our audios. 20 users contributed more than 3000 audios, and 20 others between 3000 and 1000 audios, so about 10% of speakers really hit it off. Quite interesting ! In my opinion we still have bottle necks on :

  • reaching out to diverse & minority languages ;
  • getting contributors to contribute consistently ;
  • and creating words lists for our users.

Inventing and exploring new methods for each of these bottlenecks is always welcome. Recent success with Marathi (Commons:Category:Lingua Libre pronunciation-mar‎: 15 C, 3,011 F) is a great example of reaching outside our usual pool, we surely may learn from this initiative.
I will hide in the code below the per-language stats as in tsv format, in case you want to check those. Yug (talk) 21:34, 6 February 2021 (UTC)

10% of speakers commiting to 1000 recording or more is very interesting. So it suggest that, if we give a workshop to 10 people, one committed speaker will emerge. Thereby kick starting this language. Yug (talk) 01:20, 8 February 2021 (UTC)