|
|
(175 intermediate revisions by 35 users not shown) |
Line 6: |
Line 6: |
| <!-- **** DO NOT EDIT CONTENT ABOVE **** --> | | <!-- **** DO NOT EDIT CONTENT ABOVE **** --> |
| | | |
− | == Datasets out of date ==
| |
− | Hello. It seems that the datasets page, although it claims to run every 2 days, is completely out of date: all the available zips are from April 2020 or November 2019 (and the full zip from May 2019). Is this a known problem? Is there a plan to address it? [[User:Julien Baley|Julien Baley]] ([[User talk:Julien Baley|talk]]) 23:17, 27 August 2020 (UTC)
| |
− | :Indeed, it seems to have an issue with the dataset updating. I opened a [[phab:T261519|Phabricator ticket]] about this issue. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 18:24, 28 August 2020 (UTC)
| |
| | | |
− | == Publish on Wikimedia Commons == | + | == Is the Record Wizard not working for anyone else? == |
| | | |
− | Hello, I just tested, but my records are not published on Commons. My tests: on Firefox, then on Chrome, with 50, then with 1 expression (s), with license CC3.0-BY-SA and CC1.0. —[[User:Eihel|Eihel]] ([[User talk:Eihel|talk]]) 06:51, 2 May 2021 (UTC)[[File:LiLi April 2021 - Publish on Wikimedia Commons.png|thumb|Problème de publication sur Wikimedia Commons]]
| + | My mic works with [https://mictests.com/ mictests.com], but [https://lingualibre.org/wiki/Special:RecordWizard the RecordWizard] doesn't pick anything up at the "check your microphone" stage. I've tried on both my phone and my laptop, and I can record sound in both cases, and I have the appropriate permissions enabled, but this particular website isn't detecting sounds. Is anyone else having this kind of problem? [[User:Grendelkhan|Grendelkhan]] ([[User talk:Grendelkhan|talk]]) 23:43, 24 February 2024 (UTC) |
− | :[[phab:T281636]] —[[User:Eihel|Eihel]] ([[User talk:Eihel|talk]]) 07:10, 2 May 2021 (UTC) | + | :Hello [[User:Grendelkhan]], |
− | :: Usually I have the same with the first two recordings in a session. Then I can upload them again at the end. Try again with more recordings, and using "retry filed upload" button. [[User:Poemat|Poemat]] ([[User talk:Poemat|talk]]) 08:07, 2 May 2021 (UTC)
| + | :I just received a second such report. User also checked [https://mictests.com/ mictests.com] sucessfully. |
− | ::: Yup, I had this bug many times. (I say "had" because I don't remember having encountered it after the fire incident.) Just don't give up and it should be published eventually. [[User:DSwissK|DSwissK]] ([[User talk:DSwissK|talk]]) 11:56, 2 May 2021 (UTC) | + | :On Firefox, Lingua Libre recording studio step 4, the microphone is allowed (we see the red microphone image on the left of the URL address). But after clicking the record button, no recording occurs. |
− | ::::(As of 3 May 2021 and as I checked, I'm not aware of any code changes ([https://github.com/lingua-libre/RecordWizard/commits/master history]) which may have of affected this. Seb35 made some other code change this same day.) [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 09:47, 3 May 2021 (UTC) | + | :* Mictests on other site : successful. |
− | I add a user who has the same problem: {{u|Le Commissaire}}. —[[User:Eihel-LiLi|Eihel-LiLi]] ([[User talk:Eihel-LiLi|talk]]) 15:33, 6 May 2021 (UTC)
| + | :*Device: Notebook |
− | :::::Bonjour {{ping|Seb35}}, Faudrait voir avec {{u|Le Commissaire}} si le problème persiste aussi (avant de clore le ticket Phab. Sincères salutations. —[[User:Eihel|Eihel]] ([[User talk:Eihel|talk]]) 10:01, 4 June 2021 (UTC) | + | :*OS: ? |
− | ::::::J’ai mis un message à Le Commissaire sur sa page de discussion. | + | :*Browser: Firefox, Chrome. |
− | ::::::Le problème que vous avez eu était spécifique à votre compte, c’est peut-être arrivé à d’autres personnes mais ça semble assez rare. Aussi, à partir du moment où un utilisateur a réussi à faire un envoi vers Commons, alors c’est un problème différent du vôtre ([[:phabricator:T275957|celui-ci, qui ressemble mais l’erreur est intermittente]]). Plus globalement, il faudrait que le message d’erreur soit explicite plutôt que d’aller à chercher dans la console du navigateur, je vais ouvrir un ticket Phabricator en ce sens. [[User:Seb35|Seb35]] ([[User talk:Seb35|talk]]) 10:28, 4 June 2021 (UTC) | + | :*User: [[User:Akamycoco]]. |
| + | :*Languages affected: all. |
| + | :*Dates : Worked on February 28. Stopped working on February 29. |
| + | :Let's starts an investigation. Could you let me know your OS and precise web browser version ? (Help > About Chrome or similar) |
| + | :Let me know as well if you have basic developer skills to Right-click on the staled page > Inspect > Console : are there any error message ? [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 07:55, 1 March 2024 (UTC) |
| | | |
− | == Exclusion lists ==
| + | ::My laptop is using Google Chrome <tt>122.0.6261.94 (Official Build) (64-bit)</tt> on Linux (Debian Testing). No error messages in the console when I attempt the recording. My phone is using Chrome <tt>122.0.6261.90</tt> on Android 14 on a Pixel 5a. It ''does'' seem to work on Firefox <tt>115.7.0esr (64-bit)</tt> on my laptop. (I really should have checked that before.) So maybe this is solely a Chrome problem? [[User:Grendelkhan|Grendelkhan]] ([[User talk:Grendelkhan|talk]]) 16:30, 2 March 2024 (UTC) |
− | If anyone uses the regularly updated [[user:Olafbot|Olafbot's]] lists of wanted words ([[List:Fra/Lemmas-without-audio-sorted-by-number-of-wiktionaries]], etc.), and spotted an item that should be removed without recording, you can use the brand new exclusion lists to remove it. For example on the list [[List:Fra/Lemmas-without-audio-sorted-by-number-of-wiktionaries]] there was the word "abandonar", which apparently doesn't belong to the contemporary French corpus. Having added it on the exclusion list (here: [[user:Olafbot/exclusion list/Fra]]) the bot knows this item should never appear in French lists it maintains, and [https://lingualibre.org/index.php?title=List:Fra/Lemmas-without-audio-sorted-by-number-of-wiktionaries&diff=619214&oldid=606068 removes it] during the next update.
| |
| | | |
− | Each "Lemmas without audio" list ({{Olafbot-wikt}}) has a corresponding exclusion list ({{Olafbot-exclusion}}). I hope it will help.
| + | == Automatic categorization isn't documented. == |
| | | |
− | Normally I would add a link to the exclusion list in a description of each lemmas list, but unfortunately, Lingua Libre engine doesn't allow adding any kind of comments or descriptions to lists, so this ad is the only way to spread a word about the new functionality. [[User:Olaf|Olaf]] ([[User talk:Olaf|talk]]) 09:54, 13 September 2021 (UTC)
| + | So far as I can tell, this isn't documented: if, for user Foo, category <tt>Lingua Libre pronunciation by Foo</tt> exists on Commons, then all uploads will be categorized into that category. This is helpful! It's also easy to backfill after the fact using [[:commons:Help:Gadget-Cat-a-lot]]. I'm not sure where to document this, but it seems reasonable to do so ''somewhere''. [[User:Grendelkhan|Grendelkhan]] ([[User talk:Grendelkhan|talk]]) 16:26, 3 March 2024 (UTC) |
− | :{{ping|Olaf}} Thank you so much for this useful new function! Indeed, the Record Wizard does not yet understand comments, categories nor templates on List pages, but this will be considered for future updates. — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 18:48, 13 September 2021 (UTC)
| |
| | | |
− | == Ajout d'une nouvelle langue == | + | == Understanding lingua-libre == |
| | | |
− | Bonjour !
| + | Hi, I am creating this discussion to understand lingua-libre better |
| | | |
− | Je souhaite ajouter la langue Q3196953 mais en suivant la [https://lingualibre.org/wiki/Help:Add_a_new_language/fr procédure], je ne vois pas LinguaImporter. Quelqu'un peut-il me dire pourquoi?
| + | == Uploads are failing == |
| + | :''TLDR: Large amount of users reporting failure to upload at step 5 : [[User:Grendelkhan|Grendelkhan]], [[User:Culex|Culex]], [[User:XANA000|XANA000]], [[User:Ardzun|Ardzun]] (Indonesian languages), [[User:Penn Zero MSSJ|Penn Zero MSSJ]], [[User:Univòc64]] (Whistled Occitan) and [[User:Akamycoco]] (Taiwanese languages). This likely only tip of iceberg. Only few users were able to [https://lingualibre.org/index.php?hidebots=1&translations=filter&hidepageedits=1&hideWikibase=1&hidelog=1&namespace=0&limit=1000&days=14&enhanced=1&title=Special:RecentChanges&urlversion=2 record in May], with atypically low number of recordings. Indonesia workshop with ~15 participants critically affected. Investigation ongoing. [[User:Hugo en résidence|Hugo en résidence]] ([[User talk:Hugo en résidence|talk]]) 14:20, 13 May 2024 (UTC)'' |
| | | |
− | Cdt,
| + | I can record words, but uploading them to Commons fails. The JavaScript console has the following message: |
− | BamLifa
| |
− | : {{ping|BamLifa}} c'est parce que tu n'es pas administrateur. Je viens d'importer le {{Q|646152}} [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 17:16, 13 September 2021 (UTC)
| |
− | ::{{ping|Pamputt}}, merci beaucoup pour cette précision. Si cette option n'est réservée qu'aux admins, pourquoi en parler dans la doc sans cette précision ? En plus, vue la multitude des langues que nous avons qui n'existent pas encore chez Lingua libre, ne pensez-vous pas que vous devriez simplifier cette tâche ? J'ai encore une autre langue à ajouter, le Bira (bila). [[User:BamLifa|BamLifa]] ([[User talk:BamLifa|talk]]) 12:41, 20 September 2021 (UTC)
| |
− | :::{{ping|BamLifa}} c'est indiqué sur cette page (c'est même le titre de la section (Outil pour les administrateurs)). Je ne me rappelle pas pourquoi c'est réservé aux admins mais ça limite au moins les vandales qui voudraient importer des choses qui ne sont pas des langues. Bref, j'ai importé le {{Q|656403}} et le {{Q|656404}}. Si ce ne sont pas les bonnes langues, peux-tu me donner le code ISO 639-3 correspondant (ou au moins l'identifiant Wikidata) ? [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 14:06, 20 September 2021 (UTC)
| |
− | ::::{{ping|Pamputt}}, Merci beaucoup. [[User:BamLifa|BamLifa]] ([[User talk:BamLifa|talk]]) 05:34, 22 September 2021 (UTC)
| |
| | | |
− | == Lists still don't work properly ==
| + | : <tt>'''Your IP address is in a range that has been [[m:Special:MyLanguage/Global blocks|blocked on all Wikimedia Foundation wikis]].''' The block was made by [[User:EPIC|EPIC]]. The reason given is ''[[m:Special:MyLanguage/NOP|Open proxy/Webhost]]: See the [[m:WM:OP/H|help page]] if you are affected''. * Start of block: 10:09, 1 May 2024 * Expiry of block: 10:09, 1 May 2027 Your current IP address is 2001:41d0:304:100::4790. The blocked range is 2001:41D0:0:0:0:0:0:0/33. Please include all above details in any queries you make. If you believe you were blocked by mistake, you can find additional information and instructions in the [[m:Special:MyLanguage/No open proxies|No open proxies]] global policy. Otherwise, to discuss the block please [[m:Steward requests/Global|post a request for review on Meta-Wiki]]. You could also send an email to the [[m:Special:MyLanguage/Stewards|stewards]] [[m:Special:MyLanguage/VRT|VRT]] queue at "stewards@wikimedia.org" including all above details.`, blockinfo: {…}, "*": "See https://commons.wikimedia.org/w/api.php for API usage. Subscribe to the mediawiki-api-announce mailing list at <https://lists.wikimedia.org/postorius/lists/mediawiki-api-announce.lists.wikimedia.org/> for notice of API deprecations and breaking changes." |
| | | |
− | {{Ping|WikiLucas00}} {{Ping|Poslovitch}} It's better than [[LinguaLibre:Chat_room#Lists_stopped_working|before]], but still, sometimes the Record Wizard hangs when a list is chosen.
| + | This is not my IP address shown in the error message, and whatismyip confirms that I'm not behind a proxy. The Global block request [https://meta.wikimedia.org/wiki/Steward_requests/Global/2024-w18#Global_block_for_Special:Contributions/2001:41D0:0:0:0:0:0:0/33 is here]. Is this affecting anyone else? I lost a heap of recordings. [[User:Grendelkhan|Grendelkhan]] ([[User talk:Grendelkhan|talk]]) 22:26, 4 May 2024 (UTC) |
− | Then I have to reload the page, and try again. Usually the second or the third time of trying the same list, it starts to work.
| + | :Uploads are failing for me today too, even though I am recording with my account. [[User:Culex|Culex]] ([[User talk:Culex|talk]]) 15:04, 8 May 2024 (UTC) |
− | Probably a race condition. [[User:Olaf|Olaf]] ([[User talk:Olaf|talk]]) 09:47, 30 September 2021 (UTC)
| + | :: Idem--[[User:XANA000|XANA000]] ([[User talk:XANA000|talk]]) 16:49, 9 May 2024 (UTC) |
− | :{{ping|Olaf}}It also happens to me sometimes, but I think that it could be related to the button for removing words you already recorded. When you load a list of words you never recorded (typically Olafbot's lists), ticking the button seems to kill the loading. Best — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 10:23, 30 September 2021 (UTC) | + | ::: I can record, but i couldn’t uploaded until today. I was able to upload once yesterday, but after that I couldn't upload any more. [[User:Ardzun|Ardzun]] ([[User talk:Ardzun|talk]]) 06:04, 11 May 2024 (UTC) |
− | :: Thank you. Indeed, with this switch unchecked everything seems to work. [[User:Olaf|Olaf]] ([[User talk:Olaf|talk]]) 16:02, 1 October 2021 (UTC) | + | :I guess I'm not the only one who's been trying for weeks but could not publish audio after 1 May. Hope someone can fix it. [[User:Penn Zero MSSJ|Penn Zero MSSJ]] ([[User talk:Penn Zero MSSJ|talk]]) 20:54, 13 May 2024 (UTC) |
− | | + | ::[[User:Univòc64]] (Whistled occitan) and [[User:Akamycoco]] (Taiwanese languages) also reported issues. |
− | == Liste des mots à prononcer ==
| + | ::It seems time to add a sitenotice warning. [[User:Hugo en résidence|Hugo en résidence]] ([[User talk:Hugo en résidence|talk]]) 14:07, 13 May 2024 (UTC) |
− | | + | ::In may we have mostly : 556 recordings by 7 users on May 1th, 174 recordings on May 11th ([[Special:Contributions/Austin Zhang|Austin Zhang]]), then nothing. |
− | Salut ! Existe-t-il une page où des mots peuvent être ajoutés pour qu'un bon samaritain puisse parler ? [[User:Vivaelcelta|Vivaelcelta]] ([[User talk:Vivaelcelta|talk]]) 11:30, 3 October 2021 (UTC)
| + | ::If we compare with [https://public-paws.wmcloud.org/User:Yug/QueryLingualibre-monthly.ipynb known monthly recordings], our average months recently was 30k audios, the lowest ones were 5k audios, May 2024 is heading toward 1200 audios or 5% of the average month and 20% of the lowest months. Something weird is going on indeed. |
− | :Bonjour {{u|Vivaelcelta}}, les listes sont faites pour cela. Vous pouvez [[Special:MyLanguage/Help:Create_your_own_lists|créer votre propre liste]] qui pourra ensuite être enregistrée par n'importe qui. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 16:50, 3 October 2021 (UTC) | + | {| class=wikitable |
− | :: Merci {{u|Pamputt}}. — [[User:Vivaelcelta|Vivaelcelta]] ([[User talk:Vivaelcelta|talk]]) 22:38, 3 October 2021 (UTC) | + | ! Most prolific speakers for the current month || Months since 2022 |
− | | + | |- |
− | == Projet Outils pour la patrouille ==
| + | | |
− | :''See [[LinguaLibre:Events/Patrol assistance tool prototyping project]].'' | + | <query _pagination="10" locutor="<translate><!--T:7--> Item (locutor Qid)</translate>" locutorLabel="<translate><!--T:8--> Speakers of the Month</translate>" nb="<translate><!--T:9--> Number of records</translate>"> |
− | {{LangSwitch
| + | SELECT ?locutor ?locutorLabel ?nb WHERE { |
− | |fr=Salut, | + | { |
− | | + | SELECT ?locutor (COUNT(?record) as ?nb) |
− | cette semaine commence un projet menés par des étudiants des formations IARF-RODECO de l’Université Toulouse 3 - Paul Sabatier concernant le prototypage d’outils de patrouille. Je suis, assisté par Adélaïde Calais, le superviseur de ce projet. Les étudiants sont en informatique avec une spécialisation en intelligence artificielle. L’idée est de leur faire prototyper (voire développer) des outils pour aider la patrouille de Lingua Libre en détectant automatiquement toutes sortes de problèmes. Nous avons déjà identifier quelques problèmes : clics, grésillements, bruits parasites et mauvaises prononciations (libellés et enregistrements pas raccord).
| + | WHERE { |
− | | + | ?record prop:P2 entity:Q2 . # Q2: record, P2: instance of. |
− | Et nous avons besoin de la communauté sur deux points :
| + | ?record prop:P5 ?locutor . # Property:P5: speaker |
− | # y a-t-il d’autres problèmes auxquels vous pensez ?
| + | ?record prop:P6 ?date . |
− | # nous avons besoin, pour que les étudiants puissent travailler, d’enregistrements avec défauts. Si vous les avez réenregistrés, c’est pas grave, Commons a un historique. N’hésitez pas à nous communiquer les enregistrements qui ont ou avaient des défauts !
| + | FILTER ( YEAR(?date) = YEAR(NOW()) && MONTH(?date) = MONTH(NOW()) ) |
− | | + | } |
− | Enfin, j’ai créé une page de projet accessible [[Special:MyLanguage/LinguaLibre:Events/Patrol_assistance_tool_prototyping_project|ici]] (page traduite).
| + | GROUP BY ?locutor ?locutorLabel |
− | | + | ORDER BY DESC(?nb) |
− | (Si certain·es peuvent traduire ce message en anglais, c’est super cool.)
| + | LIMIT 50 |
− | | + | } |
− | À+,
| + | SERVICE wikibase:label { |
− | |en=Hi,
| + | bd:serviceParam wikibase:language "[AUTO_LANGUAGE],en" . |
− | | + | ?locutor rdfs:label ?locutorLabel . |
− | This week, a project lead by student of University Toulouse 3 - Paul Sabatier is starting. It will be about the prototyping of patrolling tools. I supervise this project, assisted by Adélaïde Calais. The students study computer science with a specialization in Artificial Intelligence. The aim is to have them prototyping (or even developing) tools to help Lingua Libre's patrol, by automatically detecting any kind of mistake/error related to the files. We already identified a few types of mistakes: clicks, crackles, pops and labelling issues (wrong label/wrong language).
| + | } |
− | | + | } |
− | We need the community on two points :
| + | ORDER BY DESC(?nb) |
− | # are there other problems you could think of?
| + | </query> |
− | # we need some recordings having issues, in order for the students to be able to work. If you already recorded them again, it is not a big deal, Commons has a file history. Don't hesitate to provide us the files that have or had problems.
| + | | |
− | | + | <pre> |
− | Lastly, I created a project page, available [[Special:MyLanguage/LinguaLibre:Events/Patrol_assistance_tool_prototyping_project|here]].
| + | { date:2022-01, records: 21290, speakers: 46, languages: 28 }, |
− | | + | { date:2022-02, records: 3894, speakers: 40, languages: 17 }, |
− | See you,}}
| + | { date:2022-03, records: 8357, speakers: 61, languages: 21 }, |
− | [[User:Lepticed7|Lepticed7]] ([[User talk:Lepticed7|talk]]) 09:19, 19 October 2021 (UTC)
| + | { date:2022-04, records: 5454, speakers: 34, languages: 18 }, |
− | :Hello [[User:Lepticed7|Lepticed7]], Translated page —[[User:Eihel|Eihel]] ([[User talk:Eihel|talk]]) 19:49, 22 October 2021 (UTC) | + | { date:2022-05, records: 4702, speakers: 59, languages: 30 }, |
− | ::[[User:Lepticed7|Lepticed7]], [[User:Adélaïde Calais WMFr|Adélaïde]], could you specify the dates for this project ? | + | { date:2022-06, records: 7675, speakers: 41, languages: 18 }, |
− | ::Also, were your point 1 and two answered by the community somewhere ? (If not I could give it a try) [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 13:19, 15 November 2021 (UTC)
| + | { date:2022-07, records: 4364, speakers: 37, languages: 22 }, |
− | ::: {{ping|Yug}} Hi, I updated the project page with the dates. And I didn’t get any answers to my questions. [[User:Lepticed7|Lepticed7]] ([[User talk:Lepticed7|talk]]) 11:25, 28 November 2021 (UTC) | + | { date:2022-08, records: 9544, speakers: 45, languages: 23 }, |
− | | + | { date:2022-09, records: 5802, speakers: 113, languages: 30 }, |
− | == Rashidun Caliphate ==
| + | { date:2022-10, records: 6931, speakers: 74, languages: 32 }, |
− | | + | { date:2022-11, records: 8461, speakers: 54, languages: 34 }, |
− | Hello {{ping|Zinou2go}},
| + | { date:2022-12, records: 11882, speakers: 54, languages: 23 }, |
− | [https://commons.wikimedia.org/wiki/File:LL-Q13955_(ara)-Zinou2go-الخلافة_الراشدة.wav LL-Q13955 (ara)-Zinou2go-الخلافة الراشدة.wav] is problematic (currently {{Q|Q204439}} on LiLi): it contains several cuts (clicks). I proposed the file for deletion in Commons. The recordings seem to be working better, could you record Rashidun Caliphate again? I didn't check the other records, but they are likely to have "clicks" as well. Also, can an admin delete this item on LiLi, please? Cordially. —[[User:Eihel|Eihel]] ([[User talk:Eihel|talk]]) 15:31, 12 November 2021 (UTC)
| + | { date:2023-01, records: 18150, speakers: 48, languages: 29 }, |
− | :{{ping|Eihel}} Please do not nominate files for deletion before asking for the speaker to record it again and waiting a while for their answer. Also, these recordings will come useful for the team currently working on the audio issues of Lingua Libre, so we'd better not delete them (I thought you read my messages on Discord about this). — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 15:48, 12 November 2021 (UTC) | + | { date:2023-02, records: 32441, speakers: 65, languages: 29 }, |
− | ::{{Ping|WikiLucas00}}, J'ai enlevé la suppression sur Commons. —[[User:Eihel|Eihel]] ([[User talk:Eihel|talk]]) 15:54, 12 November 2021 (UTC) | + | { date:2023-03, records: 11527, speakers: 61, languages: 30 }, |
− | | + | { date:2023-04, records: 8451, speakers: 58, languages: 35 }, |
− | == Code of Conduct ==
| + | { date:2023-05, records: 21282, speakers: 97, languages: 49 }, |
− | Hi everyone, I just noticed again MediaWiki's [[:mw:Code of Conduct]] (2015) and Wikimedia Foundation's [[:foundation:Universal Code of Conduct]] (2021/02). Back in 2015, 0x010C included the first one as a condition to contribute to [https://github.com/lingua-libre/RecordWizard RecordWizard's codebase]. As far as I know, Lili.org and its community, so far, [https://lingualibre.org/index.php?search=Code+of+conduct has no Code of Conduct]. We may be ''implicitely'' binded by it or by some Wikimedia France's Code of Conduct, but it would be cleaner to ''explicitly'' adopt one and display it here, in written. We could therefor do the following :
| + | { date:2023-06, records: 17940, speakers: 56, languages: 35 }, |
− | # Short round to confirm with have nothing in place so far.
| + | { date:2023-07, records: 75825, speakers: 74, languages: 38 }, |
− | # Vote for 2 months to adopt the most recent [[:foundation:Universal Code of Conduct]] (2021/02)
| + | { date:2023-08, records: 32681, speakers: 54, languages: 30 }, |
− | # Copy the text into [[LinguaLibre:Universal Code of Conduct]].
| + | { date:2023-09, records: 28813, speakers: 114, languages: 30 }, |
− | [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 14:48, 14 November 2021 (UTC)
| + | { date:2023-10, records: 60317, speakers: 167, languages: 47 }, |
− | === Pre-discussion ===
| + | { date:2023-11, records: 49704, speakers: 140, languages: 55 }, |
− | Do we already have a Code of Conduct binding LinguaLibre ? [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 14:48, 14 November 2021 (UTC)
| + | { date:2023-12, records: 42383, speakers: 114, languages: 41 }, |
− | | + | { date:2024-01, records: 40572, speakers: 112, languages: 40 }, |
− | === Vote ===
| + | { date:2024-02, records: 22385, speakers: 197, languages: 57 }, |
− | ''Are you for or against adopting the [[:foundation:Universal Code of Conduct]] (2021) as a code of conduct for LinguaLibre's community ?''<br>
| + | { date:2024-03, records: 16997, speakers: 173, languages: 48 }, |
− | ''Possible votes : {{tl|support}} • {{tl|weak support}} • {{tl|weak oppose}} • {{tl|oppose}}''
| + | { date:2024-04, records: 8733, speakers: 117, languages: 42 }, |
− | * {{Support}} (proposer) — better to be explicit, have a framework in place, just to be clear to all on where we stand. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 14:48, 14 November 2021 (UTC)
| + | { date:2024-05, records: 556, speakers: 7, languages: 7 } |
− | | + | </pre> |
− | == Lingua Libre website should be more appealing to Language Learners ==
| |
− | :''See also [https://forvo.com Forvo.com].'' | |
− | It would be useful if LinguaLibre follows the example of Forvo to increase the number of language learners interested in the Project.
| |
− | | |
− | Forvo.com has a way of displaying the information that engage users and makes it very easy to find pronunciations.
| |
− | | |
− | For example, if someone wants to learn how to pronounce "Honoré de Balzac" in French, it would be faster to find the audio on Forvo than on LinguaLibre. Also, Forvo displays the data in a way more appealing to language learners:
| |
− | * https://forvo.com/search/Honoré_de_Balzac/
| |
− | * https://lingualibre.org/index.php?search=Honoré+de+Balzac
| |
− | '''Would it be possible to improve the way that data is displayed on LinguaLibre to make it more appealing to Language Learners ?'''
| |
− | ''In such way, the number of active users recording audios would increase significantly.'' -- [[User:Marreromarco|Marreromarco]]
| |
− | :Some people previously reported such "issue". There is a [[phab:T252319|ticket]] on Phabricator to keep this in mind. However, the priority is currently given to develop patrol tools for Lingua Libre and we do not expect to see major improvements related to the audio brosing in the coming months (at least if we have no more external developers). I think it is like this because Lingua Libre has been though so that it helps for recording, not for listening; the second is let to the other Wikimedia projects, mainly Wiktionaries et Wikidata. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 16:00, 14 November 2021 (UTC) | |
− | ::YES ! There are oral discussions and proposals in this direction, but LinguaLibre being a volunteers-based team, we are moving slowly. Forvo is a for-profit entity, it locks the copyright and resale of recordings made on its platform to the speaker-creator and to themselves, to then sell those recordings with a profit. They therefor have money and swift decision-making to sustain their UI/UX efforts. We are shorter on those sides. --[[User:Yug|Yug]] ([[User talk:Yug|talk]]) 16:30, 14 November 2021 (UTC)
| |
− | === Sound Library's forking and hacking ===
| |
− | '''On the [[LinguaLibre:Explore_the_sound_library|Sound Library]] side''', I was able to duplicate/fork it, which allows to start hack its CSS. Copy those codes into your own namespace :
| |
− | * [[User:Yug/common.js]] → [[Special:MyPage/common.js]]
| |
− | * [[User:Yug/MediaWiki:SoundLibrary.js]] → [[Special:MyPage/MediaWiki:SoundLibrary.js]]
| |
− | * [[User:Yug/LinguaLibre:Explore_the_sound_library]] → [[Special:MyPage/LinguaLibre:Explore_the_sound_library]]
| |
− | In those codes, you then have to replace all occurrences of "Yug" by your username, and it's should work. You can start hacking toward a more elegant interface. Note: the JS copy is in your *personal* JS and has a "stop" condition so the various JS instances won't fight. --[[User:Yug|Yug]] ([[User talk:Yug|talk]]) 16:30, 14 November 2021 (UTC)
| |
− | | |
− | == Allow recording only in the user's Native Language to avoid passing "mispronunciations" to Wiktionary ==
| |
− | | |
− | I started a discussion on the German Wiktionary because some words on LinguaLibre are not available on the DeWikt. The German Community told me that LinguaLibre adds words into Commons, but the Bot only accepts audios from “few” trusted users using a filter.
| |
− | | |
− | The English and German Wiktionaries use a bot called "DerbethBot" to add audios from Commons. However, the English Wiktionary community asked to block Lingua Libre's recordings because there were non-native speakers recording audios and the Bot had no way to differentiate them from Native speakers. After the audios were introduced in the English Wiktionary they had to forbid adding audios from LinguaLibre:
| |
− | | |
− | https://en.wiktionary.org/wiki/Wiktionary:Beer_parlour/2020/July#Labeling_non-native_audio
| |
− | | |
− | I believe that it is necessary to avoid giving “mispronunciations” to Wictionaries. That is similar to vandalism on a Wiktionary if the reader doesn't know that it is hearing a bad pronunciation and believes that it is “native speaker”:
| |
− | | |
− | ''Some suggestions:''
| |
− | 1) Would it be possible to name the audios files to specify if the speaker is a native or not? For example, if a French speaker records the word "maison" it could be named '''"maison-fr-native.ogg"''' . If a language learner records the same word : '''"maison-fr-learner.ogg"'''
| |
− | | |
− | 2) A radical way to address the issue would be to only allow to record in one's native language. Of course, users could change it, but strong warnings could be added and always remind people to record only their native language. Forvo seems to take this approach.
| |
− | | |
− | It might be valuable for Linguists to have recordings of non-native speakers to study their accent features in an L-2 Language. However, in my humble opinion the pronunciations added to Wiktionary should be only native speakers and bots should have a way to differentiate them.
| |
− | | |
− | Link to the German Wiktionary discussion about LinguaLibre:
| |
− | https://de.wiktionary.org/wiki/Wiktionary:Teestube#:~:text=von%20technischer%20seite%20gibt%20es%20keinem%20problem%2C%20zwei%20bots%20auf%20de.wiktionary%20arbeiten%20zu%20lassen.
| |
− | :Hi, this depends on the Wikitionary policy, and it could be different from a language to another one. Anyway, it is already possible to select only recordings done by native speaker. To do that, the speaker has to fill the {{P|16}} property ith the value {{Q|15}} (see for example {{Q|466}}). Other values for {{P|16}} are given [[Special:WhatLinksHere/Q5|here]]. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 16:38, 16 November 2021 (UTC)
| |
− | | |
− | | |
− | == Sursilvan ==
| |
− | :{{done}}
| |
− | [[Special:Contributions/Franz.Roos.1955|User:Franz.Roos.1955]] made 2 recordings in [[:en:wp:Sursilvan]] : rauna ([[Q689785]]), tschitta ([[Q689786]]). Sursilvan has no iso code. Do we have a procedure for such languages ? (I forgot if the case already shown up). [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 20:37, 17 November 2021 (UTC)
| |
− | :There is not issue. It simply uses the Wikidata identifier when there is no ISO code. Se for example {{Q|1186}}. To record in such languages, we have to create an item for this language/dialect on Lingua Libre, and this is already done for {{Q|74905}}. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 21:59, 17 November 2021 (UTC)
| |
− | ::Thank Pamputt for the clarification. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 23:12, 17 November 2021 (UTC)
| |
− | | |
− | == [[commons:commons:structured data]] ==
| |
− | | |
− | I've been very pleased with LL's tooling, that does so much of the process of uploading to Commons, sensible naming, description-writing, and categorisation for me; however, I have an idea for an additional step LL could automate. This is in Commons' no-longer-so-new structured data section, which manifests (among other ways) as a tab on the file page.
| |
− | | |
− | As an example of what could be automatically added to a file's datastore, there is a property called 'audio transcription' which serves a similar role to Commons' TimedText subtitle functionality (silly example: [[commons:TimedText:051226-kakapo-billbooming.ogg.en.srt]]) but for shorter clips -- in other words, seemingly designed with applications like LinguaLibre in mind.
| |
− | | |
− | Since these are of the so-called 'monolingual text' datatype, the source language can be specified (or where not part of the main set of languages Wikimedia uses, the special code 'mis' is used and 'language of work or name' used as a qualifier) at the same time as the actual text that is being spoken, which LL has access to since the audio file started out as a text prompt!
| |
− | | |
− | What think y'all? [[User:Arlo Barnes|Arlo Barnes]] ([[User talk:Arlo Barnes|talk]]) 04:25, 19 November 2021 (UTC)
| |
− | :Hi {{u|Arlo Barnes}} there is [[phab:T239272|Phabricator ticket]] about this topic. Currently there are not yet all properties on Wikidata to fit all Lingua Libre properties. For example, I [[d:Wikidata:Property proposal/language level|proposed to create]] a property for the language level of a speaker but it did not get enough support. SO I guess, we should first list all properties we would like to add on SDC. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 07:18, 19 November 2021 (UTC)
| |
− | | |
− | == [Feature Request] Play next sound automatically while checking recordings ==
| |
− | | |
− | After recording sounds it is important to check them to verify their quality. However, it is very tiring to record 380 words and afterwards have to click 380 times on the ''“Next button”'' while checking them.
| |
− | | |
− | '''After recording, would it be possible to add a button to "Play next sound automatically" ?''' [https://i.imgur.com/XwC34pj.png Screenshot Here] [[User:Marreromarco|Marreromarco]] ([[User talk:Marreromarco|talk]]) 04:09, 20 November 2021 (UTC)
| |
− | :Agreed, it is already [[phab:T218372|tracked on Phabricator]]. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 09:45, 20 November 2021 (UTC)
| |
− | | |
− | == "How to use Lingua Libre for your language learning" ==
| |
− | | |
− | I recently found a "new" way to benefit from the sounds on Lingua Libre. I would suggest that it could be advertised on the Lingua Libre main website and on the Wikipedia in French/English:
| |
− | * [[:en:wp:GoldenDict|GoldenDict]] is a FOSS Dictionary application very valuable for language learners.
| |
− | | |
− | A way to benefit from Lingua Libre recordings is to download the datasets, unzip them and "load" the sounds on GoldenDict (as Sound Directories. [https://i.imgur.com/9avJDgS.png Screenshot here]). In such a way, users have easily an offline "Pronunciation Dictionary". It is very easy to do. Here is an [https://i.imgur.com/axRHruk.png screenshot] of how it looks to GoldenDict the French word "fuir". Another example [https://i.imgur.com/Rq0nQCt.png here].
| |
− | | |
− | Lingua Libre sounds can be used with GoldenDict OFFLINE. That is a huge advantage in developing countries, where language learners often do not have reliable internet connection.
| |
− | | |
− | ''It would be valuable to create a description on the Lingua Libre website about'' '''"How to use Lingua Libre sounds for your language learning"''' .
| |
− | | |
− | There it would be possible to describe how to use the audios offline with GoldenDict, etc. If more methods are developed (Anki add-on), better GUI, Android App, etc. they could be explained there.--[[User:Marreromarco|Marreromarco]] ([[User talk:Marreromarco|talk]]) 04:41, 20 November 2021 (UTC)
| |
− | :1) '''Reuse of datasets :''' Yes ! Dataset download and reuse must be showcasted and strengthened. I think a "Reuses gallery" page could be created, with screenshot and minimal how-to for GoldenDict, Anki and others.
| |
− | :2) '''Anki:''' You are the 4th or 5th contributor to rise the need for an Anki add-on. We need to do something on this side, yes. It's more than 1~2 days work and too big for a volunteer work, so we need to apply for a grant. I'am looking in and mapping our options at the moment ({{tl|Grants table}}). At one point we have to jump in and design a project, yes.
| |
− | :3) For '''e-learning app''', a 5k€ project was designed by myself a year ago. The funding by local regional government was declined, but it could easily be refreshed.
| |
− | :We have to redesign some projects and apply in early 2022. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 09:28, 23 November 2021 (UTC)
| |
− | ::The core question is the Human Resources.
| |
− | ::'''*Daily routines*''' keeps WikiLucas, Pamputt, Poslovitch and myself –aka the community-side contributors— busy maintaining the place, welcoming and guiding new users, cleaning pages, etc. We are now quite smooth, successful and stable on this side.
| |
− | ::To '''*push forward*''' on developments, UI, tools, e-learning, communication, grants, we each have one or two side projects in mind, pushing those <u>''slowly''</u>. But as always in FOSS projects the task ahead is much larger and we could achieve much more with more human resources.
| |
− | ::'''Overall''', it's possible we are at a new turning right now. As things are stable, with road maps available, '''we just need 1 to 3 new coordinators and communicants contributors to tip the dynamic into forward-offensive mode''', with communication therefor new arrivals, new speakers, new devs, new coordinators and really push forward with new events/workshop, funds and SMART features.
| |
− | ::@[[User:Marreromarco|Marreromarco]], I'am currently writing down structuring "community how to" to ease new contributor's jumping in (see [[LinguaLibre:Roles]], [[LinguaLibre:Workshops]], {{tl|Grants table}}). You are doing a nice push on communication (It's FOSS) and with your questions you are mapping out Lili's needs. Pamputt and WikiLucas are following our progresses. All this is pretty interesting. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 10:48, 23 November 2021 (UTC)
| |
− | | |
− | :I would like to work on the "Public Relations" Department of LinguaLibre! - EDIT (28th Nov. 2021) : '''Any PR campaign would fail miserably if there is no search function.''' I explain the reasons at the end of this section: [[LinguaLibre:Events/Winter 2021-2022 Public Relations Campaign]]
| |
− | | |
− | [[User:Marreromarco|Marreromarco]] ([[User talk:Marreromarco|talk]]) 23:49, 23 November 2021 (UTC)
| |
− | ::Sound good :) Your outreach to YouTubers and popular FOSS blogs is spot on.
| |
− | ::I am back from a wikibreak, I am cleaning up some last pages, then since the maintenance side is stable I would like to focus my energy on projects design –recording rare languages, technology, PR campaign– and associated grant requests to secure funding and the actual realization of those visions. We can collaborate. You lead on the PR : design your campaign. I can review and help it to fit some Grants formats. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 18:00, 24 November 2021 (UTC)
| |
− | | |
− | I created a new wiki page in the "events" section of a "PR Campaign for 2022". Please visit [[LinguaLibre:Events/Winter 2021-2022 Public Relations Campaign]] and participate in the discussion with new ideas. EDIT (28th Nov. 2021) I will NOT contribute anymore to a PR campaign. the reasons are explained as comment on the relevant section [[User:Marreromarco|Marreromarco]] ([[User talk:Marreromarco|talk]]) 21:20, 25 November 2021 (UTC)
| |
− | | |
− | == Creating a LL catgory for a dialect ==
| |
− | | |
− | Would be grateful if someone could tell me if it's possible to create a LL category for a dialect?
| |
− | | |
− | We're working in Konkani, which has its own (but small) Wikipedia at http://gom.wikipedia.org Under Konkani, there are some dialects spoken, the pronunciation of one can be different from the other.
| |
− | | |
− | Would like to create a category for Saxtti (the Salcete dialect of Konkani). This will ensure that readings don't get overwritten by other dialects. Also, it would allow the recordings of many others which might have already been done in Konkani as a how.
| |
− | | |
− | Question: How do we create space for the dialects of a language?
| |
− | | |
− | Thanks very much, in advance! --[[User:Fredericknoronha|Fredericknoronha]] ([[User talk:Fredericknoronha|talk]]) 13:34, 27 November 2021 (UTC)
| |
− | :Hello {{ping|Fredericknoronha}} and welcome to Lingua Libre. I imported {{Q|700683}} (gom) as it was not on Lingua Libre yet. On Lingua Libre, dialects are treated the same way as languages. You can create an element for your dialect on Wikidata (example for [https://www.wikidata.org/wiki/Q35359 auvergnat dialect]) and tell us once it is ready, so that we can import it on Lingua Libre with an admin tool. You can also directly create an element for your dialect on Lingua Libre, following the steps described at [[Special:MyLanguage/Help:Add_a_new_language|Help:Add a new language]] and taking example of {{Q|1186}}. Don't hesitate to ping an admin if you have any questions.
| |
− | :All the best — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 15:35, 27 November 2021 (UTC)
| |
− | ::''« there are some dialects spoken, the pronunciation of one can be different from the other. […] This will ensure that readings don't get overwritten by other dialects. »''
| |
− | ::If the writing are similar but only the pronunciation differs depending on where the speaker comes from, it looks like different accents.
| |
− | ::Recordings are specific to a word, a language and a speaker. Which means me recording in French the word "bonjour" will be one audio file on Lili. WikiLucas can record in French the same word "bonjour", it will create an other audio file on Lili. My recording(s), since i come from the South West, will carry the southern accent. Recordings by WikiLucas, who lives 700km East of me, will cary the Lyon area accent. Lingualibre will store 2 recordings, one per user. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 21:59, 27 November 2021 (UTC)
| |
− | : Hello {{u|Fredericknoronha}}, I have imported {{Q|701734}} so that you can now record words in that dialect. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 17:21, 28 November 2021 (UTC)
| |
− | | |
− | == Feedback about Lingua Libre by Professor Carol Genetti, PhD ==
| |
− | | |
− | '''Dear Members of Lingua Libre,
| |
− | '''
| |
− | I am pleased to share a message from Professor [https://en.wikipedia.org/wiki/Carol_Genetti Carol Genetti], a linguist and leading expert in endangered languages. Professor Genetti is author of one of the best books in the field of Linguistics called "How Languages Work". Her vast knowledge and experience are extremely valuable and after reviewing Lingua Libre she said:
| |
− | | |
− | ''Thank you for contacting me and letting me know about this initiative. It is an interesting idea. I especially like the multilingual menus -- very helpful.''
| |
− | | |
− | ''Are you aware of [https://www.endangeredlanguages.com/ this website], hosted by the University of Hawaii (and, I believe, funded by Google). So one thing that occurs to me is the proliferation of such sites. How will people in an endangered-language community find out about their options, and then make an informed choice about which of these online resources will be best over time for their communities? Should such efforts cross-reference each other?''
| |
− | | |
− | ''My second thought has to do with longevity. It takes a significant commitment to support a site like this over time. The challenge is having someone who can keep such sites funded, working, organized, relevant, and engaging users over time. How will you make sure that the data will be available in 10, 50, 150 years? Maybe you get that automatically by being associated with Wikipedia. If so, state that. Also, there should be a clear statement of how such data might be used, and by whom, so speakers know that if they record a wordlist, someone might use if for some purpose without their permission (is that right?).
| |
− | ''
| |
− | ''I'm sorry to have to bring a down-to-earth message to the inspiration and passion for endangered languages that has clearly fueled this work, but having seen other initiatives stumble in this way, I wanted to be sure that you are thinking about this. Speakers will be entrusting you with such valuable pieces of their lives and their cultures. How will you safeguard this over time? Let people know.
| |
− | ''
| |
− | ''Those issues aside, here are a couple of other comments:''
| |
− | | |
− | * There should be a statement targeted for speakers of endangered languages - why would they want to do this? What is the value for them and their communities? What will happen to the recordings? etc.''
| |
− | * Will you provide speakers with suggestions for what vocabulary to record, e.g. greetings, colors, verb forms?''
| |
− | * It would be helpful if it was clear from the large list of languages which ones have recordings. Maybe put those in a different color font?''
| |
− | * It would be helpful to include translations of the words into one of the world's major languages or the national language. Otherwise, someone's grandkids coming to this in 30 years will not know what the words mean.''
| |
− | * Do you want to move beyond single words to a piece of connected discourse, such as a short poem or story, a song, or the reading of some common text (such as a sentence from the UN Declaration for Linguistic Rights)?''
| |
− | * Should there be a means to flag inappropriate content?''
| |
− | | |
− | ''I hope that you find this helpful. And I'm so glad you liked my book! It is lovely to hear that people have found it helpful.''
| |
− | | |
− | ''Carol Genetti''
| |
− | ''Vice Provost for Graduate and Postdoctoral Programs''
| |
− | ''NYU Abu Dhabi''
| |
− | ''(she/her/hers)''
| |
− | | |
− | [[User:Marreromarco|Marreromarco]] ([[User talk:Marreromarco|talk]]) 09:23, 4 December 2021 (UTC)
| |
− | :Hey, this is some interesting feedback.
| |
− | :* "What will happen to the recordings?": Our homepage lacks such important information. We should plan a redesign for 2022 (inspired by the homepage of [https://commonvoice.mozilla.org/ Common Voice]?) so that we finally have a homepage that properly explains what Lingua Libre is and can do.
| |
− | :* "Suggestions of things to record?": This already exists. They're called Lists. We have some pending improvements on that matter (easier to find and contribute to, etc.)
| |
− | :* "Show which languages have recordings": The datasets page could help, but I guess it would be interesting to put that on an easy-to-find page (again, like [https://commonvoice.mozilla.org/fr/languages Common Voice's languages page]?)
| |
− | :* "Include translations of the words into one of the world's major languages or the national language": we only support "transcription" for now.
| |
− | :** How could we even "link" the recordings to translations? (Lexemes? Plain text?)
| |
− | :** Who would have to do that? (the locutor? a dedicated team of contributors?)
| |
− | :** Where would it be done? (in the RecordWizard?)
| |
− | :** -> That's an interesting thing to think about, but might be slightly out of scope right now
| |
− | :* "Sentences, stories, songs...?": Yes, indeed. The Record Wizard is already able to do that (with some config tweaks that have to be done by the locutor), but it would be great to streamline this further. Dedicated UI, ability to record an audiobook (or Wikipedia, Wikisource, Wikinews article) as a mixture of sentences that can be stored locally before being all merged together into one audio file sent to Commons, ability for multiple contributors to work on the same book/article... That's something we should also discuss with the [https://librivox.org/ Librivox] folks: they use Audacity so far, but they might be interested in a tool that's better suited to their needs.
| |
− | :* "flag inappropriate content?": My insight is focused on technical stuff. This sounds more like some editorial guidelines that would have to be debated by the community.
| |
− | :* "'''longevity'''?": Should Lingua Libre vanish tomorrow, the audio recordings are not lost. They're all stored on Wikimedia Commons, and that makes them as "immortal" as files stored on hard disks, SSDs, CDs or magnetic bands and mirrored half a dozen times around the world can be. However, I can't say much about our Wikibase, which, at the current time, '''is the only place where all the recordings and locutor-related metadata is stored'''. That's a serious single point of failure. There are no dumps and therefore no mirrorring. We'll definitely have to discuss it with Wikimedia France and the Tech Team.
| |
− | :Hopefully my answers are clear and comprehensible. I'm pleased to have received feedback from Pr. Genetti. Now it's our turn to take matters in our hands! --[[User:Poslovitch|Poslovitch]] ([[User talk:Poslovitch|talk]]) 13:13, 5 December 2021 (UTC)
| |
− | | |
− | == How to delete lists? ==
| |
− | :{{Done}}
| |
− | Hello, recently I completed some lists. Now everything is done and those lists are needless. Is there any possibility to delete lists? Greetings --[[User:Onkel Tomm|Onkel Tomm]] ([[User talk:Onkel Tomm|talk]]) 10:02, 10 December 2021 (UTC)
| |
− | :{{Ping|Onkel Tomm}} hello, admins can delete those lists. The lists you created are [https://lingualibre.org/index.php?target=Onkel+Tomm&namespace=142&tagfilter=&newOnly=1&start=&end=&limit=50&title=Special%3AContributions here]. Which ones should I delete ? [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 10:25, 10 December 2021 (UTC)
| |
− | ::Hello Yug, please delete all 8 lists, because they are all finally finished. Thanks. --[[User:Onkel Tomm|Onkel Tomm]] ([[User talk:Onkel Tomm|talk]]) 13:44, 10 December 2021 (UTC)
| |
− | :{{Ping|Onkel Tomm}} We are clean ! thank for asking, it keeps the place clean :) [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 15:10, 10 December 2021 (UTC)
| |
− | | |
− | == Case study ==
| |
− | Hello all, I noticed a file upload which gather interesting use cases.
| |
− | | |
− | {| class="wikitable"
| |
− | ! Item || Label || Speaker || Account || Filename || Category
| |
| |- | | |- |
− | | [[Q709231]] ([https://lingualibre.org/index.php?title=Q709231&oldid=689510 arch.]) || "Ingenieur" || [[Q674858]] 'fleur' || User:Beat_Ruest || [[:File:LL-Q150_(fra)-fleur_(Beat_Ruest)-Ingenieur.wav]] || [[:commons:Category:Lingua Libre pronunciation by Beat Ruest]] | + | ! Daily recordings over April and May 2024 || |
| |- | | |- |
− | | — || Mispelling of "Ingénieur" || – || – || Carries the misspelling || Category page was not created, therefor virtually "lost" to Wikimedia Commons and [[:commons:Category:Lingua_Libre_pronunciation_by_user]]. | + | | |
− | |}
| + | <query _pagination="40"> |
− | | + | SELECT |
− | Questions:
| + | ?yearmonthday |
− | * Question 1: How do we handle mispelling ? I assume renaming ALL THREE of the [[Q709231]]'s label AND Property:P3 'recording' AND Wikimedia file [[:File:LL-Q150_(fra)-fleur_(Beat_Ruest)-Ingenieur.wav]] rename. Is that ok or will it break something ?
| + | (COUNT(DISTINCT ?record) AS ?records) |
− | * Question 2: Category should be automatically created. How do we go for this ? I assume a request on [[LinguaLibre:Bot]]
| + | (COUNT(DISTINCT ?speaker) AS ?speakers) |
− | * Question 3: What about the category by *speaker/voice* ([[Q709231]] 'fleur'), which curently doesn't exist, and which can have multiple speakers with the same name 'fleur' ?
| + | (COUNT(DISTINCT ?language) AS ?languages) |
− | [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 10:39, 10 December 2021 (UTC)
| |
− | : Question 1: it is a good start. I guess, we need to fix it both on Lingua Libre and on Wikimedia Commons
| |
− | : Question 2: you speak about categories on Wikimedia Commons? If so, I guess a bot can do it (Lingua Libre Bot or another one).
| |
− | : Question 3: actually the speaker is identified as "fleur (Beat Ruest)". Only one locutor of Beat Ruest can use the nickname "fleur".
| |
− | : [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 11:23, 20 December 2021 (UTC)
| |
− | ::Q1, Q2 agree.
| |
− | ::Q3 : {{ping|Pamputt}} check the categories on [[:commons:File:LL-Q150_(fra)-fleur_(Beat_Ruest)-Ingenieur.wav]]. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 14:56, 20 December 2021 (UTC)
| |
− | :::{{ping|Yug}} you mean the problem is [[:c:File:LL-Q150_(fra)-fleur_(Beat_Ruest)-Ingenieur.wav]] is categorized in "Category:Lingua Libre pronunciation by Beat Ruest" and not in "Category:Lingua Libre pronunciation by fleur (Beat Ruest)" or similar name? [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 07:57, 5 January 2022 (UTC)
| |
− | ::::Yes, we dont have categorization by '''speaker''' "Fleur (Beat Ruest)". Low importance, but could be a feature request. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 18:01, 5 January 2022 (UTC)
| |
− | | |
− | == Gestion de doublons ==
| |
− | :''See also [[Help:Homographs]]'' (new, needs review!)
| |
− | | |
− | Bonsoir !
| |
− | | |
− | Il y a-t-il une gestion de doublons dans LL pour les mots d'une même langue ? [[User:BamLifa|BamLifa]] ([[User talk:BamLifa|talk]]) 13:45, 18 December 2021 (UTC)
| |
− | :Bonjour [[User:BamLifa|BamLifa]], si un même locuteur enregistre le même mot alors l'enregistrement précédent sera écrasé (un même locuteur ne peut enregistrer qu'une seule fois le même mot). En revanche, rien n'empêche l'enregistrement d'un même mot par plusieurs locuteurs et locutrices différentes, c'est même un des objectifs de Lingua Libre : mettre en lumière la diversité des prononciations. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 11:19, 20 December 2021 (UTC)
| |
− | ::@[[User:Pamputt|Pamputt]] : Comment sont alors gérés les homographes non homophones ? ^^ [[User:Totodu74|Totodu74]] ([[User talk:Totodu74|talk]]) 00:03, 5 January 2022 (UTC)
| |
− | | |
− | :::Bonjour [[User:Totodu74|Totodu74]], il est possible d'ajouter des indications entre parenthèses (cette information est stockée à l'aide de {{P|18}}). Voir par exemple {{Q|1685}} et {{Q|1686}}. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 07:55, 5 January 2022 (UTC)<br>
| |
− | | |
− | :::@[[User:Totodu74|Totodu74]], salut, la question des homographes est en partie résolue dans nos langues africaines qui sont essentiellement des langues à tons. --[[User:Rçag|Rçag]] ([[User talk:Rçag|talk]]) 11:18, 9 January 2022 (UTC)
| |
− | :Rçag, could you explain your solution a bit so we learn from it.
| |
− | :{{Ping|BamLifa|Rçag|Pamputt|Totodu74}} the page [[Help:Homographs]] is there to gather best practices. It's new, review and edits welcome. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 15:05, 12 January 2022 (UTC)
| |
− | | |
− | == Comment changer de pseudonyme ==
| |
− | | |
− | Bonjour, sur les projets de Wikimedia, mon pseudonyme est Manjiro91 (et anciennement GamissimoYT), comment change-t-on de pseudonyme ?
| |
− | [[User:GamissimoYT|GamissimoYT]] ([[User talk:GamissimoYT|talk]]) 17:13, 11 January 2022 (UTC)
| |
− | :Bonjour {{u|GamissimoYT}}. Lingua Libre utilise le même pseudo que celui qui est en utilisation sur Wikimedia Commons. Donc si vous voulez utiliser le pesudonyme Manjiro91, déconnectez-vous de Lingua Libre, puis de Wikimedia Commons. Ensuite, connectez vous à Commons avec le pseudo Manjiro91 et enfin reconnectez vous à Lingua Libre. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 21:05, 11 January 2022 (UTC)
| |
− | {{Notif|Pamputt}} Mon pseudonyme Wikimedia Commons est Manjiro91 (anciennement GamissimoYT mais le changement de pseudonyme ne s'effectue pas sur LiLi. [[User:GamissimoYT|GamissimoYT]] ([[User talk:GamissimoYT|talk]]) 13:38, 12 January 2022 (UTC)
| |
− | :{{ping|GamissimoYT}}, tu as bien fait les connexions/déconnexions dans l'ordre que j'ai indiqué ? Si tu es sûr que tu es connecté avec Manjiro91 sur Wikimedia Commons, alors tu peux essayer de te déconnecter de Lingua Libre et te reconnecter dans la foulée. Essayer de vider le cache du navigateur peut peut-être aidé aussi. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 07:37, 13 January 2022 (UTC)
| |
− | | |
− | == Merging of items about languages ==
| |
− | :''See also [[Help:SPARQL]] and [[Help:SPARQL for maintenance]].''
| |
− | Hi y'all,
| |
− | | |
− | For the record, I just merge a couple of items about the same language:
| |
− | * {{Q|52071}} in {{Q|73}}
| |
− | * {{Q|139228}} in {{Q|183}}
| |
− | * {{Q|170137}} in {{Q|359}}
| |
− | * {{Q|683869}} in {{Q|418}}
| |
− | * {{Q|646169}} in {{Q|6714}}
| |
− | * {{Q|570518}} in {{Q|52069}}
| |
− | * {{Q|538624}} in {{Q|84030}}
| |
− | * {{Q|646173}} in {{Q|390314}}
| |
− | * {{Q|646161}} in {{Q|502754}}
| |
− | * {{Q|570510}} in {{Q|489393}}
| |
− | | |
− | I detected them with this SPARQL query:
| |
− | | |
− | <syntaxhighlight lang="sparql">
| |
− | SELECT ?idWD (COUNT(?item) AS ?compte) (GROUP_CONCAT(?item) AS ?items) WHERE { | |
− | ?item prop:P2 entity:Q4 ; prop:P12 ?idWD .
| |
− | }
| |
− | GROUP BY ?idWD
| |
− | HAVING ( ?compte > 1 )
| |
− | </syntaxhighlight>
| |
− | | |
− | Ping {{ping|WikiLucas00}} it seems you are responsible for some of them...
| |
− | | |
− | Cheers, [[User:VIGNERON|VIGNERON]] ([[User talk:VIGNERON|talk]]) 09:29, 19 February 2022 (UTC)
| |
− | :Thanks VIGNERON for finding them and cleaning it. Now what to do with recording items that use the doublon language item (for example with [[Special:WhatLinksHere/Q52071|Duala]]). I think we must modify {{P|4}} for all recording items so that languages are not counted twice and also to clean up the database (there are also transcription problems for items listed in the Duala example). [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 16:16, 19 February 2022 (UTC)
| |
− | ::Thank you {{ping|VIGNERON}} for pointing these out. As you can see, most of them were not created manually but using the tool (the pages wheighted circa 4kB, with labels in many languages). It seems that the Lingua Importer tool has (or had?) a problem, but I could not reproduce it (trying to import languages that are already in LL wikibase).<br/> During last summer's hackathon we talked a bit about languages in our wikibase, but I can't remember why we need to have language elements in our Wikibase, and not just use the existing base of WikiData 🤔 — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 23:23, 19 February 2022 (UTC)
| |
− | | |
− | == MediaWiki customizations of LinguaLibre ==
| |
− | | |
− | Love the MediaWiki skin of LinguaLibre and I am curious of skin and customizations made. Who are the authors? (can not see credits) --[[User:Zblace|Zblace]] ([[User talk:Zblace|talk]]) 10:15, 19 February 2022 (UTC)
| |
− | :The skin is known as BlueLL. The source code is available on [https://github.com/lingua-libre/BlueLL github]. It has been developed by Wikimedia France in 2020. That's said, it is true there is no licence and credits on Github. I will ask to {{u|Adélaïde Calais WMFr}} if she remember anything so that I can the missing informations. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 16:58, 19 February 2022 (UTC)
| |
− | ::Hi {{ping|Zblace}}, this skin's author is [[User:0x010C]], and its opensource. Can be reused freely. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 22:45, 22 May 2022 (UTC)
| |
− | | |
− | == New property: translation ==
| |
− | Hello, I've created {{P|38}} to be used in case there is no writing in the recording language but instead a translation in the vehicular language. See for example what I did [https://lingualibre.org/index.php?title=Q212431&type=revision&diff=743039&oldid=191330 here] and [https://lingualibre.org/index.php?title=Q58994&type=revision&diff=743044&oldid=580313 there]. Do you agree with that? Any comment? [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 16:33, 19 February 2022 (UTC)
| |
− | :It's a good idea! Many users tend to add a translation as they find it important for other people to have. It will also be handy for cases like your second example, where we only have the translation but not the transcription of the source language: we will be able to query the base to see all audios of a language that have a translation. — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 23:28, 19 February 2022 (UTC)
| |
− | ::I am thinking about a way to populate automatically this property via the Record Wizard. Currently, it seems that the Record Wizard populates {{P|18}} when something is written between brackets (see {{Q|1685}} for example but I have not checked recently). So, if we modify the Record Wizard code, it is possible to recognize this is a translation in another language and so to populate {{P|38}}. But I would like to be sure to propose the best way to do it before asking for such development. The idea is to be managed automatically (or at least not completely manually). [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 00:18, 20 February 2022 (UTC)
| |
− | | |
− | == Lingua Libre Wishlist for 2022-2023 ==
| |
− | | |
− | Hi everyone !
| |
− | <br/>This week, Wikimedia France is preparing its budget for the fiscal year to come : July 2022 to June 2023. If there are things you would like to see done or to do with our help on Lingua Libre, please share it on this page : https://lingualibre.org/wiki/LinguaLibre:2022-2023_projection
| |
− | <br/>Have a great week-end ! --[[User:Adélaïde Calais WMFr|Adélaïde Calais WMFr]] ([[User talk:Adélaïde Calais WMFr|talk]]) 17:23, 11 March 2022 (UTC)
| |
− | : {{u|marreromarco}} Thank you for your suggestions. However, I have some reservations about "Add function to "Request" a Pronunciation to Native Speakers" at this current stage for two reasons. First, this will require quite a bit of moderation to correct requests for grammar and spelling (e.g. HASBAND) as well as remove terrible requests. This will place a large burden on a few users and can easily lead to questionable decisions by moderators. Second, Forvo is flooded with requests that are either overly specific (e.g. "He came back from abyss and won the tie.") and, therefore, likely benefit only one user. IMHO, Rdrg109 proposal to focus on providing pronunciations for entries on the various wiktionaries is a better approach to building up the LL at this point. It will provide a solid foundation for users to find any word in LL. It might be a better time to open up LL to general requests once this project is completed and the community has grown. [[User:Languageseeker|Languageseeker]] ([[User talk:Languageseeker|talk]]) 15:49, 21 May 2022 (UTC)
| |
− | | |
− | == How to get the city country label in SPARQL ==
| |
− | :''See also [[Help:SPARQL]].''
| |
− | I'm working on an Anki extension for LL, but I'm having a little trouble writing the sparql query. In short, I want to be able to get the city and country for a recording in LL. However, when I query P14, I get the link to the item instead of 'residence': {'type': 'literal', 'value': 'Q142'} or 'residence': {'type': 'literal', 'value': 'Q142'}. Instead I hope to get city:"" and country "France" for the first query city:"Paris" and country:"France" for the second one. Any ideas? [[User:Languageseeker|Languageseeker]] ([[User talk:Languageseeker|talk]]) 20:23, 19 May 2022 (UTC)
| |
− | :Hi {{u|Languageseeker}} thanks for your work on a Anki extension. Could you post here the query you have now? [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 16:58, 20 May 2022 (UTC)
| |
− | ::Hi {{u|Pamputt}} . The query that I'm using is a very lightly modified version of the bot query.
| |
− | | |
− | :: <syntaxhighlight lang="sparql">ENDPOINT = "https://lingualibre.org/bigdata/namespace/wdq/sparql"
| |
− | API = "https://lingualibre.org/api.php"
| |
− | BASEQUERY = """
| |
− | SELECT DISTINCT
| |
− | ?record ?file ?transcription ?recorded
| |
− | ?languageIso ?languageQid ?languageWMCode
| |
− | ?residence ?learningPlace ?languageLevel
| |
− | ?speaker ?linkeduser
| |
| WHERE { | | WHERE { |
− | ?record prop:P2 entity:Q2 . | + | ?record prop:P5 ?speaker . |
− | ?record prop:P3 ?file .
| |
| ?record prop:P4 ?language . | | ?record prop:P4 ?language . |
− | ?record prop:P5 ?speaker . | + | ?record prop:P6 ?date . |
− | ?record prop:P6 ?recorded . | + | BIND( SUBSTR(str(?date), 0, 11) as ?yearmonthday ) |
− | ?record prop:P7 ?transcription .
| + | { SELECT ?record |
− | ?language prop:P13 ?languageIso.
| + | WHERE { |
− | ?speakerLanguagesStatement llq:P16 ?languageLevel .
| + | ?record prop:P2 entity:Q2 . |
− | ?speaker prop:P11 ?linkeduser .
| + | ?record prop:P6 ?date . |
− | ?speaker prop:P14 ?residence .
| + | FILTER(?date >= "2024-04-01T00:00:00Z"^^xsd:dateTime) |
− | ?speaker llp:P4 ?speakerLanguagesStatement .
| + | FILTER(?date < "2024-05-30T00:00:00Z"^^xsd:dateTime) |
− | ?speakerLanguagesStatement llv:P4 ?speakerLanguages .
| + | } |
− | OPTIONAL { ?speakerLanguagesStatement llq:P16 ?languageLevel . }
| |
− | FILTER( ?speakerLanguages = ?language) .
| |
− | SERVICE wikibase:label {
| |
− | bd:serviceParam wikibase:language "en" . | |
| } | | } |
− | #filters
| + | } |
− | }"""</syntaxhighlight>
| + | GROUP BY ?yearmonthday |
| + | ORDER BY (?yearmonthday) |
| + | </query> |
| + | | <= stops on 2024.05.01<br>Note: [[Special:Contributions/Austin Zhang|Austin Zhang]] recorded 174 audios on 05.11 |
| + | |} |
| + | [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 10:39, 14 May 2024 (UTC) |
| | | |
− | :: Currently, I'm running it with filters = "" because it seems that a query for a single term takes around 70s, while fetching a single transcription takes about 145 seconds. My plan is to group the results by transcription and then write that into a json file to avoid the costly query. Basically, I need the speaker name, the term, their country, their city, the ISO code of the language, date created, and the filename, languageLevel. | + | === Fixed === |
| + | Both IP ranges 2001:41D0:0:0:0:0:0:0/32 and 2001:41D0:0:0:0:0:0:0/33 were subject to global Wikimedia block at one point (see [https://meta.wikimedia.org/w/index.php?title=Steward_requests/Global&oldid=26774369#Unregistered_users_only_block_for_the_range_2001:41D0:0:0:0:0:0:0/32 Global ban range_2001:41D0:0:0:0:0:0:0/32]). Following our request, the ban have been reconfigured and uploads from LinguaLibre are possible again. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 10:38, 14 May 2024 (UTC) |
| + | :I can record and upload since yesterday with my account, so that seems fixed. But it seems the stats are still not updated. [[User:Culex|Culex]] ([[User talk:Culex|talk]]) 12:08, 15 May 2024 (UTC) |
| | | |
− | :: For example, for the term un chien, the json would look like:
| + | === Logs === |
− | :: { "term": {"un chien": {"speaker": "Julien Baley", "language": "fra", "city": "", "country": "France", "recorded": "2020-11-27", "filename": "LL-Q150_(fra)-Julien_Baley-un_chien.wav", "languageLevel": "Q15"}}} [[User:Languageseeker|Languageseeker]] ([[User talk:Languageseeker|talk]]) 23:17, 20 May 2022 (UTC) | + | For references, I investigated the relevant block logs and uploads logs for May 2024.<br>Conclusion: the uploads collapse is coherent with the IP Ban. Still, given bug reports from Akamycoco in *March* and 咽頭べさ [[:c:File:Lingua_Libre_error_2024.webm|on step 4]], I suspects other bugs are lingering around. |
| + | {| class=wikitable |
| + | !width=50%| Global IP bans |
| + | ! Lingualibre uploads logs |
| + | |- |
| + | | |
| + | * [https://meta.wikimedia.org/wiki/Special:Log?type=&user=&page=User%3A2001%3A41D0%3A%3A%2F32&wpdate=&tagfilter=&wpFormIdentifier=logeventslist 18:46, 13 May 2024] EPIC talk contribs changed global block settings for 2001:41d0::/32 talk with an expiration time of 00:51, 10 May 2026 (anonymous users only) (No open proxies <!-- SCLT ID: Possible VPN or Colocation -->) |
| + | * [https://meta.wikimedia.org/wiki/Special:Log?type=&user=&page=User%3A2001%3A41D0%3A%3A%2F32&wpdate=&tagfilter=&wpFormIdentifier=logeventslist 00:51, 10 May 2024] AmandaNP talk contribs globally blocked 2001:41d0::/32 talk with an expiration time of 00:51, 10 May 2026 (No open proxies <!-- SCLT ID: Possible VPN or Colocation -->) |
| + | * [https://meta.wikimedia.org/wiki/Special:Log?type=&user=&page=User%3A2001%3A41D0%3A%3A%2F33&wpdate=&tagfilter=&wpFormIdentifier=logeventslist 17:02, 9 May 2024] EPIC talk contribs changed global block settings for 2001:41d0::/33 talk with an expiration time of 17:09, 1 May 2027 (anonymous users only) (Open proxy/Webhost: See the help page if you are affected) |
| + | * [https://meta.wikimedia.org/wiki/Special:Log?type=&user=&page=User%3A2001%3A41D0%3A%3A%2F33&wpdate=&tagfilter=&wpFormIdentifier=logeventslist 17:09, 1 May 2024] EPIC talk contribs blocked 2001:41d0::/33 talk with an expiration time of 2 years, 364 days, 12 hours, 21 minutes and 36 seconds (anonymous users only, account creation disabled) (Open proxy/Webhost: See the help page if you are affected) |
| + | * [https://meta.wikimedia.org/wiki/Special:Log?type=&user=&page=User%3A2001%3A41D0%3A%3A%2F33&wpdate=&tagfilter=&wpFormIdentifier=logeventslist 17:09, 1 May 2024] EPIC talk contribs globally blocked 2001:41d0::/33 talk with an expiration time of 17:09, 1 May 2027 (Open proxy/Webhost: See the help page if you are affected) |
| + | | |
| + | * : [https://commons.wikimedia.org/wiki/Special:RecentChanges?hidebots=1&translations=filter&hidecategorization=1&hideWikibase=1&tagfilter=OAuth+CID%3A+1735&limit=500&days=30&urlversion=2 Uploads via Lingualibre resumed]. |
| | | |
− | == Contribution: Python program to download all files created by a specific user ==
| + | 13 May 2024 |
− | :''See also [[Help:Download datasets]].''
| + | * [... Many more uploads] |
− | I wrote a [https://github.com/rkosov/Lingua-Libre-User-Audio-Downloader python program] that downloads all the files created by one user. For video files, it downloads the full webm. For audio files, the default is to download the wave file. However, for audio files, you can optionally choose either mp3 or ogg files. Currently, the configuration requires a minor modification of lluad.py. If there is strong demand, I will write a command line parser for it. Please report any bugs or errors on the github page. Feature requests are welcome. [[User:Languageseeker|Languageseeker]] ([[User talk:Languageseeker|talk]]) 02:28, 20 May 2022 (UTC)
| + | * Upload log 23:39 Elwinlhq talk contribs uploaded File:LL-Q5218 (que)-Elwinlhq-apaqay.wav Tag: Lingua Libre [2.2] |
− | :{{Ping|Languageseeker}} please add your tool to [[Help:Download datasets]]. It lists several tools with different specifics, your tool is welcome and may help some Python users as well. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 22:41, 22 May 2022 (UTC) | + | * Upload log 19:05 Assassas77 talk contribs uploaded a new version of File:LL-Q9192 (cmn)-Assassas77-八角.wav Tag: Lingua Libre [2.2] |
− | | + | * Upload log 19:05 Assassas77 talk contribs uploaded File:LL-Q9192 (cmn)-Assassas77-八角.wav Tag: Lingua Libre [2.2] |
− | == Garbage Values in prop:P14 ==
| + | * Upload log 16:38 Oh! Tea<ref>[https://commons.wikimedia.org/wiki/Special:Log?page=User:Oh!_Tea Commons > User:Oh!_Tea : « nothing » on Commons]</ref> talk contribs uploaded File:LL-Q36759-Austin Zhang-sih8 buh8 sah8 nah4.wav Tag: Lingua Libre [2.2] |
− | :''See also [[Help:SPARQL for maintenance]] and [[Help:SPARQL_for_maintenance#.E2.9C.85_Speakers_.E2.86.92_Undefined_place_of_residence]].'' | + | 11 May 2024 |
− | As part of my Anki project, I queried the entire LL database and I'm trying to parse the output of ?speaker prop:P14 ?residence. I've noticed that there are a number of garbage values in provided for P14, such as Q1, Q2, Q103962887, Q6099648, Strasbourg. There seem to be three cases.
| + | * Upload log 20:21 Oh! Tea talk contribs uploaded File:LL-Q36759-Austin Zhang-buah8.wav Tag: Lingua Libre [2.2] |
− | # Users wishing to enter an extremely vague place such as Earth or the Universe. These should be set to None.
| + | * []... +172 recording by User:Oh! Tea] |
− | # Users accidentally linking to a disambiguation page. These require correction.
| + | * Upload log 18:56 Oh! Tea talk contribs uploaded File:LL-Q36759-Austin Zhang-a2.wav Tag: Lingua Libre [2.2] |
− | # Users not even entering a Wikidata item which require manual correction.
| + | 10 May 2024 |
− | | + | * Upload log 06:08 CapitainAfrika<ref>[https://commons.wikimedia.org/wiki/Special:Log?page=User:CapitainAfrika Commons > User:CapitainAfrika : « IP block exempt » on Commons]</ref> talk contribs uploaded File:LL-Q36217 (lin)-CapitainAfrika-Wiki na monɔkɔ mua bísó.wav Tag: Lingua Libre [2.2] |
− | To solve the root of the problem, I propose that P14 should be restricted to only Wikidata items that exist and have P17. [[User:Languageseeker|Languageseeker]] ([[User talk:Languageseeker|talk]]) 21:22, 25 May 2022 (UTC)
| + | * Upload log 00:14 Ardzun<ref>[https://commons.wikimedia.org/wiki/Special:Log?page=User:Ardzun Commons > User:Ardzun : « nothing »]</ref> talk contribs uploaded File:LL-Q13324 (min)-Ardzun-mada.wav Tag: Lingua Libre [2.2] |
− | :{{Ping|Languageseeker}} it's a good find. If you still have that SPARQL query under hand please add it into [[Help:SPARQL for maintenance]]. Yes, it's something we should clean up i think. There may be some few case where the speaker dont want to share its location but in 95% of cases i think we can go ahead, correct or ask them to correct it. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 12:39, 26 May 2022 (UTC) | + | 9 May 2024 |
− | :I noticed that when creating a new speaker, place of learning is optional. Not cool. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 21:32, 27 May 2022 (UTC) | + | * Upload log 17:08 Àncilu<ref>[https://commons.wikimedia.org/wiki/Special:Log?page=User:Àncilu Commons > User:Àncilu : « Autopatroller » on Commons]</ref> talk contribs uploaded File:LL-Q652 (ita)-XANA000-orsù.wav Tag: Lingua Libre [2.2] |
− | :: {{ping|YUG}} For the life of me, I can't get the federated query to work, but I have a separate query to get the location and country labels from wikidata. These are the problematic ones. Note, that Q20 is on the list because Q20 "Norway" is missing P17 | + | * Upload log 17:05 Àncilu talk contribs uploaded File:LL-Q652 (ita)-XANA000-frac.wav Tag: Lingua Libre [2.2] |
− | | + | 5 May 2024 |
− | * ['MichaelSchoenitzer', None]
| + | * Upload log 21:15 Benoît Prieur<ref>[https://commons.wikimedia.org/wiki/Special:Log?page=User:Benoît_Prieur Commons > User:Benoît_Prieur : « Administrator » on Commons]</ref> talk contribs uploaded File:LL-Q8785 (hye)-Benoît Prieur-Artsakh.wav Tag: Lingua Libre [2.2] |
− | * ['D.Muralidharan', None] | + | 1 May 2024 |
− | * ['Kaderousse', None]
| + | * Upload log 16:09 Penn Zero MSSJ<ref>[https://commons.wikimedia.org/wiki/Special:Log?page=User:Penn_Zero_MSSJ Commons > User:Penn Zero MSSJ : « nothing » on Commons]</ref> talk contribs uploaded File:LL-Q9199 (vie)-Penn Zero MSSJ-hệ số.wav Tag: Lingua Libre [2.2] |
− | * ['Krokus', None] | + | * Upload log 16:09 Penn Zero MSSJ talk contribs uploaded File:LL-Q9199 (vie)-Penn Zero MSSJ-hỗn số.wav Tag: Lingua Libre [2.2] |
− | * ['विदुला टोकेकर', 'Q103962887']
| + | * Upload log 16:09 Penn Zero MSSJ talk contribs uploaded File:LL-Q9199 (vie)-Penn Zero MSSJ-hằng đẳng thức.wav Tag: Lingua Libre [2.2] |
− | * ['DoctorandusManhattan', 'Q2']
| + | * [... Many more uploads] |
− | * ['Justforoc', 'Q2'] | + | |- |
− | * ['Student16 de', None]
| + | |colspan=2| <small><references /></small> |
− | * ['Didierwiki', 'Q6099648'] | + | |} |
− | * ['Sarah2149', None] | + | [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 10:38, 14 May 2024 (UTC) |
− | * ['DomesticFrog', 'Q1'] | |
− | * ['Drkanchi', None]
| |
− | * ['Satdeep Gill', None]
| |
− | * ['Iwan.Aucamp', 'Q20']
| |
− | * ['Skimel', 'Q2']
| |
− | * ['Abeɣzan', None]
| |
− | * ['Gibraltar Rocks', None]
| |
− | * ['Bomdapatrick', None]
| |
− | * ['Ibtissam RAHMOUNI', None]
| |
− | * ['Trabelsiismail', None]
| |
− | * ['Ziko', 'Q2']
| |
− | * ['Youcefelallali', None]
| |
− | * ['Foxxipeter7', None]
| |
− | * ['Celevra089', None]
| |
− | * ['Bodhisattwa', None]
| |
− | * ['Atudu', None]
| |
− | * ['KageyamaxNishinoya', 'Q30915818']
| |
− | * ['Darkdadaah', None]
| |
− | * ['JayashreeVI', None]
| |
− | * ['रश्मीमहेश', 'Q103962887']
| |
− | * ['गीता गोविंद नेने', 'Q103893785']
| |
− | * ['Awangba Mangang', None]
| |
− | * ['Abigaljo', None]
| |
− | * ['FaelDaug', 'Q29423162']
| |
− | [[User:Languageseeker|Languageseeker]] ([[User talk:Languageseeker|talk]]) 02:16, 30 May 2022 (UTC) | |
| | | |
− | == Anki Extension Release == | + | == Kinyarwanda language representation == |
| | | |
− | I just released [https://ankiweb.net/shared/info/124265771 Lingua Libre and Forvo Addon]. It has a number of advanced options to improve search results and can run either as a batch operation or on an individual note. | + | I'm Robert RUGAMBA from Rwanda and i belong to Wikimedia Rwanda as a volunteer and event organizer. |
| + | I'm exited to explore this platform of lingua libre and i wish my local languages to be add and represented. the wikidata rabel is: https://www.wikidata.org/wiki/Q33573 |
| | | |
− | By default, it first checks Lingua Libre and, if there are no results on Lingua Libre, it then checks Forvo. To run as a pure Lingua Libre extension, you will need to set "disable_Forvo" to <code>True</code> in your configuration section.
| + | Thanks. [[User:Annick green|Annick green]] |
| + | :{{Done}} This language was already on Lingualibre as [[Q285]]. If you open [[Special:RecordWizard]], at step 2, add it to your list of known languages. Please type in « Kinyarwanda », «Ikinyarwanda » and you should find it. Only user who have declared to know Kinyarwanda can record in Kinyarwanda. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 16:50, 27 June 2024 (UTC) |
| | | |
− | Please reports bugs, issues, ideas on [https://github.com/rkosov/Lingua-Libre-and-Forvo-Audio-Downloader github]. I would love any feedback. [[User:Languageseeker|Languageseeker]] ([[User talk:Languageseeker|talk]]) 02:23, 31 May 2022 (UTC)
| + | == Rename my pseudonym == |
| | | |
− | == Results of Coverage Test of French Lemma and Non-Lemma forms is English Wiktionary ==
| + | Hello. I've renamed my account on wikimedia sites but can't log in directly from this username here. Do i have something to do ? My old username is '''ElsaBester''' and the new one is '''L'embellie'''. Thanks ! |
| + | :Hello [[User:ElsaBester|L'embellie]], |
| + | :I may ping [[User:WikiLucas00|WikiLucas00]], but I think we don't currently have solution for your issue. |
| + | :We are phasing out this wiki, we hope to release a new Lingualibre this winter or early 2025. So this issue will be irrelevant by then. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 20:00, 22 August 2024 (UTC) |
| + | ::Hey there {{ping|ElsaBester|Yug}}. Sorry I don't have a solution, but I found this in the Chat Room's archives: [[LinguaLibre:Chat_room/Archives/2023#Update_my_username]]. Good luck — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 18:46, 26 August 2024 (UTC) |
| + | :::Hello {{ping|ElsaBester}} you may also look at my latest reply on [[User talk:Yug]], it's not a great option but maybe you'll want to try it. All the best — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 12:24, 31 August 2024 (UTC) |
| | | |
− | While playing around with generating lists for pronunciation from Wiktionary, I decided to run a few tests on the current coverage of French lemma and non-lemma forms in English Wiktionary. I choose French because it is the largest datasets in LL.
| + | == Two French words that are impossible to record == |
| | | |
− | Current Coverage of French in Lingua Libre
| + | Hi, |
− | * Total French Entries in Lingua Libre by a native speaker: 233 982
| |
− | * Unique French Entries in Lingua Libre by a native speaker: 154 358
| |
− | * Percentage of overlap: 34%
| |
− | * Term with the greatest number of pronunciations: "blanc" with 40
| |
| | | |
− | Current Coverage of [https://en.wiktionary.org/wiki/Category:French_lemmas Category:French lemmas]
| + | Two words are impossible to record (even before uploading): ''esclavesse'' and ''scribesse'' (all my attempts with other words work). [[User:Avatea|Avatea]] ([[User talk:Avatea|talk]]) 18:49, 30 August 2024 (UTC) |
− | * Total entries in Category:French lemmas: 84 482
| + | :Hi {{ping|Avatea}}. Sorry for the late reply. I couldn't reproduce the issue on my side, as you can see ({{Q|1385666}}, {{Q|1385667}}) I just recorded a few words ending with -esse, including the two words you mention, without encountering any issue. Did you try again recently? All the best — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 14:42, 21 September 2024 (UTC) |
− | * Pronounced entries: 50 917
| + | :: Hi {{ping| WikiLucas00}} |
− | * Entries with pronunciation: 33 565
| + | :: No. I just tried, I was able to record another word, but still not those two. [[User:Avatea|Avatea]] ([[User talk:Avatea|talk]]) 19:06, 21 September 2024 (UTC) |
− | * Coverage Percentage: 60.27%
| + | :: After several dozen new recordings (and I had made hundreds of others before), still unable to record these two words. Tested on macOS and Windows. [[User:Avatea|Avatea]] ([[User talk:Avatea|talk]]) 21:26, 7 October 2024 (UTC) |
| | | |
− | Current Coverage of [https://en.wiktionary.org/wiki/Category:French_non-lemma_forms Category:French non-lemma forms]
| + | == Supprimer deux enregistrements incorrects. == |
− | * Total entries in Category:French non-lemma forms: 29 1225
| |
− | * pronounced entries: 26 791
| |
− | * Entries with pronunciation: 264 434
| |
− | * Coverage Percentage: : 9.20%
| |
| | | |
− | For me, there are several lessons to be drawn.
| + | Bonjour! À cause d'une erreur lors d'écriture et parce que je l'ai fait pressé, j'ai enregistré par erreur deux termes: *"[[Q1387394|escaramón]]" et son pluriel *"[[Q1387395|escaramones]]". Serait-il possible de supprimer ces fichiers enregistrés ? J'ai déjà fait les enregistrements corrects de ces mots bien écrits et avec la prononciation correcte: "[[Q1387396|escamarón]]", "[[Q1387397|escamarones]]". Vous pouvez vérifier l’exactitude de ce terme [https://diccionariu.alladixital.org/index.php?cod=21008 ici]. Désolé pour le dérangement. --[[User:Limotecariu|Limotecariu]] ([[User talk:Limotecariu|talk]]) 20:31, 28 September 2024 (UTC) |
− | # First, there has been amazing growth on LL. Covering 60.27% percent is a real achievement.
| |
− | # The overlap percentage is quite small overall.
| |
− | # There needs to be a clearer sense of when LL should stop requesting pronunciations for a certain term because 40 pronunciations of "blanc" seems a bit excessive.
| |
− | # A need exists to continue pro-actively targeting entries in Wiktionary that are not in Lingua Libre. Currently, 297 999 French lemma and non-lemma forms require pronunciations.
| |
− | # Generating lists from Wiktionary and checking coverage is not as hard as I thought.
| |
− | # Lingua Libre has almost caught up with Forvo in the number of French pronunciations (233 982 vs 254, 703). Overall, Lingua Libre has shown amazing and healthy progress in a very short period of time. I'm excited about these results. [[User:Languageseeker|Languageseeker]] ([[User talk:Languageseeker|talk]]) 03:07, 1 June 2022 (UTC)
| |
− | :{{Ping|Languageseeker}} This investigation is pretty cool. (I'm not sure i understand all your numbers yet, but i will read again when back on my PC). Its quite nice to see we are reaching Forvo level for our lead language. It's possible we have more unique words than forvo since we have [[user:Olafbot]] actively guiding and pushing us on that path.
| |
− | :On Lili we have chosen to be a learning AND linguistic diversity audio database. When you account for gender, regional accents, age, voice type, having 40 french audios for a word is still 400+ voices short. | |
− | :Also, all contributors are not able to contribute audio perfect files due to various shortcomings (hardware, no recording room, no noose cancelling system, etc). We lack proper rating and review system. It's on our [slow] roadmap tho. 😉
| |
− | :PS: Should i answer to you in French i get a feeling you are French or learning it. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 15:07, 1 June 2022 (UTC)
| |