|
|
(938 intermediate revisions by 72 users not shown) |
Line 1: |
Line 1: |
| {{#SUBTITLE:{{/Header}}}} | | {{#SUBTITLE:{{/Header}}}} |
| + | {{Lang-CR}} |
| <indicator name="talk"></indicator> | | <indicator name="talk"></indicator> |
− | | + | {{LL:Chat room/FAQ}} |
| __TOC__ | | __TOC__ |
| + | <!-- **** DO NOT EDIT CONTENT ABOVE **** --> |
| | | |
− | == Chatroom FAQ ==
| |
− | * '''How to download all audios of one language ? By speaker ?'''
| |
− | ** Languages are there [https://lingualibre.fr/datasets/ https://lingualibre.fr/datasets/]. A short server-side script is auto-ran every 2 days, itself using [https://github.com/lingua-libre/CommonsDownloadTool lingua-libre/CommonsDownloadTool]. For more, see [[Help:Download from LinguaLibre]].
| |
| | | |
− | * '''How to add missing languages ?'''
| + | == Is the Record Wizard not working for anyone else? == |
− | ** Administrators can add new languages, they do so within few days. For users, please provide your language's [[:wikipedia:iso-639-3|iso-639-3]] code + link to the en.wikipedia.org's article. Optional infos are the common English name and wikidata IQ. For more, see [[Help:Add a new language]].
| |
| | | |
− | * '''How to archive sections which have been answered ?'''
| + | My mic works with [https://mictests.com/ mictests.com], but [https://lingualibre.org/wiki/Special:RecordWizard the RecordWizard] doesn't pick anything up at the "check your microphone" stage. I've tried on both my phone and my laptop, and I can record sound in both cases, and I have the appropriate permissions enabled, but this particular website isn't detecting sounds. Is anyone else having this kind of problem? [[User:Grendelkhan|Grendelkhan]] ([[User talk:Grendelkhan|talk]]) 23:43, 24 February 2024 (UTC) |
− | ** After reviewing the section, add `<code><nowiki>{{done}} -- can be closed ~~~~</nowiki></code>` to the top of the section. After few days to 2 weeks, move the section's code to [[LinguaLibre:Chat_room/Archives/2018]].
| + | :Hello [[User:Grendelkhan]], |
| + | :I just received a second such report. User also checked [https://mictests.com/ mictests.com] sucessfully. |
| + | :On Firefox, Lingua Libre recording studio step 4, the microphone is allowed (we see the red microphone image on the left of the URL address). But after clicking the record button, no recording occurs. |
| + | :* Mictests on other site : successful. |
| + | :*Device: Notebook |
| + | :*OS: ? |
| + | :*Browser: Firefox, Chrome. |
| + | :*User: [[User:Akamycoco]]. |
| + | :*Languages affected: all. |
| + | :*Dates : Worked on February 28. Stopped working on February 29. |
| + | :Let's starts an investigation. Could you let me know your OS and precise web browser version ? (Help > About Chrome or similar) |
| + | :Let me know as well if you have basic developer skills to Right-click on the staled page > Inspect > Console : are there any error message ? [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 07:55, 1 March 2024 (UTC) |
| | | |
− | * '''How to keep my wikimedia project up to date ?'''
| + | ::My laptop is using Google Chrome <tt>122.0.6261.94 (Official Build) (64-bit)</tt> on Linux (Debian Testing). No error messages in the console when I attempt the recording. My phone is using Chrome <tt>122.0.6261.90</tt> on Android 14 on a Pixel 5a. It ''does'' seem to work on Firefox <tt>115.7.0esr (64-bit)</tt> on my laptop. (I really should have checked that before.) So maybe this is solely a Chrome problem? [[User:Grendelkhan|Grendelkhan]] ([[User talk:Grendelkhan|talk]]) 16:30, 2 March 2024 (UTC) |
− | ** Contact [[User talk:0x010C|User:0x010C]], the botmaster of Lingua Libre Bot. For more, see [[Help:Bots]].
| |
| | | |
− | * '''What IRL event.s are coming ? When ? Where ?'''
| + | == Automatic categorization isn't documented. == |
− | ** Nothing coming. For more, see [[LinguaLibre:Events]].
| |
| | | |
− | * '''How to translate LinguaLibre User Interface into a new language ?'''
| + | So far as I can tell, this isn't documented: if, for user Foo, category <tt>Lingua Libre pronunciation by Foo</tt> exists on Commons, then all uploads will be categorized into that category. This is helpful! It's also easy to backfill after the fact using [[:commons:Help:Gadget-Cat-a-lot]]. I'm not sure where to document this, but it seems reasonable to do so ''somewhere''. [[User:Grendelkhan|Grendelkhan]] ([[User talk:Grendelkhan|talk]]) 16:26, 3 March 2024 (UTC) |
− | ** Go to [https://translatewiki.net/w/i.php?title=Special:Translate&group=mwgithub-recordwizard&language=fr&filter=%21translated&action=translate translatewiki.net], change the url part <code>fr</code> into your language's [[:en:List_of_ISO_639-2_codes|ISO 639-2 code]]. For more, see [[Help:Translate]].
| |
| | | |
− | == Custom Commons filename == | + | == Understanding lingua-libre == |
− | :{{Done}}
| |
− | Sorry if this is obvious.
| |
| | | |
− | How can I upload files to Commons with my preferred file name?
| + | Hi, I am creating this discussion to understand lingua-libre better |
| | | |
− | For example [[:File:LL-Q1860 (eng)-Commander Keane-phonate.wav]] should ideally be named [[:File:En-au-phonate.ogg]]. Regards --[[User:Commander Keane|Commander Keane]] ([[User talk:Commander Keane|talk]]) 05:37, 21 January 2020 (UTC)
| + | == Uploads are failing == |
− | :Hi, the file are named this way to allow several people to record the same word. Thus, it is possible to have several recordings for "phonate" from differents places in Australia (and other countries) and also to have different prononciations from the same location (gender, age, ...). A way to avoid to use the ame file name is to use the username in the filename. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 06:58, 21 January 2020 (UTC) | + | :''TLDR: Large amount of users reporting failure to upload at step 5 : [[User:Grendelkhan|Grendelkhan]], [[User:Culex|Culex]], [[User:XANA000|XANA000]], [[User:Ardzun|Ardzun]] (Indonesian languages), [[User:Penn Zero MSSJ|Penn Zero MSSJ]], [[User:Univòc64]] (Whistled Occitan) and [[User:Akamycoco]] (Taiwanese languages). This likely only tip of iceberg. Only few users were able to [https://lingualibre.org/index.php?hidebots=1&translations=filter&hidepageedits=1&hideWikibase=1&hidelog=1&namespace=0&limit=1000&days=14&enhanced=1&title=Special:RecentChanges&urlversion=2 record in May], with atypically low number of recordings. Indonesia workshop with ~15 participants critically affected. Investigation ongoing. [[User:Hugo en résidence|Hugo en résidence]] ([[User talk:Hugo en résidence|talk]]) 14:20, 13 May 2024 (UTC)'' |
| | | |
− | ::Is it possible to have my dialect (AU) specified on the Commons file description page?--[[User:Commander Keane|Commander Keane]] ([[User talk:Commander Keane|talk]]) 03:54, 23 January 2020 (UTC)
| + | I can record words, but uploading them to Commons fails. The JavaScript console has the following message: |
− | ::{{ping|Commander Keane}} a dialect is a language without an army. Most dialect actually have a dedicated iso-639.3. In your case it seems more like an ''accent''. Your files will be tagged with you as the author, as well as your linguistic properties defined on your speaker profile (name, gender, place of learning mainly). As for tagging your files with <code>AU</code>, it would require a bot. Bots are simple codes which do automatic actions via [[:mw:API]], possibly on Commons, to enrich the file's pages. You are not the first to request such feature, so it may one day come. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 18:48, 22 September 2020 (UTC)
| |
− | ::See [[:mw:API:Edit]] and [[:mw:API:Get]] for your wish, there is python and javascript avenues. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 05:47, 23 September 2020 (UTC)
| |
| | | |
− | == Have your say on Lingua Libre's 2020-2021 strategy and more ==
| + | : <tt>'''Your IP address is in a range that has been [[m:Special:MyLanguage/Global blocks|blocked on all Wikimedia Foundation wikis]].''' The block was made by [[User:EPIC|EPIC]]. The reason given is ''[[m:Special:MyLanguage/NOP|Open proxy/Webhost]]: See the [[m:WM:OP/H|help page]] if you are affected''. * Start of block: 10:09, 1 May 2024 * Expiry of block: 10:09, 1 May 2027 Your current IP address is 2001:41d0:304:100::4790. The blocked range is 2001:41D0:0:0:0:0:0:0/33. Please include all above details in any queries you make. If you believe you were blocked by mistake, you can find additional information and instructions in the [[m:Special:MyLanguage/No open proxies|No open proxies]] global policy. Otherwise, to discuss the block please [[m:Steward requests/Global|post a request for review on Meta-Wiki]]. You could also send an email to the [[m:Special:MyLanguage/Stewards|stewards]] [[m:Special:MyLanguage/VRT|VRT]] queue at "stewards@wikimedia.org" including all above details.`, blockinfo: {…}, "*": "See https://commons.wikimedia.org/w/api.php for API usage. Subscribe to the mediawiki-api-announce mailing list at <https://lists.wikimedia.org/postorius/lists/mediawiki-api-announce.lists.wikimedia.org/> for notice of API deprecations and breaking changes." |
| | | |
− | Dear all,
| + | This is not my IP address shown in the error message, and whatismyip confirms that I'm not behind a proxy. The Global block request [https://meta.wikimedia.org/wiki/Steward_requests/Global/2024-w18#Global_block_for_Special:Contributions/2001:41D0:0:0:0:0:0:0/33 is here]. Is this affecting anyone else? I lost a heap of recordings. [[User:Grendelkhan|Grendelkhan]] ([[User talk:Grendelkhan|talk]]) 22:26, 4 May 2024 (UTC) |
− | | + | :Uploads are failing for me today too, even though I am recording with my account. [[User:Culex|Culex]] ([[User talk:Culex|talk]]) 15:04, 8 May 2024 (UTC) |
− | I hope you are all doing fine in these difficult times.
| + | :: Idem--[[User:XANA000|XANA000]] ([[User talk:XANA000|talk]]) 16:49, 9 May 2024 (UTC) |
− | | + | ::: I can record, but i couldn’t uploaded until today. I was able to upload once yesterday, but after that I couldn't upload any more. [[User:Ardzun|Ardzun]] ([[User talk:Ardzun|talk]]) 06:04, 11 May 2024 (UTC) |
− | For those of you who couldn't take part in the volunteer meeting, you can find the key points that were addressed here : https://notes.wikimedia.fr/p/2020-02-29_Lingua_Libre
| + | :I guess I'm not the only one who's been trying for weeks but could not publish audio after 1 May. Hope someone can fix it. [[User:Penn Zero MSSJ|Penn Zero MSSJ]] ([[User talk:Penn Zero MSSJ|talk]]) 20:54, 13 May 2024 (UTC) |
− | | + | ::[[User:Univòc64]] (Whistled occitan) and [[User:Akamycoco]] (Taiwanese languages) also reported issues. |
− | Following the discussions we had during this meeting and the vote of the budget by the association last Saturday, here are some elements that I would like to open up for discussion :
| + | ::It seems time to add a sitenotice warning. [[User:Hugo en résidence|Hugo en résidence]] ([[User talk:Hugo en résidence|talk]]) 14:07, 13 May 2024 (UTC) |
− | | + | ::In may we have mostly : 556 recordings by 7 users on May 1th, 174 recordings on May 11th ([[Special:Contributions/Austin Zhang|Austin Zhang]]), then nothing. |
− |
| + | ::If we compare with [https://public-paws.wmcloud.org/User:Yug/QueryLingualibre-monthly.ipynb known monthly recordings], our average months recently was 30k audios, the lowest ones were 5k audios, May 2024 is heading toward 1200 audios or 5% of the average month and 20% of the lowest months. Something weird is going on indeed. |
− | | + | {| class=wikitable |
− | - with regards to the logo : given that the graphic redesign in process is finishing soon and that the search for a new logo has not been successful https://phabricator.wikimedia.org/T240552, I suggest that we adapt the current logo to the new colour and keep it until further notice.
| + | ! Most prolific speakers for the current month || Months since 2022 |
− | | + | |- |
− |
| + | | |
− | | + | <query _pagination="10" locutor="<translate><!--T:7--> Item (locutor Qid)</translate>" locutorLabel="<translate><!--T:8--> Speakers of the Month</translate>" nb="<translate><!--T:9--> Number of records</translate>"> |
− | - with regards to the strategy : several of you manifested the need to make the Lingua Libre community bigger and more international, I therefore suggest that we work on this aspect in the September 2020-June 2021 timeframe with :
| + | SELECT ?locutor ?locutorLabel ?nb WHERE { |
− | | + | { |
− | 1) the integration of a more ergonomic and mutilingual discussion space inviting for discussion and collaboration notably for newcomers on the website
| + | SELECT ?locutor (COUNT(?record) as ?nb) |
− | | + | WHERE { |
− | 2) the development of Say It, an audio variant of the Sign It extension https://addons.mozilla.org/fr/firefox/addon/lingua-libre-signit/#&gid=1&pid=3 that would display instead of the word in sign language, its pronunciation from the LL audio in a pop-up window when highlighting the word in any webpage, this would allow for a more concrete and immediate impact for a broader range of web users, not only wikimedians
| + | ?record prop:P2 entity:Q2 . # Q2: record, P2: instance of. |
− | | + | ?record prop:P5 ?locutor . # Property:P5: speaker |
− | 3) the organisation of a series of hackatons destined to develop Lingua Libre bots towards other wiktionaries - these could make Lingua Libre more known rapidly and efficiently
| + | ?record prop:P6 ?date . |
− | | + | FILTER ( YEAR(?date) = YEAR(NOW()) && MONTH(?date) = MONTH(NOW()) ) |
− | 4) once the graphic redesign complete, a series of editathons for the renewal and development of the website contents : the improvement of tutorial pages aiding contribution in particular, but also the writing of project pages exemplifying what certain communites achieved and are working on with Lingua Libre, and the translation in as many languages necessary and possible
| + | } |
− | | + | GROUP BY ?locutor ?locutorLabel |
− |
| + | ORDER BY DESC(?nb) |
− | | + | LIMIT 50 |
− | For those who wish to take part in the development of Lingua Libre in the long term, I suggest that we schedule a remote meeting to discuss the execution of the 4 points above (or their revision!) before the official beginning of the 2020-2021 year in June.
| + | } |
− | | + | SERVICE wikibase:label { |
− | In order to ensure the stability and coherence of the project, we could also discuss on this occasion the constitution of a strategic volunteer committee to supervise the phases of development of Lingua Libre along the semesters.
| + | bd:serviceParam wikibase:language "[AUTO_LANGUAGE],en" . |
− | | + | ?locutor rdfs:label ?locutorLabel . |
− |
| + | } |
− | | + | } |
− | Thank you for taking the time to vote for the future of the logo here : https://framadate.org/gSfBpVYeqzYWXLn6
| + | ORDER BY DESC(?nb) |
− | | + | </query> |
− | and write down your availabilities for a meeeting in April-May 2020 if interested : https://framadate.org/kdn7tGoqDgjpAj5i
| |
− | | |
− | you can also give us your feedback on the 4 points above on this pad : https://notes.wikimedia.fr/p/2020-02-29_Lingua_Libre_Follow-up
| |
− | | |
− |
| |
− | | |
− | Thank you in advance for your replies, apologies for the length of this email, and good luck for the lockdown...
| |
− | | |
− | One last thing: do not hesitate to forward this email to all those who you suspect may not be on the mailing list but would be interested to join, or to post this message below in various discussion pages (to be polished at will).
| |
− | | |
− | Hi! In case you have ever contributed to Lingua Libre but are not on the mailing list, please consider signing up here : https://lists.wikimedia.fr/info/lingualibre to receive updates and take part in the discussions around the project :)
| |
− | | |
− | Best regards, Emma Vadillo Quesada
| |
− | | |
− | == concours #Wikicheznous sur les projets wikimedia ==
| |
− | | |
− | Bonjour à toutes et à tous,
| |
− | <br/>Wikimédia France lance le concours #wikicheznous pendant le confinement : du 8 avril au 6 mai, vous êtes invité⋅e⋅s à contribuer sur les projets wikimédia ainsi que sur lingua libre et vikidia. Sur lingua libre, ce sont les contributions en langues autres qu'Anglais et Français qui pourront concourir. Il suffit de vous inscrire sur le [https://outreachdashboard.wmflabs.org/courses/Wikim%C3%A9dia_France/Wiki_chez_nous/enroll dashboard] pour que vos contributions soient comptabilisées via commons. Alors à vos micros ! et n'hésitez pas à aller faire un tour du côté des autres projets wikimedia.
| |
− | <br/> Pour plus de renseignements, rendez-vous ici : https://www.wikimedia.fr/lancement-de-loperation-wikicheznous/
| |
− | <br/>Bon courage et amusez-vous bien.
| |
− | --[[User:Adélaïde Calais WMFr|Adélaïde Calais WMFr]] ([[User talk:Adélaïde Calais WMFr|talk]]) 12:00, 8 April 2020 (UTC)
| |
− | | |
− | == Record Wizard translation == | |
− | :{{done}} {{ping|0x010C}} I assume this issue is closed. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 18:40, 22 September 2020 (UTC)
| |
− | Hi, can someone please merge the translation at Translate.wiki? I translated Record Wizard into Japanese several weeks ago. Thanks in advance. [[User:Higa4|Higa4]] ([[User talk:Higa4|talk]]) 14:31, 9 April 2020 (UTC)
| |
− | :Hi Higa4 and thank you for the translations in Japanese. I do not know how often Lingua Libre get the new translations from Translate Wiki. [[User:0x010C|0x010C]] certainly knows. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 14:33, 12 April 2020 (UTC) | |
− | ::Thanks for your comment. Anyway,I hope someday when having time. [[User:Higa4|Higa4]] ([[User talk:Higa4|talk]]) 07:41, 15 April 2020 (UTC)
| |
− | :::Hi Higa4, Usually translations are pulled every day from TranslateWiki, but due to a main technical overhaul in recent weeks, this has been temporarily stopped. Translations will be pulled again in a week or two :). Thanks for your involvement! — [[User:0x010C|'''0'''x'''010<span style="color: #00C41C;">C</span>''']] <sup>[[User_talk:0x010C|~talk~]]</sup> 08:04, 21 April 2020 (UTC) | |
− | | |
− | == Premier essai ==
| |
− | | |
− | J'ai fait un premier essai d'enregistrement ! Je trouve l'ensemble assez sympa. J'ai enregistré plus de 50 mots, je suis fier ! Voir https://commons.wikimedia.org/w/index.php?title=Special:ListFiles/Touam
| |
− | | |
− | Les innévitables questions :
| |
− | * Comment ça se passe ensuite pour que ces mots aillent sur le wiktionnaire ?
| |
− | * J'ai remarqué que la catégorie "Lingua Libre pronunciation by Touam" sur Commons restait en lien rouge... Pourquoi ? Qu'est-ce qu'il faut faire pour qu'elle passe en bleu, ou du moins en quelque chose de consultable ?
| |
− | * Y a-t-il des projets pour autre chose que des mots ?
| |
− | | |
− | En tous cas cet outil me séduit bien. je vais essayer de continuer. --[[User:Touam|Touam]] ([[User talk:Touam|talk]]) 20:20, 23 April 2020 (UTC)
| |
− | :Bonjour Touam et bienvenue sur Lingua Libre. Tout d'abord, félicitations pour ces premiers enregistrements. En espérant que ça sera le début d'une très longue série. Concernant tes questions
| |
− | :* Comment ça se passe ensuite pour que ces mots aillent sur le wiktionnaire ?
| |
− | ::C'est ajouté automatiquement chaque nuit par [https://fr.wiktionary.org/wiki/Sp%C3%A9cial:Contributions/Lingua_Libre_Bot Lingua Libre Bot].
| |
− | :* J'ai remarqué que la catégorie "Lingua Libre pronunciation by Touam" sur Commons restait en lien rouge... Pourquoi ? Qu'est-ce qu'il faut faire pour qu'elle passe en bleu, ou du moins en quelque chose de consultable ?
| |
− | ::Oui, je crois qu'il faut créer cette catégorie manuellement. Tu peux t'inspirer de [https://commons.wikimedia.org/wiki/Category:Lingua_Libre_pronunciation_by_Pamputt la catégorie me concernant].
| |
− | :* Y a-t-il des projets pour autre chose que des mots ?
| |
− | :: Tu penses à quoi en particulier ? Il est possible d'enregistrer des vidéos pour des mots en langue des signes. Pour le reste, n'hésite pas à indiquer quels seraient tes souhaits.
| |
− | : [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 19:54, 27 April 2020 (UTC)
| |
− | :: Merci Pamputt pour ton aide, je voudrais bien continuer, mais les téléchargements vers Commons ne fonctionnent plus ? --[[User:Touam|Touam]] ([[User talk:Touam|talk]]) 13:01, 29 April 2020 (UTC)
| |
− | :::Etrange, ça fonctionne pour d'autres. Tu peux réessayer ? [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 09:56, 30 April 2020 (UTC)
| |
− | :::: Je viens de re-éssayer et ça ne veut toujours pas télécharger vers Commons. J'ai enregistré "acronymie" et "anonymie". Ça me dit juste "Aucun téléversement n'a réussi" et ça me met un point d'exclamation rouge au niveau de chaque mot dans le record wizard. J'utilise firefox sous Linux Mint. Voilà tu sais tout je crois. Et je suis bien logué, comme tu peux le voir à ma signature. (qui d'ailleurs, n'est pas dans les boutons de l'éditeur de wikicode ?? ). --[[User:Touam|Touam]] ([[User talk:Touam|talk]]) 20:01, 30 April 2020 (UTC)
| |
− | ::::: Re-essayé ce matin avec "je" "tu" "il" - je racourcis mes mots de plus en plus - toujours pareil. Je suis en plein désespoir. --[[User:Touam|Touam]] ([[User talk:Touam|talk]]) 06:14, 1 May 2020 (UTC)
| |
− | :::::: Il s'agit donc de la dernière étape, là où Lingua Libre essaie d'envoyer les enregistrements sur Wikimedia Commons. Ca m'arrive parfois sur quelques enregistrements. Est-ce que tu peux essayer sur un autre navigateur juste pour voir si ça se passe mieux ? [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 11:04, 1 May 2020 (UTC)
| |
− | ::::::: Oui, je viens d'essayer avec Chromium Version 81.0.4044.122 (Build officiel) Built on Ubuntu , running on LinuxMint 19.3 (64 bits), même mauvais résultat. --[[User:Touam|Touam]] ([[User talk:Touam|talk]]) 13:05, 1 May 2020 (UTC)
| |
− | :::::::: Hmmm, c'est effectivement frustrant. Je n'ai aucune idée du pourquoi du comment. Peut-être que 0x010C a quelques explications. Et juste pour être sûr, tu n'a pas de problème pour envoyer directement des fichiers depuis le site de Wikimedia Commons ? [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 15:20, 1 May 2020 (UTC)
| |
− | :Bonjour,
| |
− | :Un Stewart a bloqué une grosse plage d'IP sur tous les projets Wikimedia. Le blocage du serveur de Lingua Libre a été un dégât collatéral et accidentel. Thibaut, un administrateur de Commons, vient de lever le blocage sur Commons pour résoudre le problème, merci à lui.
| |
− | :Cordialement, — [[User:0x010C|'''0'''x'''010<span style="color: #00C41C;">C</span>''']] <sup>[[User_talk:0x010C|~talk~]]</sup> 15:49, 1 May 2020 (UTC)
| |
− | :: Oui, merci à tous deux, j'ai pu enfin enregistrer "je" "tu" "il" ! Je vais m'engager à enregistrer des mots plus complexes, maintenant... Si vous avez des conseils à me donner... --[[User:Touam|Touam]] ([[User talk:Touam|talk]]) 16:59, 1 May 2020 (UTC)
| |
− | ::: Nickel, pour les listes de mots, ça dépend de tes envies. Si tu veux tu peux enregistrer les noms des villages autour de chez toi ou de ton département. Tu peux importer des listes de mots en récupérant directement le contenu d'une catégorie du dictionnaire (verbe, nom de métiers, animaux, vocabulaire technique, etc). Bref il y a le choix. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 19:02, 1 May 2020 (UTC)
| |
− | :::: Voilà je viens de faire un peu plus de 100 mots, tout s'est bien passé. J'ai observé que, sur le wiktionnaire, il ne m'ajoute pas les mots dans ma liste de suivi, alors que sur Commons il me les rajoute bien. Bug or Feature ? ([https://fr.wiktionary.org/wiki/it%E2%80%99s_not_a_bug,_it%E2%80%99s_a_feature it’s not a bug, it’s a feature] sur le wiktionnaire). Perso je préfèrerais ajouter ces mots dans ma liste de suivi wiktionnaire. --[[User:Touam|Touam]] ([[User talk:Touam|talk]]) 12:50, 2 May 2020 (UTC)
| |
− | ::::: Pour la liste de suivi, c'est facile à faire sur Commons car tu es le créateur du fichier en tant qu'''uploadeur''. Sur le Wiktionnaire, c'est Lingua Libre Bot qui ajoute les prononciations audio dans les pages donc je ne sais pas s'il est possible techniquement d'ajouter une page dans la liste de suivi de quelqu'un d'autres (j'aurais tendance à penser que non). [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 08:47, 3 May 2020 (UTC)
| |
− | | |
− | == Compilation de textes ==
| |
− | :{{done}}
| |
− | Bonjour,
| |
− | | |
− | Y a-t-il un outil qui permet de soumettre un texte, et il compilera automatiquement les enregistrements de mots ?
| |
− | | |
− | [[User:AirSThib|AirSThib]] ([[User talk:AirSThib|talk]]) 13:11, 2 May 2020 (UTC).
| |
− | :Bonjour AirSThib, juste pour être sûr de bien comprendre ce que tu veux ; tu voudrais copier coller un long texte dans la fenêtre d'ajout de mots et que Lingua Libre « découpe » automatiquement les mots pour que tu puisses ensuite les enregistrer un par un. C'est bien ça ? [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 08:48, 3 May 2020 (UTC)
| |
− | ::Bonjour @[[User:Pamputt|Pamputt]] En fait c'est plutôt l'inverse, je voudrais que j'entre un texte et que Lingua Libre compile les mots, les ajoute bout à bout pour créer un texte enregistré. [[User:AirSThib|AirSThib]] ([[User talk:AirSThib|talk]]), le 08:46, 4 May 2020 (UTC).
| |
− | :::Non, ce n'est pas encore possible d'enregistrer un texte, un poème ou n'importe quoi d'autres de long. Pour le moment Lingua Libre détecte les blancs et passe au mot suivant ; c'est son seul mode de fonctionnement. Mais ta demande revient régulièrement donc j'ai ouvert [https://phabricator.wikimedia.org/T251866 un ticket sur Phabricator] pour en garder trace. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 10:23, 5 May 2020 (UTC)
| |
− | :{{ping|AirSThib}} Did Pamputt answered your question ? [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 18:39, 22 September 2020 (UTC)
| |
− | | |
− | == One word, one language, one page ==
| |
− | | |
− | The most important change I think to attract people to the project is to make it super user-friendly to browse existing content. That should be done by having a separate page per word per language. When people can browse and listen to the existing content easily, they will be motivated to add content themselves. Compare a site like Forvo which has 9535 NL speakers contributing https://forvo.com/languages/nl/, whereas LinguaLibre has only 1 NL speaker.
| |
− | | |
− | A possible structure would be:
| |
− | | |
− | * /fr/ would be the home page for French words
| |
− | * /fr/word/chien would be the page for the French word "chien"
| |
− | * /fr/audio/joe-bloggs would be the page listing the links to all the recordings from Joe Bloggs
| |
− | | |
− | A more sophisticated URL scheme could be:
| |
− | * /fr/word/chien-12345 would be the page for the French word "chien" with unique id 12345
| |
− | * /en/fr/word/dog-12345 would be the EN URL for the french word "chien" with unique id 12345
| |
− | | |
− | | |
− | The current "Browsing the sound library" is extremely user-unfriendly. It is really only a "track down the zip which you then have to download and unzip" feature. The "Browsing the sound library" page https://lingualibre.fr/wiki/Help:Sound_library which then takes you to https://lingualibre.fr/wiki/LinguaLibre:Records which then spends forever loading is a complete turn-off to all but the most dedicated visitors.
| |
− | | |
− | Some may say that the content will be used by other sites (such as Wiktionary or Wikidata), so it is not necessary, but Lingua Libre will only be able to do a good job of feeding these other sites if it does a good job itself of being a fantastic browsing tool for the source recordings.
| |
− | | |
− | In future, it would be great if all power users could use an API to go directly to a particular page and get the recording(s), e.g. /lingua-libre/fr/chien would give me the recording of that particular word in that particular language.
| |
− | | |
− | === Examples from other sites ===
| |
− | ==== Shtooka ====
| |
− | Shtooka used to be like this, that you could simply browse to the page for a particular recording. This was great. Lingua Libre has lost a lot of this simplicity, and so has taken a large step backwards in terms of easily attracting people to the concept.
| |
− | | |
− | ==== Lingopolo ====
| |
− | My own site https://lingopolo.org/, has one word per language per page, e.g. https://lingopolo.org/dutch/word/dog https://lingopolo.org/thai/word/dog https://lingopolo.org/french/word/dog although I think in some ways it makes more sense to use the word in the language on the URL. Notice too how I have a page per audio author, e.g. https://lingopolo.org/dutch/audio/J.vdleeNL and a page listing all audio authors (per language) https://lingopolo.org/dutch/audio
| |
− | | |
− | ==== Forvo ====
| |
− | https://forvo.com/ gives a good example of one way this can be organised, but also of just how much people go mad to help. The https://forvo.com/languages/ gives an excellent overview of what languages are well supported. Each language has it's own home page, e.g. https://forvo.com/languages/nl/ for Dutch where you see a link stating the number of pronounced words. https://forvo.com/languages-pronunciations/nl/ shows you all the pronunciations, by word. Any individual word has its own page like https://forvo.com/word/jongen/ Forvo though thinks of the pronunciation of a word first, and then language second, for example "main" means very different things in English and French, but Forvo puts them both on the same page https://forvo.com/word/main/ even though separated by language. I would not propose Lingua Libre to go that far, but rather link to words spelt the same in other languages, so the Ligua Libre pages would be "English" -> "main" with a link to the "French" -> "main" page.
| |
− | | |
− | ==== Pixabay ====
| |
− | https://pixabay.com/ is not a recording site at all, but it has a very well thought-out structure for its URLs. Each image has its own page, but the interesting thing is that each image page can be accessed in different languages. For example, the EN image https://pixabay.com/photos/pug-puppy-dog-animal-cute-690566/ also has an FR URL https://pixabay.com/fr/photos/pug-chiot-chien-des-animaux-mignon-690566/ and an NL version https://pixabay.com/nl/photos/pug-puppy-hond-dierlijke-cute-690566/ etc. -- Rugops
| |
− | | |
− | :Hi [[User:Rugops|Rugops]]. Thank you for this very interesting feedback. For sure, Lingua Libre needs to be improved to become more "user-friendly". Browsing audio recording may be one way to do it. Find a way to propose list of words to record more easily would be another. I open a [https://phabricator.wikimedia.org/T252319 a Phabricator ticket] to think more about your ideas. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 07:16, 10 May 2020 (UTC)
| |
− | ::[[User:Rugops|Agree]]. More could be done for browzability and maintenance (verify audio, request re-recording to speakers), etc.
| |
− | ::It should also be noted that LinguaLibre is the DIRECT CHILD of Shtooka. Nicolas Vion who created Shtooka and myself who was PhD student at INALCO and volunteer at Wikimedia looked for way to move Shtooka from a C++ desktop based recorder toward an online HTML5-based recording apps. We then connected to Wikimedia, Lyokoi, Remy Gerbet so Nicolas was hired as freelance by Wikimédia France and got a month or two dedicated to create LinguaLibre v1.0 which was PHP-based.
| |
− | ::One core issue is that 2000s' Nicolas moved from being a young and passionate developer in his 20s to new life priorities and developments, aka the classical open source project's evolution and [https://en.wikipedia.org/wiki/Benevolent_dictator_for_life Benevolent dictator]'s drifting away for new adventures and transmitting the project to a new generation. Software development is since mainly done by sprints, via a talented developer as well, but integrating the project into the Wikimedia and MediaWiki galaxy has naturally absorbed a lot of dev energy (second issue). Overall, all strength of Shtooka haven't been moved to LinguaLibre, while LinguaLibre also has strengths that Shtooka never had.
| |
− | ::As of today (May 29), a new UI have been unrolled showing Wikimedia France will to improve the project. But more should be done in term of sound-specific UX and features. Feature requests must be collected here or better on phabricator, and WM France must be notified of the users' requests :) [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 15:53, 30 May 2020 (UTC)
| |
− | :: [[User:Languageseeker|Agree]] I think its important to also manage alternative spellings and accents. This is an issue that Forvo is currently failing to manage and taking up considerable times for editors. Right now, Lingua Libre uses geographic region as a crude stand in for accent, but I think it's important to explicitly state the accent. I also think that we need a discussion of whether or not to include (in)definite articles. Lastly, we should also think about how to make the files easily scrapable so that they can be batch imported into a program such as Anki. We should also do more to delete poorly pronounced or recorded pronunciations. [[User:Languageseeker|Languageseeker]] ([[User talk:Languageseeker|talk]]) 17:07, 20 October 2020 (UTC)
| |
− | ::Edit: {{ping|Rugops}} You inspired me to create a [[phab:T266306|phabricator ticket]] on how to do this. I'd love your feedback on it. [[User:Languageseeker|Languageseeker]] ([[User talk:Languageseeker|talk]]) 02:01, 23 October 2020 (UTC)
| |
− | | |
− | == TypeError: this.pastRecords is undefined ==
| |
− | | |
− | Bonjour, tout le monde.
| |
− | | |
− | Quand j’essaye de sélectionner des mots à enregistrer à partir d’une catégorie du Wiktionnaire français en excluant les termes déjà enregistrés, je reçois une erreur ''TypeError: this.pastRecords is undefined''. Décocher l’option éliminant les termes déjà enregistrés résout le problème, mais je n’ai pas vraiment envie de réenregistrer des mots déjà traités. Ça sent le bug ; quelqu’un pourrait corriger ?
| |
− | | |
− | [[User:LoquaxFR|LoquaxFR]] ([[User talk:LoquaxFR|talk]]) 08:59, 12 May 2020 (UTC)
| |
− | :Bonjour,
| |
− | :Je vais regarder ça, mais pas sur d'avoir le temps de corriger ça rapidement car je prépare le déploiement de la nouvelle grosse version de Lingua Libre et du RecordWizard actuellement (je pense début de semaine prochaine environ). En attendant celle-ci (qui n'a pas ce bug), tu peux désélectionner l'option dans le générateur, et retirer les mots une fois dans la liste principale en cliquant sur le bouton ''Retirer les mots déjà enregistré''.
| |
− | :Désolé pour le problème :/.
| |
− | :Cordialement — [[User:0x010C|'''0'''x'''010<span style="color: #00C41C;">C</span>''']] <sup>[[User_talk:0x010C|~talk~]]</sup> 12:44, 12 May 2020 (UTC)
| |
− | ::Ben, je viens de retenter, et ça marche, même sans utiliser ton contournement. Tant mieux. Merci du tuyau, en tout cas, et bon courage !
| |
− | ::[[User:LoquaxFR|LoquaxFR]] ([[User talk:LoquaxFR|talk]]) 13:35, 12 May 2020 (UTC)
| |
− | | |
− | == Wikidata ==
| |
− | | |
− | Hello, step by step we are spreading Lingua Libre among the community in Wikimedia Spain and two questions have arisen; on the one hand, would it be possible for a bot to automatically add the audios from Wikidata to the different Wiktionaries? On the other hand, can different accents from the same language be added to the audio statement in Wikidata?. Thanks. [[User:Rodelar|Rodelar]] ([[User talk:Rodelar|talk]]) 12:09, 22 May 2020 (UTC)
| |
− | : Hello Rodelar, thanks for adding Spanish pronunciations.
| |
− | : About Wikidata, I added your request on [https://phabricator.wikimedia.org/T224312 this Phabricator ticket] in order to remember it.
| |
− | : To add audio pronunctions to the Wiktionary pages, there are at least two options
| |
− | :# the current method is to write a bot that add them. We code is available [https://github.com/lingua-libre/Lingua-Libre-Bot here]. Lingua Libre Bot is already running for fr and oc Wiktionaries. You can have a look to the code about [https://github.com/lingua-libre/Lingua-Libre-Bot/blob/master/wikis/ocwiktionary.py oc Wiktionary] and try to adapt it for eswiktionary. If so, you can send a pull request to be added in the Lingua Libre Bot code. Then, the bot will add the new audio pronunciations (in any language) on the Spanish Wiktionary. It has to be done one by one by for each Wiktionary because the page structure is different from one Wiktionary version to another. And here comes the second "solution".
| |
− | :# the other method is to get the pronunciation data (and other data) directly from Wikidata and to display them in the Spanish Wiktionary. Wikipedia already does that (with the infobox for example). This requires that the access to the lexicographical data be enabled. The [https://phabricator.wikimedia.org/T212843 T212843 ticket] follows progress on that but it is currently not yet possible to access them.
| |
− | : Cheers [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 06:29, 23 May 2020 (UTC)
| |
− | | |
− | == speedy et / ou delete ==
| |
− | Bonjour,<br />
| |
− | Il peut arriver qu'un Élément Qs ne sert plus (enregistrement impropre, page WM différente, titrage incorrect, etc). On peut l'effacer de Commons, mais il subsiste ici. Pour commencer, je propose la création d'une page dédiée à la suppression, avec un/des template/s speedy et/ou delete.<br />
| |
− | Une de mes créations ne me convenait pas, alors j'ai effacé le fichier sur Commons pour le remplacer par un autre avec mes propres outils pour remettre tout en ordre sur LL. Bref le temps de la nouvelle création, Q309179 avait disparu. Pour speedy et delete, qu'en dites-vous ? Des commentaires ? —[[User:Eihel|Eihel]] ([[User talk:Eihel|talk]]) 17:35, 29 May 2020 (UTC) <small>ps. J'ai déjà ajouté un template. Voir [[LinguaLibre:Administrators' noticeboard]]</small>
| |
− | :Salut Eihel, oui pourquoi pas. À noter cependant que si une prononciation est incorrecte, réenregistrer le mot uploadera la nouvelle prononciation sur Commons à la place de l'ancien enregistrement. Par ailleurs vu que les noms des fichiers sont générés automatiquement par Lingua Libre, les cas à traiter devraient être relativement rares. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 12:46, 31 May 2020 (UTC)
| |
− | ::De manière générale nous avons un point faible dans la gestion dynamique des audios : consultation agreable, renommage, suppression, etc. C'est mentionné (consultation agréable) plus haut dans la comparaison avec Shtooka. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 10:24, 3 June 2020 (UTC)
| |
− | | |
− | == 2020.05.29 - new LinguaLibre UI and UX ==
| |
− | | |
− | Hi, let's create below a list of points to review and improve. The discussion must be centered around finding pratical, rapid solution to the issues found :) [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 16:20, 30 May 2020 (UTC)
| |
− | ===CSS===
| |
− | * {{Done}} CSS could be improve. As admin, where could we edit it or suggest modifications ? (ex: <code>h2 { margin-top: 1em; }</code>)
| |
− | *: I guess we should edit [[MediaWiki:Common.css]] to modify that. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 12:51, 31 May 2020 (UTC)
| |
− | *::The best would to make a pull request on [https://github.com/lingua-libre/BlueLL the skin's git repository]. — [[User:0x010C|'''0'''x'''010<span style="color: #00C41C;">C</span>''']] <sup>[[User_talk:0x010C|~talk~]]</sup> 14:36, 1 June 2020 (UTC)
| |
− | * Recording icon : the previous version had reddish micro-phone icon to enlighten the "Record Wizard" button. I guess the icon was willfully dropped.
| |
− | *: Which icon are you talking about? [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 12:51, 31 May 2020 (UTC)
| |
− | *::[[MediaWiki:Common.css]] now contains guideline on how to submit new CSS to Lingua Libre. Thanks [[user:Pamputt|Pamputt]] & [[User:0x010C|0x]] ;) [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 10:15, 3 June 2020 (UTC)
| |
− | *:::[[File:Breezeicons-status-22-mic-red-LinguaLibre.svg|32px]] <- this icon, which we [https://lingualibre.org/index.php?title=MediaWiki:Common.css&oldid=83430 previously CSS integrated]. The bright red was chosen on purpose to attract the visitor eyes to the recording button. Also, the new skin is white/blue centered. All these must be balanced. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 10:19, 3 June 2020 (UTC)
| |
− | *::::{{ping|Yug}} This icon was purposely removed by the UI/UX specialist who created the new mockups. — [[User:0x010C|'''0'''x'''010<span style="color: #00C41C;">C</span>''']] <sup>[[User_talk:0x010C|~talk~]]</sup> 11:03, 3 June 2020 (UTC)
| |
− | ===Content===
| |
− | * {{done}} [[LinguaLibre:Stats#Number_of_records_per_languages]] ([https://lingualibre.org/index.php?title=LinguaLibre:Stats&action=edit§ion=3 edit SPARQL query])- the table is not-human readable. English name or ISO639-3 codes are needed. LL's languages items locally just have English name & wikidata id... editing the query so it display English name and/or query from wikidata the iso639-3 code would be appreciated.
| |
− | *: Full language names have been dropped since the database has grown too much for the request to respond without a timeout. This may be restored once the work on either the SPARQL endpoint performance or the QueryViz caching feature has been done. — [[User:0x010C|'''0'''x'''010<span style="color: #00C41C;">C</span>''']] <sup>[[User_talk:0x010C|~talk~]]</sup> 14:36, 1 June 2020 (UTC)
| |
− | | |
− | ===Baleswari Odia (dialect of Odia language) and Odia===
| |
− | [[File:Requested features for Lingua Libre.jpg|thumb|400px|Some requested features for Lingua Libre including an option for changing the default naming option and custom metadata (particular multilingual descriptions in wiki-code). (''details below'')]]
| |
− | Just recorded over 300 words in the Baleswari dialect of the Odia language. The new UI is certainly better, more effective and faster than the last one.
| |
− | ====a. Multiple recordings of the same word====
| |
− | :{{done}} (part of a group of 4 sections)
| |
− | However, the "remove words already recorded" feature does not detect words recorded by the same user on a different date. So, a newer version of the same recording gets updated on Commons. This is not useful. Ideally:<br>
| |
− | One should be able to upload multiple recordings of the same word. While uploading, they should be able to see the duplicates, and have an option to remove some/all the duplicates from the new list. If a user decides to re-record an existing word, new files should be created ''instead of uploading a new version of an existing file as it is happening now'') e.g. if the old file was "OLDNAME.wav", the new file should be "OLDNAME_01.wav". If both "OLDNAME.wav" and "OLDNAME_01.wav" exist, then the third recording should be "OLDNAME_02.wav".
| |
− | *: Thanks for your recordings [[User:Psubhashish|Psubhashish]].
| |
− | *:: About the first point (removal af duplicate), I will check and open a Phrabicator ticket if I can reproduce because this is definitely a regression compared to the previous version of the website.
| |
− | *::: Thanks, please do add "psubhashish1" to the subscriber list when you create the ticket. --[[User:Psubhashish|Subhashish Panigrahi]] ([[User talk:Psubhashish|talk]]) 13:18, 4 June 2020 (UTC)
| |
− | *::: [[User:Psubhashish|Psubhashish]], I just tested and if I click on "remove all the duplicates" at the "recording list" step, the words that I added and I have already recorded are removed. Could you try again? If it does not work for you, can you open a Phabricator ticket and describe exactly what you do in order to be able to reproduce? [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 15:33, 2 June 2020 (UTC)
| |
− | *:::: It is working for recordings made from a particular list on LinguaLibre. But, when I try to record the pronunciation of a word that I myself had recorded earlier, it doesn't flag that a duplicate exists on Commons. Ideally, it should let me know that a recording that I myself made already exists so that I can decide if I want to record or not. If I decide to rerecord, it should go as a new recording with a suffix "_01.FILENAME". --[[User:Psubhashish|Subhashish Panigrahi]] ([[User talk:Psubhashish|talk]]) 13:18, 4 June 2020 (UTC)
| |
− | *:::::Currently it is working this way. When you create (or load) a list of words to record, there is always a button saying you can remove from the list all the words you have already recorded. Lingua Libre does not say you directly that there are words you have already recorded in the list (I think because user experience is better this way (less messages)). To have different recording is only possible now if you add information in brackets after the word you want to save (example: "cat (some information)"). That's said, I do not really see what use case would need such feature. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 07:45, 5 June 2020 (UTC)
| |
− | *::::::{{ping|Psubhashish}} Regarding the deduplication feature, it is working fine on my side. If it's not working on your side, it may be related to the Odia script, which in unicode can have several code points for the same symbol, which do some mess when we do comparaisons. This issue will need to be investigated further. Could you provide one or two examples of transcriptions with witch the deduplicate feature doesn't work for you?
| |
− | *::::::Regarding your second point, this will not be done, as we don't want to create duplicate files on Commons. If a record has the exact same metadata, it should replace the previous one. If you want to record another speaker, create a new speaker profile for him (step 2 of the Record Wizard). If you want to record in a different dialect/language, add this new language/dialect on your speaker profile (on step 2) and select it for your records (on step 3), see also my answer in section D below. If you want to record some [https://en.wikipedia.org/wiki/Heteronym_(linguistics) heteronym], you can add a textual qualifier between bracets at the end of the transcription of that word, example: "desert (arid region)" and "desert (leave)".
| |
− | *::::::Best regards — [[User:0x010C|'''0'''x'''010<span style="color: #00C41C;">C</span>''']] <sup>[[User_talk:0x010C|~talk~]]</sup> 23:28, 8 June 2020 (UTC)
| |
− | *::::::: I feel that for heteronym's it would be better to have the textual qualifier as a separate metadata item. Otherwise, it might get difficult to find them. It's also easier to ensure accurate formatting if it's done automatically that if user's do it manually. Currently, Forvo has a big problem where there is no standard way to distinguish heteronym's leading to quite a bit of difficulty in finding them. [[User:Languageseeker|Languageseeker]] ([[User talk:Languageseeker|talk]]) 19:38, 20 October 2020 (UTC)
| |
− | | |
− | ==== b. Custom-categorization:====
| |
− | :{{done}} (part of a group of 4 sections)
| |
− | There is no option for a user to decide about the categories. I might want to add a custom category (say a category for each date) for a particular batch.
| |
− | *:: About categories, you would like to be able to create your own categories on Wikimedia Commons. For example, instead of automatic categorisation in [[c:Category:Lingua Libre pronunciation by Psubhashish|Category:Lingua Libre pronunciation by Psubhashish]], you would like to be able to set a custom name for a category. This category would be categorized in Category:Lingua Libre pronunciation by Psubhashish, which means all custom categories would be subcategories of the main categories created automatically by Lingua Libre. Do you agree? If so, I think it looks like what is asked in [[phab:T201135|T201135]].
| |
− | *::: Pardon for repeating the question. I see a discussion from 2018 but it doesn't tell me how to add a custom category for a batch. Can you probably explain here or, much better, add to the help page? --[[User:Psubhashish|Subhashish Panigrahi]] ([[User talk:Psubhashish|talk]]) 13:18, 4 June 2020 (UTC)
| |
− | *::::This feature does not exist yet. The Phabricator ticket is just there to remind that this feature is frequently asked and should be considered by developers. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 22:04, 8 June 2020 (UTC)
| |
− | *:::::{{ping|Psubhashish}} This is indeed an interesting feature, I will prioritize it for the next update. — [[User:0x010C|'''0'''x'''010<span style="color: #00C41C;">C</span>''']] <sup>[[User_talk:0x010C|~talk~]]</sup> 23:07, 8 June 2020 (UTC)
| |
− | *::::::{{Ping|Pamputt|Psubhashish}} a way to go would be to have a bot which use [[:mw:API:Edit]] on the list of audio files. Basic js skills is enough to go forward, and LinguaLibre will sooner or later need such bot for maintaining Commons pages. Just... let's keep it it mind. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 18:31, 22 September 2020 (UTC) (PS: I'am learning about Commons bot at the moment, come back to me if needed. Do we have other bot master here ?)
| |
− | | |
− | ====c. Custom metadata parameters:====
| |
− | :{{done}} (part of a group of 4 sections)
| |
− | There should be at least one additional parameter for the metdata (description, etc. that appear on Commons) so that the user can add some additional information. I personally speak in multiple accents and I'd like to denote the accent used for each batch separately. Having this option would be of great help. It is not practical to edit hundreds of files manually to make such changes once uploaded on Commons. --[[User:Psubhashish|Subhashish Panigrahi]] ([[User talk:Psubhashish|talk]]) 07:40, 2 June 2020 (UTC)
| |
− | | |
− | *:: Metadata: this is an interesting idea. You would like to have something like « free text » that would be a parameter attached to one speaker (you can define several speakers for yourself, one for each accent). A Phabricator ticket should be opened to track this idea.
| |
− | *: [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 11:46, 2 June 2020 (UTC)
| |
− | *:: I have added this to Phrabricator ([[phab:T254241|T254241]]). --[[User:Psubhashish|Subhashish]] ([[User talk:Psubhashish|talk]]) 06:43, 5 June 2020 (UTC)
| |
− | *::: See the image above for an example how I'd personally like to use a bilingual wikicode-based description as opposed to the current one. The latter is not very helpful for someone who is viewing a standalone file. --[[User:Psubhashish|Subhashish]] ([[User talk:Psubhashish|talk]])
| |
− | | |
− | ====d. Standardized naming:====
| |
− | :{{done}} (part of a group of 4 sections)
| |
− | Currently, Lingua Libre follows a naming which prefixes a long text whereas audio recordings of pronunciations are generally named on Commons in "<code>LANGUAGECODE-DIALECT OR VARIATION CODE-WORD</code>" format e.g. if the word "''color''" needs to be recorded in an American accent, an ideal way of naming it would be "File:En-us-color.wav" whereas "en" stands for English, and "us" for American. In the picture uploaded above, I have used a similar format ("ori" being the language code for "Odia" and "nor" being the code for the Northern Balasore (or Baleswari Odia) dialect). I understand that Lingua Libre follows a different format. But can a user opt (or modify in a batch) the naming that they prefer? Better, can Lingua Libre suggest a standardized naming for users so that the recordings on Commons have a much more standardized naming? The naming that I've suggested is something I learned from others on Commons but they make sense from a linguistics standpoint. It's simple, short and does the job. I had to use another code and spend hours to rename only a few hundred files whereas having an option to change in the first place while uploading could have been much easier. --[[User:Psubhashish|Subhashish]] ([[User talk:Psubhashish|talk]])
| |
− | :It has been decided not to get only the language and the place where the locutors have learnt their language. If I understood well, it is more relevant from a linguist point of view because most of the people are not aware that they speak a specific variety of their language. For example, we could use en-us but why it is more relevant than en-us-Texas or en-us-Florida where the accents are probably different. Maybe [[User:Lyokoï|Lyokoï]] or [[User:Noé|Noé]] could say more about this point. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 22:11, 8 June 2020 (UTC)
| |
− | ::Hi {{ping|Psubhashish}}
| |
− | ::We perfectly know this naming convention that exists on Commons, and it's true that it's shortness is an advantage. But we purpously decided not to use it. We've immagined our naming convention so that our filenames can be as precise and unique as possible: a file corresponds to a transcription recorded in a specific language/dialect by a specific person.
| |
− | ::* The other naming convention doesn't allow 2 person to record the same word in the same language, without starting using ''tricks'' like appending 2, 3, 4,... ; that's why we include the name of the speaker in the title.
| |
− | ::* Language codes used on Commons are a bit random sometimes especialy if we look at minor languages, which often have not those standardized code. But as Lingua Libre aims to be able to record all languages, common or minor, we prefered using Wikidata Qids for every languages; it may be less plesant to read but it fits every known languages/dialects. To follow Pamputt's example, we have a standardize code for Texan english, which is [https://www.wikidata.org/wiki/Q7707309 Q7707309] ;).
| |
− | ::By applying those rules for each records, we are sure to be able to have a <u>consistent</u> naming convention, for all languages and dialects, supporting records of same words by multiple speakers, and that's why we can <u>safely</u> override files if the same speaker record the same word in the same language.
| |
− | ::(for your case of two dialects, you won't get any problem if you want to record the same words one time in ''standard''(?) Odia (WD:[https://www.wikidata.org/wiki/Q33810 Q33810] = LL:[[Q336]]) and another time in Baleswari Odia (WD:[https://www.wikidata.org/wiki/Q4850727 Q4850727] = LL:[[Q322719]]), as long as you select the right language/dialect before the recording process.)
| |
− | ::The real issue is that we have imported on Lingua Libre only a subset of all available languages/dialects on Wikidata (as it's growing fast), and we have to manually import missing ones from time to time :/
| |
− | ::Best regards — [[User:0x010C|'''0'''x'''010<span style="color: #00C41C;">C</span>''']] <sup>[[User_talk:0x010C|~talk~]]</sup> 23:05, 8 June 2020 (UTC)
| |
− | | |
− | ==='''Discussion'''===
| |
− | Hi Yug, I guess it should be better to open tickets on [[phab:project/view/3393/|Phabricator]] to keep track of all theses issues and be able to discuss each one more easiyl (structured way). [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 12:51, 31 May 2020 (UTC)
| |
− | :Thanks Pamputt :) [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 07:51, 1 June 2020 (UTC)
| |
− | ::+1 to pamputt, phabricator is more appropriate for that for advanced users. — [[User:0x010C|'''0'''x'''010<span style="color: #00C41C;">C</span>''']] <sup>[[User_talk:0x010C|~talk~]]</sup> 14:36, 1 June 2020 (UTC)
| |
− | | |
− | == Bugs ==
| |
− | === Enregistrements accélérés ===
| |
− | Bonjour,
| |
− | Mes enregistrements du jour ont été accélérés. Heureusement, je me suis vite rendu compte. Quelques exemples : [[Q332977]] [[Q332978]] [[Q332979]] [[Q332980]] [[Q332981]] [[Q332982]].
| |
− | | |
− | PS : Le lien "Commencer une nouvelle discussion" ci-dessus n'a pas l'air de fonctionner.
| |
− | | |
− | [[User:DSwissK|DSwissK]] ([[User talk:DSwissK|talk]]) 08:36, 28 June 2020 (UTC)
| |
− | :Salut [[User:DSwissK|DSwissK]], problème étrange. J'ai ouvert un [[phab:T256663|ticket sur Phabricator]] à ce sujet. J'en ai également ouvert [[phab:T256665|un autre]] à propos de lien « commencer une nouvelle discussion » car je n'ai pas trouvé comment le corriger moi-même. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 17:40, 29 June 2020 (UTC)
| |
− | ::{{ping|DSwissK|Pamputt}} I got the same feedbacks of speeded up audios from [[User:Luilui6666|Luilui6666]] for Cantonese, today. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 16:24, 16 July 2020 (UTC)
| |
− | ::[https://lingualibre.org/index.php?title=Special:Contributions/Luilui6666&dir=prev&offset=20200709043912&limit=500&target=Luilui6666 Contributions] > Example (corrupted): [https://lingualibre.org/wiki/Q338365 Q338365] [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 16:42, 16 July 2020 (UTC)
| |
− | ::Should we review and remove all the bad audios, so it become easier to re-record ? And where should we remove them, here or on Commons ? [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 17:44, 16 July 2020 (UTC)
| |
− | :::{{ping|Yug}} We can list such items [[LinguaLibre:Misleading_items|here]]. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 07:44, 18 July 2020 (UTC)
| |
− | | |
− | === ratelimited ===
| |
− | :{{done}} ''See [[LinguaLibre:User rights]], [[phab:T260649|T260649]], [[phab:T245214|T245214]], [[:Commons:Commons:Guide_to_batch_uploading#Rate_limits]], [[:mw:Special:MyLanguage/Manual:$wgRateLimits]]. Test your ratelimit : [https://commons.wikimedia.org/w/api.php?action=query&meta=userinfo&uiprop=ratelimits here].''
| |
− | [[User:Luilui6666|Luilui6666]] also got about 1/4 of her audios not uploading, this for past 3 months. She either have to wait and retry or to give up. The last event, on July 10th, returned an error mentioning
| |
− | {| class="wikitable"
| |
| | | | | |
− | <nowiki>[RequestQueue]</nowiki> Reject ratelimited
| + | <pre> |
− | Object:
| + | { date:2022-01, records: 21290, speakers: 46, languages: 28 }, |
− | *: "See https://commons.wikimedia.org/w/api.php for API usage. Suscrib..."
| + | { date:2022-02, records: 3894, speakers: 40, languages: 17 }, |
− | code:"ratelimited"
| + | { date:2022-03, records: 8357, speakers: 61, languages: 21 }, |
− | info:"You've exceeded your rate limit. Please wait some times and tray again...
| + | { date:2022-04, records: 5454, speakers: 34, languages: 18 }, |
− | |}
| + | { date:2022-05, records: 4702, speakers: 59, languages: 30 }, |
− | I can't say more. Did any of you encountered such event ? [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 16:30, 16 July 2020 (UTC)
| + | { date:2022-06, records: 7675, speakers: 41, languages: 18 }, |
− | :I do not know either. You should open a [[phab:project/view/3393/|Phabricator ticket]] about this issue. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 07:46, 18 July 2020 (UTC)
| + | { date:2022-07, records: 4364, speakers: 37, languages: 22 }, |
− | :: {{ping|Luilui6666}} {{ping|Yug}} {{ping|Pamputt}} This error could be due to your user rights on Wikimedia Commons. Without the "autopatrolled" rights, you are not able to upload more than ~400 files/hour. — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 23:40, 17 August 2020 (UTC) | + | { date:2022-08, records: 9544, speakers: 45, languages: 23 }, |
− | :::Indeed, this is a possible explanation. I opened [[phab:T260649|T260649]] to keep track. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 06:23, 18 August 2020 (UTC)
| + | { date:2022-09, records: 5802, speakers: 113, languages: 30 }, |
− | :::{{ping|Pamputt|WikiLucas00|0x010C}} the classic 1 hour recording sprint generate 800 records. It is frequent to do a 2hours, 1500+ recording sprint per day. Also, we must take note that 0x010C won't be able to save us from now on. Do we have an identified fall back ? [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 05:59, 5 September 2020 (UTC) | + | { date:2022-10, records: 6931, speakers: 74, languages: 32 }, |
− | :{{ping|Pamputt}} I searched in the following without success:
| + | { date:2022-11, records: 8461, speakers: 54, languages: 34 }, |
− | :* [[:Commons:Commons:Autoconfirmed_users|Commons:Autoconfirmed_users]] -- account must be 4 days old to perform <code>reupload</code> over existing files
| + | { date:2022-12, records: 11882, speakers: 54, languages: 23 }, |
− | :* [[:mw:Special:MyLanguage/Manual:User rights|mw:Manual:User rights]] | + | { date:2023-01, records: 18150, speakers: 48, languages: 29 }, |
− | :* [[:Commons:Special:ListGroupRights]]
| + | { date:2023-02, records: 32441, speakers: 65, languages: 29 }, |
− | :I didn't find anything relevant to upload amount. Luilui6666 limitation happened more than 4 days after her account creation. I'am not sure what went on. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 07:35, 18 September 2020 (UTC)
| + | { date:2023-03, records: 11527, speakers: 61, languages: 30 }, |
− | ::{{ping|Pamputt|}} Found it in [[:Commons:Commons:Guide_to_batch_uploading#Rate_limits|Commons:Guide_to_batch_uploading#Rate_limits]] ! | + | { date:2023-04, records: 8451, speakers: 58, languages: 35 }, |
− | ::{| class="wikitable"
| + | { date:2023-05, records: 21282, speakers: 97, languages: 49 }, |
− | |
| + | { date:2023-06, records: 17940, speakers: 56, languages: 35 }, |
− | ;Rate limits
| + | { date:2023-07, records: 75825, speakers: 74, languages: 38 }, |
− | Normal users on Commons are [[:mw:Special:MyLanguage/Manual:$wgRateLimits|rate limited]] to 380 uploads per 72 minutes. Users granted <code>image-reviewer</code>, <code>patroller</code>, or <code>autopatrolled</code> status have a <code>ratelimit</code> of 999 uploads per 1 second. Users can apply for these user rights at [[:commons:COM:RFR|COM:RFR]].
| + | { date:2023-08, records: 32681, speakers: 54, languages: 30 }, |
− | |}
| + | { date:2023-09, records: 28813, speakers: 114, languages: 30 }, |
− | ::It mentions [[:mw:Special:MyLanguage/Manual:$wgRateLimits]]. This ''"380 uploads"'' match the ''"around 400 uploads"'' described by [[User:Luilui6666]]. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 12:37, 18 September 2020 (UTC)
| + | { date:2023-10, records: 60317, speakers: 167, languages: 47 }, |
− | | + | { date:2023-11, records: 49704, speakers: 140, languages: 55 }, |
− | === Accès impossible au Wizard dans l'interface en occitan ===
| + | { date:2023-12, records: 42383, speakers: 114, languages: 41 }, |
− | | + | { date:2024-01, records: 40572, speakers: 112, languages: 40 }, |
− | Bonjour,
| + | { date:2024-02, records: 22385, speakers: 197, languages: 57 }, |
− | J'ai toujours un problème pour enregister quand l'interface est en occitan. Je dois faire un atelier public cet été et je suis obligée de passer en français.
| + | { date:2024-03, records: 16997, speakers: 173, languages: 48 }, |
− | * Si je clique sur le bouton d'enregistrement en haut de la page, j'ai l'erreur suivante :
| + | { date:2024-04, records: 8733, speakers: 117, languages: 42 }, |
− | Fatal error: Maximum execution time of 30 seconds exceeded in /home/www/lingualibre.org/includes/cache/MessageCache.php on line 812
| + | { date:2024-05, records: 556, speakers: 7, languages: 7 } |
− | * Si je clique sur le bouton d'enregistrement en bas de la page d'accueil, j'ai l'erreur suivante :
| + | </pre> |
− | Fatal error: Maximum execution time of 30 seconds exceeded in /home/www/lingualibre.org/languages/Language.php on line 198
| |
− | [[User:Guilhelma|Guilhelma]]
| |
− | :J'ai ajouté les nouveaux messages d'erreur au [[phab:T210477|ticket Phabricator]] qui parle des problèmes avec la version en occitan. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 08:55, 19 July 2020 (UTC)
| |
− | ::{{ping|Guilhelma}}, is this bug confirmed and reoccuring ? [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 18:21, 22 September 2020 (UTC)
| |
− | ::{{ping|Guilhelma}}, est-ce que ce bug est confirmé et continue de vous géner ? [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 18:21, 22 September 2020 (UTC) | |
− | Yes, the bug is confirmed
| |
− | Fatal error: Maximum execution time of 30 seconds exceeded in /home/www/lingualibre.org/languages/Language.php on line 4422[[User:Guilhelma|Guilhelma]]
| |
− | | |
− | === Ajouter la langue Mozabite ===
| |
− | :{{done}}
| |
− | Bonjour, ma langue mere est le Mozabite (mzb) Q36149 (wikidata).
| |
− | je peux pas contribuer en cet langue. pouvez vous l'ajouter s'il vous plais. --[[User:Arha06|Arha06]] ([[User talk:Arha06|talk]]) 19:24, 22 July 2020 (UTC)
| |
− | : {{ping|Arha06}} bonjour et merci pour vos enregistrements sur Lingua Libre. Je viens d'ajouter le [[Q339099|mozabite]] sur Lingua Libre. Il est donc possible d'enregistrer des mots dans cette langue dès à présent. Bonnes contributions. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 08:26, 24 July 2020 (UTC)
| |
− | | |
− | === Adding list from Wikidata ===
| |
− | Hello. It seems the interface has changed since i last used it and i cannot see how to create a word list from a Wikidata query. Could someone tell me the best way of doing this? thanks [[User:Jason.nlw|Jason.nlw]] ([[User talk:Jason.nlw|talk]]) 08:49, 17 August 2020 (UTC)
| |
− | :Hi [[User:Jason.nlw|Jason.nlw]], as far as I remember it has never been possible to generate such list but I may be wrong. I opened a [[phab:T260650|feature request]] on Phabricator. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 06:26, 18 August 2020 (UTC)
| |
− | * The only workaround is now: run a query--> download the label list as csv-->copy the column-->create a local list on LiLi. This won't remember and link the Wikdiata items though, and the bot won't work either. You can only record the words. --[[User:Titodutta|টিটো দত্ত (Titodutta)]] ([[User talk:Titodutta|কথা]]) 00:42, 19 September 2020 (UTC)
| |
− | | |
− | == Adding list into RecordWizard ==
| |
− | :{{done}} | |
− | Hello. The Record Wizard offers the Local List, Nearby and Wikimedia category buttons as ways to generate a list of words to record. The alternative seems to be typing words, one by one, in the "Type here the word to record". I have my own list in a file, but when I paste it in that field, it thinks it's 1 word instead of many. Is there a way I can provide my own list without having to type the words one by one? [[User:Julien Baley|Julien Baley]] ([[User talk:Julien Baley|talk]]) 22:19, 27 August 2020 (UTC)
| |
− | :Hi [[User:Julien Baley|Julien]], it is possible to create local list as much as you want/need. So to create your own, you can paste the content of your file into [[List:Fra/Julien Baley]] (for example); each word is separated by #. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 18:19, 28 August 2020 (UTC) | |
− | ::Thanks a lot! Is there any naming convention, or can I create whatever I want? [[User:Julien Baley|Julien Baley]] ([[User talk:Julien Baley|talk]]) 19:19, 28 August 2020 (UTC) | |
− | :::No, so far there is no naming convention. Yet, if you want your list be record by other user, you should use a self-supporting name. You can see some examples [https://lingualibre.org/index.php?search=List%3Afra%2F*&title=Special%3ASearch&profile=advanced&fulltext=1&ns142=1 here]. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 19:46, 28 August 2020 (UTC) | |
− | ::::{{ping|Julien Baley}} please browse [[Help:Main]] ;) [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 18:04, 22 September 2020 (UTC) | |
− | | |
− | == Datasets out of date ==
| |
− | Hello. It seems that the datasets page, although it claims to run every 2 days, is completely out of date: all the available zips are from April 2020 or November 2019 (and the full zip from May 2019). Is this a known problem? Is there a plan to address it? [[User:Julien Baley|Julien Baley]] ([[User talk:Julien Baley|talk]]) 23:17, 27 August 2020 (UTC)
| |
− | :Indeed, it seems to have an issue with the dataset updating. I opened a [[phab:T261519|Phabricator ticket]] about this issue. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 18:24, 28 August 2020 (UTC) | |
− | | |
− | == How much to record at once? ==
| |
− | :{{done}} | |
− | Hello! I have a little question regarding risk management; when I record words, where is the data stored before it's uploaded? What happens if my Internet connection dies out? Can I refresh the page, or am I losing all the non-uploaded recordings? What are your suggestions in that respect? [[User:Julien Baley|Julien Baley]] ([[User talk:Julien Baley|talk]]) 09:32, 2 September 2020 (UTC)
| |
− | :{{ping|Julien Baley}} I am not sure (to be tested) but I would say that the recordings are stored on the server as long as your browser is not closed. So if your internet connection dies out but you do not change the web page, you may be able to send to server the words from the last you recorded as soon as your internet connection is restored. Maybe it is even possible to continue to record if recordings are stored locally on your computer before to be sent to the server. Yet, as I said, it should be tested. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 22:26, 9 September 2020 (UTC)
| |
− | ::{{ping|Julien Baley}} The audio are first stored locally, within your browser's tab memory. In case of staled upload, KEEP THIS TAB OPEN, reclick "Upload" (or "Retry Upload" ?) few hours later. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 07:58, 18 September 2020 (UTC) | |
− | ::For new accounts, there may be some limit in the number of daily uploads. We suspect something around 400 uploads. If so, keep the computer and browser tab open, stay on that page, and retry upload later. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 11:58, 18 September 2020 (UTC) | |
− | :*Greetings, what does "new account" mean? How many days, how many edits? This would be good to document these. Good wishes, --[[User:Titodutta|টিটো দত্ত (Titodutta)]] ([[User talk:Titodutta|কথা]]) 00:33, 19 September 2020 (UTC) +the stat details you have given above about Commons upload rate, are helpful, regards. --[[User:Titodutta|টিটো দত্ত (Titodutta)]] ([[User talk:Titodutta|কথা]]) 00:39, 19 September 2020 (UTC) | |
− | ::Hello ! As you noticed in the other section I confirmed that "new users" (according to Wikimedia Commons) are limited to 380 upload per 72 minutes. | |
− | ::Checking on [https://commons.wikimedia.org/w/api.php?action=query&list=users&ususers=0x010C%7CYug%7CWikiLucas00%7CLuilui6666%7CShufaBot%7CJulien_Baley%7CTitodutta&usprop=groups%7Ceditcount%7Cregistration this Commons API], I can see that [[User:Titodutta]] has lot of higher-rights groups providing a ratelimit of 999 uploads / sec, while [[user:Julien Baley]] is still a new user within initial groups <code>[ "*", "user", "autoconfirmed" ]</code> which provide a maximum of 380 uploads per 72 minutes. See [[LinguaLibre:User_rights#User_rights_on_Commons]].
| |
− | ::{{ping|Julien Baley}}, you previously recorded 60 audio files. If your recorded files above 380 are not uploading... keep the browser tab open, then after one and half hour, click again to upload. You may also request higher rights on Commons, in line with [https://commons.wikimedia.org/w/index.php?title=Commons:Requests_for_rights&oldid=467003025#User:Luilui6666_(LinguaLibre_project) this request]. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 18:19, 22 September 2020 (UTC) | |
− | :::{{ping|Yug}} Oh, it may not show on my account, because I'm recruiting people to record words in other languages, and I'd like them not to encounter any problem with the uploading. I usually have a list of several hundreds of words ready, but I can aim to keep it under 380 to avoid any issue. [[User:Julien Baley|Julien Baley]] ([[User talk:Julien Baley|talk]]) 11:32, 30 September 2020 (UTC) | |
− | ::::{{ping|Julien Baley}} I think the uploader is your account [[user:Julien Baley]], and the speaker doesn't necessary have an account nor on LinguaLibre or Commons. He just need to be ''defined'' in LinguaLibre.
| |
− | ::::If your speaker creates its own account and work independently, your could follow [[LinguaLibre:User_rights#Request_new_user_rights]] so to request user rights on Commons for that account. It has proven to be granted quickly, see the Luilui6666's request I made for this user. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 19:39, 3 October 2020 (UTC) | |
− | | |
− | == Pages translation ==
| |
− | I would like to be able to mark pages for translation, but I don't have the user rights (pagetranslation) to do so. This rights are restricted to sysops (see [[Special:ListGroupRights]]). Should we create a translation administrator user group? Are there plans for creating a page like [[LinguaLibre:Requests for rights|this]] in the future? — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 03:08, 13 September 2020 (UTC)
| |
− | :{{ping|WikiLucas00}} indeed, currently there are not a lot of different user rights available here (bot, admin, bureaucrat). If you think we should have more, please feel free to open a ticket asking for that on [[phab:project/view/3393/|Phabricator]]. About, [[LinguaLibre:Requests for rights]], the same, feel free to create and initialize this page :D [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 17:06, 13 September 2020 (UTC) | |
− | ::{{ping|Pamputt}} I created [[phab:T262855|a task on Phabricator]]. Let's first see how it evolves before creating a Request page. — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 19:42, 14 September 2020 (UTC)
| |
− | ::* Greetings, not sure specifically about this right, but most of the rights are managed at Localsettings.php ([[:mw:Manual:User rights]]). Good wishes. --[[User:Titodutta|টিটো দত্ত (Titodutta)]] ([[User talk:Titodutta|কথা]]) 00:35, 19 September 2020 (UTC) | |
− | ==== New admins ? ====
| |
− | ''See also [[Special:ListUsers/sysop]]''
| |
− | | |
− | {{ping|Pamputt|WikiLucas00|Titodutta|Lyokoï}} I think it would be nice to make WikiLucas an admin. We are a micro-wiki, WikiLucas has proven to be active and knowledgeable, all lights are green to make him a sysop. I would also encourage to have one or two Indian admins. Indian users are the second largest community here, they bring new insights to the projects, let's empower them properly. Any idea who among this later community would need the admin tools ? (page translate, page deletion, language import) [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 05:09, 23 September 2020 (UTC)
| |
− | | |
− | Checking over [[Special:ListUsers/sysop]] I also notice :
| |
− | * Bureaucrats: 0x010C is taking a year off; Xenophon is a WMfr staff with bureacrat right for security reasons but barely active here; GrandCelinien... I barely crossed him; it leaves Pamputt as the single active bureaucrat. [[:en:Bus factor|Not enough]]. We need at least 3 '''active''' bureaucrats. I propose to promote Lyokoï to bureacrat if he is ok. He is a regular contributor and solid bet. We also will need someone on the Indian/Asian side soon. Bureacrats mainly can gives users more rights, such as admin status. It's not much but when we need it we need it, and relying on one single Bureaucrat is no a good practice.
| |
− | *: I’m OK to be a bureaucrat. If you want it, I see no problem. [[User:Lyokoï|Lyokoï]] ([[User talk:Lyokoï|talk]]) 16:36, 6 October 2020 (UTC)
| |
− | * Admins: WikiLucas is an obvious candidate, he bumped into limitations (page translation right above). I see about 3 Indian contributors quite engaged here, could we promote one ? [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 05:24, 23 September 2020 (UTC)
| |
− | ::{{ping|Yug}} I am clearly not opposed to have more bureaucrat or admin, neither to have some Indian contributors among them. That being said, I don't think we're in a hurry (the Lingua Libre community is not very active at the moment). I prefer to take some time to give the rights to people involved in Lingua Libre, so that we can be sure that they will use their rights for at least a few months. Yet, if someone requests admin or bureaucrat right, just ask (there is no bureaucratic procedure yet here). [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 06:25, 23 September 2020 (UTC) | |
− | :::{{ping|Yug|Pamputt|Lyokoï|Titodutta}} I agree with Yug, I would be more valuable to the project as an admin. As Pamputt has pointed out, the project is not very active for the moment, but in the light of future events -- for instance the training course I will be giving this month with Emma Vadillo to the alumni of the ''INaLCO'' in Paris <small>(being able to quickly delete the potential mistakes of the learners would be worthy)</small>, or the possible nomination of the project to the [[m:Coolest_Tool_Award|Coolest Tool Awards]]), its outreach will grow, so will the community. — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 17:12, 3 October 2020 (UTC) | |
− | :{{Ping|Pamputt}} per the request just above let's '''grant WikiLucas00 adminship''', he is one of the most active here anyway, and admin is just an active user with a toolbox to add languages, block users, add translatable pages. | |
− | :As '''for my general argument''', I consider that being a small wiki and with most admins/bureaucrats rarely passing by or via occasional sprints (my case), we therefor need a high ratio of admins/bureaucrates so there is always one around and checking upon the [[Special:RecentChanges]]. | |
− | :Last, '''as for Bureaucrats''', we are failing the [[:en:Bus factor|Bus factor/Bus test]] : it's a organizational risk we should not fail, ever. I recommend adding one Bureaucrat for sure. Keeping Xenophon as far away backup. I would also recommend to keep the door open for one more, preferably from the East-Asian community (different timezone, human network, strategic opportunity, etc.).
| |
− | :We also need to '''recruit an admin on Commons''' able to do mass delete when we provide a list of files. Do we have this already ? {{ping|VIGNERON}} ? [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 19:23, 3 October 2020 (UTC)
| |
− | ::Ok, I granted [[User:WikiLucas00|WikiLucas00]] as admin. For Commons, it would be indeed interested to have such profile. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 21:04, 4 October 2020 (UTC) | |
− | ::: {{ping|Yug|Pamputt}} Thank you for your trust. — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 21:25, 4 October 2020 (UTC)
| |
− | ::Ok for me to give admin statut to WikiLucas. [[User:Lyokoï|Lyokoï]] ([[User talk:Lyokoï|talk]]) 16:26, 6 October 2020 (UTC)
| |
− | | |
− | == Userrights==
| |
− | :{{done}} | |
− | :::<s>On '''user rights''', I think it would be good to change a bunch of them on LinguaLibre. LinguaLibre is not a central Wikimedia project, better to have lower "gates".</s> (We depend on Commons)
| |
− | :::{{ping|Titodutta}} As I understand it we '''dependent upon Commons for upload rights.''' Each mediawiki as build in <code>groups</code> such as <code>users</code>, <code>autopatrolled</code>, <code>admin</code>, etc, which have specific user rights allowing actions and ratelimits defining the number of this action allowed within a time period. So far, I found: | |
− | :::* <code>user</code> group's upload right: <code>"user": { "hits": 380, "seconds": 4320 }</code> (72 mins)
| |
− | :::* <code>autopatrolled</code> group's upload righ: <code>"autopatrolled": { "hits": 32, "seconds": 60 }</code>
| |
− | :::Most LinguaLibre contributions are done via confirmed Wikimedian account, likely to be part of the <code>autopatrolled</code> group on commons, so we rarely run into any upload limit. We run into it when whe have a non-wikimedia, who suddenly rush into LinguaLibre, like did Luilui6666 : 5000 upload within one month <small>(I made a student-rate donation in exchange for this dedicated work, worth it!).</small>
| |
− | :::Can we tell Commons API's ''"Hey, this user account is ok, please grant it <userright>"'' or should we specifically ask for userights there via [[:Commons:Commons:Requests_for_rights|Commons:Requests_for_rights]] and a mentor-based application. By example, I would ask there for User:Luilui6666 to get <code>Autopatrolled users</code> so her upload ratelimit move from 380 per 72mins (group <code>users</code>) to 999/sec. Some digging in this direction would be welcome, so I just created: | |
− | :::* '''[[LinguaLibre:User rights]]''', a new page to expand according to our emerging knowledge on both user rights and <code>ratelimit</code>. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 14:50, 20 September 2020 (UTC)
| |
− | ::::* For upload we need to depend on Wikimedia Common's rate and rules, and I believe that's for good. I agree with you that we'll very rarely face this problem for 2 reasons: a) most of us have rights on Wikimedia Commons, b) 380 per 72mins is also too high. Anyway, the only solution I can think of, if an editor is uploading too many words per hour, and uploaded around 500–1,000 files on Commons, try the "autopatrolled" right on Wikimedia Commons. --[[User:Titodutta|টিটো দত্ত (Titodutta)]] ([[User talk:Titodutta|কথা]]) 18:57, 20 September 2020 (UTC) | |
− | | |
− | == L’importateur de langue ne fonctionne plus ? ==
| |
− | :{{done}} -- it works, issue closed. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 19:52, 6 October 2020 (UTC) | |
− | Bonjour j’essaie d’importer la langue dagbani (Q32238 sur WD) après [https://twitter.com/dagbaniwiki/status/1306912106776858624 une demande sur Twitter], mais l’outil d’import ne fonctionne pas, y’a-t’il un moyen de le corriger ou de le contourner ? [[User:Lyokoï|Lyokoï]] ([[User talk:Lyokoï|talk]]) 18:00, 19 September 2020 (UTC)
| |
− | :Avec les audios accélérés, il semblerait qu'on ait quelques bugs trés génants. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 14:05, 21 September 2020 (UTC) | |
− | ::J'ai ouvert un [[phab:T263477|ticket sur Phabricator]]. Peut-être que créer manuellement l'élément permet de contourner le problème. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 15:49, 21 September 2020 (UTC) | |
− | ::: {{ping|Pamputt}} Il faut faire comment manuellement ? [[User:Lyokoï|Lyokoï]] ([[User talk:Lyokoï|talk]]) 18:30, 23 September 2020 (UTC)
| |
− | :::: {{ping|Lyokoï}} en passant par [[Special:NewItem]], tu peux créer un nouvel élément pour ta langue. Ensuite il faut ajouter les propriétés (tu peux t'inspirer de [[Q21]]). Mais pas sûr que ça soit automatiquement reconnu par le système. Ca ne coute pas grand chose d'essayer donc on sera vite fixé. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 01:01, 24 September 2020 (UTC)
| |
− | ::::: {{ping|Pamputt}} Merci Je vais essayer ! [[User:Lyokoï|Lyokoï]] ([[User talk:Lyokoï|talk]]) 18:32, 27 September 2020 (UTC)
| |
− | :{{ping|Lyokoï}} je viens de faire le test avec le bankon [[Q386221]]. Et ça a l'air de fonctionner. Plus de détail [[Help:Add_a_new_language/fr#Créer_une_nouvelle_langue_manuellement|ici]] sur la manière de faire bien que toutes les images aient disparu. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 16:44, 29 September 2020 (UTC) | |
− | :: {{ping|Pamputt}} C’est bon, j’ai testé aussi. [[User:Lyokoï|Lyokoï]] ([[User talk:Lyokoï|talk]]) 14:43, 30 September 2020 (UTC)
| |
− | :::{{ping|Lyokoï|Pamputt}} I tested with Western Kurd (Kurmanji), via administrator language import tool in the Top-right Action tab : it worked. Chrome + Ubuntu 20.04. I think we can close this bug and just keep an eyes on it. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 15:29, 6 October 2020 (UTC) | |
− | ::::Indeed, it works here as well. I really do not understand what happened because no one touched the code of [[MediaWiki:Gadget-LinguaImporter.js]] last days ... So OK to close the bug report. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 19:02, 6 October 2020 (UTC) | |
− | :::::Maybe [[User:Pamputt|you]] imported an existing language ? [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 19:52, 6 October 2020 (UTC) | |
− | :::::No, for sure not. Few weeks ago, the gadget did not allow to enter anything in the field where you type the Wikidata QID. So, no explanation so far but not a big deal. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 19:54, 6 October 2020 (UTC) | |
− | | |
− | == 0x010C year offgrid : preparations ==
| |
− | Hello folks, [[User:0x010C|0x010C]] anounced by email his soon to be departure from the project for a year+ off grid (he will tell more here if he wish to ;) ). We can't fully replace our [[:en:Benevolent_dictator_for_life#;)|benevolent lead developer]]. But could we brainstorm to see where he was active, and how to best fill the gap ? I'am kick starting this table but I have a biais since I don't know every task 0x010C was taking on nor do I know all active users on the project and your full skillsets. Please help us to fill in the gaps. 0x010C will be available between '''Oct. 15th and October 30th to pass some know how''' to who wish to. Let's prepare our questions properly for this transition. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 14:01, 21 September 2020 (UTC)
| |
− | | |
− | {| class="wikitable sortable"
| |
− | ! Critical || Task / Aspect || Requirement ? || Who else knows ?<br>Who wish to learn ?|| Satisfy<br>requirements (%)
| |
− | |-style="background:#FFAA0066;border-color:#FFAA0099"
| |
− | | high || Server maintenance || 1. Has back-end sysop knowledge<br>2. Has access rights to WMFr server (see WMFr sysop).<br>3. Knows how to maintain/restarts scripts and processes.<br>4. Knows how to restart NGINX server || NEEDS SOMEONE || 0% so far
| |
| |- | | |- |
− | | high || Edit recording wizard JS library || 1. Has advanced javascript know how.<br>2. Knows where js code is {link to js repository}<br>3. Edit and test js code locally .<br>4. Has access rights to push. || None or Nicolas Vion ? || 0% or 60%
| + | ! Daily recordings over April and May 2024 || |
| |- | | |- |
− | | high || Deploying fix into production || 1. Has back-end sysop knowledge<br>2. Has access rights to server. <br>3. Has access rights to pull corrected code.<br>4. Knows how to rebuild/deploy. || WMFR's new server sysop ? (to be hired) || Uncertain | + | | |
− | |-
| + | <query _pagination="40"> |
− | | high || Add new language to LinguaLibre || 1. Has <code>administrator</code> user rights<br>2. Can read tutorial {add tutorial link here} || Has done it: Pamputt, Lyokoy, Yug, ... || 90%
| + | SELECT |
− | |-
| + | ?yearmonthday |
− | | high || Read Phabricator task, fix code || 1. Has background knowledge to understand bug description.<br>2. Edit code, test locally.<br>4. Has access rights to push. || No replacement for real code, code deployment.<br>Replacements available for CSS, wiki content fixes. || 30%
| + | (COUNT(DISTINCT ?record) AS ?records) |
− | |-
| + | (COUNT(DISTINCT ?speaker) AS ?speakers) |
− | | medium || Assign user rights || 1. Has bureaucrats status<br>2. Know how to assign new user rights. || [[Special:ListUsers/sysop]]: 0x010C, GrandCelinien, Pamputt, Xenophôn.<br>Few more wouldn't hurt to counter unequal activity levels. || 100%
| + | (COUNT(DISTINCT ?language) AS ?languages) |
− | |-
| + | WHERE { |
− | | medium || Github repository manager || 1. Have access to repository {link here}.<br>2. Has <code>owner</code> status.<br>3. Can manage userrights || Has basic understanding: Yug, Pamputt... || 70%
| + | ?record prop:P5 ?speaker . |
− | |-
| + | ?record prop:P4 ?language . |
− | | medium || Create tasks on Phabricator || 1. Have account on phabricator.<br>2. Has background knowledge to write sharp bug / task description.<br>3. Know to manage Phabricator tasks || Pamputt, Lyokoï, Yug, ... || 70%
| + | ?record prop:P6 ?date . |
− | |-
| + | BIND( SUBSTR(str(?date), 0, 11) as ?yearmonthday ) |
− | | low - assumed very stable || LinguaLibre -> Wikimedia Commons API communication || 1. Knows NodeJS (?) scripts.<br>Well documented on [[:mw:API]].<br>2. Knows where to edit existing nodes scripts.<br>3. Can test locally.<br>4. Has access rights to push. || Has basic understanding: Yug, others ? || 10%
| + | { SELECT ?record |
− | |-
| + | WHERE { |
− | | low || Update site's CSS || 1. Edit [[MediaWiki:Common.css]] (hack)<br>or<br>1. Edit {git repository page}. || Has basic understanding of the hack way: Yug, ... || 30%
| + | ?record prop:P2 entity:Q2 . |
− | |- | + | ?record prop:P6 ?date . |
− | | colspan=5| <small>Please help describe where we need help to take on 0x010C's skills. This year long's departure is the opportunity to us to increase our know how in these various fields. See also: [[:Commons:Category:Lingua_Libre]].</small>
| + | FILTER(?date >= "2024-04-01T00:00:00Z"^^xsd:dateTime) |
− | |-
| + | FILTER(?date < "2024-05-30T00:00:00Z"^^xsd:dateTime) |
− | | colspan=5| <center>[[File:2018-12_Lingua_Libre_webrequest_flow.png|center|700px]]</center>
| + | } |
| + | } |
| + | } |
| + | GROUP BY ?yearmonthday |
| + | ORDER BY (?yearmonthday) |
| + | </query> |
| + | | <= stops on 2024.05.01<br>Note: [[Special:Contributions/Austin Zhang|Austin Zhang]] recorded 174 audios on 05.11 |
| |} | | |} |
− | :Thanks for that overview. For now, the worst is there is no developer at all. without that, I think, we can only list all the issues we encounter on Phabricator waiting one is hired by WMFr or other. Considering all the tickets opened on Phabricator, a new developer can be busy for several month, especially if he/she does not very well the project as 0x010C does. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 15:56, 21 September 2020 (UTC)
| + | [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 10:39, 14 May 2024 (UTC) |
− | ::Yes. I'am quite worry about the speeded-up-audios and add-language bugs. The first literally throw to the trash hour-long efforts and '''pollute''' existing audios datasets, really bad. The second prevents diversity growth. It would be good to mount an emergency budget to pay 0x010C to fix these 2 critical phabricator issues before he leaves. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 16:42, 21 September 2020 (UTC)
| |
− | ==== Feedbacks from 0x010C ====
| |
− | Ok! So, I just got a correct phone-call with 0x010C! 0x has great projects ahead it's awesome, so happy.<br>
| |
− | Also, he pointed out the following points:
| |
− | * 0x010C will pass by LL's discussion page to '''add pointers''' to the table above and answer questions.
| |
− | ** This will occurs in late October.
| |
− | * 0x010C will ''not'' be able to '''inspect the speeding-up bug''' : we need to find an alternative to fix this.
| |
− | * 0x010C underlined the most critical need : '''a server sysop''', able to do server maintenance and restarts processes. Some peripheral routines such as the SPARQL counter occasionally fails and needs to be restarted manually.
| |
− | | |
− | First, '''on the speeding-up bug''', we therefor have 2 ways to push forward :
| |
− | # Corner the bug. We currently suspect it to be linked to Chrome. Test more. With various browsers. Record 30 words, listen to them, then report results (see section below)
| |
− | # Hire a freelance asap. The bug is suspected to be within https://github.com/lingua-libre/LinguaRecorder . There, the last 2 modifications and prime suspects are:
| |
− | #* 2020-05-09 [https://github.com/lingua-libre/LinguaRecorder/commit/102aa5041cbe24255fdb522bb045f693e9ca05fd#diff-e3f94ea1709f1bc0a8f6d9b4d22192f2 src/AudioRecord.js]
| |
− | #* 2020-04-28: [https://github.com/lingua-libre/LinguaRecorder/commit/102aa5041cbe24255fdb522bb045f693e9ca05fd#diff-e3f94ea1709f1bc0a8f6d9b4d22192f2 src/LinguaRecorder.js]
| |
− | | |
− | Secondly, on the '''critical server's maintenance, Wikimedia France's server sysop is also leaving soon''' and the next one is not yet identified. So we may need to send a far reaching call for a server sysop's help, either a volunteer, or the sysop of some friendly chapter ? (UK? DE? IT?) It could equally be the opportunity to open up deeply to non-French member. LL is 3 years old yet most of the LL's admins are French. Not smart. I would especially encourage to open up toward the Indian community, Odia, Tamil, who have been quite active and with high quality feedbacks. They themselves developed a shell-based audio recording tools few years back, so there is there a culture which value orality and acts to protect it. The call shouldn't be limited to this community. The Mediawiki community (techs), Commons, Wikidata, could have some relevant volunters with the needed skillsets. Would be good if we could write together a call to find a volunteer server sysop, together with a call for more diverse contributions to LinguaLibre.
| |
− | | |
− | While we can co-write this call asap, I think we should first corner the speeding-up bug before asking people to join in and contribute. So as long as the speeding-up bug is unsolved, we can only call for a server sysop, not for more contributors.
| |
| | | |
− | Meanwhile, please update the table above as you feel suit. I'am sure I forgot a lot of things. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 17:52, 22 September 2020 (UTC)
| + | === Fixed === |
| + | Both IP ranges 2001:41D0:0:0:0:0:0:0/32 and 2001:41D0:0:0:0:0:0:0/33 were subject to global Wikimedia block at one point (see [https://meta.wikimedia.org/w/index.php?title=Steward_requests/Global&oldid=26774369#Unregistered_users_only_block_for_the_range_2001:41D0:0:0:0:0:0:0/32 Global ban range_2001:41D0:0:0:0:0:0:0/32]). Following our request, the ban have been reconfigured and uploads from LinguaLibre are possible again. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 10:38, 14 May 2024 (UTC) |
| + | :I can record and upload since yesterday with my account, so that seems fixed. But it seems the stats are still not updated. [[User:Culex|Culex]] ([[User talk:Culex|talk]]) 12:08, 15 May 2024 (UTC) |
| | | |
− | == Speeding-up bug : call for testers == | + | === Logs === |
− | Please, we need testers to corner that nasty bug ! Could you test recording with various browsers ? Go to [[Special:RecordWizard]], pick a random language, record 30 words, listen to them, [DO NOT UPLOAD], then report here the resulting pentad ;)
| + | For references, I investigated the relevant block logs and uploads logs for May 2024.<br>Conclusion: the uploads collapse is coherent with the IP Ban. Still, given bug reports from Akamycoco in *March* and 咽頭べさ [[:c:File:Lingua_Libre_error_2024.webm|on step 4]], I suspects other bugs are lingering around. |
− | * Test list (suggested) : <code>List:Kur/Test</code> (10 words)
| + | {| class=wikitable |
− | * Username : <code>yourusername</code>
| + | !width=50%| Global IP bans |
− | * Speeding bug : <code>true|false</code>
| + | ! Lingualibre uploads logs |
− | * Web browser : <code>name-version</code> | |
− | * OS : <code>name-version</code>
| |
− | * Microphone : <code>internal|external</code> | |
− | [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 18:00, 22 September 2020 (UTC) PS: {{ping|DSwissK}} | |
− | {|class="wikitable sortable" style="width:100%;" | |
− | ! Tester username || Success rate || Web browser + version || OS + version || Microphone type || Comments | |
| |- | | |- |
− | | DSwissK || 0% (speeding bug occurs at every recording) || Google Chrome 84.0.4183.127 || Android 9 || internal || List:Fra/Dico des Ados (3k+ words) | + | | |
− | |-
| + | * [https://meta.wikimedia.org/wiki/Special:Log?type=&user=&page=User%3A2001%3A41D0%3A%3A%2F32&wpdate=&tagfilter=&wpFormIdentifier=logeventslist 18:46, 13 May 2024] EPIC talk contribs changed global block settings for 2001:41d0::/32 talk with an expiration time of 00:51, 10 May 2026 (anonymous users only) (No open proxies <!-- SCLT ID: Possible VPN or Colocation -->) |
− | | ClasseNoes || 0% (speeding bug occurs at every recording) || Google Chrome || ChromeOS || internal || List:Fra/Dico des Ados (3k+ words)
| + | * [https://meta.wikimedia.org/wiki/Special:Log?type=&user=&page=User%3A2001%3A41D0%3A%3A%2F32&wpdate=&tagfilter=&wpFormIdentifier=logeventslist 00:51, 10 May 2024] AmandaNP talk contribs globally blocked 2001:41d0::/32 talk with an expiration time of 00:51, 10 May 2026 (No open proxies <!-- SCLT ID: Possible VPN or Colocation -->) |
− | |-
| + | * [https://meta.wikimedia.org/wiki/Special:Log?type=&user=&page=User%3A2001%3A41D0%3A%3A%2F33&wpdate=&tagfilter=&wpFormIdentifier=logeventslist 17:02, 9 May 2024] EPIC talk contribs changed global block settings for 2001:41d0::/33 talk with an expiration time of 17:09, 1 May 2027 (anonymous users only) (Open proxy/Webhost: See the help page if you are affected) |
− | | Luilui6666 || 0% (speeding bug occurs<br>on one session) || Google Chrome || MacOS || || 126 (100%) audios of [https://lingualibre.org/index.php?title=Special:Contributions/Luilui6666&offset=20200717000000&limit=128&target=Luilui6666 the 04:5*am upload batch] are corrupted. This session contains longer than average phrases. Session before and after are ok.
| + | * [https://meta.wikimedia.org/wiki/Special:Log?type=&user=&page=User%3A2001%3A41D0%3A%3A%2F33&wpdate=&tagfilter=&wpFormIdentifier=logeventslist 17:09, 1 May 2024] EPIC talk contribs blocked 2001:41d0::/33 talk with an expiration time of 2 years, 364 days, 12 hours, 21 minutes and 36 seconds (anonymous users only, account creation disabled) (Open proxy/Webhost: See the help page if you are affected) |
− | |-
| + | * [https://meta.wikimedia.org/wiki/Special:Log?type=&user=&page=User%3A2001%3A41D0%3A%3A%2F33&wpdate=&tagfilter=&wpFormIdentifier=logeventslist 17:09, 1 May 2024] EPIC talk contribs globally blocked 2001:41d0::/33 talk with an expiration time of 17:09, 1 May 2027 (Open proxy/Webhost: See the help page if you are affected) |
− | | Yug || 100% (no bug) || Google Chrome 85.0.4183.121 (64-bit) || Ubuntu 20.04 || internal || Observed at step 5. Review before Publish. List:Kur/Test (10 words)
| + | | |
− | |-
| + | * : [https://commons.wikimedia.org/wiki/Special:RecentChanges?hidebots=1&translations=filter&hidecategorization=1&hideWikibase=1&tagfilter=OAuth+CID%3A+1735&limit=500&days=30&urlversion=2 Uploads via Lingualibre resumed]. |
− | | Yug || 100% (no bug) || Google Chrome 85.0.4183.121 (64-bit) || Ubuntu 20.04 || external || Observed at step 5. Review before Publish. List:Kur/Test (10 words)
| |
− | |-
| |
− | | Yug || 100% (no bug) || Chromium 85.0.4183.121 (64-bit) || Ubuntu 20.04 || internal || Observed at step 5. Review before Publish. List:Kur/Test (10 words)
| |
− | |-
| |
− | | Yug || 100% (no bug) || Chromium 85.0.4183.121 (64-bit) || Ubuntu 20.04 || external || Observed at step 5. Review before Publish. List:Kur/Test (10 words)
| |
− | |-
| |
− | | Yug || 100% (no bug) || Kiwi 77.0.3865.92 (2020-08-15) || Android 9 || external || Observed at step 5. Review before Publish. List:Kur/Test (10 words)
| |
− | |-
| |
− | | Yug || 100% (no bug) || Kiwi 77.0.3865.92 (2020-08-15) || Android 9 || internal || Observed at step 5. Review before Publish. List:Kur/Test (10 words)
| |
− | |-
| |
− | | Yug || 100% (no bug) || Chrome 80.0.3987.99 || Android 9 || external || Observed at step 5. Review before Publish. List:Kur/Test (10 words)
| |
− | |-
| |
− | | Yug || 100% (no bug) || Chrome 80.0.3987.99 || Android 9 || internal || Observed at step 5. Review before Publish. List:Kur/Test (10 words)
| |
− | |-
| |
− | | Pamputt || 100% (no bug) || Firefox 78.3.0 esr || Mageia Linux 7 || internal ||
| |
− | |-
| |
− | | DSwissK || 100% (no bug) || Google Chrome 86.0.4240.75 || Android 10 || internal || List:Fra/Dico des Ados (3k+ words)
| |
− | |-
| |
− | | DSwissK || 100% (no bug) || Google Chrome 86.0.4240.99 || Android 10 || external|| List:Fra/Dico des Ados/alea (54 words)
| |
− | |-
| |
− | | <add yourself> || || || || ||
| |
− | |}
| |
− | {{Ping|ClasseNoes|DSwissK}} It works fine with me on both Chrome and Chromium. Did you and could you try with other OSes ? [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 14:59, 6 October 2020 (UTC)
| |
− | :{{Ping|Titodutta|Lyokoï|Pamputt}}, we need help to corner the speed up bug by doing more devices testings. Could you help ? INALCO workshop is on Oct. 17th. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 17:32, 6 October 2020 (UTC) | |
− | ::I've added the information relative to my configuration. Another explanation may be the internet quality (bandwith, latency, etc). In the case of micro-cuts, some software accelerates the voice to make up for the delay. So maybe it could happen more often with 3G connection instead of optical fibers. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 19:50, 6 October 2020 (UTC)
| |
− | :::0x010C was suggesting a purely client-side issue. The recording into audio data are done client-side. I don't see clear pattern emerge so far. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 21:52, 6 October 2020 (UTC) | |
− | ::::It's definitely NOT internet quality for ClasseNoes and myself. We had a good connexion (not over mobile) and the bug occured at several days of difference. [[User:DSwissK|DSwissK]] ([[User talk:DSwissK|talk]]) 06:19, 8 October 2020 (UTC)
| |
− | :::::'''Current assessment:''' 3 out of 3 of our users with speeding bug used Google Chrome (v84?), on Android 9, ChromeOS, MacOS. The best lead we have so far is a Google Chrome recording API implementation-related, either due to a recent Google Chrome update, or to recent LinguaLibre JS's update done by 0x010C around May. (See above to link to suspected JS code). I took a quick (3mins) look at "[https://www.google.com/search?q=Chrome+audio+recording+speed+bug&oq=Chrome+audio+recording+speed+bug&uact=5 Google Chrome + audio recording + speed bug]" but nothing conclusive.
| |
− | :::::'''More tests?:''' Could you [[User:DSwissK|DSwissK]] & [[User:ClasseNoes|ClasseNoes]]* test again on the same devices (hardware, OS) but with different web browser.
| |
− | :::::@ClasseNoes: could you check your exact Google Chrome's version ?
| |
− | :::::@DSwissK, after retesting on Android 9 Chrome v.84, do you have the possibility to update so to test Android 9 Chrome to v.85 ?Android 9 Chrome to v.85 works for me on small test lists (~10 items). Could you also comment more : does this speeding bug show up ''EACH'' time you use this Android 9 Chrome v.84 pair ? Is there a saturation effect with longer lists ?
| |
− | :::::<nowiki>*: Luilui6666 is a student who did a paid recording sprint on Cantonese and moved on, he volutarily helped a lot already with the `ratelimit` bug, I can't request further free help from this side. PS: 270€ for 9h and 5000 audios despite the ratelimit bug. Really fruitful experiment! Worth it to boost a language.</nowiki> [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 12:18, 8 October 2020 (UTC)
| |
− | :::::: I was thinking maybe it was because of the huge list I'm using ([[List:Fra/Dico des Ados]]) but no, it works fine (see last row) on last Chrome version (and Android 10 that I flashed this week-end on the same smartphone). [[User:DSwissK|DSwissK]] ([[User talk:DSwissK|talk]]) 07:23, 11 October 2020 (UTC)
| |
− | :::::::{{ping|DSwissK}} I suspect Google Chrome v84 to be the issue. But we have not enough details to be conclusive. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 08:49, 15 October 2020 (UTC)
| |
− | :::::::After verification, Luilui6666 had both corrupted batch and non-corrupted batch of audios about 30mins appart. Recordings ok before and after [https://lingualibre.org/index.php?title=Special:Contributions/Luilui6666&offset=20200717000000&limit=128&target=Luilui6666 the corrupted session]. My Chrome-centered hypothesis is challenged. Must be something else. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 09:45, 17 October 2020 (UTC)
| |
− | ::::::::Yug, how did it go on Oct 17 ? Did you encounter that problem with some users ? [[User:DSwissK|DSwissK]] ([[User talk:DSwissK|talk]]) 11:10, 20 October 2020 (UTC)
| |
− | :::::::::{{ping|Eavq|wikiLucas00|DSwissK|Nicolas_Lopez_de_Silanes_WMFr}} Hello DSwissK. I was not there, I'am too far off (near Spain, 800km). Need to ask WikiLucas, Eavq and Nicolas. See also [[LinguaLibre:Formations_CCWL]]. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 15:16, 22 October 2020 (UTC) | |
− | | |
− | == Lingua Libre Story for September 2020 ==
| |
− | | |
− | :''This is not an official story or newsletter. This is an attempt by the project user(s) to share some updates about the program. There might be more stories which we have missed.''<br>
| |
− | September 2020 was an eventful month and we have seen a lot of activities of uploading new content and also around project-related discussion. Here are some of the best stories from September 2020.
| |
− | * '''300,000 files:''' On 10 September 2020 we completed 300,000 pronunciation uploads. After the launch in August 2018, the first 100,000 files were uploaded in April 2019, and the milestone of 200,000 files was reached on January 2020. As of 30 September 2020 there are 366 speakers at this project working in 92 languages.
| |
− | * '''Maximum number of pronunciations in a month:''' In September 2020, 23,209 files were uploaded. This is the maximum number of files uploaded ever in a particular calendar month (earlier it was 22,963 files in June 2020, and 22,293 files in May 2019).
| |
− | * '''Indian language in top 3 list:''' This month Bengali language came into the top three languages by the number of files uploaded using Lingua Libre. This is possibly the first time a non-European/Indian language came into the top three most-uploaded languages on the project. As of 30 September there were 26,757 files in Bengali (the top two languages by file count were French: 164,626 files and Esperanto: 28,100)
| |
− | * '''Project chat:''' Several discussion started on the Chat room, such as [[LinguaLibre:Chat_room#Speeding-up_bug_:_call_for_testers|Bug testing]] (you may help), [[LinguaLibre:Chat_room#0x010C_year_offgrid_:_preparations|Technical preparations]] etc.
| |
− | * '''Coming:''' 1) Oct. 17th's workshop at [[:en:INALCO|INALCO University]], Paris. This University teach about 105 languages. 2) In late October, [[User:0x010C]] willing to share server's know-how before year-long departure off-grid. | |
− | That's it. Have a good time. --[[User:Titodutta|টিটো দত্ত (Titodutta)]] ([[User talk:Titodutta|কথা]]) 16:30, 1 October 2020 (UTC)
| |
− | :Thank you [[User:Titodutta]], it's an interesting format. We can also think of it as collaborative news-letter, edited here, then shareable to our networks. I added a "Coming" section. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 20:31, 4 October 2020 (UTC) | |
| | | |
− | == English label and non-English label ==
| + | 13 May 2024 |
− | Most probably I did not notice this earlier [[Q389651]]. Label: this is not English, this is Bengali. My language is set to Bengali as well. Sad thing is: this affects many Bengali files, if not all. I also saw one of the recent uploads in other language: Esperanto: [[Q389566]]. For Bengali, and several other languages the script is completely different. (a whole lot of bot work I think). Opinion? --[[User:Titodutta|টিটো দত্ত (Titodutta)]] ([[User talk:Titodutta|কথা]]) 22:04, 4 October 2020 (UTC)
| + | * [... Many more uploads] |
− | :{{ping|Titodutta}} not sure if this is really a bug. This behaviour comes from the fact Lingua Libre uses Wikibase to handle its own items. And Wikibase allows as many labels as there are languages but actually we do not need any label on Lingua Libre. So, by default it is always English. That's said, I understand it can be weird for some people, so I think the label should be the word that has been recording in English and in the language of the word so that it can be displayed as it when we you use Lingua Libre in your mother tongue. Or maybe it could be the same label for all languages. Anyway, except it is a bit strange, it is not a big deal because these labels are not used by any one. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 19:41, 6 October 2020 (UTC) | + | * Upload log 23:39 Elwinlhq talk contribs uploaded File:LL-Q5218 (que)-Elwinlhq-apaqay.wav Tag: Lingua Libre [2.2] |
− | | + | * Upload log 19:05 Assassas77 talk contribs uploaded a new version of File:LL-Q9192 (cmn)-Assassas77-八角.wav Tag: Lingua Libre [2.2] |
− | == Other bugs from India ==
| + | * Upload log 19:05 Assassas77 talk contribs uploaded File:LL-Q9192 (cmn)-Assassas77-八角.wav Tag: Lingua Libre [2.2] |
− | Hi [[User:Titodutta|কথা]], I was happy to meet you tonight even if it was short. About the bugs you discussed I have created [[phab:T264790|T264790]]. There is also the problem with the labels discussed above. You talk about a problem about duplicates in the word list but I am not sure I have understood correctly because I was not able to reproduce. So could you open a bug report on Phrabricator to describe what is wrong? If you are uncomfortable with Phabricator, you can describe the problem here and I will open the ticket over there. And there was also another point but I have forgotten it so could you kindly remind me? [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 19:45, 6 October 2020 (UTC)
| + | * Upload log 16:38 Oh! Tea<ref>[https://commons.wikimedia.org/wiki/Special:Log?page=User:Oh!_Tea Commons > User:Oh!_Tea : « nothing » on Commons]</ref> talk contribs uploaded File:LL-Q36759-Austin Zhang-sih8 buh8 sah8 nah4.wav Tag: Lingua Libre [2.2] |
− | * The 4 bugs we discussed today: | + | 11 May 2024 |
− | :a) Special pages showing errors, you have explained it above. | + | * Upload log 20:21 Oh! Tea talk contribs uploaded File:LL-Q36759-Austin Zhang-buah8.wav Tag: Lingua Libre [2.2] |
− | :: {{done}} [[phab:T264790|T264790]]. | + | * []... +172 recording by User:Oh! Tea] |
− | | + | * Upload log 18:56 Oh! Tea talk contribs uploaded File:LL-Q36759-Austin Zhang-a2.wav Tag: Lingua Libre [2.2] |
− | : b)Post file move error on Wikimedia Commons: it has 2 types of problems: on Commons, after moved the files are not displayed, b) "Remove words already recorded" don't count those words, so if LL-...Hello.wav is moved to LL-...Bonjour.wav, LinguaLibre does not understand Bonjour is already recorded, and asks to record again. | + | 10 May 2024 |
− | :: This is currently an issue. Ideally, recording should not be renamed manually because it happens what you described. The problem comes rom the fact the Commons database and the Lingua Libre Wikibase are not connected. The workaround is to modify manually the Lingua Libre item corresponding to the recording once renaming has been done on Commons.
| + | * Upload log 06:08 CapitainAfrika<ref>[https://commons.wikimedia.org/wiki/Special:Log?page=User:CapitainAfrika Commons > User:CapitainAfrika : « IP block exempt » on Commons]</ref> talk contribs uploaded File:LL-Q36217 (lin)-CapitainAfrika-Wiki na monɔkɔ mua bísó.wav Tag: Lingua Libre [2.2] |
− | :: If Wikimedia France finds some money, a tool allowing to rename and to apply the needed changes in Lingua Libre could be developed. See [[phab:T264789|T264789]] for a brief overview (you should develop further what we would like this tool be able to do). | + | * Upload log 00:14 Ardzun<ref>[https://commons.wikimedia.org/wiki/Special:Log?page=User:Ardzun Commons > User:Ardzun : « nothing »]</ref> talk contribs uploaded File:LL-Q13324 (min)-Ardzun-mada.wav Tag: Lingua Libre [2.2] |
− | | + | 9 May 2024 |
− | : c) some words are being eliminated: I'll try to explain this: a particular word, such as "Paris", when I try to generate from a category from Wikipedia, I get this word and record it. Then I try another option: "Nearby" to generate words, and there also I get the same word. Now, ''possibly'' LiLi ''sometimes'' fails to understand the word is already recorded. This is not applicable for all files, I have seen this in 3-4 files. <br>Let me give a clear example, see [https://lingualibre.org/index.php?title=Q381622&oldid=330125#P19 this edit]. This is pretty clear I got the word from Wikipedia. The immediate [https://lingualibre.org/index.php?title=Q381622&type=revision&diff=339010&oldid=330125 next edit] I used "Nearby" to get a list.<br>I have checked it for words. I once I can find more samples, I'll report a bug. | + | * Upload log 17:08 Àncilu<ref>[https://commons.wikimedia.org/wiki/Special:Log?page=User:Àncilu Commons > User:Àncilu : « Autopatroller » on Commons]</ref> talk contribs uploaded File:LL-Q652 (ita)-XANA000-orsù.wav Tag: Lingua Libre [2.2] |
− | :: Hard to debug. I tried several times with several words and I was not able to reproduce this bug. Are you sure you clicked on "Remove words already recorded"? So not sure opening a bug report is very useful until you have a word that triggers this bug. Did you try to rerecord [[Q381622]] to see wether this bug occurs again? | + | * Upload log 17:05 Àncilu talk contribs uploaded File:LL-Q652 (ita)-XANA000-frac.wav Tag: Lingua Libre [2.2] |
− | | + | 5 May 2024 |
− | : d) There are actually more bugs, which we did not discuss, that may need quick fix: For example if you use Vector Skin ([https://lingualibre.org/wiki/LinguaLibre:Main_Page?useskin=vector click here for preview]) at the top of the page it says "A maintenance operation is planned for today. ..." I am seeing this message for 2 months now. It might be a minor fix, perhaps we forgot to remove this notice. b) vector skin main page may need more work, as the main page is designed for LiLi skin, the recent files and other nice designs are not working in vector at all. | + | * Upload log 21:15 Benoît Prieur<ref>[https://commons.wikimedia.org/wiki/Special:Log?page=User:Benoît_Prieur Commons > User:Benoît_Prieur : « Administrator » on Commons]</ref> talk contribs uploaded File:LL-Q8785 (hye)-Benoît Prieur-Artsakh.wav Tag: Lingua Libre [2.2] |
− | :: Lingua Libre supports officially only one skin (BlueLLs). I guess Vector has not been disabled when we moved to the new version of the website. I will open a ticket to ask to remove Vector so that there is only one skin to support. See [[phab:T265079|T265079]]. | + | 1 May 2024 |
− | | + | * Upload log 16:09 Penn Zero MSSJ<ref>[https://commons.wikimedia.org/wiki/Special:Log?page=User:Penn_Zero_MSSJ Commons > User:Penn Zero MSSJ : « nothing » on Commons]</ref> talk contribs uploaded File:LL-Q9199 (vie)-Penn Zero MSSJ-hệ số.wav Tag: Lingua Libre [2.2] |
− | : e) Coming to internationalization (which is not a bug): some important pages need to be in English also, as of now, such as [[LinguaLibre:Privacy_policy]]. Of course the page can be marked for translation, however until the page is in English also, this might a bit difficult to translate directly from French. | + | * Upload log 16:09 Penn Zero MSSJ talk contribs uploaded File:LL-Q9199 (vie)-Penn Zero MSSJ-hỗn số.wav Tag: Lingua Libre [2.2] |
− | :: This is known. A working group is planned to work on the documentation page later this month. We will move all pages to English and make them translatable before the end of the year. | + | * Upload log 16:09 Penn Zero MSSJ talk contribs uploaded File:LL-Q9199 (vie)-Penn Zero MSSJ-hằng đẳng thức.wav Tag: Lingua Libre [2.2] |
− | | + | * [... Many more uploads] |
− | :(Fun fact: You saw I used the word "LiLi". Sometime ago I posted on your talk page about the pronunciation of "LinguaLibre". I am aware in Indian community we often use the short form LiLi/Lili in our discussion, which is a [https://www.sheknows.com/baby-names/name/lili/ female name] in multiple language including a few Indian languages.) | |
− | : This was indeed good to talk you. This was very kind you switched to English briefly. As gradually we are seeing more contribution from India and other countries, possibly we can have a global meet/France-India meet in future. | |
− | Regards. --[[User:Titodutta|টিটো দত্ত (Titodutta)]] ([[User talk:Titodutta|কথা]]) 20:47, 6 October 2020 (UTC)
| |
− | :: {{support}} I approve using the nickname "'''LiLi'''"! — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 11:05, 7 October 2020 (UTC)
| |
− | ::Yes, it's an elegant nickname. Thanks for the suggestion :) [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 20:58, 7 October 2020 (UTC) (The writen "LL" don't translate well when we oralize it in French.) | |
− | {{ping|Titodutta}} for '''e)''', I've marked the page for translation, and translated it into English :). Please do not hesitate if you see other pages in the same situation. — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 13:08, 8 October 2020 (UTC)
| |
− | :{{ping|Titodutta}} I answered point by point in your text to make the discussion more understandable. Feel free to reply below. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 19:08, 8 October 2020 (UTC)
| |
− | | |
− | == Add "Recent changes (non-audio)" to "Tools" menu ==
| |
− | :2. Reverted. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 09:06, 9 October 2020 (UTC)
| |
− | :1. {{Done}} -- this allow quick access to list of recent changes, without the MASSIVE overload of audio recordings. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 15:21, 8 October 2020 (UTC)
| |
− | I just found out my [[User:Yug/common.js]] doesnt work, doesn't even run a simple <code>console.log("Hello world!")</code>. Any idea why ?
| |
− | | |
− | [[MediaWiki:common.js]] does work as expected. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 14:52, 8 October 2020 (UTC)
| |
− | :[[:mw:Manual:Interface/JavaScript]]: ''"If $wgAllowUserJs is set to true, users can customize the interface for only themselves by creating and importing personal scripts in certain user subpages."'' [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 14:59, 8 October 2020 (UTC)
| |
− | :Damn. I wanted to test on myself before to move to [[MediaWiki:common.js]] [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 15:00, 8 October 2020 (UTC)
| |
− | ::{{ping|Yug}} I reverted what you did. I think it is not a good idea to enable it for everyone because it loads more javascript for something that almost no one uses; it is possible to get the same results in a few clicks. So please, add this code only in your [[Special:MyPage/common.js|common.js]]. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 19:20, 8 October 2020 (UTC)
| |
− | :::Hello Pamputt. As explained above, [[Special:MyPage/common.js]] is not activated on LL so I couldn't test it there first and we aren't able to do so. So I went ahead and tested this non-breaking change on the site-wide Common.js.
| |
− | :::Ok for the revert. Review and community discussion and approval was required, your input and revert are meaningful parts of this required discussion.
| |
− | :::'''As for the whole rational...''' ''Recent change'' is an access point which mainly allow active users do to patrolling activity.
| |
− | :::Current Recent change ([[Special:RecentChanges]]) access point just display 50 last changes while Lili records between 300 and 700 audios per days. The stream of recent changes is therefor overfloaded by large amount of audios files which no-one but the speaker actually will create, edit, review. Does someone browser those 3~700 audios changes daily? listen them ? Can you or me review the correctness of Bengali recordings' file names ? Or review their content correctness ? Unlikely. As far as I can see, it's a stream of "Done" things : there is no practical patrolling to do this flow of audio files, nor is it any need to patrol them. This situation is proper to LinguaLibre. Most wikis are text based. [[:Commons:Special:RecentChanges]] is an hybrid with uploads but also lot of file renaming, editing, discussions, projects pages so the stream is a mix. LinguaLibre file-work is close to 100% only uploading it by the speaker. Then nothing, the stream is dispatched to Commons.
| |
− | :::On the other hand, the meaningful changes done on textual pages which require active monitoring is made harder since these textual changes are buried down among the number of audio files. It seems to me that patrolling edited text-pages (documentation, discussion, user pages) seems as or more relevant, but is currently made inaccessible or accessible via a more complex access point.
| |
− | :::But definitive adoption would needs consensus, which we haven't. So no quick adoption needed. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 08:57, 9 October 2020 (UTC)
| |
− | :* Suggestions: a) enable user common.js and common.css pages (I was not aware that these pages are disabled!), AND/OR b) put this as an opt-in gadget at [[Special:Preferences#mw-prefsection-gadgets|Special:Preferences/Gadgets]]. The "Gadgets" page is empty now, and gradually gadgets can be added, mostly to be opted in (or manually enabled) by interested users. Kind regards. --[[User:Titodutta|টিটো দত্ত (Titodutta)]] ([[User talk:Titodutta|কথা]]) 23:20, 9 October 2020 (UTC)
| |
− | ::I do not know why user common.js is disable. Anyway, I added a new gadget for this settings. So feel free to enable it. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 09:51, 10 October 2020 (UTC)
| |
− | :::I think the default php setting is without personal js. The developer has to set <code>$wgAllowUserJs</code> to true, which 0x010C apparently never did because the need never arose before. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 13:35, 12 October 2020 (UTC)
| |
− | | |
− | == Translation ==
| |
− | I translated into English several pages ([[LinguaLibre:Privacy policy]], [[Help:Your first record]] and [[LinguaLibre:About]]) that were originally written in French, and marked the new versions for translation (I also marked for translation the latest version of [[LinguaLibre:Stats]], which includes the latest crossed thresholds in the description paragraph, and a new row in a table). I think the translation of the pages will be easier for non-French-speakers as from now. Though, '''''every translation of these pages - '''except in French''' - is now outdated (for all or part of it)'''''.
| |
− | | |
− | While translating (or patrolling other people translating), please be careful with the code (one should not change the code on translation pages, only on the main page (/en)). — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 11:50, 10 October 2020 (UTC)<br/>
| |
− | <small>PS: for the Stats page, I changed the translation areas, to limit the amount of code in the translation, in order to limit the risks of translators breaking the code — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 22:13, 10 October 2020 (UTC)</small>
| |
− | | |
− | :Perfect. Meanwhile, I finish to translate the main page and I added a button at the bottom to be able to translate this page. So, go ahead :) [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 15:25, 10 October 2020 (UTC)
| |
− | ::{{ping|Pamputt}} Great! All of this is a big step forward {{smile}}. I think we could also take some time to design improvements for the chat room. Like most "Village pumps", we could divide it in two parts: permanent content in one hand, such as the FAQs, but in the other hand, some content such as this topic, archived after a while (for instance every 3-6 months, since it is not too active), in order to make the page lighter and easier to read. Also, the link "Start a new discussion" does not seem to be working. When I look over the text of the Header it's clickable, but not on the actual page... — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 21:25, 10 October 2020 (UTC)
| |
− | :::{{ping|WikiLucas00}} Good idea to create a FAQ page in parallel of the chat room; feel free to start one. If you are interested in documentation, there is a meeting end of October (https://framadate.org/1C4aA6vVYWz2izgp). About archiving the chat room, this is done once a year (manually) ; see [[LinguaLibre:Chat_room/Archives/2018]] and [[LinguaLibre:Chat_room/Archives/2019]]. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 08:25, 11 October 2020 (UTC)
| |
− | | |
− | == Stats page ==
| |
− | Is the Stats page loading now? <br>Some parts of the source page should not be taken by Fuzzybot to other language pages as it is. For example, the Statistics (En) page had language labels in "Fr" (which should have been in En, as the page was in En), while translating I fixed it. Now it is again broken, and I can not edit the bn page, other than translating. --[[User:Titodutta|টিটো দত্ত (Titodutta)]] ([[User talk:Titodutta|কথা]]) 22:12, 10 October 2020 (UTC)
| |
− | :{{ping|Titodutta}} You are not able to change the labels while translating? for instance in this section, you could replace the words that are within quotation marks by words in Bengali: <nowiki><query yearmonth="Date" records="New records" speakers="Active speakers" languages="Active languages"></nowiki> — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 22:16, 10 October 2020 (UTC)
| |
− | ::Thanks, yes, [https://drive.google.com/file/d/1gUo138CfGh3Y6noWwQ144d7tMuCO2v01/view?usp=sharing translating] seems to be the only option. Regards. --[[User:Titodutta|টিটো দত্ত (Titodutta)]] ([[User talk:Titodutta|কথা]]) 22:38, 10 October 2020 (UTC)
| |
− | ::* The stat page seems to be much slower now to load. Can anyone else check please? Regards. --[[User:Titodutta|টিটো দত্ত (Titodutta)]] ([[User talk:Titodutta|কথা]]) 00:02, 11 October 2020 (UTC)
| |
− | : {{ping|Titodutta}} I just understood you were talking about the name of each language in the table, and not the title of the columns. Sorry for this. I added a new section to the translation, it's mostly some code, and in theory the translator only has to insert the language code instead of "en" in the section. The thing is, only "fr" and "en" seem to be working... I set the English stats page and every translated stats page to "en", except for French (it made more sense like this). I don't know where to find the "languageLabel" in order to translate them into other languages... <br/> I tried many changes to the requests and really felt the slowness of the current system while waiting everytime for the tables to load. {{ping|Pamputt}}, do you know if we could be able to add a cache to this pages (to be purged on a regular basis), to avoid having to load the whole request everytime?(for translated stats pages, the waiting time is so long that I don't think many people wait until the end) — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 00:54, 11 October 2020 (UTC)
| |
− | :* Bn is working fine at [[User:Titodutta/প্রশ্ন]], other languages should work fine as well. Each language is an item such as [[Q126]], which needs labels in different languages. Thanks. --[[User:Titodutta|টিটো দত্ত (Titodutta)]] ([[User talk:Titodutta|কথা]]) 01:00, 11 October 2020 (UTC)
| |
− | ::I am not really an expert of the SPARQL system. [[User:VIGNERON|VIGNERON]] knows much more about that. I only know there is a [[phab:T212079|bug report]] about the performance issues. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 08:11, 11 October 2020 (UTC)
| |
− | | |
− | | |
− | ==Priorities of Lingua Libre==
| |
− | I've looked around and I can't seem to find any priorities of this project. It seems that the overall goal is to record pronunciation, but how this will be done is less clear. Based on my experience with Forvo, I think that this will help the project.
| |
− | | |
− | === Words priority ===
| |
− | Focus on pronouncing headwords first. Forvo is flooded with overly specific phrases that only a few uses will use. It would be helpful to scrape a large authoritative dictionary such as the OED, Duden, or TLFI to get a list of words. I don't think that words are under copyright.
| |
− | : We recommend frequency lists and authoritative this, but the copyright status of those are embiguous. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 14:47, 21 October 2020 (UTC)
| |
− | :: Could we use wiktionary to help create this official list? [[User:Languageseeker|Languageseeker]] ([[User talk:Languageseeker|talk]]) 22:45, 21 October 2020 (UTC)
| |
− | :::{{ping|Languageseeker}} see [[User:Titodutta#কোয়েরি]]'s wiki query. I think we can use wiktionary, but I'am unclear how. Maybe it's even available in the Wizard as a built-in feature when you chose the list. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 20:41, 22 October 2020 (UTC)
| |
− | | |
− | === Words variations ===
| |
− | For each headword, pronounce it with the definite, indefinite, and solo; e.g. "the dog", "a dog", "dog". Also pronounce the declined forms in languages such as Latin or German. Group them all on one page under the headword. For phrases, there's no need to inflect or decline them.
| |
− | : Words variations and verbs are typical to English and western languages. I'am not sure how each language process those questions, but I think we have no recommendation in place. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 14:47, 21 October 2020 (UTC)
| |
− | :: This is why I believe it is important to create recommendations in place before the site gets too large. We don't want to have to manually deal with these issues latter. I know that the editors of Forvo are struggling with precisely this issue, especially in English. The best thing to do would be to create a bot to tag alternative spellings, generate files for them, and automatically generating the pages for the alternative spelling. For example, in French, you have électroménager and électro-ménager . If a user pronounces either one of these orthographic variations, the bot should generate files and pages for both variations. Otherwise, we'll be asking to effectively pronounce the same word multiple times. It would also probably be helpful to create a bot to scrape alternative spellings from wiktionary. [[User:Languageseeker|Languageseeker]] ([[User talk:Languageseeker|talk]]) 22:44, 21 October 2020 (UTC)
| |
− | :::Maybe we should consider "Portals" per language.... with the specific tips, recommended list, active/reference users. Seems a good idea (IF someone ready to attack it XD) [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 20:18, 22 October 2020 (UTC)
| |
− | :::: {{ping|Yug}} You inspired me to create a [[phab:T266306|phabricator ticket]] on how to do this. [[User:Languageseeker|Languageseeker]] ([[User talk:Languageseeker|talk]]) 02:00, 23 October 2020 (UTC)
| |
− | | |
− | === For verbs ===
| |
− | It's best to focus on the irregular and model ones first. It's also makes sense to pronounce them in all the possible permutations. For example, in French, il/elle/on est should have the following entries: "il est", "elle est", "on est", "il/elle/on est", and "est"
| |
− | : As for the previous point, each user is free to record whatever he/she wants. So the question becomes how to manage/organise/browser all the recordings. For that point, everything has to be done so your point of view is more than welcome. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 16:06, 21 October 2020 (UTC)
| |
− | | |
− | === User supplied lists ===
| |
− | They are great, but quickly turn into a headache. They require lots of proofreading that can overwhelm editors. Only after we finish pronouncing all the headwords and verbs should we open this to general suggestions.
| |
− | :For "user supplied lists" and "site supplied lists" there are some ongoing efforts this side. An user can create a list which becomes a site supplied list for later users. [EDIT]: We have help pages recommending and demonstrating how to create frequency lists for better impact. See [[Help:Main]]> search "frequency". [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 14:47, 21 October 2020 (UTC)
| |
− | :: If I understand correctly, you suggest to have "official" lists that are proofread so that we can propose these lists in priority to the users. I think it is a good idea because some users do not always know which words to record. We should open a Phrabicator ticket to keep track of that feature request. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 16:06, 21 October 2020 (UTC)
| |
− | ::: Precisely, I'm not against allowing speakers to pronounce whatever they want, but we should have some official list of what we want pronounced. This is why I also suggested the different forms that we should target, especially if we begin to suggest words for users to pronounce. Otherwise, people will begin adding lists with misspelled words or phrases that have no widespread usage. This will create unnecessary work for editors to correct and delete. Do we really want discussions about whether or not "The Pink Adrietic restaurant will closed today at 9:30 due to an alien invasjon" should be on the official list at this stage? What will we do if a user adds 60,000 of these? [[User:Languageseeker|Languageseeker]] ([[User talk:Languageseeker|talk]]) 22:56, 21 October 2020 (UTC)
| |
− | ::::(Note: I edited my paragraph above.) [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 14:43, 22 October 2020 (UTC)
| |
− | | |
− | === Site supplied lists ===
| |
− | Such lists of words that usersy can pronounce are better from a project management standpoint. This list should be randomized at each refresh in case the user is not interested in the selection. Let users see the list first and then make them log in.
| |
− | : I am not sure to understand exactly what you mean. Is it related to the previous item? Could you give an example? [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 16:06, 21 October 2020 (UTC)
| |
− | :: It's related to the previous item. If we create official lists, we should not adopt the last-in/first-pronounced model of Forvo. On Forvo, the last word added is the first word in the list for users to pronounce. Speakers have no option to change the way that the list is generated. I believe that we should have more flexibility. Instead of displaying the last words first, I'm proposing that LL randomly sorts the list by default. We could also add a drop-down menu with: Random, Newest, and Oldest. [[User:Languageseeker|Languageseeker]] ([[User talk:Languageseeker|talk]]) 23:00, 21 October 2020 (UTC) Edit: Official lists can also help prevent unnecessary duplication of effort. Look at the entry for "arbre" on French Wiktionary. Do we really need 32 pronunciations of "arbre"? Does it make sense to add 32 different sound files to one wiktionary page? [[User:Languageseeker|Languageseeker]] ([[User talk:Languageseeker|talk]]) 16:06, 22 October 2020 (UTC)
| |
− | :::Note: There are various schools of though here on LinguaLibre :
| |
− | :::* the '''linguists''', which would be happy to have 200,000 version of "arbre" so one could study the variability.
| |
− | :::* the '''language teachers/learners/learning apps developpers''', who one one clear and standard speaker for 10~30,000 most frequent words, just once each and with zero hole in the dataset.
| |
− | :::[[User:Yug|Yug]] ([[User talk:Yug|talk]]) 20:45, 22 October 2020 (UTC)
| |
− | :::: I'm not for limiting the maximum number of pronunciations for an item, but I do not want a situation where "arbre" has 200,000 pronunciations and "cigale de mer" has zero. A site supplied list will make it more likely that rarer words will receive at least one pronunciation. [[User:Languageseeker|Languageseeker]] ([[User talk:Languageseeker|talk]]) 02:07, 23 October 2020 (UTC)
| |
− | :::::+1. Maybe a specific label within the list's pagename. Ex: List:CMN/HSK-0001-to-8868_(RECOMMENDED) ?
| |
− | :::::This decision could be made within a language community via its Portal. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 18:11, 23 October 2020 (UTC)
| |
− | ::::::+1. I like the idea of adding labels. I think this should help also avoid any copyright claims. We can add tags such as "HSK 1.1" (HSK Revision 1 Level 1) "HSK 2.1" (HSK Revision 2 Level 1) and "HSK 3.1" (HSK Revision 3 Level 1) for the various iterations of HSK and that way a user can easily tag see all the word pronounced for a given language list. [[User:Languageseeker|Languageseeker]] ([[User talk:Languageseeker|talk]]) 00:48, 24 October 2020 (UTC)
| |
− | | |
− | === Accents matter ===
| |
− | :''See also: [[Help:Renaming]] (using metadata tags).
| |
− | They should be tagged as part of the filename. For example, <code>LL-Q1860_(eng)-Commander_Keane-phonate.wav</code> contains no accent information. <code>LL-Q1860_(eng_Au)-Commander_Keane-phonate.wav</code> would be better. Also, allow users to filter by accent codes. We would need to think about and propose a list of accents for each language.
| |
− | : The choice has been done to tag the location in the wiki metadata. Indeed, people are not always aware that they have an accent and an accent can highly vary in a given country or region. So it becomes difficult to find the good granularity. So that, saving the location allow to write some query to get exactly the recordings we are interested in. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 16:06, 21 October 2020 (UTC)
| |
− | :: But, geographic location is not an accurate predictor for an accent because people move around and some have speaking impediments. For example, you can have an American living in Paris or a Parisian living in America. Who will speak the words more accurately? A person living in London could speak the Queen's English or have a Cockney accent. Especially for language learning, accents are important. We're not judging accents, but merely tagging their existence per speaker. If a user has an incorrect accent listed, I believe that mods should be able to change it annd Lingua Libre will automatically retag all their pronunciations. [[User:Languageseeker|Languageseeker]] ([[User talk:Languageseeker|talk]]) 23:06, 21 October 2020 (UTC)
| |
− | :::{{ping|Languageseeker}}: I believe information on accent is conserved thanks to the file '''metadata'''. To make them more visible, see [[Help:Renaming]] and fish metadata tag' value to push it into the new filename of your desired shape. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 14:48, 22 October 2020 (UTC)
| |
− | :::: {{ping|Yug}} I don't see accents on either user profiles or on Wiki Commons. See: speaker [[Q141723]] for instance. Accents should be easily visible and filterable. For Example, on commons there should be a category for Lingua Libre pronunciation in French (Parisian Accent). On the description page for a file on Commons, there should be "AccendId" under "languageId" [[User:Languageseeker|Languageseeker]] ([[User talk:Languageseeker|talk]]) 16:00, 22 October 2020 (UTC)
| |
− | :::::The profile contains the location of where you learnt a language. City and country, if I remember well. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 19:59, 22 October 2020 (UTC)
| |
− | ::::: {{ping|Yug}} Correct, but this does not necessarily translate into accent. We're making an assumption that an accent can be geographically located and that all individuals that inhabit that geographic location will have a particular accent. Also, an accent can span across geographic region as well. What's wrong with adding a third parameter called accent? Is there any reason that you feel that it would be detrimental? [[User:Languageseeker|Languageseeker]] ([[User talk:Languageseeker|talk]]) 20:10, 22 October 2020 (UTC)
| |
− | ::::::The filenames are already 3 times too long in my opinion.........
| |
− | ::::::I don't remember clearly. But maybe we then assumed the *speaker* to be the data marking the accent. {{reply to|Lyokoï|p=}} may remember. --[[User:Yug|Yug]] ([[User talk:Yug|talk]]) 20:16, 22 October 2020 (UTC)
| |
− | :::::::Could we set an advance option in the user profile, where users can decide how they wish the language files to be named based on metadata? That way users can have fine-tune the way they see the files?
| |
− | :::::::My point is that this is an invalid assumption. People speak with dialects and may have a speech disorder that Lingua Libre should have a metadata tag for. It's easier to add at first, then to have to manually add later. It shouldn't be that hard to implement. Furthermore, it will also make it easier for users to filter pronunciations. Take, for instance, Russian that has three major accents with twenty-four sub-divisions in Russia. Each accents occurs in hundreds of cities and villages. If we don't have dialect metadata, then we need to create a list with all of these geographical locations to group the pronunciations in the same dialect. If we have dialect metadata, then I can just filter by "Central Russian" or "Chukhloma enclave." All we would need to do is add the following three options to person's user page "Dialect" (required); "Dialect Sub-group" (optional); "Speech Disorder" (optional) and then have that propagate automatically to all their pronunciations. To help users and promote standardization, we can make these drop down menus with an option for custom. [[User:Languageseeker|Languageseeker]] ([[User talk:Languageseeker|talk]]) 01:12, 24 October 2020 (UTC)
| |
− | | |
− | === Authoritative lists (& copyrights) ===
| |
− | Lists such as HSK or JLPT should be an high priority. This would help language learners the most.
| |
− | : No problem to import such lists if they are not copyrighted. No idea on that. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 16:06, 21 October 2020 (UTC)
| |
− | ::{{ping|Languageseeker|Pamputt}} Actually, we are not Wikipedia nor Commons. So we could set up our own copyright rule in accordance with the server's geographic localisation and associated laws. We can consider, like Anki and Skritter.com do, that we are just '''hosting content uploaded by the user, who is the legally responsible party''', and as the European and French law commands: we will take down any content '''following a formal complain'''. This is the true legal requirement we have in France. Our (LinguaLibre) rules don't have to be as pro-active as Wikipedia or Commons, which decided to go beyond what is legally required. Where we put the cursor is really up to us. It's a matter of internal policies. Some tolerance such as the one I cited above could be greatly advantageous to LinguaLibre's objectives. Indeed, it's the strategy that took Anki, Skritter, Memzine, Duolinguo, and many other actor of online language learning. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 14:55, 22 October 2020 (UTC)
| |
− | ::: {{ping|Yug|Pamputt}} Agreed, we shouldn't worry too much about copyrights until someone submits a claim and then remove the entries. I think it'd be very difficult to file a copyright claim for "chien." We could also automatically reorder list to prevent an argument that the specific order makes them somehow under copyright. We could also rely on corpuses in the PD at first. [[User:Languageseeker|Languageseeker]] ([[User talk:Languageseeker|talk]]) 16:04, 22 October 2020 (UTC)
| |
− | ::::'''Copyrights violations''' are not possible for stand alone lexemes such "狗" (gǒu: dog) from HSK.
| |
− | ::::List are different : lists are specific creation of one's mind so its author can claim copyrights on a given list.
| |
− | ::::Still, the Chinese Ministry of Education which authored the HSK '''''LIST OF SELECTED WORDS''''' and could legally file a copyright complain yet never filed copyrights violations on any for-profit companies to remove their online HSK lists. Then why would the MoE do so for a non-profit ? And if they do, we can simply delete the said list(s).
| |
− | ::::'''Shuffling''' is not enough to claim '''difference and originality''', especially when your page is named "List:cmn/HSK1" ^^
| |
− | ::::To claim originality the minimum would be to substantially edit the list. In order to not loose data, it lead to adding words. The HSK 1 to 6's 8800+ words could indeed be extended to 11000 via a merge with a relevant frequency lists, around this threshold of difference we could start to claim originality. The algo could be ligthly more complex, with ranking... You see the idea. But I thing the fair-use option is more practical and relevant for us (see below).
| |
− | ::::'''Sum up:''' as I shared above, I think we could advise and state that :
| |
− | ::::* Our policy should be based on the law of the land (Europe & France's laws) and current observed online practices.
| |
− | ::::* Our users make fair-use judgement and uploads the lists
| |
− | ::::* Our admins and/or bureaucrats (?) receives the copyrights claims, one of them do a rapid review process on a case by case basis, then remove the list if the complain has merit.
| |
− | ::::This seems a good balance satisfying both relevant laws together with our project's objectives and interests. Can we push this way ? [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 19:40, 22 October 2020 (UTC)
| |
− | ::::: {{ping|Yug}} This is an awesome idea. I think that it would make a lot of sense to combine a set of frequency lists with official lists, then deduplicate them. This should prevent any copyright claims. We can even think about whether it makes sense to create frequency lists based on data in Project Gutenberg as part of this process. Then, ve should also write a script to automatically add inflections, conjugations, articles, etc. based on wiktionary data. These lists could be the first set of official lists for LL. [[User:Languageseeker|Languageseeker]] ([[User talk:Languageseeker|talk]]) 22:06, 22 October 2020 (UTC)
| |
− | ::::::We don't have the human resources to find and merge those list as I wished above. It need knowledge of the target language, of the available resources (only major, official languages such as EN/FR/ES/DE/JA/CN/KO have HSK/JLPT-like lists), and programming skills. Then add free time and willingness. Having them all is very rare. Better to redefine our copyright rule toward more tolerance so we accept any list there is. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 18:09, 23 October 2020 (UTC)
| |
− | ::::::: Agreed. I do think that this might take too much time and dev energy for a temporary and potentially problematic taks. I'm against user supplied lists at this stage for the same reason. I don't think that we have the resources to proofread and merge multiple lists. I'd propose focusing on creating an official list from the various languages of Wiktionary because that will have no copyright claims and will, eventually, contain all the words and phrases in a language. If we create a list from that, it will probably contain around 500, 000 to 600, 000 items per language which will be enough for an initial set and then we can open it up to user suggestions afterwards. [[User:Languageseeker|Languageseeker]] ([[User talk:Languageseeker|talk]]) 00:39, 24 October 2020 (UTC)
| |
− | | |
− | === Recording quality and post recording clean ups matters ===
| |
− | Words pronounced with lots of static or background hum should be deleted unless it's a truly rare language.
| |
− | : We have no or weak process for that. There are documented methods to denoise audios, see : [[Help:SoX]]. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 14:47, 21 October 2020 (UTC)
| |
− | :: Yes, some tools have to been developed to control that. Ideas are welcomed. About denoising, a [[phab:T251638|ticket]] already exists to add by default in the Record Wizard. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 16:06, 21 October 2020 (UTC)
| |
− | ::: Most of this has to do with recording equipment. On Forvo, recording quality is usually pretty fairly consistent for users. We may have to flag users with recording quality issues and delete their files. It could be a manually review process for a few files or batch deletion. The same would be true for users with terrible or fake accents. (Think Dick Van Dyke in Mary Poppins) [[User:Languageseeker|Languageseeker]] ([[User talk:Languageseeker|talk]]) 23:11, 21 October 2020 (UTC)
| |
− | ::::Personnally, I think we should argue more for good microphones and silent room. We frequently run after users and get lightly-noisy audios which are not satisfying. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 15:03, 22 October 2020 (UTC)
| |
− | :::: {{ping|Yug}} It seems that we need a statement on our expectations for recording quality. In the end, we only need one good pronunciation entry per language + accent. Terrible quality recording help nobody unless they are the only one that we have. It might be worth having a voting system such as on Forvo to help flag good/bad speakers. We can even toy with the idea of a speaker of the week or month to reward those who really help us out. [[User:Languageseeker|Languageseeker]] ([[User talk:Languageseeker|talk]]) 16:46, 22 October 2020 (UTC)
| |
− | :+1. We need a quality statement somewhere. It's not perfect but it sets the tone a bit so we may require more quality from contributors. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 20:11, 22 October 2020 (UTC)
| |
− | :: I decided to create a draft of standard for Libre Lingua in [[phab:T266309|phabricator.]] [[User:Languageseeker|Languageseeker]] ([[User talk:Languageseeker|talk]]) 03:36, 23 October 2020 (UTC)
| |
− | :::{{reply to|Languageseeker}} Not dev related, to move back here. Also we need to check we may have something already in place making most of the job. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 18:03, 23 October 2020 (UTC)
| |
− | ==== Minimal threshold ? ====
| |
− | I also recommend to push for sets of more that 1000 words. Basically, in the download page, we should compile list by language by all datasets (one speaker one language) of less than 1000 audios should be ignored. We assume that sets of 1000+ audios are autoconfirmed. While sets of <1000 audios are potential beginners and likely noisy ''play-around'' (as for myself ! My audios are just test and not good !). Ideally we would have a download page such as :
| |
− | {| class="wikitable"
| |
− | |+ Download audios
| |
− | ! Language || All audios || Top 1 speaker || Autoconfirmed speakers (≥1000) || Other speakers (≤999)
| |
− | |-
| |
− | | French || 88,934 audios by 34 speakers || 47,076 audios by speaker Tom Smith|| 76,567 audios by 4 speakers || 12,367 audios by 30 speakers
| |
| |- | | |- |
− | | Gascon || ... || ... || ... || ... | + | |colspan=2| <small><references /></small> |
| |} | | |} |
− | [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 20:11, 22 October 2020 (UTC) | + | [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 10:38, 14 May 2024 (UTC) |
− | : Disagree, I don't think we should require a minimum number because that would discourage users. This would especially impact language with fewer speaker and can perpetuate oppression. Instead, I think we should not divide audio into datasets by speaker. Moreover, a system to vote on pronunciations and report them can help to flag problems. As a final resort, we can vote on whether or not to batch delete the pronunciation of speakers that are particularly horrible. [[User:Languageseeker|Languageseeker]] ([[User talk:Languageseeker|talk]]) 03:36, 23 October 2020 (UTC)
| |
− | ::{{ping|Languageseeker}} datasets are grouped by languages via downloadable zips, then by speakers.
| |
− | ::My proposal is to create variable packagings for one languages : All / Top speaker / Autoconfirmed speakers / Non-autoconfirmed speakers.
| |
− | ::The ranking system / API would be great yes. Right now when I review a list of words I have to copy the filename(-filepath), store it, to then send a message "this audio is to redo". Not right. Maybe a smart template could do as of now. There is also the question of synch between Lili and Commons to keep in mind. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 18:00, 23 October 2020 (UTC)
| |
− | | |
− | === Volume normalization ===
| |
− | :''{{done}} -- feature request on Phabricator. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 20:05, 23 October 2020 (UTC)''
| |
− | It should be automatically provided across the entire corpus. We don't want one word at 140db and the other at 20db.
| |
− | : Volume normalization : we already have some normalization, we reject low db and high db recordings. But I cannot specify the exact mechanism : per file ? per recording set? I think it's the former. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 14:47, 21 October 2020 (UTC)
| |
− | :: This has already been proposed and is saved in [[phab:T213535|ticket]]. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 16:06, 21 October 2020 (UTC)
| |
| | | |
− | === Anki plugin === | + | == Kinyarwanda language representation == |
− | ::''{{done}} a proposal have been documented on phabricator. To keep in mind and follow through there. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 20:30, 22 October 2020 (UTC)''
| |
− | It would be great to develop an Anki plugin that would enable users to automatically add audio to flashcards. This is the biggest downside to Forvo that requires users to manually add one word at a time. Providing an Anki plugin will help to popularize this project attracting new users. Since, Anki is python based, this can be based on the French bot. Having a large group of testers can help to identify how the metadata of these files can be improved.
| |
− | : Anki plugin: YES, it's about 1~2 day work. Maybe Anki folks could help.
| |
− | :# Read Anki's documentation for Anki decks folder's syntax
| |
− | :# Download the target [https://lingualibre.org/datasets/ language folder]
| |
− | :# Create a bash script to pick up the filepaths, the words, and create the Anki decks file with the proper syntax.
| |
− | :# Document process to share file with Anki community [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 14:47, 21 October 2020 (UTC)
| |
− | :: Please, feel free to open a Phabricator ticket to keep in mind this need. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 16:06, 21 October 2020 (UTC)
| |
− | :::Appreciated. I'll write up a full proposal in the next few days and open a Phabricator ticket.
| |
− | :::[EDIT]:Ticket created for [[phab:T266209|Anki Plugin]]. [[User:Languageseeker|Languageseeker]] ([[User talk:Languageseeker|talk]]) 05:22, 22 October 2020 (UTC)
| |
− | ::::Awesome. Thank you :) [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 15:06, 22 October 2020 (UTC)
| |
| | | |
− | === Flac ===
| + | I'm Robert RUGAMBA from Rwanda and i belong to Wikimedia Rwanda as a volunteer and event organizer. |
− | Files should be uploaded and stored as '''flac files''' to enable '''tagging''' and reduce file user. All modern browsers use flac and it's emerged as the default lossless audio compression format that is widely supported across different devices. Tagging will help keep the metadata with the file and enable easier renaming by end user or bots.
| + | I'm exited to explore this platform of lingua libre and i wish my local languages to be add and represented. the wikidata rabel is: https://www.wikidata.org/wiki/Q33573 |
− | : Tagging and flacs : we do tag files in the files' code; we already have long discussion about file formats and chose to keep .wav together with sharing scripts to mass convert to alternative formats. See [[Help:Convert_audios%3F]]. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 14:47, 21 October 2020 (UTC)
| |
− | :: I was not aware about the reason of choosing the wave format. Concerning FLAC, there is [[phab:T213534|a ticket]] explaining why it is interesting (in addition of the points given here). [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 16:06, 21 October 2020 (UTC)
| |
− | :::It's a decision by Mr. Vion and {{reply to|Lyokoï|p=}} if i remember well. Maybe worth creating a [[Help:Formats]].
| |
− | :::Side note: it also seems to be the time to create categories to class our help pages via several dimensions : scripts, lists, guidelines, recording, ... ? [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 20:32, 22 October 2020 (UTC)
| |
| | | |
− | === Discussions ===
| + | Thanks. [[User:Annick green|Annick green]] |
− | These are my few thoughts and I'd love to hear any feedback. [[User:Languageseeker|Languageseeker]] ([[User talk:Languageseeker|talk]]) 19:30, 20 October 2020 (UTC) Edited on [[User:Languageseeker|Languageseeker]] ([[User talk:Languageseeker|talk]]) 23:35, 20 October 2020 (UTC)
| + | :{{Done}} This language was already on Lingualibre as [[Q285]]. If you open [[Special:RecordWizard]], at step 2, add it to your list of known languages. Please type in « Kinyarwanda », «Ikinyarwanda » and you should find it. Only user who have declared to know Kinyarwanda can record in Kinyarwanda. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 16:50, 27 June 2024 (UTC) |
− | :{{ping|Languageseeker}}Hi there, thank you for this review. I edited your points and added bold so the key topics are more visible and we agree on naming for the discussion to continue. I will try to answer to several of your points [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 14:47, 21 October 2020 (UTC) | |
− | ::I reorganised by section and move your answer, Yug, so that it will be easier to follow the different points (I hope so). I will add some answers as well. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 15:49, 21 October 2020 (UTC)
| |
− | ::: [[User:Pamputt|Pamputt]] Thank you!! [[User:Languageseeker|Languageseeker]] ([[User talk:Languageseeker|talk]]) 23:12, 21 October 2020 (UTC)
| |
− | ::::Thanks, good by me. As long as it improves and is in good faith it's a good practice to allow. ;) [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 19:21, 22 October 2020 (UTC)
| |
| | | |
− | == Datasets to download == | + | == Rename my pseudonym == |
− | Hello the team, I just noticed that https://lingualibre.org/datasets/ which is central for external developers such Anki's community to reuse our audios work has been lost in the recent UI revamp. Any idea where to put it back so it stays highly visible ? [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 14:50, 21 October 2020 (UTC)
| |
− | :The dates visible are also mainly from 2019. Any idea what they are ? First compilation ? Last compilation ? [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 14:53, 21 October 2020 (UTC)
| |
| | | |
− | == CSS fixes ==
| + | Hello. I've renamed my account on wikimedia sites but can't log in directly from this username here. Do i have something to do ? My old username is '''ElsaBester''' and the new one is '''L'embellie'''. Thanks ! |
− | :See [[MediaWiki:Common.css]] | + | :Hello [[User:ElsaBester|L'embellie]], |
− | There is clearly some CSS to update. Most notably for the H2, H3, H4 section titles *within wikipedia pages*. I will test some solutions soon. Feel free to test other CSS aspects in [[MediaWiki:Common.css]] (admin only?) or in [[User:Yourname/common.css]]. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 14:55, 21 October 2020 (UTC)
| + | :I may ping [[User:WikiLucas00|WikiLucas00]], but I think we don't currently have solution for your issue. |
| + | :We are phasing out this wiki, we hope to release a new Lingualibre this winter or early 2025. So this issue will be irrelevant by then. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 20:00, 22 August 2024 (UTC) |
| + | ::Hey there {{ping|ElsaBester|Yug}}. Sorry I don't have a solution, but I found this in the Chat Room's archives: [[LinguaLibre:Chat_room/Archives/2023#Update_my_username]]. Good luck — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 18:46, 26 August 2024 (UTC) |
| + | :::Hello {{ping|ElsaBester}} you may also look at my latest reply on [[User talk:Yug]], it's not a great option but maybe you'll want to try it. All the best — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 12:24, 31 August 2024 (UTC) |
| | | |
− | {| class="wikitable" style="width:100%;"
| + | == Two French words that are impossible to record == |
− | ! Code !! Redering
| |
− | |-
| |
− | |
| |
− | <source lang="wiki">
| |
− | == H2 == | |
− | Paragraph.
| |
− | :Paragraph.
| |
− | === H3 ===
| |
− | Paragraph.
| |
− | :Paragraph.
| |
− | ==== H4 ====
| |
− | Paragraph.
| |
− | :Paragraph.
| |
− | ===== H5 =====
| |
− | Paragraph.
| |
− | :Paragraph.
| |
− | ;Should be bold
| |
− | Paragraph.
| |
− | :Paragraph.</source>
| |
− | |}
| |
| | | |
| + | Hi, |
| | | |
− | == Proposed Community Standards ==
| + | Two words are impossible to record (even before uploading): ''esclavesse'' and ''scribesse'' (all my attempts with other words work). [[User:Avatea|Avatea]] ([[User talk:Avatea|talk]]) 18:49, 30 August 2024 (UTC) |
− | Based on a conversation with Yug, I've decided to draft a community compact statement that sets out our standards. Any and all feedback welcome.
| + | :Hi {{ping|Avatea}}. Sorry for the late reply. I couldn't reproduce the issue on my side, as you can see ({{Q|1385666}}, {{Q|1385667}}) I just recorded a few words ending with -esse, including the two words you mention, without encountering any issue. Did you try again recently? All the best — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 14:42, 21 September 2024 (UTC) |
| + | :: Hi {{ping| WikiLucas00}} |
| + | :: No. I just tried, I was able to record another word, but still not those two. [[User:Avatea|Avatea]] ([[User talk:Avatea|talk]]) 19:06, 21 September 2024 (UTC) |
| | | |
| + | == Supprimer deux enregistrements incorrects. == |
| | | |
− | Lingua Libre is dedicated to providing free, high-quality recording of words and phrases in all languages. To achieve this, we ask you to abide by the following community standards
| + | Bonjour! À cause d'une erreur lors d'écriture et parce que je l'ai fait pressé, j'ai enregistré par erreur deux termes: *"[[Q1387394|escaramón]]" et son pluriel *"[[Q1387395|escaramones]]". Serait-il possible de supprimer ces fichiers enregistrés ? J'ai déjà fait les enregistrements corrects de ces mots bien écrits et avec la prononciation correcte: "[[Q1387396|escamarón]]", "[[Q1387397|escamarones]]". Vous pouvez vérifier l’exactitude de ce terme [https://diccionariu.alladixital.org/index.php?cod=21008 ici]. Désolé pour le dérangement. --[[User:Limotecariu|Limotecariu]] ([[User talk:Limotecariu|talk]]) 20:31, 28 September 2024 (UTC) |
− | # All people and accents are welcome. However, do not assume an accent that you do not normally use.
| |
− | # Please, fill out your profile information accurately. This enables the correct usage of your pronunciations.
| |
− | # Do not upload pronunciations that you did not create or are posted with a restrictive license elsewhere.
| |
− | # Please, record in a quiet room with no background noise audible when listening with headphones.
| |
− | # Do not include excessive silence before or after your pronunciations.
| |
− | # Record your pronunciations in a relaxed, neutral tone. If you wish to pronounce them fast, slow, or emphatically, please tag your pronunciations appropriately.
| |
− | # Discrimination, cyberbullying, harassment, stalking, or any other form of intimidation on the base of age, accent, class, disability, ethnic identity, gender identity, geographical location, marital status, native language, political beliefs, race, religious identity, religious beliefs, sex, sexual orientation, or any other category will not be tolerated. Any such behavior will result in a permanent ban.
| |
− | # Lingua Libre reserves the right to delete any pronunciations that do not follow these guidelines.
| |