|
|
Line 1: |
Line 1: |
− | {{/Header}} | + | {{#SUBTITLE:{{/Header}}}} |
− | | + | {{Lang-CR}} |
| + | <indicator name="talk"></indicator> |
| + | {{LL:Chat room/FAQ}} |
| __TOC__ | | __TOC__ |
| + | <!-- **** DO NOT EDIT CONTENT ABOVE **** --> |
| | | |
− | == Chatroom FAQ ==
| |
− | * '''How to download all audios of one language ? By speaker ?'''
| |
− | ** Languages are there [https://lingualibre.fr/datasets/ https://lingualibre.fr/datasets/]. A short server-side script is auto-ran every 2 days, itself using [https://github.com/lingua-libre/CommonsDownloadTool lingua-libre/CommonsDownloadTool]. For more, see [[Help:Download from LinguaLibre]].
| |
− |
| |
− | * '''How to add missing languages ?'''
| |
− | ** Administrators can add new languages, they do so within few days. For users, please provide your language's [[:wikipedia:iso-639-3|iso-639-3]] code + link to the en.wikipedia.org's article. Optional infos are the common English name and wikidata IQ. For more, see [[Help:Add a new language]].
| |
− |
| |
− | * '''How to archive sections which have been answered ?'''
| |
− | ** After reviewing the section, add `<nowiki>{{done}} -- can be closed ~~~~</nowiki>` to the top of the section. After some days to 2 weeks, move the sectin's code to [[LinguaLibre:Chat_room/Archives/2018]].
| |
− |
| |
− | * '''How to keep my wikimedia project up to date ?'''
| |
− | ** Contact [[User talk:0x010C|User:0x010C]], the botmaster of Lingua Libre Bot. For more, see [[Help:Bots]].
| |
− |
| |
− | * '''What IRL event.s are coming ? When ? Where ?'''
| |
− | ** Paris's [[LinguaLibre:Hackathon_15-16_Décembre]] just finished. More events to come. For more, see [[LinguaLibre:Events]].
| |
− |
| |
− | == Utiliser le Lingua Libre Bot dans l'incubator:shy ==
| |
− |
| |
− | Est-ce que c'est possible de faire la même chose pour le wiktionnaire en Chaoui ? je veux dir est-il il possible d'utiliser votre bot sur notre wiktionnaire aussi ? je peux donner l'algorithme du [https://incubator.wikimedia.org/wiki/Wt/shy wiki-test]. Cordialement. -[[User:Reda Kerbouche|Reda Kerbouche]] ([[User talk:Reda Kerbouche|talk]]) 12:32, 8 July 2018 (UTC)
| |
− | :Oui bien sur ! Avez-vous un bistro / village pump / ... pour en discuter là-bas ? — [[User:0x010C|'''0'''x'''010<span style="color: #00C41C;">C</span>''']] <sup>[[User_talk:0x010C|~talk~]]</sup> 15:24, 8 July 2018 (UTC)
| |
− | ::Oui il y a un bistro vierge du [https://incubator.wikimedia.org/wiki/Talk:Wt/shy/Asebtar_amenzu wiktionnaire Chaoui] que vous pouvez activer. Ou bien celui de [https://incubator.wikimedia.org/wiki/Incubator:Community_Portal l'incubator] où en peut discuter avec des administrateurs à propos de l'autorisation du bot. Cordialement. -[[User:Reda Kerbouche|Reda Kerbouche]] ([[User talk:Reda Kerbouche|talk]]) 18:26, 8 July 2018 (UTC)
| |
− | :::Je suis en ce moment en chemin pour Wikimania, je vais n'avoir que très peu de temps jusque là, mais je lancerais la discussion à mon retour. Cordialement — [[User:0x010C|'''0'''x'''010<span style="color: #00C41C;">C</span>''']] <sup>[[User_talk:0x010C|~talk~]]</sup> 11:43, 11 July 2018 (UTC)
| |
− | :::: Bon voyage.--[[User:Reda Kerbouche|Reda Kerbouche]] ([[User talk:Reda Kerbouche|talk]]) 21:48, 11 July 2018 (UTC)
| |
− | [[User:0x010C|'''0'''x'''010<span style="color: #00C41C;">C</span>''']] J'espère que vous m'avez pas oublié =) Car en septembre on lance un concours pour le wiktionnaire en Chaoui, et si on peut enregistrer des mots qui vont passer directement sur incubateur, je fais la promo de Lingualibre en même temps que la promo du concours.--[[User:Reda Kerbouche|Reda Kerbouche]] ([[User talk:Reda Kerbouche|talk]]) 14:01, 16 August 2018 (UTC)
| |
− | :[[User:Reda Kerbouche|Reda Kerbouche]], [[User:0x010C|'''0'''x'''010<span style="color: #00C41C;">C</span>''']], Is this {done} ? --[[User:Yug|Yug]] ([[User talk:Yug|talk]]) 11:04, 15 December 2018 (UTC)
| |
− |
| |
− | == Liste sur le modèle de Petscan ==
| |
− | Salut, est ce qu'il serait possible de faire une liste à la volée sur le modèle de ce qu'est capable de faire Petscan ? [https://petscan.wmflabs.org/?language=fr&project=wiktionary&categories=Lemmes%20en%20fran%C3%A7ais&negcats=Prononciations%20audio%20en%20fran%C3%A7ais&ns%5B0%5D=1&search_max_results=500&interface_language=fr&&doit= Ici], on a la liste de tous les lemmes du Wiktionnaire qui n'ont pas de catégorie « Prononciations audio en français » ce qui signifie qu'il n'ont pas le modèle « écouter » qui permet d'ajouter les entrées dans cette catégorie. Je trouve que la génération d'une telle liste serait vraiment sympa pour les Wiktionnaires. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 06:07, 12 July 2018 (UTC)
| |
− | :L'idée est bonne en effet, cependant ça représente un gros boulot à intégrer sur Lingua Libre. Je pense qu'il serait intéressant d'en discuter un peu et d'établir un petit cahier des charges de ce que l'on veut pouvoir faire (tout dans petscan n'est pas utile ici). — [[User:0x010C|'''0'''x'''010<span style="color: #00C41C;">C</span>''']] <sup>[[User_talk:0x010C|~talk~]]</sup> 22:00, 14 July 2018 (UTC)
| |
− | ::[[User:0x010C|0x010C]], est ce que tu penses que l'exemple que j'ai donné ci-dessus (lemmes en français qui n'ont pas de prononciation) peut être implémenté à partir de [[MediaWiki:Gadget-Demo.js]]. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 14:23, 14 October 2018 (UTC)
| |
− | :::Oui c'est exactement ça, il faut passer par la création d'un nouveau générateur de mots. Dans mon début de réflexion plus haut, je réfléchissais à comment implémenter les fonctionnalités de petscan dans un générateur. Sauf que niveau performance et rapidité, on pourrait jamais faire quelque chose d'utilisateur avec des catégories aussi grosse que "Lemmes en français", je m'explique. Petscan fait son travail de recherche et de recoupement côté serveur, directement sur une copie de la base de donnée des wikis (il peut ainsi en un coup explorer tous les enregistrements). Or ici, nous n'avons pas d'accès à la base de donnée et les calculs doivent être fait côté client, en javascript. On dépend donc de l'API des wikis en question pour récupérer les données, API qui n'est pas du tout faite pour travailler sur des catégories très grosses (ne peut retourner que 500 membres par requête, etc).
| |
− | ::: Bref, c'est pas possible. Cependant, on peut imaginer se reposer sur petscan pour faire le boulot chiant à notre place (ce générateur deviendrait complètement dépendant de cet outil externe, une panne de ce dernier bloquerait le fonctionnement du premier). Je vois trois options :
| |
− | :::# le générateur reprend un certain nombre de champs de petscan, et va à partir des valeurs fournies générer une requête à petscan (complexe pour l'utilisateur lambda, flexible pour l'utilisateur expérimenté) ;
| |
− | :::# le générateur propose à l'utilisateur de choisir parmi un certain nombre de requêtes petscan préparé à l'avance par nos soins (par exemple en cliquant sur "mots en français n'ayant pas de prononciation sur le wiktionnaire francophone", ta requête exposé plus haut serait utilisé), ou de coller l'URL / l'identifiant d'une requête qu'il a préparé / trouvé (plus simple à implémenter, nous oblige à créer pleins de requêtes pour supporter différentes langues, assez flexible) ;
| |
− | :::# on fait un générateur spécialisé "mots dont la prononciation est manquante" où il va automatiquement forger la requête petscan pour faire comme dans ton exemple pour la langue sélectionnée (facile d'utilisation, très spécifique mais potentiellement très utile, nous obligerait à renseigner manuellement les catégories wiktionnaire correspondante car je ne vois aucun moyen de deviner le nom de la catégorie d'une langue à partir de son code ou son id wikidata...)
| |
− | :::Qu'en penses-tu ?
| |
− | :::— [[User:0x010C|'''0'''x'''010<span style="color: #00C41C;">C</span>''']] <sup>[[User_talk:0x010C|~talk~]]</sup> 02:53, 16 October 2018 (UTC)
| |
− | ::::La première proposition me semble trop usine à gaz et bien que puissante, je ne pense pas qu'elle s'adresse au public de Lingua Libre.Entre les propositions 2 et 3, j'ai une préférence pour la 2 car elle est simple d'utilisation au premier abord (on utilise des requêtes pré-forgées) tout en permettant une utilisation avancée (avantage de la solution 1). Et par rapport à la solution 3, ça évite de la maintenance pour déterminer la langue d'une catégorie donc c'est plus maintenable sur le long terme à mon avis. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 06:23, 17 October 2018 (UTC)
| |
− | :::::@[[User:Pamputt|Pamputt]]: Entre deux avions, je viens de finir une première version du générateur petscan, activable via [[Special:Preferences#mw-prefsection-gadgets|préférences > gadgets]]. Est-ce que tu peux y jetter un œil et me dire ce que tu en penses avant que je continue et que je l'annonce plus largement ?
| |
− | :::::Merci — [[User:0x010C|'''0'''x'''010<span style="color: #00C41C;">C</span>''']] <sup>[[User_talk:0x010C|~talk~]]</sup> 08:39, 22 October 2018 (UTC)
| |
− | ::::::[[User:0x010C|0x010C]], j'ai activé le gadget et je vois bien PetScan dans la liste. J'ai fait quelques essais et ça fonctionne bien. J'ai essayé avec l'URL du premier message et ça fonctionne nickel. En revanche, j'ai essayé avec [https://petscan.wmflabs.org/?language=fr&project=wiktionary&categories=Adjectifs%20en%20fran%C3%A7ais&negcats=Prononciations%20audio%20en%20fran%C3%A7ais&ns%5B0%5D=1&search_max_results=500&interface_language=en&active_tab= ça] et ça m'indique "Petscan output something weired with this URL, check it and come back afterwards.". En revanche si j'ajoute le « &doit= » à la fin, ça fonctionne correctement (est-il vraiment nécessaire) ?
| |
− | ::::::Autre point, cest-ce qu'il est déjà possible de préparer des requêtes pré-faites (« mots en français n'ayant pas de prononciation sur le wiktionnaire francophone », ...) ou pas encore ? En l'état c'est déjà super cool. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 17:04, 22 October 2018 (UTC)
| |
− | :::::::J'avais oublié que cetaines URL pouvaient ne pas avoir l'auto-run, c'est fix. Je réfléchis actuellement à la meilleur façon de faire en fait. Ma problématique, c'est qu'une requête comme « mots en français n'ayant pas de prononciation sur le wiktionnaire francophone » n'intéressera que ceux qui font des enregitrements en français, si un germanophone dois scroller 25 requêtes qui le concerne pas (et qu'il ne comprend surement pas) avant d'en trouver une en allemand, c'est pas cool pour lui.
| |
− | :::::::De là, trois idées qui me viennent en écrivant ces lignes :
| |
− | :::::::* Une page par langue, dans l'espace de nom list ([[List:fra]] ? [[List:fra-external]] ? [[List:fra-examples]] ? ...) qui regroupe via une liste à puce toutes les urls dispo pour une langue ;
| |
− | :::::::* Une fois ce travail fait, ce n'est pas très compliqué de supporter d'autres outils externes qui peuvent être appelé via une URL et renvoyer le résultat en JSON ; je pense notamment à querry.wikidata.org ;
| |
− | :::::::* Et là, plus une réflexion, est-ce que ça serait pertinent une fois que ça sera stable de l'intégrer au générateur "listes" actuel (genre avoir deux onglets dedans, "listes statique", "listes dynamiques/externes/..." ?), ou l'intégrer comme un nouveau générateur à part entière dans le core du RecordWizard ? (et du coup comment le nommer dans ce cas ?)
| |
− | :::::::Un avis externe me serait bien utile pour trancher tout cela :) — [[User:0x010C|'''0'''x'''010<span style="color: #00C41C;">C</span>''']] <sup>[[User_talk:0x010C|~talk~]]</sup> 19:52, 22 October 2018 (UTC)
| |
− |
| |
− | == Variations géographiques ==
| |
− |
| |
− | Bonjour,
| |
− |
| |
− | Bravo pour ce projet très intéressant.
| |
− |
| |
− | Je me pose une question à propos des prononciations. Je suis du sud de la France et contrairement à une bonne partie du reste de la France, nous usons beaucoup de l'accent tonique (influence italienne et espagnole, j'imagine). Du coup, la prononciation de certains mots, et surtout des locutions, ont une rythmique différente par chez moi.
| |
− |
| |
− | Comment gérer ces variations de prononciation ? Ont-elles droit de cité ou comme les québécois doit-on privilégier un "Français international" neutre ?
| |
− |
| |
− | Pour finir sur le sujet, la prononciation de certains mots sont différentes chez nous : lait, mas, moins (avec un s !), etc. Comment intégrer ça dans Wiktionnaire ou Wikipédia ?
| |
− |
| |
− | [[User:Jpgibert|Jpgibert]] ([[User talk:Jpgibert|talk]]) 12:02, 13 July 2018 (UTC)
| |
− |
| |
− | :Bonjour,
| |
− | :Merci pour ton intérêt !
| |
− | :Non, il ne faut surtout pas privilégier un français "''neutre''". Chaque variation / accent locale est intéressent. En fait, juste avant de commencer à enregistrer il t'es demandé de remplir ton profil de locuteur, dans lequel tu peux renseigner ton lieu d'habitation / d'apprentissage d'une langue.
| |
− | :Lorsqu'un enregistrement est ajouté ensuite sur le Wiktionnaire par exemple, cette information y est inclu. Si plusieurs personnes ont enregistré les même mots, on pourra donc écouter les différences de prononciation de « lait » en Alsace, au Québec, en Occitanie, en Île de France, au Mali,... Et ça c'est cool :)
| |
− | :Cela répond à tes questions ?
| |
− | :Cordialement — [[User:0x010C|'''0'''x'''010<span style="color: #00C41C;">C</span>''']] <sup>[[User_talk:0x010C|~talk~]]</sup> 21:55, 14 July 2018 (UTC)
| |
− |
| |
− | ::Bonjour [[User:0x010C]]
| |
− | ::Merci pour la réponse. Je m'inquiétais de la chose parce que s'il existe un code linguistique pour les variations du français au Québec (fr-CA) ou de Belgique (fr-BE), en revanche l'accent n'est pas pris en compte.
| |
− | ::Content d'apprendre que malgré mon accent, je serai le bienvenu. Bon pour le moment, faut que j'achète un bon micro avant de faire quoi que ce soit, mais dès que j'aurai ça, je tenterai de partager mon accent méridional.
| |
− | ::[[User:Jpgibert|Jpgibert]] ([[User talk:Jpgibert|talk]]) 12:31, 23 July 2018 (UTC)
| |
− |
| |
− |
| |
− |
| |
− | == How to add missing languages ? ==
| |
− | :See [[Help:Add_a_new_language]], for administrators. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 17:48, 22 November 2018 (UTC)
| |
− | I couln't find this language on lingua libre : https://en.wikipedia.org/wiki/Southern_Min
| |
− |
| |
− | Could you investigate, and explain. There are [https://en.wikipedia.org/wiki/Formosan_languages#List_of_languages 16 Taiwanese languages] to check. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 09:55, 22 July 2018 (UTC)
| |
− | * Need this as well : https://en.wikipedia.org/wiki/Baoul%C3%A9_language bci
| |
− | :I'll import them as soon as I'm back to my laptop, in ~2h30. @[[User:Yug|Yug]], Can you give me a direct link to each languages you want me to import? — [[User:0x010C|'''0'''x'''010<span style="color: #00C41C;">C</span>''']] <sup>[[User_talk:0x010C|~talk~]]</sup> 12:49, 22 July 2018 (UTC)
| |
− | ::Both linked languages are now available on Lingua Libre. :)
| |
− | ::Best regards — [[User:0x010C|'''0'''x'''010<span style="color: #00C41C;">C</span>''']] <sup>[[User_talk:0x010C|~talk~]]</sup> 15:35, 22 July 2018 (UTC)
| |
− | Hello @[[User:0x010C]], could you add the following languages :
| |
− | * living aboriginal languages alive : ami, tay, bnn, xnb, ckv, pwn, pyu, dru, sxr, xsy, trv, ssf, tsu, tao
| |
− | * Sino-Taiwanese languages : cmn, nan, hak.
| |
− | So Taiwanese linguists could work peacefully as needed. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 13:20, 6 August 2018 (UTC)
| |
− | :Hello @[[User:0x010C]], did I miss something in my guidances to you ? I search within the Recorder wizard but the autocompletion fail to return me a result for the iso:
| |
− | :* Taiwan needed: ami ([https://www.wikidata.org/wiki/Q715760 Q715760]), tay (Q715766), bnn (Q56505), xnb (Q172244), ckv (Q716627), pwn (Q715755), pyu (Q716690), dru (Q49232), sxr (Q716599), xsy (Q716695), trv (Q716686), ssf (Q676492), tsu (Q716681), tao (Q715760).
| |
− | :* Chineses needed: hak Q33375.
| |
− | :* Chineses which are already in : cmn (Chinese), nan (Southern Min)
| |
− | :I tested the record wizard by typing both iso639 codes and Plain English names. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 15:11, 21 November 2018 (UTC)
| |
− | ::[[Help:Add_a_new_language]] explains how to request a new language. I have done it for all your requests (see [[Special:RecentChanges]] for details). [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 17:08, 21 November 2018 (UTC)
| |
− | :::Thanks [[User:Pamputt|Pamputt]] ! [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 17:48, 22 November 2018 (UTC)
| |
− |
| |
− | == Thésaurus ==
| |
− |
| |
− | Bonjour,
| |
− |
| |
− | Durant la vidéo de présentation du projet par [[user:Lyokoï|Lyokoï]] ([https://www.youtube.com/watch?v=8p_z3jyihwU LetsContribute6]), j'ai appris qu'on pouvait générer des listes de mots à partir de catégories. Serait-il possible de faire le même genre de chose à partir d'un thésaurus ? Question subsidiaire, est-ce que ça à un intérêt ?
| |
− |
| |
− | [[User:Jpgibert|Jpgibert]] ([[User talk:Jpgibert|talk]]) 12:39, 23 July 2018 (UTC)
| |
− | :Ca pourrait effectivement être intéressant même si c'est plus compliqué à coder (j'imagine). Juste pour donner un exemple pour ceux qui ne voient pas ce dont il est question, on peut aller voir [https://fr.wiktionary.org/wiki/Th%C3%A9saurus:pain/fran%C3%A7ais ici]. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 21:30, 23 July 2018 (UTC)
| |
− | :: {{ping|Jpgibert}} Le plus simple pour faire ça, c'est de copier-coller le contenu du thésaurus et de séparer les mots avec un #. Ça doit demander quelques minutes pour être mis en forme, mais ce n'est pas non plus le Pérou. [[User:Lyokoï|Lyokoï]] ([[User talk:Lyokoï|talk]]) 14:22, 15 December 2018 (UTC)
| |
− |
| |
− | == General issues + issues with Odia and Asian writing systems ==
| |
− | :{{Done}}, all issue tracked on phabricator or explained below. Ready to archive. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 23:22, 23 December 2018 (UTC)
| |
− | I loved the current version! Truly admire the changes you all have made over time.
| |
− | I have also done a few recordings in my own language Odia to check for any error. Below are a few:
| |
− |
| |
− | # '''Tag already recorded items ([https://phabricator.wikimedia.org/T212580 T212580]):''' When a word has already been recorded and has been uploaded on Commons, does is not make sense to show it as a flag instead of letting any user to upload it directly?
| |
− | # '''Add custom commons categories ([https://phabricator.wikimedia.org/T201135 T201135]):''' Also, different languages have different additional categories which Lingua Libre does not let one to add. For instance, I generally add a user category to know how many audio files I have uploaded. For the files recorded using Lingua Libre, I don't see an option to add that optional category.
| |
− | # '''Remove duplicated words (in same session: explanation below ; across time: [https://phabricator.wikimedia.org/T212580 T212580]):''' If I am adding a wordlist before recording, is that possible to keep only one word if the same word is used multiple times? This would save some time for the uploader.
| |
− | # '''Monitor suspect cracking sound in audios ([https://phabricator.wikimedia.org/T201136 T201136]):''' There is a bit of crackling sound that is heard while monitoring the recorded words. Any particular reason?
| |
− | # '''Some words fails anyway ([https://phabricator.wikimedia.org/T212584 T212584]):''' Even though I am correctly pronouncing every word, I see a lot of red-labelled words.
| |
− | # '''Allow click-play-listen while recording ([https://phabricator.wikimedia.org/T212583 T212583]):''' While recording, I cannot check how the recording sounds like. I can only choose to re-record after hearing the recorded sound. Otherwise even having that option is of no use.
| |
− | # '''Remove underline ([https://github.com/lingua-libre/LinguaRecorder/commit/c48d2e6f6cb31acef8a39245bab1eccc5dbdb969 done]):''' While recording each word is seen as a green button and during the recording the word is underlined. This works well for Latin-based scripts. However for my script, Odia, and even many other Asian languages, this is a problem as we have diacritics and accent marks below the character. It becomes too hard at times to read when underlined. Also, the light green color and a white background is not accessible to people with corrections or color blindness. Maybe black background with white text will create more contrast and make it easier to read.
| |
− | # '''Last word cannot be re-recorded (explanation below):''' When you reach the last word of a batch and want to re-record that word, it doesn't allow you to click on the word button and re-record.
| |
− |
| |
− | Also, requesting to add the Warang Citi (used for Ho language) and Ol Chiki (used for Santali language).
| |
− |
| |
− | Thank you much again. I would really love to contribute more myself, and involve other community members. --[[User:Psubhashish|Psubhashish]] ([[User talk:Psubhashish|talk]]) 07:21, 26 July 2018 (UTC)
| |
− | :Hi!
| |
− | :First of all, thanks for your feedbacks, that's really helpful. Here are some details about your remarks:
| |
− | :# In my opinion, it is interesting to have several records of the same word by different users, the naming convention takes this into account to avoid records to be overridden by another user. But as I'm not sure I understood this point very well, don't hesitate to clarify it if my answer is mistaken.
| |
− | :# [https://phabricator.wikimedia.org/T201135 T201135]
| |
− | :# If I have correctly understood your point, that's already the case. You can't add duplicate words in the same record batch (if you try to do so, the second one will be dismissed).
| |
− | :# It's just a small file-loading issue, it will be fixed soon, see [https://phabricator.wikimedia.org/T201136 T201136]
| |
− | :# This is a major issue I'm already aware of. In some cases (~ 1 word out of 100), for some unknown reason, MediaWiki is mistaken in taking WAV files for executable files, so it refuses them...
| |
− | :# I'll try to add a way to listen the records while still in the recording studio.
| |
− | :# I wasn't aware of that particularities, I'll remove the underline. I'm not so fond of the white text on black, but I'll try to find something more accessible.
| |
− | :# Hum, this works well with me. When you have recorded the last word, the record automatically cuts off, did you click on the big red button to enable it again?
| |
− | :I've imported the Ho language, which was missing from Lingua Libre, but the two writing system you've mentionned are part of Unicode and should works, am I wrong?
| |
− | :Best regards — [[User:0x010C|'''0'''x'''010<span style="color: #00C41C;">C</span>''']] <sup>[[User_talk:0x010C|~talk~]]</sup> 08:37, 3 August 2018 (UTC)
| |
− | ::+1 for point 7, the underline is also troublesome for Chinese. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 13:08, 6 August 2018 (UTC)
| |
− | ::Hi! Continuing the cleaning effort and tracking of issues, also to stay short and concise, I [https://lingualibre.fr/index.php?title=LinguaLibre:Chat_room&diff=next&oldid=63662 enhanced the initial post] with title and status (phabricator issue). Sorry for that, just cleaner. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 11:33, 24 December 2018 (UTC)
| |
− |
| |
− | == Première utilisation : quelques questionnements ==
| |
− |
| |
− | Bonjour !
| |
− |
| |
− | Tout d'abord, merci beaucoup pour ce super outil !
| |
− |
| |
− | J'ai remarqué quelques difficultés à l'usage. Peut-être que c'est juste parce que je suis nouvelle et pas au courant de toutes les options, mais voilà ma liste :
| |
− |
| |
− | # Sur une liste de 20 mots, il faut généralement que je reprenne l'enregistrement manuellement trois ou quatre fois parce que l'outil décide soudain de ne plus enregistrer. Quand je sélectionne un mot, même en cliquant sur le gros bouton rouge, il y a à peu près une chance pour deux pour que l'enregistrement se lance.
| |
− | # Mes mots sont très souvent coupés au début et à la fin (pour les noms propres en deux ou trois mots surtout) : peut-être qu'il serait pertinent d'avoir un petit bouton "next" pour marquer manuellement les fins de mots ? Sur 20 mots enregistrés, entre ceux que l'outil n'a pas envie de me laisser enregistrer (cf #1) et ceux qui sont coupés, m'en reste peu. Sur 3 listes d'une vingtaine de noms, j'en ai eu 2, 5 et 7 exploitables.
| |
− | # Sur une page d'enregistrement comme [[Q44570]], le lien vers la page Wikipédia met un + au lieu d'un _ entre les mots donc on arrive sur un lien rouge dans Wikipédia.
| |
− |
| |
− | Si ça peut servir, je suis sur la dernière version en date de Firefox au 10/10/18 & Windows 10.
| |
− |
| |
− | Pour le reste : c'est vraiment super, bravo pour tout ce travail ! Je vais continuer à faire joujou avec l'outil jusqu'à être bien familière avec.
| |
− |
| |
− | [[User:Exilexi|Exilexi]] ([[User talk:Exilexi|talk]]) 06:22, 10 October 2018 (UTC)
| |
− | :Les problèmes 1 et 2 sont en fait quasiment réglés avec un meilleur micro. Lingua Libre demande la permission pour un micro qui n'est pas mon micro par défaut, pour une raison inconnue.
| |
− | :Nouveau souci avec l'upload : tous les mots sauf 1 sont bien téléversés. Le bouton Commons s'affiche en grisé et rien ne se passe si je clique sur la petite croix à côté d'un mot : apparemment, c'est tout ou rien pour mettre sur Commons, donc je viens de perdre 29 mots parce qu'un seul refusait de s'uploader. [[User:Exilexi|Exilexi]] ([[User talk:Exilexi|talk]]) 06:44, 10 October 2018 (UTC)
| |
− | :: J'en ajoute un : j'avais enregistré 20 mots "autour de moi". Là, je viens d'en lancer 20 autres... et c'est les mêmes. Il pourrait être intéressant d'ajouter une option pour éviter d'enregistrer plusieurs fois la même chose (mon accent ne change pas d'un jour à l'autre). [[User:Exilexi|Exilexi]] ([[User talk:Exilexi|talk]]) 05:36, 11 October 2018 (UTC)
| |
− |
| |
− | Salut [[User:Exilexi|Exilexi]], quelques remarques ou éléments de réponse à tes commentaires
| |
− | # Lorsque tu décris que l'outil stoppe l'enregistrement, je pense que le problème vient de la qualité du micro. C'est ce que tu sembles avoir conclu également.
| |
− | # Lingua Libre découpe les mots automatiquement dès qu'il détecte un blanc. Pour les noms à rallonge, on pourrait envisager d'ajouter un bouton pour passer manuellement au mot suivant. Cela étant dit, ça perd un peu de l'intérêt de l'outil car ça devient beaucoup plus lent.
| |
− | # concernant le lien vers Wikipédia (avec un « + »), ça semble en effet un bogue. J'ai ouvert un [https://phabricator.wikimedia.org/T206801 ticket sur Phabricator].
| |
− | # pour les problèmes d'upload, quand un téléversement échoue, un [https://phabricator.wikimedia.org/T198014 ticket] existe déjà sur ce sujet.
| |
− | # pour les listes de mots, il est possible d'en créer soi-même. Il en existe déjà plusieurs en français (quelques dizaines) et moins dans les autres langues. Il est expliquer [[Help:Create_your_own_lists/fr|ici]] sur la façon de procéder. Si tu as besoin d'aide, fais-nous signe. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 20:13, 11 October 2018 (UTC)
| |
− |
| |
− | == Erreur sur Opera et IE en occitan ==
| |
− | :{{done}} -- no recent activity, soon to be archived. 10:40, 23 December 2018 (UTC)
| |
− | Bonjour,
| |
− | J'ai un problème quand je veux utiliser le Wizzard avec l'interface de Lingua Libre en occitan (sur Linux ou Windows):
| |
− | * avec Opera j'ai cette erreur : Impossible de traiter cette demande via lingualibre.fr à l'heure actuelle.
| |
− | * avec IE j'ai cette erreur : Le site Web ne peut pas afficher la page HTTP 500
| |
− | Par contre je n'ai aucun problème si je mets l'interface en français.
| |
− |
| |
− | [[User:Guilhelma|Guilhelma]]
| |
− | :Salut Guilhelma, j'ai déplacé ta question afin qu'elle soit plus visible. C'est à quel moment précisément que ce problème apparait ? [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 12:22, 4 November 2018 (UTC)
| |
− | ::Je suis en train d'investiguer pourquoi le RecordWizard se comporte ainsi concernant l'occitan, alors qu'il fonctionne bien avec toutes les autres langues que j'ai pu tester... je vous tiendrais au courant des avancements. — [[User:0x010C|'''0'''x'''010<span style="color: #00C41C;">C</span>''']] <sup>[[User_talk:0x010C|~talk~]]</sup> 11:42, 12 November 2018 (UTC)
| |
− | : Désolée, je n'avais pas vu que le message avait été déplacé, cela arrive quand je clique sur Participer/ assistent d'enregistrement, Aure a le même problème aussi. [[User:Guilhelma|Guilhelma]]
| |
− | ::J'ai créé un [https://phabricator.wikimedia.org/T210477 ticket sur Phabricator] pour ne pas perdre de vue ce problème. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 07:00, 27 November 2018 (UTC)
| |
− | ::: [[User:Pamputt|Pamputt]], [[User:0x010C|'''0'''x'''010<span style="color: #00C41C;">C</span>''']], can we archive this discussion since you track this issue on Pharicator ? [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 13:55, 19 December 2018 (UTC)
| |
− |
| |
− | == Formosan languages workshop ==
| |
− | Hi there, I had an email exchange with Vicky, the [https://en.wikipedia.org/wiki/National_Chengchi_University NCCU] language researcher involved in Formosan languages protection. Some of her questions are [https://en.wikiquote.org/wiki/Toy_Story#Buzz_Lightyear beyond] my skills :
| |
− |
| |
− | 1. I couldn't find ais(Sakizaya), ami(Amis), trv(Truku) in the language list. Please add, thanks!
| |
− | 2. Can I add the dialect information in the speaker file?
| |
− | Because there are 42 dialects under 16 aboriginal languages, I had record Squliq dialect not C’uli’ dialect of Atayal language today.
| |
− | 3. I had add the Chinese translation after the aboriginal languages, is that ok for lingua libre?
| |
− | Or I only can type in aboriginal languages?
| |
− |
| |
− | I broke the questions in several subsections so a quick discussion may occurs for each. Please take notes that Vicky workshop is coming this week, so it would be cool to forward her practical solutions early.
| |
− | [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 09:38, 29 November 2018 (UTC)
| |
− |
| |
− | === 1) Requesting languages additions ===
| |
− | * Amis_language (iso: ami; wikidata: [[wikidata:Q35132|Q35132]]).
| |
− | * Sakizaya has no iso639, from my understanding. Sakizaya_language (iso: none, wikidata: [[wikidata:Q718269|Q718269]]), Nataoran_language (iso: ais, wikidata: [[wikidata:Q42508148|Q42508148]]).
| |
− | * Truku (no iso no wikidata) : is described in Wikipedia as the main component of Seediq language (iso: trv, wikidata: [[wikidata:Q716686|Q716686]]), already in LinguaLibre. Taiwanese linguist, the most experienced in the matter, are making a distinction.
| |
− | If I understand well, LL only requires wikidata ID. If so, I would recommend to add [[wikidata:Q35132|Q35132]] (amis), [[wikidata:Q718269|Q718269]] (Sakizaya). [[wikidata:Q42508148|Q42508148]] (Nataorans) and [[wikidata:Q716686|Q716686]] (Seediq) are already in I think. Truku may require a wikidata item creation, then integration in LL.
| |
− | [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 09:38, 29 November 2018 (UTC)
| |
− | :The four languages have been imported here: [[Q51311]] Seediq, [[Q51870]] Amis, [[Q51871]] Sakizaya and [[Q51872]] Nataoran and can be used for recording. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 04:15, 30 November 2018 (UTC)
| |
− |
| |
− | === 2) "There are 42 dialects under 16 aboriginal languages". ===
| |
− | We previously added 15 or 16 of these recognized languages into LinguaLibre (thanks x0 and Pamputt). Again, Taiwanese linguists are the experts on the matter, so what can we (LL) recommend for these 42 variants ? Two ideas came to me.
| |
− | # Add the information in he speaker name or place of learning. By example for : Paul Martin (Breton north) ; Paul Martin (Breton south).
| |
− | # Add the Wikidata items following Taiwanese linguists recommendations, while no wikipedia articles nor iso639 exists.
| |
− | What do you think ? [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 09:38, 29 November 2018 (UTC)
| |
− | : As far as I uundertand, if no Wikidata item exists for a given language, we have two options: create it on Wikidata (whether it is notable) and import here after or create it by hand directly here. So for dialect, I would say they are enough notable to be created on Wikidata but I have no time to do it by myself before the end of the year (I have no regular Internet connection for now). [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 04:18, 30 November 2018 (UTC)
| |
− | ::In fact, the second option mentionned above by Pamputt won't work. For a language to be recognised by the RecordWizard, it has to have a wikidata ID. The right way to do it imho is (as also suggested by Pamputt) to create the corresponding item on wikidata, and then ask for an import here. — [[User:0x010C|'''0'''x'''010<span style="color: #00C41C;">C</span>''']] <sup>[[User_talk:0x010C|~talk~]]</sup> 14:46, 3 December 2018 (UTC)
| |
− |
| |
− | === 3) "Is it ok to use <code>mhway su (谢谢)</code> ?" (target word + translation) ===
| |
− | * '''Technically''', both aboriginal languages and Chinese, de factor the target word together with its closest macro-language's translation, here, Chinese.
| |
− | * Keep extremely '''consistent''' in your practice, so to ease later usages (learning apps). If the rule is
| |
− | <span style="color:green;">{aboriginal}{white_space}{opening_round_braket_(}{Chinese}{closing_round_braket_)}</span>
| |
− | stick to it, and avoid round brackets in other places of your element. Early consistency makes later usages easier.
| |
− | [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 09:38, 29 November 2018 (UTC)
| |
− | :@x0, devs, there again we have the questions of wordlists with translations. I previously suggested that [[Help:Create_your_own_lists|words lists]] support a iso639 syntaxe or wikidata id syntax so to push the translation into a different metadata field. Example of list :
| |
− | : mhway su [cmn:谢谢,eng:Thank]
| |
− | :Then "mhway su" is the target recorded word. "谢谢" is the translation in the meta data "cmn" (Chinese). "Thank" is the translation in the meta data "eng" (English). I guess I should open a ticket on Phabricator. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 10:19, 29 November 2018 (UTC)
| |
− | ::Multi-lingual wordlist --wordlist including the translation of target words-- are not supported at the moment. An issue have been opened on LinguaLibre developments and bugs tracking system ([https://phabricator.wikimedia.org/T211086 T211086]). [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 09:29, 4 December 2018 (UTC)
| |
| | | |
− | == Key directions and task forces == | + | == Is the Record Wizard not working for anyone else? == |
− | Following the [[LinguaLibre:Hackathon_15-16_Décembre|2018.12.15-16 Paris hackathon]] and ongoing [https://github.com/wikimedia-france/Lingua-Libre/wiki/Actions github clean up], I bumped into this past structuration :
| |
− | The Dev team
| |
− | The administrative-communication team
| |
− | The linguistic research team
| |
− | The recording team
| |
− | I think the idea is interesting and worth to keep in mind. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 13:35, 19 December 2018 (UTC)
| |
| | | |
− | == Thésaurus (2) ==
| + | My mic works with [https://mictests.com/ mictests.com], but [https://lingualibre.org/wiki/Special:RecordWizard the RecordWizard] doesn't pick anything up at the "check your microphone" stage. I've tried on both my phone and my laptop, and I can record sound in both cases, and I have the appropriate permissions enabled, but this particular website isn't detecting sounds. Is anyone else having this kind of problem? [[User:Grendelkhan|Grendelkhan]] ([[User talk:Grendelkhan|talk]]) 23:43, 24 February 2024 (UTC) |
− | J'ai [https://lingualibre.fr/index.php?title=LinguaLibre%3AChat_room&type=revision&diff=62426&oldid=62425 archivé] le coeur de la discussion de Benoit & 0x010C, mais cet autre sujet mérite une section:
| + | :Hello [[User:Grendelkhan]], |
− | :"Rien à voir. Je pensais qu'un petit outil de génération de liste depuis un thésaurus fr.wikt ce serait top. Au lieu de choisir une catégorie d'un wikiprojet, on choisirait un thésaurus. Une idée comme ça. --[[User:Benoît Prieur|Benoît]] 21:36, 20 December 2018 (UTC)" | + | :I just received a second such report. User also checked [https://mictests.com/ mictests.com] sucessfully. |
− | --[[User:Yug|Yug]] ([[User talk:Yug|talk]]) 10:41, 24 December 2018 (UTC) | + | :On Firefox, Lingua Libre recording studio step 4, the microphone is allowed (we see the red microphone image on the left of the URL address). But after clicking the record button, no recording occurs. |
| + | :* Mictests on other site : successful. |
| + | :*Device: Notebook |
| + | :*OS: ? |
| + | :*Browser: Firefox, Chrome. |
| + | :*User: [[User:Akamycoco]]. |
| + | :*Languages affected: all. |
| + | :*Dates : Worked on February 28. Stopped working on February 29. |
| + | :Let's starts an investigation. Could you let me know your OS and precise web browser version ? (Help > About Chrome or similar) |
| + | :Let me know as well if you have basic developer skills to Right-click on the staled page > Inspect > Console : are there any error message ? [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 07:55, 1 March 2024 (UTC) |
| | | |
− | == My recordings never turn green ==
| + | ::My laptop is using Google Chrome <tt>122.0.6261.94 (Official Build) (64-bit)</tt> on Linux (Debian Testing). No error messages in the console when I attempt the recording. My phone is using Chrome <tt>122.0.6261.90</tt> on Android 14 on a Pixel 5a. It ''does'' seem to work on Firefox <tt>115.7.0esr (64-bit)</tt> on my laptop. (I really should have checked that before.) So maybe this is solely a Chrome problem? [[User:Grendelkhan|Grendelkhan]] ([[User talk:Grendelkhan|talk]]) 16:30, 2 March 2024 (UTC) |
| | | |
− | I don't know if I'm doing something wrong, but it only remains blue or sometimes turns red, but never green. What am I missing? [[User:NMaia|NMaia]] ([[User talk:NMaia|talk]]) 11:08, 22 December 2018 (UTC)
| + | == Automatic categorization isn't documented. == |
− | :Hello [[User:NMaia|NMaia]], thanks for using LinguaLibre and providing this informations.<br>My first idea : if your microphone is too sensitive, audio will always be saturated, too loud, and thus fail (red).<br>After what, could you provide more info : your system, links to screenshots if possible or the bugging page's url and/or content description ?<br>Did you test with recent Chrome or Firefox, most of our users use these and it work fine. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 10:28, 23 December 2018 (UTC)
| |
− | ::Hi Yug, thanks for reaching out. I found out the problem: I was clicking the record button before it turned green. I didn't know I had to wait 😅 [[User:NMaia|NMaia]] ([[User talk:NMaia|talk]]) 17:17, 27 December 2018 (UTC)
| |
| | | |
− | == Warn the user when they try to record a file that they already made ==
| + | So far as I can tell, this isn't documented: if, for user Foo, category <tt>Lingua Libre pronunciation by Foo</tt> exists on Commons, then all uploads will be categorized into that category. This is helpful! It's also easy to backfill after the fact using [[:commons:Help:Gadget-Cat-a-lot]]. I'm not sure where to document this, but it seems reasonable to do so ''somewhere''. [[User:Grendelkhan|Grendelkhan]] ([[User talk:Grendelkhan|talk]]) 16:26, 3 March 2024 (UTC) |
| | | |
− | Yesterday I accidetenly recorded the Esperanto word "tuj" twice. My second upload overwrote my first upload on Commons: https://commons.wikimedia.org/wiki/File:LL-Q143_(epo)-Robin_van_der_Vliet-tuj.wav
| + | == Understanding lingua-libre == |
| | | |
− | It would be nice if this site would give me a warning if I add a word to the list of words that I want to record. [[User:Robin van der Vliet|Robin van der Vliet]] ([[User talk:Robin van der Vliet|talk]]) 17:29, 22 December 2018 (UTC)
| + | Hi, I am creating this discussion to understand lingua-libre better |
− | :There have been oral discussions on this. Ideally, when in the Record wizard, facing a words list and ready to record, the list on screen can be compared with the speakers' previously recorded words. Via a checkbox, words already recorded could be toggled grey and skipped or into display:none. This feature request emerged recently as users are coming back (<3) to record again different words lists. It is not coded so far. @Devs: what is the best way to ask a such feature ? Phabricator issue could be opened, I guess. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 10:24, 23 December 2018 (UTC)
| |
− | ::Hi Robin van der Vliet! This feature is planned, but I have first to solve a couple of technical issues before its implementation. I can't give you any date yet, it will be at some point in early 2019 ;).
| |
− | ::@Yug: Phabricator is the right place to do so: https://phabricator.wikimedia.org/tag/lingua_libre/
| |
− | ::Best regards — [[User:0x010C|'''0'''x'''010<span style="color: #00C41C;">C</span>''']] <sup>[[User_talk:0x010C|~talk~]]</sup> 12:15, 23 December 2018 (UTC)
| |
− | :::Ok, super. I opened [https://phabricator.wikimedia.org/T212580 T212580] and reorganized a bit the [https://phabricator.wikimedia.org/tag/lingua_libre/ working board]. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 18:14, 23 December 2018 (UTC)
| |
| | | |
− | == CJK recording and Studio CSS == | + | == Uploads are failing == |
− | :{{Done}} -- fixed on github. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 20:38, 23 December 2018 (UTC) | + | :''TLDR: Large amount of users reporting failure to upload at step 5 : [[User:Grendelkhan|Grendelkhan]], [[User:Culex|Culex]], [[User:XANA000|XANA000]], [[User:Ardzun|Ardzun]] (Indonesian languages), [[User:Penn Zero MSSJ|Penn Zero MSSJ]], [[User:Univòc64]] (Whistled Occitan) and [[User:Akamycoco]] (Taiwanese languages). This likely only tip of iceberg. Only few users were able to [https://lingualibre.org/index.php?hidebots=1&translations=filter&hidepageedits=1&hideWikibase=1&hidelog=1&namespace=0&limit=1000&days=14&enhanced=1&title=Special:RecentChanges&urlversion=2 record in May], with atypically low number of recordings. Indonesia workshop with ~15 participants critically affected. Investigation ongoing. [[User:Hugo en résidence|Hugo en résidence]] ([[User talk:Hugo en résidence|talk]]) 14:20, 13 May 2024 (UTC)'' |
− | Where can I edit the CSS of the record wizard ? The focus-underline is confusion for Chinese character and others scripts, I would like to remove it. See [https://commons.wikimedia.org/wiki/File:LinguaLibre-Recorder-Chinese_characters_underline.png first blue word underlined via css here] for example. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 18:14, 23 December 2018 (UTC)
| |
− | :Fixed [https://github.com/lingua-libre/LinguaRecorder/commit/c48d2e6f6cb31acef8a39245bab1eccc5dbdb969 on github]. Effective in next deploiment. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 20:38, 23 December 2018 (UTC)
| |
− | ::I've reverted your commit, because what you've edited is the css of the demo studio of the LinguaRecorder library, not the studio of the RecordWizard. The right file to edit is [https://github.com/lingua-libre/RecordWizard/blob/master/modules/ext.recordWizard.css#L249 this one] ;). (and please, don't put a red border, user may think there is an error...) — [[User:0x010C|'''0'''x'''010<span style="color: #00C41C;">C</span>''']] <sup>[[User_talk:0x010C|~talk~]]</sup> 00:14, 24 December 2018 (UTC)
| |
− | :::Thanks 0x010C !! I found the "demo" name a bit suspicious indeed XD Thanks for the correction ! [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 10:34, 24 December 2018 (UTC)
| |
| | | |
− | == Limitation dans le choix de catégories ==
| + | I can record words, but uploading them to Commons fails. The JavaScript console has the following message: |
| | | |
− | Bonjour,
| + | : <tt>'''Your IP address is in a range that has been [[m:Special:MyLanguage/Global blocks|blocked on all Wikimedia Foundation wikis]].''' The block was made by [[User:EPIC|EPIC]]. The reason given is ''[[m:Special:MyLanguage/NOP|Open proxy/Webhost]]: See the [[m:WM:OP/H|help page]] if you are affected''. * Start of block: 10:09, 1 May 2024 * Expiry of block: 10:09, 1 May 2027 Your current IP address is 2001:41d0:304:100::4790. The blocked range is 2001:41D0:0:0:0:0:0:0/33. Please include all above details in any queries you make. If you believe you were blocked by mistake, you can find additional information and instructions in the [[m:Special:MyLanguage/No open proxies|No open proxies]] global policy. Otherwise, to discuss the block please [[m:Steward requests/Global|post a request for review on Meta-Wiki]]. You could also send an email to the [[m:Special:MyLanguage/Stewards|stewards]] [[m:Special:MyLanguage/VRT|VRT]] queue at "stewards@wikimedia.org" including all above details.`, blockinfo: {…}, "*": "See https://commons.wikimedia.org/w/api.php for API usage. Subscribe to the mediawiki-api-announce mailing list at <https://lists.wikimedia.org/postorius/lists/mediawiki-api-announce.lists.wikimedia.org/> for notice of API deprecations and breaking changes." |
| | | |
− | Ce n'est pas possible d'enregistrer une langue donnée quand on sélectionne un lexique de cette langue dans un wikiprojet qui n'est pas dans cette langue.
| + | This is not my IP address shown in the error message, and whatismyip confirms that I'm not behind a proxy. The Global block request [https://meta.wikimedia.org/wiki/Steward_requests/Global/2024-w18#Global_block_for_Special:Contributions/2001:41D0:0:0:0:0:0:0/33 is here]. Is this affecting anyone else? I lost a heap of recordings. [[User:Grendelkhan|Grendelkhan]] ([[User talk:Grendelkhan|talk]]) 22:26, 4 May 2024 (UTC) |
| + | :Uploads are failing for me today too, even though I am recording with my account. [[User:Culex|Culex]] ([[User talk:Culex|talk]]) 15:04, 8 May 2024 (UTC) |
| + | :: Idem--[[User:XANA000|XANA000]] ([[User talk:XANA000|talk]]) 16:49, 9 May 2024 (UTC) |
| + | ::: I can record, but i couldn’t uploaded until today. I was able to upload once yesterday, but after that I couldn't upload any more. [[User:Ardzun|Ardzun]] ([[User talk:Ardzun|talk]]) 06:04, 11 May 2024 (UTC) |
| + | :I guess I'm not the only one who's been trying for weeks but could not publish audio after 1 May. Hope someone can fix it. [[User:Penn Zero MSSJ|Penn Zero MSSJ]] ([[User talk:Penn Zero MSSJ|talk]]) 20:54, 13 May 2024 (UTC) |
| + | ::[[User:Univòc64]] (Whistled occitan) and [[User:Akamycoco]] (Taiwanese languages) also reported issues. |
| + | ::It seems time to add a sitenotice warning. [[User:Hugo en résidence|Hugo en résidence]] ([[User talk:Hugo en résidence|talk]]) 14:07, 13 May 2024 (UTC) |
| + | ::In may we have mostly : 556 recordings by 7 users on May 1th, 174 recordings on May 11th ([[Special:Contributions/Austin Zhang|Austin Zhang]]), then nothing. |
| + | ::If we compare with [https://public-paws.wmcloud.org/User:Yug/QueryLingualibre-monthly.ipynb known monthly recordings], our average months recently was 30k audios, the lowest ones were 5k audios, May 2024 is heading toward 1200 audios or 5% of the average month and 20% of the lowest months. Something weird is going on indeed. |
| + | {| class=wikitable |
| + | ! Most prolific speakers for the current month || Months since 2022 |
| + | |- |
| + | | |
| + | <query _pagination="10" locutor="<translate><!--T:7--> Item (locutor Qid)</translate>" locutorLabel="<translate><!--T:8--> Speakers of the Month</translate>" nb="<translate><!--T:9--> Number of records</translate>"> |
| + | SELECT ?locutor ?locutorLabel ?nb WHERE { |
| + | { |
| + | SELECT ?locutor (COUNT(?record) as ?nb) |
| + | WHERE { |
| + | ?record prop:P2 entity:Q2 . # Q2: record, P2: instance of. |
| + | ?record prop:P5 ?locutor . # Property:P5: speaker |
| + | ?record prop:P6 ?date . |
| + | FILTER ( YEAR(?date) = YEAR(NOW()) && MONTH(?date) = MONTH(NOW()) ) |
| + | } |
| + | GROUP BY ?locutor ?locutorLabel |
| + | ORDER BY DESC(?nb) |
| + | LIMIT 50 |
| + | } |
| + | SERVICE wikibase:label { |
| + | bd:serviceParam wikibase:language "[AUTO_LANGUAGE],en" . |
| + | ?locutor rdfs:label ?locutorLabel . |
| + | } |
| + | } |
| + | ORDER BY DESC(?nb) |
| + | </query> |
| + | | |
| + | <pre> |
| + | { date:2022-01, records: 21290, speakers: 46, languages: 28 }, |
| + | { date:2022-02, records: 3894, speakers: 40, languages: 17 }, |
| + | { date:2022-03, records: 8357, speakers: 61, languages: 21 }, |
| + | { date:2022-04, records: 5454, speakers: 34, languages: 18 }, |
| + | { date:2022-05, records: 4702, speakers: 59, languages: 30 }, |
| + | { date:2022-06, records: 7675, speakers: 41, languages: 18 }, |
| + | { date:2022-07, records: 4364, speakers: 37, languages: 22 }, |
| + | { date:2022-08, records: 9544, speakers: 45, languages: 23 }, |
| + | { date:2022-09, records: 5802, speakers: 113, languages: 30 }, |
| + | { date:2022-10, records: 6931, speakers: 74, languages: 32 }, |
| + | { date:2022-11, records: 8461, speakers: 54, languages: 34 }, |
| + | { date:2022-12, records: 11882, speakers: 54, languages: 23 }, |
| + | { date:2023-01, records: 18150, speakers: 48, languages: 29 }, |
| + | { date:2023-02, records: 32441, speakers: 65, languages: 29 }, |
| + | { date:2023-03, records: 11527, speakers: 61, languages: 30 }, |
| + | { date:2023-04, records: 8451, speakers: 58, languages: 35 }, |
| + | { date:2023-05, records: 21282, speakers: 97, languages: 49 }, |
| + | { date:2023-06, records: 17940, speakers: 56, languages: 35 }, |
| + | { date:2023-07, records: 75825, speakers: 74, languages: 38 }, |
| + | { date:2023-08, records: 32681, speakers: 54, languages: 30 }, |
| + | { date:2023-09, records: 28813, speakers: 114, languages: 30 }, |
| + | { date:2023-10, records: 60317, speakers: 167, languages: 47 }, |
| + | { date:2023-11, records: 49704, speakers: 140, languages: 55 }, |
| + | { date:2023-12, records: 42383, speakers: 114, languages: 41 }, |
| + | { date:2024-01, records: 40572, speakers: 112, languages: 40 }, |
| + | { date:2024-02, records: 22385, speakers: 197, languages: 57 }, |
| + | { date:2024-03, records: 16997, speakers: 173, languages: 48 }, |
| + | { date:2024-04, records: 8733, speakers: 117, languages: 42 }, |
| + | { date:2024-05, records: 556, speakers: 7, languages: 7 } |
| + | </pre> |
| + | |- |
| + | ! Daily recordings over April and May 2024 || |
| + | |- |
| + | | |
| + | <query _pagination="40"> |
| + | SELECT |
| + | ?yearmonthday |
| + | (COUNT(DISTINCT ?record) AS ?records) |
| + | (COUNT(DISTINCT ?speaker) AS ?speakers) |
| + | (COUNT(DISTINCT ?language) AS ?languages) |
| + | WHERE { |
| + | ?record prop:P5 ?speaker . |
| + | ?record prop:P4 ?language . |
| + | ?record prop:P6 ?date . |
| + | BIND( SUBSTR(str(?date), 0, 11) as ?yearmonthday ) |
| + | { SELECT ?record |
| + | WHERE { |
| + | ?record prop:P2 entity:Q2 . |
| + | ?record prop:P6 ?date . |
| + | FILTER(?date >= "2024-04-01T00:00:00Z"^^xsd:dateTime) |
| + | FILTER(?date < "2024-05-30T00:00:00Z"^^xsd:dateTime) |
| + | } |
| + | } |
| + | } |
| + | GROUP BY ?yearmonthday |
| + | ORDER BY (?yearmonthday) |
| + | </query> |
| + | | <= stops on 2024.05.01<br>Note: [[Special:Contributions/Austin Zhang|Austin Zhang]] recorded 174 audios on 05.11 |
| + | |} |
| + | [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 10:39, 14 May 2024 (UTC) |
| | | |
− | Par exemple : impossible d'enregistrer en anglais la catégorie "Catégorie:Lexique en anglais du cyclisme" issue de fr.wikt.
| + | === Fixed === |
| + | Both IP ranges 2001:41D0:0:0:0:0:0:0/32 and 2001:41D0:0:0:0:0:0:0/33 were subject to global Wikimedia block at one point (see [https://meta.wikimedia.org/w/index.php?title=Steward_requests/Global&oldid=26774369#Unregistered_users_only_block_for_the_range_2001:41D0:0:0:0:0:0:0/32 Global ban range_2001:41D0:0:0:0:0:0:0/32]). Following our request, the ban have been reconfigured and uploads from LinguaLibre are possible again. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 10:38, 14 May 2024 (UTC) |
| + | :I can record and upload since yesterday with my account, so that seems fixed. But it seems the stats are still not updated. [[User:Culex|Culex]] ([[User talk:Culex|talk]]) 12:08, 15 May 2024 (UTC) |
| | | |
− | Le problème - possiblement - est que la catégorie n'est pas "nativement" associée à l'anglais et qu'il faudrait sans doute vérifier préalablement que "Catégorie:anglais" est parente de la catégorie choisie pour fiabiliser les enregistrements à venir juste après.
| + | === Logs === |
| + | For references, I investigated the relevant block logs and uploads logs for May 2024.<br>Conclusion: the uploads collapse is coherent with the IP Ban. Still, given bug reports from Akamycoco in *March* and 咽頭べさ [[:c:File:Lingua_Libre_error_2024.webm|on step 4]], I suspects other bugs are lingering around. |
| + | {| class=wikitable |
| + | !width=50%| Global IP bans |
| + | ! Lingualibre uploads logs |
| + | |- |
| + | | |
| + | * [https://meta.wikimedia.org/wiki/Special:Log?type=&user=&page=User%3A2001%3A41D0%3A%3A%2F32&wpdate=&tagfilter=&wpFormIdentifier=logeventslist 18:46, 13 May 2024] EPIC talk contribs changed global block settings for 2001:41d0::/32 talk with an expiration time of 00:51, 10 May 2026 (anonymous users only) (No open proxies <!-- SCLT ID: Possible VPN or Colocation -->) |
| + | * [https://meta.wikimedia.org/wiki/Special:Log?type=&user=&page=User%3A2001%3A41D0%3A%3A%2F32&wpdate=&tagfilter=&wpFormIdentifier=logeventslist 00:51, 10 May 2024] AmandaNP talk contribs globally blocked 2001:41d0::/32 talk with an expiration time of 00:51, 10 May 2026 (No open proxies <!-- SCLT ID: Possible VPN or Colocation -->) |
| + | * [https://meta.wikimedia.org/wiki/Special:Log?type=&user=&page=User%3A2001%3A41D0%3A%3A%2F33&wpdate=&tagfilter=&wpFormIdentifier=logeventslist 17:02, 9 May 2024] EPIC talk contribs changed global block settings for 2001:41d0::/33 talk with an expiration time of 17:09, 1 May 2027 (anonymous users only) (Open proxy/Webhost: See the help page if you are affected) |
| + | * [https://meta.wikimedia.org/wiki/Special:Log?type=&user=&page=User%3A2001%3A41D0%3A%3A%2F33&wpdate=&tagfilter=&wpFormIdentifier=logeventslist 17:09, 1 May 2024] EPIC talk contribs blocked 2001:41d0::/33 talk with an expiration time of 2 years, 364 days, 12 hours, 21 minutes and 36 seconds (anonymous users only, account creation disabled) (Open proxy/Webhost: See the help page if you are affected) |
| + | * [https://meta.wikimedia.org/wiki/Special:Log?type=&user=&page=User%3A2001%3A41D0%3A%3A%2F33&wpdate=&tagfilter=&wpFormIdentifier=logeventslist 17:09, 1 May 2024] EPIC talk contribs globally blocked 2001:41d0::/33 talk with an expiration time of 17:09, 1 May 2027 (Open proxy/Webhost: See the help page if you are affected) |
| + | | |
| + | * : [https://commons.wikimedia.org/wiki/Special:RecentChanges?hidebots=1&translations=filter&hidecategorization=1&hideWikibase=1&tagfilter=OAuth+CID%3A+1735&limit=500&days=30&urlversion=2 Uploads via Lingualibre resumed]. |
| | | |
− | Bref. Ce n'est pas vraiment une question mais plutôt une piste de réflexion.
| + | 13 May 2024 |
| + | * [... Many more uploads] |
| + | * Upload log 23:39 Elwinlhq talk contribs uploaded File:LL-Q5218 (que)-Elwinlhq-apaqay.wav Tag: Lingua Libre [2.2] |
| + | * Upload log 19:05 Assassas77 talk contribs uploaded a new version of File:LL-Q9192 (cmn)-Assassas77-八角.wav Tag: Lingua Libre [2.2] |
| + | * Upload log 19:05 Assassas77 talk contribs uploaded File:LL-Q9192 (cmn)-Assassas77-八角.wav Tag: Lingua Libre [2.2] |
| + | * Upload log 16:38 Oh! Tea<ref>[https://commons.wikimedia.org/wiki/Special:Log?page=User:Oh!_Tea Commons > User:Oh!_Tea : « nothing » on Commons]</ref> talk contribs uploaded File:LL-Q36759-Austin Zhang-sih8 buh8 sah8 nah4.wav Tag: Lingua Libre [2.2] |
| + | 11 May 2024 |
| + | * Upload log 20:21 Oh! Tea talk contribs uploaded File:LL-Q36759-Austin Zhang-buah8.wav Tag: Lingua Libre [2.2] |
| + | * []... +172 recording by User:Oh! Tea] |
| + | * Upload log 18:56 Oh! Tea talk contribs uploaded File:LL-Q36759-Austin Zhang-a2.wav Tag: Lingua Libre [2.2] |
| + | 10 May 2024 |
| + | * Upload log 06:08 CapitainAfrika<ref>[https://commons.wikimedia.org/wiki/Special:Log?page=User:CapitainAfrika Commons > User:CapitainAfrika : « IP block exempt » on Commons]</ref> talk contribs uploaded File:LL-Q36217 (lin)-CapitainAfrika-Wiki na monɔkɔ mua bísó.wav Tag: Lingua Libre [2.2] |
| + | * Upload log 00:14 Ardzun<ref>[https://commons.wikimedia.org/wiki/Special:Log?page=User:Ardzun Commons > User:Ardzun : « nothing »]</ref> talk contribs uploaded File:LL-Q13324 (min)-Ardzun-mada.wav Tag: Lingua Libre [2.2] |
| + | 9 May 2024 |
| + | * Upload log 17:08 Àncilu<ref>[https://commons.wikimedia.org/wiki/Special:Log?page=User:Àncilu Commons > User:Àncilu : « Autopatroller » on Commons]</ref> talk contribs uploaded File:LL-Q652 (ita)-XANA000-orsù.wav Tag: Lingua Libre [2.2] |
| + | * Upload log 17:05 Àncilu talk contribs uploaded File:LL-Q652 (ita)-XANA000-frac.wav Tag: Lingua Libre [2.2] |
| + | 5 May 2024 |
| + | * Upload log 21:15 Benoît Prieur<ref>[https://commons.wikimedia.org/wiki/Special:Log?page=User:Benoît_Prieur Commons > User:Benoît_Prieur : « Administrator » on Commons]</ref> talk contribs uploaded File:LL-Q8785 (hye)-Benoît Prieur-Artsakh.wav Tag: Lingua Libre [2.2] |
| + | 1 May 2024 |
| + | * Upload log 16:09 Penn Zero MSSJ<ref>[https://commons.wikimedia.org/wiki/Special:Log?page=User:Penn_Zero_MSSJ Commons > User:Penn Zero MSSJ : « nothing » on Commons]</ref> talk contribs uploaded File:LL-Q9199 (vie)-Penn Zero MSSJ-hệ số.wav Tag: Lingua Libre [2.2] |
| + | * Upload log 16:09 Penn Zero MSSJ talk contribs uploaded File:LL-Q9199 (vie)-Penn Zero MSSJ-hỗn số.wav Tag: Lingua Libre [2.2] |
| + | * Upload log 16:09 Penn Zero MSSJ talk contribs uploaded File:LL-Q9199 (vie)-Penn Zero MSSJ-hằng đẳng thức.wav Tag: Lingua Libre [2.2] |
| + | * [... Many more uploads] |
| + | |- |
| + | |colspan=2| <small><references /></small> |
| + | |} |
| + | [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 10:38, 14 May 2024 (UTC) |
| | | |
− | Bonnes fêtes à tous et à toutes. --[[User:Benoît Prieur|Benoît]] ([[User talk:Benoît Prieur|talk]]) 11:51, 25 December 2018 (UTC)
| + | == Kinyarwanda language representation == |
− | :Salut Benoît :)
| |
− | :Est-tu sûr que tu as bien sélectionné "fr - Wiktionnaire" (quand tu enregistre de l'anglais, c'est Wikipédia en anglais qui est sélectionné par défaut) et que tu n'as pas fait une faute de frappe ? Car j'arrive bien à charger la catégorie anglaise que tu indiques...
| |
− | :Si effectivement cela ne fonctionne pas chez toi, je regarderais plus en profondeur.
| |
− | :Bonnes fêtes à toi aussi ! — [[User:0x010C|'''0'''x'''010<span style="color: #00C41C;">C</span>''']] <sup>[[User_talk:0x010C|~talk~]]</sup> 00:23, 26 December 2018 (UTC)
| |
− | ::Coucou,
| |
− | ::Non, c'est moi qui me suis emmêlé les pinceaux (du coup je vais enregistrer "s’emmêler les pinceaux").
| |
− | ::Ça marche effectivement très bien.
| |
− | ::--[[User:Benoît Prieur|Benoît]] ([[User talk:Benoît Prieur|talk]]) 08:22, 26 December 2018 (UTC)
| |
− | :::Chouette chouette chouette :) — [[User:0x010C|'''0'''x'''010<span style="color: #00C41C;">C</span>''']] <sup>[[User_talk:0x010C|~talk~]]</sup> 11:11, 27 December 2018 (UTC)
| |
| | | |
− | == Why Commons logs me out so quickly during recording ==
| + | I'm Robert RUGAMBA from Rwanda and i belong to Wikimedia Rwanda as a volunteer and event organizer. |
| + | I'm exited to explore this platform of lingua libre and i wish my local languages to be add and represented. the wikidata rabel is: https://www.wikidata.org/wiki/Q33573 |
| | | |
− | I usually take about 100 lexemes from Wikidata to record. I do it that way:
| + | Thanks. [[User:Annick green|Annick green]] |
− | # enter record wizard, it asks me to log into commons, I do it
| + | :{{Done}} This language was already on Lingualibre as [[Q285]]. If you open [[Special:RecordWizard]], at step 2, add it to your list of known languages. Please type in « Kinyarwanda », «Ikinyarwanda » and you should find it. Only user who have declared to know Kinyarwanda can record in Kinyarwanda. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 16:50, 27 June 2024 (UTC) |
− | # then I quickly setup speaker
| |
− | # then I quickly setup list of words by entering url to query
| |
− | # then I record words
| |
− | # then I listen recorded words to catch any failures
| |
− | # then I upload records to commons
| |
− | I do every step of above as quick as possible. But in the middle of the upload to commons uploading stops and all failures happen. It took me a day to realise that I'm simply logged out from my OAuth Commons session. I workaround this now by opening second window of record wizard and again pass first step with logging in. Then return to window with upload session and retry. Finally everything is uploaded. Note all this time I'm not logged out from Commons, only OAuth session of Lingua Libre is logged out. Why OAuth session in Commons is so short? [[User:KaMan|KaMan]] ([[User talk:KaMan|talk]]) 12:20, 25 December 2018 (UTC)
| |
− | :Same here. I do some wiki maintenance mainly, but my login time is about 30 mins these past few days. Too short. I may be a real issue for recording sessions. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 18:15, 25 December 2018 (UTC) | |
− | ::Yep, I know... [[User:Jitrixis|Jitrixis]] tried during the last Lingua Libre hackathon to fix this, bit it seems that it has not changed.
| |
− | ::This issue is near the top of my personal todo-list, so I expect to have a look at it by the end of the week.
| |
− | ::Thanks KaMan for the report! — [[User:0x010C|'''0'''x'''010<span style="color: #00C41C;">C</span>''']] <sup>[[User_talk:0x010C|~talk~]]</sup> 00:28, 26 December 2018 (UTC)
| |
− | :::Ok, after a day of research, I've found the issue. I'm currently trying a patch on the testing instance of Lingua Libre. If it works as expected, I'll deploy it here this night (UTC+2 timezone). The session will last 24 hour after that. — [[User:0x010C|'''0'''x'''010<span style="color: #00C41C;">C</span>''']] <sup>[[User_talk:0x010C|~talk~]]</sup> 19:17, 26 December 2018 (UTC)
| |
− | ::::I've just pushed the new patch into Lingua Libre, so that you'll have to login only once as long as you keep your browser open from now on. — [[User:0x010C|'''0'''x'''010<span style="color: #00C41C;">C</span>''']] <sup>[[User_talk:0x010C|~talk~]]</sup> 11:20, 27 December 2018 (UTC)
| |
− | :::::Many thanks! I will test it. [[User:KaMan|KaMan]] ([[User talk:KaMan|talk]]) 13:12, 27 December 2018 (UTC)
| |
| | | |
− | == Feature request: ask to reuse existing audio if there is already identical one == | + | == Rename my pseudonym == |
| | | |
− | I waste a lot of time because Lingua Libre Bot has to have new audio for every lexeme forms. For example this audio https://commons.wikimedia.org/wiki/File:LL-Q809_(pol)-KaMan-Bizancjum.wav I had to record 10 times (https://lingualibre.fr/index.php?title=Q55850&action=history). A lot of forms in Polish language is duplicated in different cases. It would be great if in word generator (+ExternalTools) in Record Wizard could be question to ask if duplicate should be recorded (identical speaker, language and lexeme), and Lingua Libre Bot propagate existing audio. It could save time. [[User:KaMan|KaMan]] ([[User talk:KaMan|talk]]) 14:28, 25 December 2018 (UTC) | + | Hello. I've renamed my account on wikimedia sites but can't log in directly from this username here. Do i have something to do ? My old username is '''ElsaBester''' and the new one is '''L'embellie'''. Thanks ! |
− | :KaMan, where does your wordlist(s?) come from ? how is it created ? You use LinguaLibre word generator ? [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 00:12, 27 December 2018 (UTC) | + | :Hello [[User:ElsaBester|L'embellie]], |
− | :If I understand well, you eventually have the same issue as raised in [[LinguaLibre:Chat_room#Warn_the_user_when_they_try_to_record_a_file_that_they_already_made|Warn the user when they try to record a file that they already made]]. Namely, you meet again and again words that you already recorded. If this is correct, then we started to look for technical solutions ([https://phabricator.wikimedia.org/T212580 T212580]). As of now, for long series, it is important to stick to large frequency list, so to not re-record similar words multiple times. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 00:17, 27 December 2018 (UTC) | + | :I may ping [[User:WikiLucas00|WikiLucas00]], but I think we don't currently have solution for your issue. |
− | :I took a look online for available frequency lists in polish.
| + | :We are phasing out this wiki, we hope to release a new Lingualibre this winter or early 2025. So this issue will be irrelevant by then. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 20:00, 22 August 2024 (UTC) |
− | :* Subtlex-pl : [http://crr.ugent.be/papers/subtlex-pl.pdf article], [http://crr.ugent.be/papers/subtlex-pl.pdf http://crr.ugent.be/programs-data/subtitle-frequencies/subtlex-pl data], available but "for research usage".
| + | ::Hey there {{ping|ElsaBester|Yug}}. Sorry I don't have a solution, but I found this in the Chat Room's archives: [[LinguaLibre:Chat_room/Archives/2023#Update_my_username]]. Good luck — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 18:46, 26 August 2024 (UTC) |
− | :* Worldlex : [https://link.springer.com/article/10.3758/s13428-015-0621-0 article], [http://worldlex.lexique.org data], available but unstated license
| + | :::Hello {{ping|ElsaBester}} you may also look at my latest reply on [[User talk:Yug]], it's not a great option but maybe you'll want to try it. All the best — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 12:24, 31 August 2024 (UTC) |
− | :* Hermit Dave, 2016 : [https://invokeit.wordpress.com/frequency-word-lists/ page], [https://github.com/hermitdave/FrequencyWords/tree/master/content/2016/pl data], CC-by-sa
| |
− | :So Hermit Dave's data would do. We have tutorials on [[Help:How_to_create_a_wordlist_%3F#Command|how to clean up frequency lists]],[[Help:How_to_create_a_wordlist_%3F#Splitting_a_very_long_file|how to split such long file]], other [[Help:How_to_create_a_wordlist_%3F#From_corpus_to_frequency_data_.60.7Boccurences.7D_.7Bitem.7D.60|tricks]], and [[Help:Create_your_own_lists#Create_a_new_list|how to create a list on LinguaLibre]] to help. | |
− | :Some command will need minor changes if your input differs. If you have some basic shell skills, you can do it and learn the exact commands needed quickly. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 01:30, 27 December 2018 (UTC)
| |
− | ::No Yug. He's talking about word lists generated with a SPARQL query from Lexemes on Wikidata, and from the fact that Lingua Libre Bot only associate audio recordings on the Lexeme when there is a direct link, causing him to re-record many times homograph words that are also homonym. | |
− | ::But the main issue I pointed out in [https://phabricator.wikimedia.org/T212580 T212580] apply here too, I don't have any idea of easy and effective implementation right now.
| |
− | ::(and no Yug, it is not "''important to stick to large frequency list''", we have other —more simple— solutions yet as Wikimedia categories or external tools imports).
| |
− | ::Best regards — [[User:0x010C|'''0'''x'''010<span style="color: #00C41C;">C</span>''']] <sup>[[User_talk:0x010C|~talk~]]</sup> 11:10, 27 December 2018 (UTC)
| |
− | ::: 0x010C is right. It's not problem of wrong list, list of words is correct. If there is no easy solution to it I can work with it as is but I admit I feel pain ;) before recording of 14 identical forms of https://www.wikidata.org/wiki/Lexeme:L19356 :) [[User:KaMan|KaMan]] ([[User talk:KaMan|talk]]) 13:22, 27 December 2018 (UTC)
| |
− | ::::"Who doesnt try cannot be wrong." It really needs to read between lines to find the Wikidata reference. "Lexeme" is lexicology term before being a Wikidata item type. The current SPARQL query doesnt seems time savy. | |
− | ::::And yes, generally speaking frequency list of unique words save our speakers energy. First, each form is recorded only once : this is why human speakers are for, and they shouldn't have to record multiple times a same form. Second, in natural language, words frequency follow the [[:wikipedia:Zipf's law|Zipf's law]]. Thus, the 135 most frequent English items represent 50% coverage of written text. On the opposite side, recording Wikipedia categories is not representative of human language and thus not time efficient. One volunteer can audio record 2000 categories it will still barely account for 1% of this human language. This only has internal value, by wikipedians for wikipedians, which is positive but sub-optimal.
| |
− | ::::As of KaMan's case, I would still recommend using frequency list : it would save valuable human time. A later bot could dispatch the audios upon the various wikidata items of this language and form. So I just used Hermit Dave CC-by-sa data to create Polish language frequency lists on LinguaLibre for the first 20k words, they are now availale to in the [[Special:RecordWizard|Record Studio]] > Details step : Local list > "pol". [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 13:51, 27 December 2018 (UTC)
| |
− | ::::: Yug, it's not a problem of frequency list but feature of language. I record all FORMS of words. Every noun in Polish has at least 14 forms, every adjective has 30-80 forms, same for verbs. Every form has entry in Wikidata and needs recording. But many of these forms are identical so in the end I have to record the same audio several times. It is independent from the fact the word is from frequency list. In other words word from frequency list has the same problem in Wikidata. BTW: I already follow frequency list in creating lexemes in Wikidata, but thanks :) [[User:KaMan|KaMan]] ([[User talk:KaMan|talk]]) 16:27, 27 December 2018 (UTC)
| |
| | | |
− | == Homonymy == | + | == Two French words that are impossible to record == |
| | | |
− | How homonyms are treated? Will they be overwritten with new recordings? [[User:Infovarius|Infovarius]] ([[User talk:Infovarius|talk]]) 17:42, 27 December 2018 (UTC)
| + | Hi, |
− | :Yes, if a new word has the same transcription, the same language and the same speaker as an old one, it will be override. If you want to record two homonym words that have a different pronunciation, you can add a small qualifier into brakets just after the word when you type it in the 3rd step of the RecordWizard. Everything that is inside brackets will be put aside, like on this record [[:File:LL-Q150 (fra)-0x010C-fils (enfant).wav]]. — [[User:0x010C|'''0'''x'''010<span style="color: #00C41C;">C</span>''']] <sup>[[User_talk:0x010C|~talk~]]</sup> 21:26, 27 December 2018 (UTC)
| |
− | :: It is good that this is possible in principle. But how can I know that I am recording a homonym of something already recorded? [[User:Infovarius|Infovarius]] ([[User talk:Infovarius|talk]]) 21:51, 27 December 2018 (UTC)
| |
| | | |
− | == Categories ==
| + | Two words are impossible to record (even before uploading): ''esclavesse'' and ''scribesse'' (all my attempts with other words work). [[User:Avatea|Avatea]] ([[User talk:Avatea|talk]]) 18:49, 30 August 2024 (UTC) |
| + | :Hi {{ping|Avatea}}. Sorry for the late reply. I couldn't reproduce the issue on my side, as you can see ({{Q|1385666}}, {{Q|1385667}}) I just recorded a few words ending with -esse, including the two words you mention, without encountering any issue. Did you try again recently? All the best — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 14:42, 21 September 2024 (UTC) |
| + | :: Hi {{ping| WikiLucas00}} |
| + | :: No. I just tried, I was able to record another word, but still not those two. [[User:Avatea|Avatea]] ([[User talk:Avatea|talk]]) 19:06, 21 September 2024 (UTC) |
| + | :: After several dozen new recordings (and I had made hundreds of others before), still unable to record these two words. Tested on macOS and Windows. [[User:Avatea|Avatea]] ([[User talk:Avatea|talk]]) 21:26, 7 October 2024 (UTC) |
| | | |
− | How can I automatically add some categories to new Commons uploads (like "Russian pronunciation" and others)? [[User:Infovarius|Infovarius]] ([[User talk:Infovarius|talk]]) 17:44, 27 December 2018 (UTC)
| + | == Supprimer deux enregistrements incorrects. == |
− | :Currently this is not possible but it is a planned feature, see [https://phabricator.wikimedia.org/T201135 task T201135 on phabricator]. — [[User:0x010C|'''0'''x'''010<span style="color: #00C41C;">C</span>''']] <sup>[[User_talk:0x010C|~talk~]]</sup> 21:46, 27 December 2018 (UTC)
| |
| | | |
− | == How to properly credit lists ==
| + | Bonjour! À cause d'une erreur lors d'écriture et parce que je l'ai fait pressé, j'ai enregistré par erreur deux termes: *"[[Q1387394|escaramón]]" et son pluriel *"[[Q1387395|escaramones]]". Serait-il possible de supprimer ces fichiers enregistrés ? J'ai déjà fait les enregistrements corrects de ces mots bien écrits et avec la prononciation correcte: "[[Q1387396|escamarón]]", "[[Q1387397|escamarones]]". Vous pouvez vérifier l’exactitude de ce terme [https://diccionariu.alladixital.org/index.php?cod=21008 ici]. Désolé pour le dérangement. --[[User:Limotecariu|Limotecariu]] ([[User talk:Limotecariu|talk]]) 20:31, 28 September 2024 (UTC) |
− | ([https://phabricator.wikimedia.org/T212671 T212671]) I attempted this [[https://lingualibre.fr/wiki/List:Pol/words-by-frequency-2001-to-4000#Source]], but loading the list in the Record Studio keeps the source section as a word to record. Is there a know trick to hide this source section in the Record Studio ? [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 16:56, 28 December 2018 (UTC)
| |