User talk
Difference between revisions of "Olaf"
(→Bot) |
WikiLucas00 (talk | contribs) (→French list: new section) |
||
(24 intermediate revisions by 7 users not shown) | |||
Line 8: | Line 8: | ||
== Bot == | == Bot == | ||
Olafbot got the bot status from Pamputt. I moved our conversation to [[LinguaLibre:Technical board]], I didn't answer there to leave this opportunity to others (you may want to ping Poslovitch and VIGNERON on that one). But I think we get a good filter tool in line with your vision via either per-language commons category or/and sparql queries. The implemetation is to design, and they will see details better than me, but it's sounds possible to me. Anyway, let's see on the T-board how does the conversations goes ! [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 12:24, 25 February 2021 (UTC) | Olafbot got the bot status from Pamputt. I moved our conversation to [[LinguaLibre:Technical board]], I didn't answer there to leave this opportunity to others (you may want to ping Poslovitch and VIGNERON on that one). But I think we get a good filter tool in line with your vision via either per-language commons category or/and sparql queries. The implemetation is to design, and they will see details better than me, but it's sounds possible to me. Anyway, let's see on the T-board how does the conversations goes ! [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 12:24, 25 February 2021 (UTC) | ||
+ | :As for your question about other non-Lili audios, I think we should ignore them. Lili has the firepower and methodology to roll on them, replace them, and we should do so. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 13:34, 26 February 2021 (UTC) | ||
+ | ::Olaf hello, | ||
+ | ::Is your bot simple or a large multitask beast ? I would be interested to add its code to our github.com/lingua-libre'. I'am also personally interested to edit UNILEX's lists and talkpages programatically. So I'am looking for code samples, NodeJS first.... but Python seems accessible as well. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 11:44, 1 March 2021 (UTC) | ||
+ | :::Ok, thank you. So I will explore [[Help:Bots]]'s Wiki API, a JS based project. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 17:02, 1 March 2021 (UTC) | ||
+ | :::Note: I also therefore assume that your bot should not be migrated on our LinguaLibre repository. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 17:27, 1 March 2021 (UTC) | ||
+ | |||
+ | == OlafBot == | ||
+ | Hello Olaf, I'am pausing few days before running my Dragons Bot to import UNILEX's lists. Doing so I wrote down [[LinguaLibre:Chat_room#Pause_before_running|the rational]], can you review it and edit to add the number of lists you created ? I believe its 72. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 14:43, 3 March 2021 (UTC) | ||
+ | :Olaf, did you take a look to my [[LinguaLibre:Chat_room#Marathi_women_speakers_celebrate_.27Women.27s_Day.27_.26_.27Women_History_Month.27_on_Lingua_Libre|Marathi bash code]] on the chatroom. While I think unilex lists should be constant, as a methodical 101 straight path, I think the Marathi "by letter" requests may gain from your bot. If I understand well your bot is the only current way we have to avoid a recorder B to redo a word already done by a contributor A. I also see the Marathi community is not that fast so far : 8000 in 2 months for 20+ participants. So It could be better for them to work on the 30,000 most frequent words first, split by letters, reduced by your script. Then if they as us for more rerun the bash command to get the 50,000 most frequent. I would prefer the Marathi community to find some local dev resources to handle this list creation, but maybe we will have to carry it to the finish line. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 22:07, 5 March 2021 (UTC) | ||
+ | :On a discrete side, I started to map the maintenance functions Lili will need as specific js files to codes. For now, only one is coded. See [https://github.com/hugolpz/Dragons_Bot#current-script Dragons bot]. The gain here would be to have a maintenance bot dedicated to Lingualibre's tasks. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 22:43, 5 March 2021 (UTC) | ||
+ | ::Given the largest letters and calm contributions of the group I assumed at one point they will have to do relays on a same letter. But if we assume consistency, then no need for updates indeed. Thanks for seeing that angles. I will upload their lists. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 11:45, 6 March 2021 (UTC) | ||
+ | |||
+ | == Lists == | ||
+ | {{GT|Olaf bonjour, je pense que tu devrais aimer ce modèle : {{tl|Wikt-list}} ;).}} [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 18:57, 5 March 2021 (UTC) | ||
+ | :Olaf bonjour, I'am creating this {{tl|Recommended lists}} template. The template will be added to 1000 [Category:Speakers in {iso639-3}] to create awareness about best available `paths` to speakers. If we continue to put {#babel:records-mar} on userpages, then these users will be categorized and have a gentle link to [Category:Speakers in {iso639-3}] where they can be reminded of the best paths. I sorted the list by "easiness", first the frequency list of 200, 800 items. Then the wiktionary list of 1000. The lists show up only if they indeed exist. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 11:35, 6 March 2021 (UTC) | ||
+ | |||
+ | == common.js == | ||
+ | |||
+ | Hi Olaf, I saw you have created [[User:Olaf/common.js]]. For your information, common.js (and common.css) of users are not enabled for now (see [[phab:T273709|T273709]]) so it should not work for you. [[User:Pamputt|Pamputt]] ([[User talk:Pamputt|talk]]) 12:05, 9 March 2021 (UTC) | ||
+ | :Hi Olaf. Fyi, I split the ratelimit issue [https://lingualibre.org/index.php?title=LinguaLibre:Technical_board&diff=479546&oldid=479541 in two tickets]. Cheers ! [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 23:25, 9 March 2021 (UTC) | ||
+ | ::Your common.js should now works :D [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 22:25, 25 April 2021 (UTC) | ||
+ | |||
+ | == Suggestion for OlafBot == | ||
+ | |||
+ | Hi, | ||
+ | |||
+ | I love the lists of 380 missing words. I use them in French and Esperanto. However, I have a suggestion. There is currently a problem with the record wizard which I exposed [[LinguaLibre:Technical_board#Remove_duplicates_bug|here]]. Is it possible to not include words with capital letters? It would be way more effective. For example, as I had the problem before, I always skip words with capital letters. | ||
+ | |||
+ | Thanks, [[User:Lepticed7|Lepticed7]] ([[User talk:Lepticed7|talk]]) 16:47, 21 May 2021 (UTC) | ||
+ | |||
+ | == Problem on your 380 words list in french == | ||
+ | |||
+ | Hello ! First of all, I thank you very well for the creation of these 380 words lists. It’s very helpfull for the projet ! | ||
+ | |||
+ | Most part of time that I use it in french there is no problem. But actually, there is a world (banket) who are not french (just old french - not the same language). Why this world is on the list and how I can put out it of the list ? Because, if I don’t record it, the word appear on the new list along that he don’t have a recording… | ||
+ | |||
+ | Can you help ? [[User:Lyokoï|Lyokoï]] ([[User talk:Lyokoï|talk]]) 09:59, 15 June 2021 (UTC) | ||
+ | :Thank for your response. I will tell you if I find anther similar problematic words. And I think an exclusion list by language is a good idea. [[User:Lyokoï|Lyokoï]] ([[User talk:Lyokoï|talk]]) 20:12, 15 June 2021 (UTC) | ||
+ | |||
+ | == Lemmas-without-audio-sorted-by-number-of-wiktionaries == | ||
+ | |||
+ | Hi {{ping|Olaf}} I hope you are doing well! I'm using your list quite often and I just realized that is not exactly a list of '''lemmas''', rather a list of '''forms'''. For instance, I find more and more verbal inflections in your list in French, which are useful to cover but not always a priority. Would it be possible for you to create a similar list, but dedicated to real '''lemmas''' (i.e. in French: singular for nouns, sg. masculine for adjectives, infinitive tense for verbs)? I don't know if this issue only happens to your list in French or also in other languages (we seem to have digged deeper with French lemmas haha). All the best — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 10:21, 21 July 2021 (UTC) | ||
+ | |||
+ | == Translation of [[LinguaLibre:Explore the sound library]] == | ||
+ | |||
+ | Hello, | ||
+ | |||
+ | I modified the page [[LinguaLibre:Explore the sound library]] and its JavaScript so that all messages can now be translated directly on the page. I saw you translated it to polski, you can finish to translate the remaining messages if you want. | ||
+ | |||
+ | ~ [[User:Seb35|Seb35]] [[User talk:Seb35|<small>[^_^]</small>]] 09:16, 4 October 2021 (UTC) | ||
+ | |||
+ | == You put up a very good competition! :D == | ||
+ | |||
+ | Hey, I wanted to say that you put up a very good competition. :) I'd not have managed to upload over 10K recordings in a month that was almost otherwise over with a lot of pending work and personal time off. This was fun, especially the last minute rush. Happy New Year! :) | ||
+ | |||
+ | == French list == | ||
+ | |||
+ | Hello Olaf, I hope you are doing well. | ||
+ | |||
+ | I recorded a few hundred words from [[List:Fra/Lemmas-without-audio-sorted-by-number-of-wiktionaries|your Wiktionary-based French list]] yesterday, and I have noticed that your bot did not update it, even after an entire day. Have you changed the frequency of updates on the Wiktionary lists, or may there be an issue with the bot? | ||
+ | |||
+ | All the best — '''[[User:WikiLucas00|WikiLucas]]''' [[User talk:WikiLucas00|(🖋️)]] 22:44, 4 March 2022 (UTC) |
Latest revision as of 22:44, 4 March 2022
Welcome
Lingua Libre is a project which aims to build a collaborative multilingual audiovisual corpus under free licence in order to expand knowledge about languages and help online language communities to develop.
You can help us!
You can visit this page if you want to learn more about the project.
You can create your User page by clicking on it. We recommend that you integrate the babel template onto it. It is very useful to indicate to others the languages you speak, and to facilitate finding other persons speaking your languages. If you are not familiar with wikicode, you can go to this demo User page, read the instructions and copy the prepared babel template, and then paste it onto your user page before adapting it to your information. You can then publish your User page!
- Follow the steps of the Record Wizard
- Think of the words you want to record. You may enter them one by one (live list), use an existing category from a Wiktionary or Wikipedia project, or create your own list.
You can visit this help page where you will find advice for beginning your contributions on Lingua Libre. If you did not find the answer to your question, please ask it in the Chat room.
- Try to avoid background noises during the recording
- Please listen to the pronunciations before uploading them
- Consider using an external microphone
Need more ?
Hello Olaf, your current userrights on Commons limits you to 380 recordings per 72mins. We can upgrade you rapidly via a request on Commons:Requests for rights. Interested ? Yug (talk) 23:07, 17 February 2021 (UTC)
- Hoy, I see you are also a bot master. We actually have a bot who injects LL's audio into Wikimedia's wikis, but not all wikis localizations (languages) are authorized. We look for help on this side. But if you have a bot, I think we could also be interested to check it a bit too. Yug (talk) 01:55, 18 February 2021 (UTC)
Bot
Olafbot got the bot status from Pamputt. I moved our conversation to LinguaLibre:Technical board, I didn't answer there to leave this opportunity to others (you may want to ping Poslovitch and VIGNERON on that one). But I think we get a good filter tool in line with your vision via either per-language commons category or/and sparql queries. The implemetation is to design, and they will see details better than me, but it's sounds possible to me. Anyway, let's see on the T-board how does the conversations goes ! Yug (talk) 12:24, 25 February 2021 (UTC)
- As for your question about other non-Lili audios, I think we should ignore them. Lili has the firepower and methodology to roll on them, replace them, and we should do so. Yug (talk) 13:34, 26 February 2021 (UTC)
- Olaf hello,
- Is your bot simple or a large multitask beast ? I would be interested to add its code to our github.com/lingua-libre'. I'am also personally interested to edit UNILEX's lists and talkpages programatically. So I'am looking for code samples, NodeJS first.... but Python seems accessible as well. Yug (talk) 11:44, 1 March 2021 (UTC)
OlafBot
Hello Olaf, I'am pausing few days before running my Dragons Bot to import UNILEX's lists. Doing so I wrote down the rational, can you review it and edit to add the number of lists you created ? I believe its 72. Yug (talk) 14:43, 3 March 2021 (UTC)
- Olaf, did you take a look to my Marathi bash code on the chatroom. While I think unilex lists should be constant, as a methodical 101 straight path, I think the Marathi "by letter" requests may gain from your bot. If I understand well your bot is the only current way we have to avoid a recorder B to redo a word already done by a contributor A. I also see the Marathi community is not that fast so far : 8000 in 2 months for 20+ participants. So It could be better for them to work on the 30,000 most frequent words first, split by letters, reduced by your script. Then if they as us for more rerun the bash command to get the 50,000 most frequent. I would prefer the Marathi community to find some local dev resources to handle this list creation, but maybe we will have to carry it to the finish line. Yug (talk) 22:07, 5 March 2021 (UTC)
- On a discrete side, I started to map the maintenance functions Lili will need as specific js files to codes. For now, only one is coded. See Dragons bot. The gain here would be to have a maintenance bot dedicated to Lingualibre's tasks. Yug (talk) 22:43, 5 March 2021 (UTC)
Lists
Olaf bonjour, je pense que tu devrais aimer ce modèle : {{Wikt-list}} ;). (🗨️ translate) Yug (talk) 18:57, 5 March 2021 (UTC)
- Olaf bonjour, I'am creating this {{Recommended lists}} template. The template will be added to 1000 [Category:Speakers in {iso639-3}] to create awareness about best available `paths` to speakers. If we continue to put {#babel:records-mar} on userpages, then these users will be categorized and have a gentle link to [Category:Speakers in {iso639-3}] where they can be reminded of the best paths. I sorted the list by "easiness", first the frequency list of 200, 800 items. Then the wiktionary list of 1000. The lists show up only if they indeed exist. Yug (talk) 11:35, 6 March 2021 (UTC)
common.js
Hi Olaf, I saw you have created User:Olaf/common.js. For your information, common.js (and common.css) of users are not enabled for now (see T273709) so it should not work for you. Pamputt (talk) 12:05, 9 March 2021 (UTC)
- Hi Olaf. Fyi, I split the ratelimit issue in two tickets. Cheers ! Yug (talk) 23:25, 9 March 2021 (UTC)
Suggestion for OlafBot
Hi,
I love the lists of 380 missing words. I use them in French and Esperanto. However, I have a suggestion. There is currently a problem with the record wizard which I exposed here. Is it possible to not include words with capital letters? It would be way more effective. For example, as I had the problem before, I always skip words with capital letters.
Thanks, Lepticed7 (talk) 16:47, 21 May 2021 (UTC)
Problem on your 380 words list in french
Hello ! First of all, I thank you very well for the creation of these 380 words lists. It’s very helpfull for the projet !
Most part of time that I use it in french there is no problem. But actually, there is a world (banket) who are not french (just old french - not the same language). Why this world is on the list and how I can put out it of the list ? Because, if I don’t record it, the word appear on the new list along that he don’t have a recording…
Can you help ? Lyokoï (talk) 09:59, 15 June 2021 (UTC)
- Thank for your response. I will tell you if I find anther similar problematic words. And I think an exclusion list by language is a good idea. Lyokoï (talk) 20:12, 15 June 2021 (UTC)
Lemmas-without-audio-sorted-by-number-of-wiktionaries
Hi @Olaf I hope you are doing well! I'm using your list quite often and I just realized that is not exactly a list of lemmas, rather a list of forms. For instance, I find more and more verbal inflections in your list in French, which are useful to cover but not always a priority. Would it be possible for you to create a similar list, but dedicated to real lemmas (i.e. in French: singular for nouns, sg. masculine for adjectives, infinitive tense for verbs)? I don't know if this issue only happens to your list in French or also in other languages (we seem to have digged deeper with French lemmas haha). All the best — WikiLucas (🖋️) 10:21, 21 July 2021 (UTC)
Translation of LinguaLibre:Explore the sound library
Hello,
I modified the page LinguaLibre:Explore the sound library and its JavaScript so that all messages can now be translated directly on the page. I saw you translated it to polski, you can finish to translate the remaining messages if you want.
~ Seb35 [^_^] 09:16, 4 October 2021 (UTC)
You put up a very good competition! :D
Hey, I wanted to say that you put up a very good competition. :) I'd not have managed to upload over 10K recordings in a month that was almost otherwise over with a lot of pending work and personal time off. This was fun, especially the last minute rush. Happy New Year! :)
French list
Hello Olaf, I hope you are doing well.
I recorded a few hundred words from your Wiktionary-based French list yesterday, and I have noticed that your bot did not update it, even after an entire day. Have you changed the frequency of updates on the Wiktionary lists, or may there be an issue with the bot?