LinguaLibre
Difference between revisions of "Events/2021 Wikimedia Wikimeet India"
< LinguaLibre:Events
Line 50: | Line 50: | ||
== Review and improvements == | == Review and improvements == | ||
− | [[File:Lingua_Libre_presentation_WMFr_for_the_Wikimedia_Meet_India_2021.pdf|thumb]] | + | |
+ | [[File:Lingua_Libre_presentation_WMFr_for_the_Wikimedia_Meet_India_2021.pdf|thumb|Notes are available on [https://etherpad.wikimedia.org/p/wmwmd03s10 etherpad.wikimedia.org/p/wmwmd03s10]]] | ||
'''General part''' | '''General part''' | ||
* Remind possible usages : Wikimedia Websites, Language preservation, Language learning, Natural language processing (text2speech, speech2text) | * Remind possible usages : Wikimedia Websites, Language preservation, Language learning, Natural language processing (text2speech, speech2text) | ||
Line 56: | Line 57: | ||
'''Demonstration :''' | '''Demonstration :''' | ||
* (Was so complete and systematic ! Merci Adélaide !) | * (Was so complete and systematic ! Merci Adélaide !) | ||
+ | * by default, the limit is 380 every three days '''for newly created accounts'''. See [[LinguaLibre:Userrights]] for details. But! 380/hour for 4 days old accounts and an easy process to raise this significantly. | ||
+ | * When you change the "stop threshold" <code>5</code> to record sentences, you have to move it '''down''' if you want longer audios. | ||
+ | * But mainly, the "silence length" should be the single element to change. | ||
'''Technical side :''' | '''Technical side :''' | ||
* Add lingualibre.org/datasets/ & talk about external usages | * Add lingualibre.org/datasets/ & talk about external usages | ||
+ | * Remind possible usages : Wikimedia websites via bots, Language learning apps, Natural language processing (text2speech, speech2text). | ||
* "We think about more features and improvements that we can develop" (therefore we need devs). See Phabricator. | * "We think about more features and improvements that we can develop" (therefore we need devs). See Phabricator. | ||
+ | * Do we have a long term strategy. What is our development vision (it's redundant with Adelaide request to define a long term ) | ||
+ | |||
+ | == Other interesting presentations == | ||
+ | === Content Translation Tool === | ||
+ | Lead by the WMF's Language team. See [[:mw:Wikimedia Language engineering]], [[:meta:Category:Language research]] & [[:meta:Research:Section_Translation_Design_Research]]. | ||
+ | * Hugo: Do Content Translations organizes IRL "Translathon" and collaboration with language universities ? Any feedbacks on such event ? what are the key element to kickstart such event ? -- Q by fr:Yug | ||
+ | ** Amir: We are a product development team: we design and development software features, and we don't organize events in our Wikimedia Foundation capacity. However, many chapters and individual community members organize such events, in many countries and for many languages. Some tips about organizing such events can be found at this link: https://meta.wikimedia.org/wiki/Best_practices_for_Content_Translation_events | ||
+ | ** Runa: We did a survey last year in partnership with an editathon event in India (https://outreachdashboard.wmflabs.org/courses/CT_University/Syberthon_2020_Indian_Languages/home) to understand how newer editors interacted with Content Translation. The report is here: https://meta.wikimedia.org/wiki/Research:Content_Translation_Newcomer_Survey,_India_2020 . | ||
+ | |||
</noinclude> | </noinclude> |
Revision as of 16:10, 21 February 2021
- Description: Presentations on Lingualibre recording contribution, and on Github opensource assets.
- Place: online.
- Time: 2021/02/19-21 > Sunday 21: 13h25 UTC/14h25 UTC+1.
- Organisation: Indian Wikipedia
- Presenters: User:Adélaïde Calais WMFr, user:Poslovitch, User:Yug.
- Participants:
Conference details
- Schedule:meta:Wikimedia_Wikimeet_India_2021/Program
- Place: Zoom.us with pre-conference trial session.
- Checklist: meta:Wikimedia_Wikimeet_India_2021/Checklist
- Telegram group for presenters: https://t.me/joinchat/VJ78lQaYYQTDK_Dh
- Other forms of communication: wmwm@cis-india.org (Indian Standard Time, UTC+0530 hours).
Lingualibre : why & how to contribute
Technical presentations
1. Status du Github : Need you to spread the word about language conservation
- maintenu propre
- 11 repositories
- Referees on nearly each repository
- JS, Python, NodeJS, PHP, MediaWiki modules,
- Welcome volunteer devs
2. Key repositories
Repository | Technologies (Stack) | Definition and impact |
---|---|---|
Lingua-Libre-Bot | Python, autorization | Spread 400,000 audios in your wiki |
RecordWizard | JS, VueJS, CSS, PHP MW-module | Mediawiki UI module to record audios/video (sign language) |
LinguaRecorder | JS, NodeJS | Js library controlling audio recordings |
QueryViz | SparQL | Helps extract data, files, meaning from Lili |
SignIt | JS, OOJS-UI, CSS, NodeJS | Helps teach sign language |
Notes: |
3. How to help LinguaLibre via tech
- Authorize LinguaLibre Bot on your wiki
- Brief summary of the process.
- Tell your FOSS community about LinguaLibre
- ...
4. Q&A
Review and improvements
General part
- Remind possible usages : Wikimedia Websites, Language preservation, Language learning, Natural language processing (text2speech, speech2text)
- The basics: this is a "recording project". We currently focus on words, but we can think and expand to more orality and knowledge related services.
Demonstration :
- (Was so complete and systematic ! Merci Adélaide !)
- by default, the limit is 380 every three days for newly created accounts. See LinguaLibre:Userrights for details. But! 380/hour for 4 days old accounts and an easy process to raise this significantly.
- When you change the "stop threshold"
5
to record sentences, you have to move it down if you want longer audios. - But mainly, the "silence length" should be the single element to change.
Technical side :
- Add lingualibre.org/datasets/ & talk about external usages
- Remind possible usages : Wikimedia websites via bots, Language learning apps, Natural language processing (text2speech, speech2text).
- "We think about more features and improvements that we can develop" (therefore we need devs). See Phabricator.
- Do we have a long term strategy. What is our development vision (it's redundant with Adelaide request to define a long term )
Other interesting presentations
Content Translation Tool
Lead by the WMF's Language team. See mw:Wikimedia Language engineering, meta:Category:Language research & meta:Research:Section_Translation_Design_Research.
- Hugo: Do Content Translations organizes IRL "Translathon" and collaboration with language universities ? Any feedbacks on such event ? what are the key element to kickstart such event ? -- Q by fr:Yug
- Amir: We are a product development team: we design and development software features, and we don't organize events in our Wikimedia Foundation capacity. However, many chapters and individual community members organize such events, in many countries and for many languages. Some tips about organizing such events can be found at this link: https://meta.wikimedia.org/wiki/Best_practices_for_Content_Translation_events
- Runa: We did a survey last year in partnership with an editathon event in India (https://outreachdashboard.wmflabs.org/courses/CT_University/Syberthon_2020_Indian_Languages/home) to understand how newer editors interacted with Content Translation. The report is here: https://meta.wikimedia.org/wiki/Research:Content_Translation_Newcomer_Survey,_India_2020 .