LinguaLibre

Difference between revisions of "Events/2021 Wikimedia Wikimeet India"

< LinguaLibre:Events

Line 50: Line 50:
  
 
== Review and improvements ==
 
== Review and improvements ==
[[File:Lingua_Libre_presentation_WMFr_for_the_Wikimedia_Meet_India_2021.pdf|thumb]]
+
 
 +
[[File:Lingua_Libre_presentation_WMFr_for_the_Wikimedia_Meet_India_2021.pdf|thumb|Notes are available on [https://etherpad.wikimedia.org/p/wmwmd03s10 etherpad.wikimedia.org/p/wmwmd03s10]]]
 
'''General part'''
 
'''General part'''
 
* Remind possible usages : Wikimedia Websites, Language preservation, Language learning, Natural language processing (text2speech, speech2text)
 
* Remind possible usages : Wikimedia Websites, Language preservation, Language learning, Natural language processing (text2speech, speech2text)
Line 56: Line 57:
 
'''Demonstration :'''
 
'''Demonstration :'''
 
* (Was so complete and systematic ! Merci Adélaide !)
 
* (Was so complete and systematic ! Merci Adélaide !)
 +
* by default, the limit is 380 every three days '''for newly created accounts'''. See [[LinguaLibre:Userrights]] for details. But! 380/hour for 4 days old accounts and an easy process to raise this significantly.
 +
* When you change the "stop threshold" <code>5</code> to record sentences, you have to move it '''down''' if you want longer audios.
 +
* But mainly, the "silence length" should be the single element to change.
 
'''Technical side :'''
 
'''Technical side :'''
 
* Add lingualibre.org/datasets/ & talk about external usages
 
* Add lingualibre.org/datasets/ & talk about external usages
 +
* Remind possible usages : Wikimedia websites via bots, Language learning apps, Natural language processing (text2speech, speech2text).
 
* "We think about more features and improvements that we can develop" (therefore we need devs). See Phabricator.
 
* "We think about more features and improvements that we can develop" (therefore we need devs). See Phabricator.
 +
*  Do we have a long term strategy. What is our development vision (it's redundant with Adelaide request to define a long term )
 +
 +
== Other interesting presentations ==
 +
=== Content Translation Tool ===
 +
Lead by the WMF's Language team. See [[:mw:Wikimedia Language engineering]], [[:meta:Category:Language research]] & [[:meta:Research:Section_Translation_Design_Research]].
 +
* Hugo: Do Content Translations organizes IRL "Translathon" and collaboration with language universities ? Any feedbacks on such event ? what are the key element to kickstart such event ? -- Q by fr:Yug
 +
** Amir: We are a product development team: we design and development software features, and we don't organize events in our Wikimedia Foundation capacity. However, many chapters and individual community members organize such events, in many countries and for many languages. Some tips about organizing such events can be found at this link: https://meta.wikimedia.org/wiki/Best_practices_for_Content_Translation_events
 +
** Runa: We did a survey last year in partnership with an editathon event in India (https://outreachdashboard.wmflabs.org/courses/CT_University/Syberthon_2020_Indian_Languages/home) to understand how newer editors interacted with Content Translation. The report is here: https://meta.wikimedia.org/wiki/Research:Content_Translation_Newcomer_Survey,_India_2020 .
 +
 
</noinclude>
 
</noinclude>

Revision as of 16:10, 21 February 2021

Conference details

  1. Schedule:meta:Wikimedia_Wikimeet_India_2021/Program
  2. Place: Zoom.us with pre-conference trial session.
  3. Checklist: meta:Wikimedia_Wikimeet_India_2021/Checklist
  4. Telegram group for presenters: https://t.me/joinchat/VJ78lQaYYQTDK_Dh
  5. Other forms of communication: wmwm@cis-india.org (Indian Standard Time, UTC+0530 hours).

Lingualibre : why & how to contribute

Technical presentations

1. Status du Github : Need you to spread the word about language conservation

  • maintenu propre
  • 11 repositories
  • Referees on nearly each repository
  • JS, Python, NodeJS, PHP, MediaWiki modules,
  • Welcome volunteer devs

2. Key repositories

Github.com/Lingualibre/
Repository Technologies (Stack) Definition and impact
Lingua-Libre-Bot Python, autorization Spread 400,000 audios in your wiki
RecordWizard JS, VueJS, CSS, PHP MW-module Mediawiki UI module to record audios/video (sign language)
LinguaRecorder JS, NodeJS Js library controlling audio recordings
QueryViz SparQL Helps extract data, files, meaning from Lili
SignIt JS, OOJS-UI, CSS, NodeJS Helps teach sign language
Notes:

3. How to help LinguaLibre via tech

  • Authorize LinguaLibre Bot on your wiki
    • Brief summary of the process.
  • Tell your FOSS community about LinguaLibre
  • ...

4. Q&A

Review and improvements

General part

  • Remind possible usages : Wikimedia Websites, Language preservation, Language learning, Natural language processing (text2speech, speech2text)
  • The basics: this is a "recording project". We currently focus on words, but we can think and expand to more orality and knowledge related services.

Demonstration :

  • (Was so complete and systematic ! Merci Adélaide !)
  • by default, the limit is 380 every three days for newly created accounts. See LinguaLibre:Userrights for details. But! 380/hour for 4 days old accounts and an easy process to raise this significantly.
  • When you change the "stop threshold" 5 to record sentences, you have to move it down if you want longer audios.
  • But mainly, the "silence length" should be the single element to change.

Technical side :

  • Add lingualibre.org/datasets/ & talk about external usages
  • Remind possible usages : Wikimedia websites via bots, Language learning apps, Natural language processing (text2speech, speech2text).
  • "We think about more features and improvements that we can develop" (therefore we need devs). See Phabricator.
  • Do we have a long term strategy. What is our development vision (it's redundant with Adelaide request to define a long term )

Other interesting presentations

Content Translation Tool

Lead by the WMF's Language team. See mw:Wikimedia Language engineering, meta:Category:Language research & meta:Research:Section_Translation_Design_Research.