LinguaLibre

Difference between revisions of "Events/2021 Wikimedia Wikimeet India"

< LinguaLibre:Events

 
(19 intermediate revisions by the same user not shown)
Line 1: Line 1:
 +
 +
[[File:Lingua_Libre_presentation_WMFr_for_the_Wikimedia_Meet_India_2021.pdf|thumb|Notes are available on [https://etherpad.wikimedia.org/p/wmwmd03s10 etherpad.wikimedia.org/p/wmwmd03s10]]]
 +
[[File:Lingua_Libre_-_record_your_language_-_Lingua_Libre_bot_and_recording_wizard.webm|thumb|Video]]
 
* '''Description''': Presentations on Lingualibre recording contribution, and on Github opensource assets.
 
* '''Description''': Presentations on Lingualibre recording contribution, and on Github opensource assets.
* '''Place''': online.
+
* '''Place''': Zoom > https://zoom.us/j/97614383815?pwd=OW9QazdjeVFaOTF1VXQxZ0U2ZkpCUT09
 
* '''Time''': 2021/02/19-21 > Sunday 21: 13h25 UTC/14h25 UTC+1.
 
* '''Time''': 2021/02/19-21 > Sunday 21: 13h25 UTC/14h25 UTC+1.
 
* '''Organisation''': Indian Wikipedia
 
* '''Organisation''': Indian Wikipedia
Line 6: Line 9:
 
*# [[:meta:Wikimedia_Wikimeet_India_2021/Submissions/Lingua_Libre_-_record_your_language|/Lingua_Libre_-_record_your_language]]
 
*# [[:meta:Wikimedia_Wikimeet_India_2021/Submissions/Lingua_Libre_-_record_your_language|/Lingua_Libre_-_record_your_language]]
 
*# [[:meta:Wikimedia_Wikimeet_India_2021/Submissions/Lingua_Libre_bot_and_recording_wizard|/Lingua_Libre_bot_and_recording_wizard]]
 
*# [[:meta:Wikimedia_Wikimeet_India_2021/Submissions/Lingua_Libre_bot_and_recording_wizard|/Lingua_Libre_bot_and_recording_wizard]]
* '''Participants:'''
+
* '''Participants:''' ~60 watchers, largely Indians leading contributors, movement leaders and developers.
 +
* '''Outcome:''' outreach successful, to continue.
 +
* '''Review & improvements:''' continue work on the presentation. Some proposals [[LinguaLibre:Events/2021 Wikimedia Wikimeet India#Post-event_review|here]].
 +
* '''Program:''' [[:meta:Wikimedia_Wikimeet_India_2021/Program]]
 
<noinclude>
 
<noinclude>
 
== Conference details ==
 
== Conference details ==
Line 15: Line 21:
 
# Other forms of communication: wmwm@cis-india.org (Indian Standard Time, UTC+0530 hours).
 
# Other forms of communication: wmwm@cis-india.org (Indian Standard Time, UTC+0530 hours).
  
== Lingualibre : why & how to contribute ==
+
== Presentation ==
 +
Presentation was mainly in 4 sections :
 +
* Lingualibre : why & what. By Adelaide.
 +
* Lingualibre : how to contribute / demonstration. By Adelaide.
 +
* Lingualibre Technical side : Generalities. By Yug.
 +
* Lingualibre Technical side : [[LinguaLibre:Bot]] roles and request process. By Poslovitch.
 +
* Q&A. By team.
  
== Technical presentations ==
+
'''Repositories'''
'''1. Status du Github : Need you to spread the word about language conservation'''
 
* maintenu propre
 
* 11 repositories
 
* Referees on nearly each repository
 
* JS, Python, NodeJS, PHP, MediaWiki modules,
 
* Welcome volunteer devs
 
'''2. Key repositories'''
 
 
{| class="wikitable" style="background:#FFFFDD"
 
{| class="wikitable" style="background:#FFFFDD"
 
!colspan=3|  <center>'''Github.com/Lingualibre/'''</center>
 
!colspan=3|  <center>'''Github.com/Lingualibre/'''</center>
Line 36: Line 41:
 
| LinguaRecorder  || JS, NodeJS                    || Js library controlling audio recordings
 
| LinguaRecorder  || JS, NodeJS                    || Js library controlling audio recordings
 
|-
 
|-
| QueryViz        || SparQL                        || Helps extract data, files, meaning from Lili
+
| SignIt          || JS, OOJS-UI, CSS, NodeJS      || Helps teach sign language
 
|-
 
|-
| SignIt          || JS, OOJS-UI, CSS, NodeJS      || Helps teach sign language
+
| QueryViz (not presented) || SparQL                        || Helps extract data, files, meaning from Lili
 
|-
 
|-
 
|colspan=3| <small>Notes:</small>
 
|colspan=3| <small>Notes:</small>
 
|}
 
|}
'''3. How to help LinguaLibre via tech'''
+
 
* Authorize LinguaLibre Bot on your wiki
+
== Post-event review ==
** Brief summary of the process.
+
Here are possible improvements.
* Tell your FOSS community about LinguaLibre
+
 
* ...
+
'''General part'''
'''4. Q&A'''
+
* Remind possible usages : Wikimedia Websites, Language preservation, Language learning, Natural language processing (text2speech, speech2text)
 +
* The basics: this is a "recording project". We currently focus on words, but we can think and expand to more orality and knowledge related services.
 +
'''Demonstration :'''
 +
* (Was so complete and systematic ! Merci Adélaide !)
 +
* by default, the limit is <s>380 every three days</s> 380 per 72mins. for newly created accounts and normal accounts. See [[LinguaLibre:User_rights#User_rights_on_LinguaLibre]] for details. But! 380/hour for 4 days old accounts and an easy process to raise this significantly.
 +
* When you change the "stop threshold" <code>5</code> to record sentences, I think (?) it's the volume level under which the audio will stop. So we have to move it '''down''' if you want longer audios.
 +
* The "silence length" should be the single element to change.
 +
'''Technical side :'''
 +
* Add lingualibre.org/datasets/ & talk about external usages
 +
* Remind possible usages : Wikimedia websites via bots, Language learning apps, Natural language processing (text2speech, speech2text).
 +
* Referees on nearly each repository
 +
* Welcome volunteer devs
 +
* "We think about more features and improvements that we can develop" (therefore we need devs). See Phabricator.
 +
* RecordWizard : a point on the Video recording variation.
 +
* Do we have a long term strategy. What is our development vision (it's redundant with Adelaide request to define a long term )
 +
 
 +
Also:
 +
* Improve calls for action and when we write/say them. "Come and try it out".
 +
* Visually thanks the community. (nothing in the slides)
 +
* A final slide with the Recording Wizard recording button's screenshot.
 +
 
 +
== Networking ==
 +
=== Content Translation Tool ===
 +
Lead by the WMF's Language team. See [[:mw:Wikimedia Language engineering]], [[:meta:Category:Language research]] & [[:meta:Research:Section_Translation_Design_Research]].
 +
* Hugo: Do Content Translations organizes IRL "Translathon" and collaboration with language universities ? Any feedbacks on such event ? what are the key element to kickstart such event ? -- Q by fr:Yug
 +
** Amir: We are a product development team: we design and development software features, and we don't organize events in our Wikimedia Foundation capacity. However, many chapters and individual community members organize such events, in many countries and for many languages. Some tips about organizing such events can be found at this link: https://meta.wikimedia.org/wiki/Best_practices_for_Content_Translation_events
 +
** Runa: We did a survey last year in partnership with an editathon event in India (https://outreachdashboard.wmflabs.org/courses/CT_University/Syberthon_2020_Indian_Languages/home) to understand how newer editors interacted with Content Translation. The report is here: https://meta.wikimedia.org/wiki/Research:Content_Translation_Newcomer_Survey,_India_2020 .
  
 
</noinclude>
 
</noinclude>

Latest revision as of 21:46, 23 February 2021

Video

Conference details

  1. Schedule:meta:Wikimedia_Wikimeet_India_2021/Program
  2. Place: Zoom.us with pre-conference trial session.
  3. Checklist: meta:Wikimedia_Wikimeet_India_2021/Checklist
  4. Telegram group for presenters: https://t.me/joinchat/VJ78lQaYYQTDK_Dh
  5. Other forms of communication: wmwm@cis-india.org (Indian Standard Time, UTC+0530 hours).

Presentation

Presentation was mainly in 4 sections :

  • Lingualibre : why & what. By Adelaide.
  • Lingualibre : how to contribute / demonstration. By Adelaide.
  • Lingualibre Technical side : Generalities. By Yug.
  • Lingualibre Technical side : LinguaLibre:Bot roles and request process. By Poslovitch.
  • Q&A. By team.

Repositories

Github.com/Lingualibre/
Repository Technologies (Stack) Definition and impact
Lingua-Libre-Bot Python, autorization Spread 400,000 audios in your wiki
RecordWizard JS, VueJS, CSS, PHP MW-module Mediawiki UI module to record audios/video (sign language)
LinguaRecorder JS, NodeJS Js library controlling audio recordings
SignIt JS, OOJS-UI, CSS, NodeJS Helps teach sign language
QueryViz (not presented) SparQL Helps extract data, files, meaning from Lili
Notes:

Post-event review

Here are possible improvements.

General part

  • Remind possible usages : Wikimedia Websites, Language preservation, Language learning, Natural language processing (text2speech, speech2text)
  • The basics: this is a "recording project". We currently focus on words, but we can think and expand to more orality and knowledge related services.

Demonstration :

  • (Was so complete and systematic ! Merci Adélaide !)
  • by default, the limit is 380 every three days 380 per 72mins. for newly created accounts and normal accounts. See LinguaLibre:User_rights#User_rights_on_LinguaLibre for details. But! 380/hour for 4 days old accounts and an easy process to raise this significantly.
  • When you change the "stop threshold" 5 to record sentences, I think (?) it's the volume level under which the audio will stop. So we have to move it down if you want longer audios.
  • The "silence length" should be the single element to change.

Technical side :

  • Add lingualibre.org/datasets/ & talk about external usages
  • Remind possible usages : Wikimedia websites via bots, Language learning apps, Natural language processing (text2speech, speech2text).
  • Referees on nearly each repository
  • Welcome volunteer devs
  • "We think about more features and improvements that we can develop" (therefore we need devs). See Phabricator.
  • RecordWizard : a point on the Video recording variation.
  • Do we have a long term strategy. What is our development vision (it's redundant with Adelaide request to define a long term )

Also:

  • Improve calls for action and when we write/say them. "Come and try it out".
  • Visually thanks the community. (nothing in the slides)
  • A final slide with the Recording Wizard recording button's screenshot.

Networking

Content Translation Tool

Lead by the WMF's Language team. See mw:Wikimedia Language engineering, meta:Category:Language research & meta:Research:Section_Translation_Design_Research.