LinguaLibre

Workshops

LinguaLibre's workshops gathers various kind of real-life and online events promoting the creation of audios via LinguaLibre rapid recording, by demoing it, mentoring hand-on usage of Special:RecordWizard Studio or via other ways. This page introduces and provides resources for each type of those workshops. Both conceptual and practical resources will be covered.


Draft
Twemoji12 1f3d7.svg
Twemoji12 1f3d7.svg

This page is a work in progress.

Rapid demo

User Interface and teaching principles.

On site workshop

Example: 2021 INALCO.

Online workshop

Example: ContribuLing 2021.
Workshop at ContribuLing 2021 (in French)

Personal challenge

"One rooster does not weave a morning,
he will always need the other roosters…"

—— João Cabral de Melo Neto - Weaving the Morning, Brazilian poem cited by Marreromarco.

User:Titodutta early 2021's 50,000 words challenge was a personal challenge lead by Bengali speaker Titodutta to record 50,000 words in his native language, Bengali. Titodutta contributed nearly daily for 2 months, fast catching up and passing over the most productive speakers. His effort alone rose Bengali among the top 2 languages of LinguaLibre.

User:Marreromarco's year long challenge project is a personal goal to record 400 words daily in his “local” variety of Spanish. After 365 days, we will have 146,000 Spanish words on Lingua Libre ! There are ~22 million learners of Spanish that could benefit from this "Personal" Challenge on Lingua Libre!

Sponsorship

Example: Cantonese 2020.

A private citizen paying an audio-voice professional to records a list of words in his language is also a de facto option. Wikimedia cannot, by rule, pay for content. But given the high productivity of Lingualibre's studio (Special:RecordWizard), it becomes affordable, efficient, and profitable for private citizen or institutions to lead such operation, hire an audio professional who will mass record. A first experiment was made with Cantonese. A modest 300€ lead to the creation of 5,000+ audios files within 3 weeks. These 5,000 words make up around 95% of daily Cantonese conversations. Cantonese can therefore be considered as largely covered, on the teaching side at least. For a more exhaustive coverage, 30,000 would be a better target.

This approach is especially needed and promising for rare languages, where volunteer contributors are and will stay hard to find. Sometimes, sponsoring contribution could boost a language. With such low amounts involved, a private or institutional donor can likely be found.

Editathon

Example: n.a.

Editathons aim to gather and synchronize a group of interested users into editing Lingualibre's content pages. It could be to improve our documentations (Help:*), project space (LinguaLibre:*), create or curate words lists for your language (List:*/*), communication materials, prepare a funding request (Grants table) or any one full day or more sprint organized by the community.

Hackathon

Lingualibre:Hackathon
Example: 2018 at Wikimedia France's Paris headquarter.

Hackathon aim to gather and synchronize a group of interested users into taking over Lingualibre's codes (see github.com/lingua-libre) and features/bugs tickets (phabricator:tag/lingua_libre).

Skills we are looking for are mainly organizational skills for events management, then JS and VueJS (web developments) ; PHP (mediawiki) ; Python (bots).

Field linguist workshops

Example 2022 LinguaLibre-Surui workshop.

Field linguists are the people who study a language by going into the target community and describing real world usages. By extension, in the context of LinguaLibre, field workshops are all efforts where a LinguaLibrist actively contact a target community to build lexicographic and audio-lexicographic content with them. The volunteer(s) takes a wide range of roles such as:

  • advocate for open content
  • ambassador for lingualibre
  • coordinator
  • guide for lexical list management : picking the right list or list translation
  • event organizer
  • Lingualibre app trainer
  • List review and clean up
  • translator between LinguaLibre's English resources and local community's language
  • traveler

This global mission will likely include IRL travels, for which some refunds options are available, see Template:Grants table and Wikimedia Micro-fi. One such 2022 training session in Paris had a budget of 1,000€ for one trainer and one speaker meeting up from 1,000km away and working on Surui language. In this case, both the trainer from Bayonna and the speaker from Brazil met in an intermediate city, Paris. A better heuristic could be pre-arranged field linguistic journeys, one week long and well pre-planned, where a LinguaLibre volunteer-trainer visit the target community and ready-to-contribute locals to constitute within a week a 5,000+ words dictionary on a 1,000€ budget.

Outreach workshops

Existing document may be reprinted and sent to various local groups.
Example: n.a. See also :Commons:Category:Lingua Libre outreach.

A vastly overlooked aspect of Lingualibre is the networking and communication side. Following April 2021 site-wide upgrade, LinguaLibre's rapid-recording-as-a-service is technically stable, mature, and satisfying. The real need is in letting the outside world know how the recording studio makes rapidly and clean audio recording possible and easy. Early outreach has been limited to French Wikimedians and as since expanded toward European and Indic Wikimedians. These Wikimedia groups returns are unbalanced by geography and gender.

  1. On demographic, the simplicity of the recording studio actually make contributing to LinguaLibre accessible to all public, 7-99 years old. Outreach needs and opportunities are toward older, younger, and more feminine groups.
  2. On geography, an effort is underway to translate the recording studio, but similarly, outreach to local communities abroad is missing.
  3. On institutional side, various existing community have been working to document the diversity of languages for years already. These volunteer or academic communities have better field experience, trust, contacts with diverse languages. On these sides we can cite the Endangered Language Alliances of New York City (elalliance.org), the National Institute of Languages and Civilisations in Paris (100+ languages), the School of Oriental and Asian Studies in London (~35 languages), the UNESCO, etc.

Each of these groups has various subgroups, not aware of how « LinguaLibre makes rapid, clean, mass recording possible ». There are opportunities to gently ping these communities via multiples entries points, assess these communities actual appetite for rapid recording, send them flyers ? The field is actually quite new, with custom and hybrid solutions to find.

Funding editathon

Example: Wikimédia France's Microfi. See also LinguaLibre:Grants.

Funding editathons are sprints aiming to apply to available fundings in order to fund medium to heavy projects, which needs more than a regular volunteer and segmented small pushes to be properly achieve. It first requires defining your project and potential funding avenue. Define your project (1), scooting for available funding (2), deep reading of the required application format (3) and mostly, the writing of a solid and decent project draft can be the focus of a one or two days editathon. If involving several contributors IRL, such event allows community building and various strategic discussions. Also, reasonable transportation, food, and venue costs can be funded by WMFR or WMF's Rapid Funds as well.

Funded project's phases: Phases 1 to 4 are suitable for editathons.

  1. Define: core project design with problem to solve, actions to lead (objectives), type (community, research, technology), actors and budget.
  2. Orientation: identify the relevant funding program to apply to ?
  3. Format: read their requirements, deadlines, application format ; identify a previous application able to best serve as model.
  4. Write: write down a solid project draft.
  5. Iterate and polish.
  6. Apply: submit and follow process
  7. Work
  8. Report
  9. Close

Major funding avenues: The table below is an indicative summary, for authoritative values, please check each programs' pages. To clarify if your idea fits your target Fund's requirement and rules, please read their documentation and engage with their team. Grants can be allocated to organizing forces: individuals, groups or institutions.

Program Budget Next deadline Delay Scope: who & what Application length Comments
WMFR Rapid Grant "Microfi" Indiv: <2,000 EUR
Group: <5,000 EUR
Year round 2~4 weeks Hardware ; Books ; Organization ; Travels fees ; Food (?) 1/2 A4 page. Limited to French Wikimedians ? ; 2020-2021 budget is 50kEUR : ~20kEUR spent.
Wikimedia Community Fund Rapid Fund
(2022)
500 - 5K USD Year round 4 weeks - Individuals, groups, organizations
- Promoting knowledge equity in line with WM movement
- Underrepresented and marginalized communities
- Editathon ; Contest ; General promotion ; Video campaign ; Translation ; Others
- Transportation ; Food ; Venue, etc.
n.a.
Conference & Event Fund 10K - 90K USD Round 1: Sept.-Nov.
Round 2: Feb.-Mar.
60 days n.a. (likely: 2~4 pages A4)
General Support Fund 10K - 300K+ USD
(flexible)
Two round / year, per region.
* Group A: Central Europe & Asia ; N. Americas ; Latin Americas ; Middle East & Africa : Oct. ; Feb.
* Group B: Western Europe ; South. Asia (SAARC) ; East. Asia (ESEAP) : Nov. April.
60 days n.a. (likely: 2~4 pages A4)
Wikimedia Alliances Fund <100k USD September 8 or 9 or 29, 2022 60 days Activities: edit-a-thons, hack-a-thons, meet-ups, celebrations, campaigns. n.a. (likely: 2~4 pages A4)
Wikimedia Research & Technology Fund 2k-50k USD 3 January 2023 ? 6 months - Individuals, groups, organizations
- Research interests on Wikimedia projects
- Define clear road/abstract to further understanding of WM's project or community
- Define budget
n.a. (likely: 2~4 pages A4) Research fund: open.
Technology fund: early 2022.
mw:Google Summer of Code/2022 12 weeks dev ca. 1st Feb., 2023 ? ? ? n.a.
Notes: 1) Covid19 pandemic and Wikimedia awareness requires small or carefully assessed IRL events. 2) More spending items are covered but wee need to ask questions and increase our understanding for specific cases relevant to us. 3) n.a. : need to find this information.


See also