LinguaLibre

Difference between revisions of "Events/2022 LREC"

< LinguaLibre:Events

(~40 languages communities contacted, informed. Contacts taken : see etherpad. Needs follow up email, see Lingualibre:Mailing.)
Line 8: Line 8:
 
* '''Objectives:''' 1) Advocate for "Lingualibre : rapid audio recording tool for lexicons and more. 2) Investigate possible technical sponsorship / collaboration."
 
* '''Objectives:''' 1) Advocate for "Lingualibre : rapid audio recording tool for lexicons and more. 2) Investigate possible technical sponsorship / collaboration."
 
* '''Etherpad:''' https://etherpad.wikimedia.org/p/LREC
 
* '''Etherpad:''' https://etherpad.wikimedia.org/p/LREC
* '''Program:''' https://lrec2022.lrec-conf.org/en/workshops-and-tutorials/ws-tut-schedule/
+
* '''Program:'''  
 +
** https://lrec2022.lrec-conf.org/en/workshops-and-tutorials/ws-tut-schedule/
 
* '''Report:''' See Etherpad or [[:meta:User:Yug/Marseille]].
 
* '''Report:''' See Etherpad or [[:meta:User:Yug/Marseille]].
 
* '''Outcome:''' ~40 languages communities contacted, informed. Contacts taken : see etherpad. Needs follow up email, see [[Lingualibre:Mailing]].
 
* '''Outcome:''' ~40 languages communities contacted, informed. Contacts taken : see etherpad. Needs follow up email, see [[Lingualibre:Mailing]].
 
* '''Lessons:''' Needs more flyers, namecards (?), poster (?). Duo was good idea. Discussion strategy to shorten so to investigate more participants.
 
* '''Lessons:''' Needs more flyers, namecards (?), poster (?). Duo was good idea. Discussion strategy to shorten so to investigate more participants.
 +
 +
<noinclude>
 +
Avec plaisir! Here are the papers:
 +
- https://research.google/pubs/pub47206/ for mining wordlists (Unilex-style) from 2,000+ languages
 +
- https://research.google/pubs/pub46952/ cleaning them up; open-sourced in https://arxiv.org/abs/2103.15845
 +
- https://research.google/pubs/pub49814/ using these wordlists to find sentences using our web crawler
 +
- https://research.google/pubs/pub50211/ cleaning up web-crawled text
 +
- https://arxiv.org/abs/2205.03983 building machine translation systems from them; blog post https://ai.googleblog.com/2022/05/24-new-languages-google-translate.html
 +
</noinclude>

Revision as of 14:51, 23 June 2022

The 2022 Language Resources and Evaluation Conference is a major international and academico-professional conference on language resources. The dominant topic is on discourse evaluation and for anti-abusive discourse management, but the field is dedicated to expand its reach in term of language diversity. Orality is marginal, which therefore gives Lingualibre value and opportunities.


Avec plaisir! Here are the papers: - https://research.google/pubs/pub47206/ for mining wordlists (Unilex-style) from 2,000+ languages - https://research.google/pubs/pub46952/ cleaning them up; open-sourced in https://arxiv.org/abs/2103.15845 - https://research.google/pubs/pub49814/ using these wordlists to find sentences using our web crawler - https://research.google/pubs/pub50211/ cleaning up web-crawled text - https://arxiv.org/abs/2205.03983 building machine translation systems from them; blog post https://ai.googleblog.com/2022/05/24-new-languages-google-translate.html