LinguaLibre talk

Difference between revisions of "Citations"

Line 1: Line 1:
 
== Massively hyper lingual projects ==
 
== Massively hyper lingual projects ==
Avec plaisir! Here are the papers:
+
=== Network ===
 +
* [https://www.connectedpapers.com/main/767dcc48c7ad2c943f3c1a25c46b873e7b8b3bc8/Glot500%3A-Scaling-Multilingual-Corpora-and-Language-Models-to-500-Languages/graph Glot500]
 +
=== Alphabet ===
 
* https://research.google/pubs/pub47206/ for mining wordlists (Unilex-style) from 2,000+ languages
 
* https://research.google/pubs/pub47206/ for mining wordlists (Unilex-style) from 2,000+ languages
 
* https://research.google/pubs/pub46952/ cleaning them up; open-sourced in https://arxiv.org/abs/2103.15845
 
* https://research.google/pubs/pub46952/ cleaning them up; open-sourced in https://arxiv.org/abs/2103.15845
Line 8: Line 10:
 
* https://arxiv.org/abs/2305.13516 https://huggingface.co/spaces/mms-meta/MMS
 
* https://arxiv.org/abs/2305.13516 https://huggingface.co/spaces/mms-meta/MMS
 
[[User:Yug|Yug]] ([[User talk:Yug|talk]]) 09:29, 11 August 2023 (UTC)
 
[[User:Yug|Yug]] ([[User talk:Yug|talk]]) 09:29, 11 August 2023 (UTC)
 +
 +
=== Facebook ===
 +
* https://ai.meta.com/blog/multilingual-model-speech-recognition/ Introducing speech-to-text, text-to-speech, and more for 1,100+ languages
 +
* https://arxiv.org/abs/2305.13516 Scaling Speech Technology to 1,000+ Languages
 +
* https://arxiv.org/abs/2305.12182 Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages
 +
 +
=== Others ===
 +
* https://www.semanticscholar.org/paper/e4aa101556fc5b238a88d99c07c1055fe3bc4764 Taxi1500: A Multilingual Dataset for Text Classification in 1500 Languages

Revision as of 11:51, 17 December 2023

Massively hyper lingual projects

Network

Alphabet

Yug (talk) 09:29, 11 August 2023 (UTC)

Facebook

Others