Difference between revisions of "Homographs"

Latest revision as of 09:56, 24 September 2024

Euler diagram showing the relationships between heteronyms and related linguistic concepts.

Rules

If one pronunciation is clearly the norm, no suffix is needed.
For equal rank or rare pronunciations, add to that word a suffix within brackets, example:
# word (suffix).
This suffix should hint at the difference between two homographs or more.
The suffix must be consistent and stable, e.g. if you start with (noun), (verb), keep that exact convention for all your recordings. If you start with a transcription, keep on that transcription. Etc.
The suffix is in the same language as the word, e.g. red (noun), အနီရောင် (နာမ်).
Abbreviated suffixes should be avoided. Prefer full suffix adjective, verb, noun, casual, formal, ...

Homographs homophones

Given one language and one speaker, one recording for them all. Even if meaning or role (part of speech) diverge.

Homographs non-homophones

The following are homographs non-homophones (heterophone), the part between brackets is not read aloud in LinguaLibre but is used to distinguish those recordings' files.

Distinction via semantic synonyms. In English :

# crooked (injured), pronounced and recorded `crookaid` /ˈkrʊkɪd/
# crooked (corrupt), pronounced and recorded `crookt` /ˈkrʊkt/
# axes (weapon), pronounced and recorded `akses` /?/ <------- Example to review by English native speaker
# axes (geometry), pronounced and recorded `aksis` /ˈæk.sɪs/ <------- Example to review by English native speaker

Distinction via pronunciation. In Mandarin Chinese, using toned Hanyu pinyin:

# 雨 (yǔ), noun, pronounced and recorded `/yː3/`
# 雨 (yù), verb, pronounced and recorded `/y:4/`

Distinction via the part of speech. In French :

# excellent (verb), pronounced and recorded `excel` /ɛk.sɛl/
# excellent (adjective), pronounced and recorded `excellant` /ɛk.sɛ.lɑ̃/

Distinction via pronunciation. In English, using IPA:

# crooked (/ˈkrʊkɪd/), pronounced and recorded `crookaid` /ˈkrʊkɪd/
# crooked (/ˈkrʊkt/), pronounced and recorded `crookt` /ˈkrʊkt/

Distinction via cultural dimension, depending on the public (hierarchy, age, seniority). In Japanese :

# 昨日 (default), pronounced and recorded `きのう` (kinō)
# 昨日 (polite), pronounced and recorded `さくじつ` (sakujitsu)
# 明日 (default), pronounced and recorded `あした` (ashita)
# 明日 (polite), pronounced and recorded `あす` (asu), `みょうにち` (myōnichi)
# 私 (default), pronounced and recorded `わたし` (watashi)
# 私 (polite), pronounced and recorded `わたくし` (watakushi)

In practice

Within your list such as List:mnw/Commons, transform :

# ကစေံ1
# ကစေံ2
# ကစေံ3
# ကစေံ4

into

# ကစေံ (read)
# ကစေံ (speak)
# ကစေံ (Tang)
# ကစေံ (Te)

You can now record your words, without reading the suffix.

Technical details

The suffix is not part of the word and is stored with the property qualifier (P18) in the Wikibase. See fils (enfant) (Q1686) and fils (pluriel de fil) (Q1685) for example. It is then possible to query recordings without mixing words and suffixes.

Lingua Libre Help pages
General help pages	Help:Interface • Help:Your first record • Help:Choosing a microphone • Help:Configure your microphone • Help:Translate • Help:Langtags • LinguaLibre:Language codes systems used across LinguaLibre • LinguaLibre:List of languages
Linguistic help pages	Help:Add a new language • Help:Homographs • Help:List translation • Help:Ethics
Lists help pages	Help:Create your own lists • Help:How to create a frequency list? • Help:Why wordlists matter? • Help:Swadesh lists • Help:Lists • Help:Create a new generator
Events, Outreach	Lingualibre:Events • Lingualibre:Roles • Lingualibre:Workshops • Lingualibre:Hackathon • Lingualibre:Interested communities • Lingualibre:Events/2022 Public Relations Campaign • Lingualibre:Mailing • Lingualibre:Jargon • Lingualibre:Apps • Lingualibre:Citations • Service civique 2022-2023
Strategy	Lingualibre 2022 Review (including outreach) • 2022-2023 Lingualibre wishlist • {{Wikimedia Language Diversity/Projects}} • Speakers map • Voices gender • Stats • Lingua Libre SignIt/2022 report • {{Grants}}

@@ Line 6: / Line 6: @@
 # For equal rank or rare pronunciations, add to that word a suffix within brackets, example:<br><code># word (suffix)</code>.
 # This suffix should hint at the difference between two homographs or more.
-# The suffix must be consistent and stable, ex: if you start with <code>(noun)</code>, <code>(verb)</code>, keep that exact convention <u>for all</u> your recordings. If you start with a transcription, keep on that transcription. Etc.
+# The suffix must be consistent and stable, e.g. if you start with <code>(noun)</code>, <code>(verb)</code>, keep that exact convention <u>for all</u> your recordings. If you start with a transcription, keep on that transcription. Etc.
-# The suffix is in the same language as the word, ex : <code>red (noun)</code>, <code>အနီရောင် (နာမ်)</code>.
+# The suffix is in the same language as the word, e.g. <code>red (noun)</code>, <code>အနီရောင် (နာမ်)</code>.
-# Abbreviated suffixes should be avoided. Prefer full suffix <code>adjective</code>, <code>verb</code>, <code>noun</code>, <code>casual</code>, <code>formal</code>, …
+# Abbreviated suffixes should be avoided. Prefer full suffix <code>adjective</code>, <code>verb</code>, <code>noun</code>, <code>casual</code>, <code>formal</code>, ...
 == Homographs homophones ==
@@ Line 14: / Line 14: @@
 == Homographs non-homophones ==
-The following are homographs non-homophones, the part between brackets is not read aloud in LinguaLibre but is used to distinguish those recordings.
+The following are homographs non-homophones (heterophone), the part between brackets is not read aloud in LinguaLibre but is used to distinguish those recordings' files.
-Distinction via semantic synonyms :
+Distinction via semantic synonyms. In English :
 * <code># crooked (injured)</code>, pronounced and recorded `crookaid` /ˈkrʊkɪd/
 * <code># crooked (corrupt)</code>, pronounced and recorded `crookt` /ˈkrʊkt/
+* <code># axes (weapon)</code>, pronounced and recorded `akses` /?/  <------- Example to review by English native speaker
+* <code># axes (geometry)</code>, pronounced and recorded `aksis` /ˈæk.sɪs/  <------- Example to review by English native speaker
-Distinction via pronunciation, using toned [[:en:Hanyu pinyin|Hanyu pinyin]]:
+Distinction via pronunciation. In Mandarin Chinese, using toned [[:en:Hanyu pinyin|Hanyu pinyin]]:
-* <code># 雨 (yǚ)</code>, noun, pronounced and recorded `/yː3/`
+* <code># 雨 (yǔ)</code>, noun, pronounced and recorded `/yː3/`
 * <code># 雨 (yù)</code>, verb, pronounced and recorded `/y:4/`
-Distinction via the part of speech :
+Distinction via the part of speech. In French :
-* <code># excellent (verb)</code>, pronounced and recorded `excel`
+* <code># excellent (verb)</code>, pronounced and recorded `excel` /ɛk.sɛl/
-* <code># excellent (adjective)</code>, pronounced and recorded `excellant`
+* <code># excellent (adjective)</code>, pronounced and recorded `excellant` /ɛk.sɛ.lɑ̃/
-Distinction via pronunciation, using [[:en:IPA|IPA]]:
+Distinction via pronunciation. In English, using [[:en:IPA|IPA]]:
 * <code># crooked (/ˈkrʊkɪd/)</code>, pronounced and recorded `crookaid` /ˈkrʊkɪd/
 * <code># crooked (/ˈkrʊkt/)</code>, pronounced and recorded `crookt` /ˈkrʊkt/
-Distinction via cultural dimension :
+Distinction via cultural dimension, depending on the public (hierarchy, age, seniority). In Japanese :
-* <code># kon</code> (equals), pronounced and recorded `kon`
+* <code># 昨日 (default)</code>, pronounced and recorded `きのう` (kinō)
-* <code># kon (young to old)</code>, pronounced and recorded `kon-ee`
+* <code># 昨日 (polite)</code>, pronounced and recorded `さくじつ` (sakujitsu)
-* <code># kon (old to young)</code>, pronounced and recorded `ko-on`
+* <code># 明日 (default)</code>, pronounced and recorded `あした` (ashita)
+* <code># 明日 (polite)</code>, pronounced and recorded `あす` (asu), `みょうにち` (myōnichi)
-Distinction via language level, depending on the public (hierarchy, age, seniority) :
+* <code># 私 (default)</code>, pronounced and recorded `わたし` (watashi)
-* <code># 昨日</code> (neutral), pronounced and recorded `きのう`
+* <code># 私 (polite)</code>, pronounced and recorded `わたくし` (watakushi)
-* <code># 昨日 (polite)</code>, pronounced and recorded `さくじつ`
-* <code># 明日</code> (neutral), pronounced and recorded `あした `
-* <code># 明日 (polite)</code>, pronounced and recorded `あす`,`みょうにち`
 == In practice ==
@@ Line 65: / Line 64: @@
 == See also ==
 * [[Help:Lists]]
+* [[Help:List translation]]
-[[Category:Lingua Libre:Help]]
+{{Helps}}