Ukrainian MFA dictionary v3.0.0#

  • Maintainer: Montreal Forced Aligner

  • Language: Ukrainian

  • Dialect: N/A

  • Phone set: MFA

  • Number of words: 55,548

  • Phones: b bʲː c dzʲ dʲː d̪z̪ d̪z̪ː d̪ː e f i j k l m mʲː n̪ː o p pʲː sʲː s̪ː tsʲ tsʲː tʃʲ tʃʲː tʃː tʲː t̪s̪ t̪s̪ː t̪ː u x zʲː z̪ː ç ɐ ɑ ɔ ɛ ɟ ɡ ɦ ɪ ɲ ɲː ɾ ɾʲ ɾʲː ɾː ʃ ʃʲ ʊ ʋ ʋʲ ʋʲː ʋː ʎ ʎː ʒ ʒʲ ʒʲː ʝ

  • License: CC BY 4.0

  • Compatible MFA version: v3.0.0

  • Citation:

@techreport{mfa_ukrainian_mfa_dictionary_2024,
	author={McAuliffe, Michael and Sonderegger, Morgan},
	title={Ukrainian MFA dictionary v3.0.0},
	address={\url{https://mfa-models.readthedocs.io/pronunciation dictionary/Ukrainian/Ukrainian MFA dictionary v3_0_0.html}},
	year={2024},
	month={Mar},
}
../../_images/full_logo_yellow.svg

Installation#

Install from the MFA command line:

mfa model download dictionary ukrainian_mfa

Or download from the release page.

The dictionary available from the release page and command line installation has pronunciation and silence probabilities estimated as part acoustic model training (see Silence probability format and training pronunciation probabilities for more information. If you would like to use the version of this dictionary without probabilities, please see the [plain dictionary](https://raw.githubusercontent.com/MontrealCorpusTools/mfa-models/main/dictionary/ukrainian/mfa/Ukrainian MFA dictionary v3_0_0.dict).

Intended use#

This dictionary is intended for forced alignment of Ukrainian transcripts.

This dictionary uses the MFA phone set for Ukrainian, and was used in training the Ukrainian MFA acoustic model. Pronunciations can be added on top of the dictionary, as long as no additional phones are introduced.

Performance Factors#

When trying to get better alignment accuracy, adding pronunciations is generally helpful, especially for different styles and dialects. The most impactful improvements will generally be seen when adding reduced variants that involve deleting segments/syllables common in spontaneous speech. Alignment must include all phones specified in the pronunciation of a word, and each phone has a minimum duration (by default 10ms). If a speaker pronounces a multisyllabic word with just a single syllable, it can be hard for MFA to fit all the segments in, so it will lead to alignment errors on adjacent words as well.

Ethical considerations#

Deploying any Speech-to-Text model into any production setting has ethical implications. You should consider these implications before use.

Demographic Bias#

You should assume every machine learning model has demographic bias unless proven otherwise. For pronunciation dictionaries, it is often the case that transcription accuracy and lexicon coverage for the prestige variety modeled in this dictionary compared to other variants. If you are using this dictionary in production, you should acknowledge this as a potential issue.

IPA Charts#

Consonants#

Obstruent symbols to the left of are unvoiced and those to the right are voiced.

Manner

Labial

Labiodental

Dental

Alveolar

Alveopalatal

Palatal

Velar

Glottal

Nasal

Occurrences:
11,729
Examples:
* ману:
[m ɐ ʊ]
* матки:
[m ɑ k ɪ]
* мирні:
[m ɪ ɾ ɲ i]
* многі:
[m ɔ ʝ i]
Occurrences:
1,313
Examples:
* мізку:
[ i k ʊ]
* місіс:
[ i i ]
* міс:
[ i ]
* німі:
[ɲ i i]
Occurrences:
8
Examples:
Occurrences:
17,447
Examples:
* вікон:
[ʋʲ i k ɔ ]
* дивну:
[ ɪ ʋ ʊ]
* бощан:
[b ɔ ʃ ɑ ]
* інший:
[i ʃ e i]
Occurrences:
378
Examples:
* панно:
[p ɑ n̪ː ɔ]
* денне:
[ e n̪ː e]
* сонно:
[ ɔ ɔ n̪ː ɔ]
* ганно:
[ɦ ɑ n̪ː ɔ]
Occurrences:
5,049
Examples:
* кузню:
[k u ɲ ʊ]
* нюх:
[ɲ u x]
* ніжці:
[ɲ i ʒ tsʲ i]
* нього:
[ɲ ɔ ɦ ɔ]
Occurrences:
1,021
Examples:
* тінню:
[ i ɲː ʊ]
* кінні:
[c i ɲː i]
* хінні:
[ç i ɲː i]
* рання:
[ɾ ɑ ɲː ɐ]

Stop

Occurrences:
13,843
Examples:
* поти:
[p ɔ ɪ]
* випив:
[ʋ ɪ p ɪ ʋ]
* плай:
[p l ɑ i]
* шапка:
[ʃ ɑ p k ɐ]
Occurrences:
1,388
Examples:
* спір:
[ i ɾ]
* співу:
[ i ʋ ʊ]
* пірка:
[ i ɾ k ɐ]
* спід:
[ i ]
Occurrences:
1
Examples:
Occurrences:
6,027
Examples:
* блиск:
[b l ɪ k]
* буцім:
[b ʊ tsʲ i m]
* абрам:
[ɐ b ɾ ɐ m]
* баня:
[b ɑ ɲ ɐ]
Occurrences:
774
Examples:
* дубі:
[ ʊ i]
* обіді:
[ɔ i i]
* небіж:
[ ɛ i ʒ]
* бік:
[ i k]
Occurrences:
3
Examples:
Occurrences:
1
Examples:
Occurrences:
15,354
Examples:
* тісто:
[ i ɔ]
* гатку:
[ɦ ɑ k ʊ]
* хат:
[x ɑ ]
* ост:
[ɔ ]
Occurrences:
16
Examples:
Occurrences:
9,899
Examples:
* вход:
[ʋ x ɔ ]
* ридав:
[ɾ ɪ ɑ ʋ]
* ядру:
[j ɐ ɾ ʊ]
* вадим:
[ʋ ɑ ɪ m]
Occurrences:
105
Examples:
* оддам:
[ɔ d̪ː ɐ m]
* оддає:
[ɔ d̪ː ɑ j e]
* оддай:
[ɔ d̪ː ɑ i]
* оддав:
[o d̪ː ɑ ʋ]
Occurrences:
0
Examples:
Occurrences:
5,220
Examples:
* ідуть:
[i u ]
* втіха:
[ʋ i x ɐ]
* дасть:
[ ɐ ]
* ждуть:
[ʒ u ]
Occurrences:
138
Examples:
* пуття:
[p ʊ tʲː ɑ]
* виття:
[ʋ e tʲː ɑ]
* життє:
[ʒ e tʲː ɛ]
* шиття:
[ʃ e tʲː ɑ]
Occurrences:
0
Examples:
Occurrences:
2,199
Examples:
* діво:
[ i ʋ ɔ]
* радіо:
[ɾ ɑ i ɔ]
* буді:
[b u i]
* дню:
[ ɲ ʊ]
Occurrences:
45
Examples:
* міддю:
[ i dʲː ʊ]
* суддю:
[ ʊ dʲː u]
* суддя:
[ ʊ dʲː ɑ]
* судді:
[ ʊ dʲː i]
Occurrences:
970
Examples:
* якім:
[j ɑ c i m]
* кінні:
[c i ɲː i]
* кіш:
[c i ʃ]
* меткі:
[m ɛ ɔ c i]
Occurrences:
1
Examples:
Occurrences:
3
Examples:
* ґміни:
[ɟ i ɪ]
* ґніт:
[ɟ ɲ i ]
Occurrences:
14,664
Examples:
* кимсь:
[k ɪ m ]
* литку:
[l e k ʊ]
* дудка:
[ u k ɐ]
* кохаю:
[k ɔ x ɑ j ʊ]
Occurrences:
7
Examples:
* мекка:
[m ɛ ɐ]
* мекку:
[m ɛ ʊ]
Occurrences:
81
Examples:
* манґи:
[m ɐ ɡ ɪ]
* ґрунт:
[ɡ ɾ u ]
* ґвалт:
[ɡ ʋ ɑ l ]
* дзиґа:
[d̪z̪ ɪ ɡ ɐ]

Affricate

Occurrences:
723
Examples:
* цапів:
[t̪s̪ ɑ i ʋ]
* отцем:
[ɔ t̪s̪ e m]
* оцей:
[ɔ t̪s̪ ɛ i]
* цапи:
[t̪s̪ ɑ p ɪ]
Occurrences:
2
Examples:
* цска:
[t̪s̪ː k ɐ]
Occurrences:
346
Examples:
* відси:
[ʋʲ i d̪z̪ ɪ]
* ґудзь:
[ɡ u d̪z̪]
* дззи:
[d̪z̪ ɪ]
* дзиґа:
[d̪z̪ ɪ ɡ ɐ]
Occurrences:
2
Examples:
* дзз:
[d̪z̪ː]
Occurrences:
0
Examples:
Occurrences:
2,158
Examples:
* борці:
[b ɔ ɾ tsʲ i]
* місці:
[ i tsʲ i]
* плець:
[p l e tsʲ]
* ціпок:
[tsʲ i p ɔ k]
Occurrences:
143
Examples:
* гудця:
[ɦ ʊ tsʲː ɐ]
* сітці:
[ i ɔ tsʲː i]
* кітці:
[c i ɔ tsʲː i]
* річці:
[ɾʲ i tsʲː i]
Occurrences:
0
Examples:
Occurrences:
71
Examples:
* дзьоб:
[dzʲ ɔ b]
* дзвін:
[dzʲ ʋʲ i ]
Occurrences:
6,504
Examples:
* кучми:
[k u m ɪ]
* січ:
[ i ]
* дещо:
[ e ʃ ɔ]
* чуб:
[ ʊ b]
Occurrences:
395
Examples:
* чітку:
[tʃʲ i k ʊ]
* онучі:
[o u tʃʲ i]
* мощі:
[m ɔ ʃ tʃʲ i]
* утечі:
[ʊ e tʃʲ i]
Occurrences:
24
Examples:
* рітчі:
[ɾʲ i tʃʲː i]
* віччю:
[ʋʲ i tʃʲː ʊ]
* ніччю:
[ɲ i tʃʲː ʊ]
* матчі:
[m ɐ tʃʲː i]
Occurrences:
71
Examples:
* матчу:
[m ɐ tʃː ʊ]
* лучче:
[l u tʃː e]
* одчай:
[ɔ tʃː ɐ i]
* матч:
[m ɐ tʃː]
Occurrences:
444
Examples:
* джуді:
[ ʊ i]
* імідж:
[i i ]
* бджіл:
[b i l]
* джерю:
[ e ɾʲ ʊ]

Sibilant

Occurrences:
11,279
Examples:
* савки:
[ ɐ ʋ k ɪ]
* цар:
[t̪s̪ ɑ ɾ]
* сало:
[ ɐ l ɔ]
* осені:
[ɔ ɛ ɲ i]
Occurrences:
19
Examples:
* ссати:
[s̪ː ɑ ɪ]
* ссала:
[s̪ː ɐ l ɐ]
* руссю:
[ɾ u s̪ː ʊ]
* улісс:
[ʊ ʎ i s̪ː]
Occurrences:
8,333
Examples:
* низом:
[ ɪ ɔ m]
* заким:
[ ɐ k ɪ m]
* збити:
[ b ɪ ɪ]
* зона:
[ ɔ ɐ]
Occurrences:
15
Examples:
* дзз:
[d̪z̪ː]
* ззаду:
[z̪ː ɑ ʊ]
* ззаді:
[z̪ː ɑ i]
Occurrences:
0
Examples:
Occurrences:
9,211
Examples:
* сіра:
[ i ɾ ɐ]
* якось:
[j ɐ k ɔ ]
* уся:
[u ɐ]
* сіяв:
[ i j ɑ ʋ]
Occurrences:
104
Examples:
* нісся:
[ɲ i sʲː ɐ]
* россю:
[ɾ o sʲː u]
* отся:
[ɔ sʲː ɐ]
* отсі:
[ɔ sʲː i]
Occurrences:
0
Examples:
Occurrences:
1,388
Examples:
* зятю:
[ ɑ tʲː ʊ]
* князі:
[k ɲ ɑ i]
* зняті:
[ ɲ ɑ i]
* разів:
[ɾ ɑ i ʋ]
Occurrences:
8
Examples:
Occurrences:
5,024
Examples:
* груша:
[ɦ ɾ u ʃ ɐ]
* шити:
[ʃ ɪ ɪ]
* душив:
[ u ʃ ɪ ʋ]
* шапку:
[ʃ ɑ p k ʊ]
Occurrences:
134
Examples:
* довші:
[ ɔ ʋ ʃʲ i]
* нашій:
[ ɑ ʃʲ ]
* нашім:
[ ɑ ʃʲ i m]
* миші:
[m e ʃʲ i]
Occurrences:
2,866
Examples:
* враже:
[ʋ ɾ ɑ ʒ e]
* жадаю:
[ʒ ɑ ɐ j ʊ]
* межа:
[m e ʒ ɑ]
* кожне:
[k ɔ ʒ ɛ]
Occurrences:
62
Examples:
* жінку:
[ʒʲ i k ʊ]
* етажі:
[e ɐ ʒʲ i]
* жінці:
[ʒʲ i ɲ tsʲ i]
* жінка:
[ʒʲ i k ɐ]
Occurrences:
17
Examples:

Fricative

Occurrences:
568
Examples:
* шофер:
[ʃ ɔ f ɛ ɾ]
* фронт:
[f ɾ ɔ ]
* флоті:
[f l ɔ i]
* фойє:
[f ɔ i e]
Occurrences:
214
Examples:
* фіно:
[ i ɔ]
* фін:
[ i ]
* фільм:
[ i ʎ m]
* шафі:
[ʃ ɑ i]
Occurrences:
2
Examples:
* моффа:
[m ɔ ɐ]
Occurrences:
189
Examples:
* ляхів:
[ʎ ɑ ç i ʋ]
* хід:
[ç i ]
* лихі:
[l e ç i]
* рахів:
[ɾ ɐ ç i ʋ]
Occurrences:
388
Examples:
* гілля:
[ʝ i ʎː ɐ]
* убогі:
[ʊ b ɔ ʝ i]
* гілці:
[ʝ i tsʲ i]
* легіт:
[l e ʝ i ]
Occurrences:
7,031
Examples:
* голка:
[ɦ ɔ l k ɐ]
* граба:
[ɦ ɾ ɐ b ɐ]
* смуга:
[ m u ɦ ɐ]
* нього:
[ɲ ɔ ɦ ɔ]

Approximant

Occurrences:
21,243
Examples:
* сиваш:
[ e ʋ ɑ ʃ]
* вид:
[ʋ ɪ ]
* кусав:
[k ʊ ɑ ʋ]
* весло:
[ʋ ɛ l ɔ]
Occurrences:
3,043
Examples:
* лаві:
[l ɑ ʋʲ i]
* візу:
[ʋʲ i ʊ]
* навіс:
[ ɑ ʋʲ i ]
* віри:
[ʋʲ i ɾ ɪ]
Occurrences:
17
Examples:
* вві:
[ʋʲː i]
* ввів:
[ʋʲː i ʋ]
Occurrences:
46
Examples:
* ввело:
[ʋː e l ɔ]
* вверх:
[ʋː e ɾ x]
* ввесь:
[ʋː ɛ ]
* вволю:
[ʋː ɔ ʎ ʊ]
Occurrences:
8,529
Examples:
* хомою:
[x ɔ m ɔ j ʊ]
* яєць:
[j ɐ j e tsʲ]
* ясів:
[j ɑ i ʋ]
* дії:
[ i j i]

Tap

Occurrences:
19,999
Examples:
* ромен:
[ɾ ɔ m ɛ ]
* русі:
[ɾ u i]
* хрест:
[x ɾ ɛ ]
* орля:
[ɔ ɾ l ɑ]
Occurrences:
2,410
Examples:
* пріск:
[p ɾʲ i k]
* зорі:
[ ɔ ɾʲ i]
* рівну:
[ɾʲ i ʋ ʊ]
* тріо:
[ ɾʲ i ɔ]
Occurrences:
6
Examples:
* гаррі:
[ɦ ɐ ɾʲː i]
Occurrences:
4
Examples:
* гурра:
[ɦ ʊ ɾː ɐ]
* ферро:
[f ɛ ɾː ɔ]

Lateral

Occurrences:
14,044
Examples:
* скали:
[ k ɑ l ɪ]
* столи:
[ ɔ l ɪ]
* лихе:
[l e x ɛ]
* цілує:
[tsʲ i l ʊ j e]
Occurrences:
34
Examples:
* халлу:
[x ɐ ʊ]
* волл:
[ʋ ɔ ]
* аллен:
[ɐ ɛ ]
* халл:
[x ɐ ]
Occurrences:
4,894
Examples:
* лягла:
[ʎ ɐ ɦ l ɐ]
* шлюбу:
[ʃ ʎ u b ʊ]
* лім:
[ʎ i m]
* любку:
[ʎ ʊ b k ʊ]
Occurrences:
72
Examples:
* валль:
[ʋ ɐ ʎː]
* сіллю:
[ i ʎː ʊ]
* заллє:
[ ɐ ʎː ɛ]
* гілля:
[ʝ i ʎː ɐ]

Vowels#

Vowel symbols to the left of are unrounded and those to the right are rounded.

Front

Near-Front

Central

Near-Back

Back

Close

Occurrences:
24,363
Examples:
* заїр:
[ ɐ j i ɾ]
* іде:
[i ɛ]
* фіри:
[ i ɾ ɪ]
* мішку:
[ i ʃ k ʊ]
Occurrences:
1,084
Examples:
* пійло:
[ l ɔ]
* бійся:
[ ɐ]
* копій:
[k o ]
* лівій:
[ʎ i ʋʲ ]
Occurrences:
5,020
Examples:
* юного:
[j u ɔ ɦ ɔ]
* круп:
[k ɾ u p]
* людей:
[ʎ u ɛ i]
* юшка:
[j u ʃ k ɐ]
Occurrences:
21,505
Examples:
* рикав:
[ɾ ɪ k ɐ ʋ]
* замки:
[ ɑ m k ɪ]
* мине:
[m ɪ e]
* миски:
[m ɪ k ɪ]
Occurrences:
14,771
Examples:
* суне:
[ ʊ e]
* новою:
[ ɔ ʋ ɔ j ʊ]
* краму:
[k ɾ ɐ m ʊ]
* дурну:
[ ʊ ɾ ʊ]

Close-Mid

Occurrences:
25,968
Examples:
* серед:
[ ɛ ɾ e ]
* миль:
[m e ʎ]
* себе:
[ e b ɛ]
* каноє:
[k ɐ ɔ j e]
Occurrences:
2,120
Examples:
* поміг:
[p o i ɦ]
* окуня:
[o k u ɲ ɐ]
* софія:
[ o i j ɐ]
* щоці:
[ʃ o tsʲ i]

Open-Mid

Occurrences:
6,129
Examples:
* своєю:
[ ʋ ɔ j ɛ j ʊ]
* густе:
[ɦ ʊ ɛ]
* вєм:
[ʋ j ɛ m]
* щезне:
[ʃ ɛ e]
Occurrences:
34,471
Examples:
* колу:
[k ɔ l ʊ]
* обоз:
[ɔ b ɔ ]
* тої:
[ ɔ j i]
* діяло:
[ i j ɑ l ɔ]
Occurrences:
27,661
Examples:
* гайда:
[ɦ ɑ i ɐ]
* марії:
[m ɐ ɾʲ i j i]
* воля:
[ʋ ɔ ʎ ɐ]
* лада:
[l ɑ ɐ]

Open

Occurrences:
19,839
Examples:
* марку:
[m ɑ ɾ k ʊ]
* вдача:
[ʋ ɑ ɐ]
* п'ядь:
[p j ɑ ]
* хадід:
[x ɑ i ]