Skip to main content
Ctrl+K

Montreal Forced Aligner

  • Dictionaries
  • G2P models
  • Acoustic models
  • Language models
  • Ivector extractors
    • Tokenizers
    • Benchmarks
    • Corpora
    • MFA docs
  • GitHub
  • Dictionaries
  • G2P models
  • Acoustic models
  • Language models
  • Ivector extractors
  • Tokenizers
  • Benchmarks
  • Corpora
  • MFA docs
  • GitHub
Ctrl+K

Section Navigation

  • Abkhaz
    • Common Voice Abkhaz v7.0
  • Arabic
    • Common Voice Arabic v8.0
    • GlobalPhone Arabic v3.1
    • MediaSpeech Arabic v1.1
  • Armenian
    • Common Voice Armenian v7.0
  • Bashkir
    • Common Voice Bashkir v7.0
  • Basque
    • Common Voice Basque v7.0
  • Belarusian
    • Common Voice Belarusian v7.0
  • Bulgarian
    • Common Voice Bulgarian v7.0
    • Common Voice Bulgarian v8.0
    • Common Voice Bulgarian v9.0
    • Common Voice Bulgarian v16.1
    • GlobalPhone Bulgarian v3.1
  • Chuvash
    • Common Voice Chuvash v7.0
  • Croatian
    • Common Voice Serbian v8.0
    • Common Voice Serbian v9.0
    • GlobalPhone Croatian v3.1
    • VoxPopuli Croatian
    • ParlaSpeech
  • Czech
    • Common Voice Czech v7.0
    • Common Voice Czech v8.0
    • Common Voice Czech v9.0
    • GlobalPhone Czech v3.1
    • Large Corpus of Czech Parliament Plenary Hearings
    • Czech Parliament Meetings
    • VoxPopuli Czech
  • Dutch
    • Common Voice Dutch v7.0
  • English
    • Common Voice English v8.0
    • Common Voice English v17.0
    • LibriSpeech English
    • NCHLT English
    • ARU English corpus
    • Corpus of Regional African American Language v2021.07
    • Google Nigerian English
    • Google UK and Ireland English
    • L2-ARCTIC
    • ICE-Nigeria
    • A Scripted Pakistani English Daily-use Speech Corpus
    • Buckeye Corpus
    • TIMIT
  • French
    • Common Voice French v7.0
    • Common Voice French v8.0
    • Common Voice French v16.1
    • GlobalPhone French v3.1
    • Multilingual LibriSpeech French
    • African-accented French
  • Georgian
    • Common Voice Georgian v7.0
  • German
    • Common Voice German v7.0
    • Common Voice German v8.0
    • Common Voice German v16.1
    • Multilingual LibriSpeech German
    • GlobalPhone German v3.1
  • Greek
    • Common Voice Greek v7.0
  • Guarani
    • Common Voice Guarani v7.0
  • Hausa
    • GlobalPhone Hausa v3.1
    • Common Voice Hausa v8.0
    • Common Voice Hausa v9.0
    • Common Voice Hausa v7.0
  • Hindi
    • Common Voice Hindi v7.0
  • Hungarian
    • Common Voice Hungarian v7.0
  • Indonesian
    • Common Voice Indonesian v7.0
  • Italian
    • Common Voice Italian v7.0
  • Japanese
    • Common Voice Japanese v7.0
    • Common Voice Japanese v8.0
    • Common Voice Japanese v9.0
    • Common Voice Japanese v12.0
    • LaboroTV Japanese v1.0d
    • TEDxJP-10K v1.1
    • GlobalPhone Japanese v3.1
    • Microsoft Speech Language Translation Japanese
    • Japanese Versatile Speech
  • Kazakh
    • Common Voice Kazakh v7.0
  • Korean
    • Pansori TEDxKR
    • Zeroth Korean
    • Deeply Korean read speech corpus public sample
    • ASR-KCSC A Korean Conversational Speech Corpus
    • ASR-SKDuSC A Scripted Korean Daily-use Speech Corpus
    • Seoul Corpus
    • Korean Single Speaker Speech Dataset
    • Common Voice Korean v16.1
    • GlobalPhone Korean v3.1
  • Kurmanji
    • Common Voice Kurmanji v7.0
  • Kyrgyz
    • Common Voice Kyrgyz v7.0
  • Maltese
    • Common Voice Maltese v7.0
  • Mandarin
    • Common Voice Chinese (China) v8.0
    • Common Voice Chinese (Taiwan) v8.0
    • Common Voice Chinese (China) v9.0
    • Common Voice Chinese (Taiwan) v9.0
    • Common Voice Chinese (China) v16.1
    • Common Voice Chinese (Taiwan) v16.1
    • AI-DataTang Corpus
    • AISHELL-3
    • THCHS-30
    • GlobalPhone Chinese-Mandarin v3.1
  • Polish
    • Common Voice Polish v7.0
    • Common Voice Polish v8.0
    • Multilingual LibriSpeech Polish
    • M-AILABS Polish
    • GlobalPhone Polish v3.1
    • VoxPopuli Polish
  • Portuguese
    • Common Voice Portuguese v7.0
    • Common Voice Portuguese v8.0
    • Multilingual LibriSpeech Portuguese
    • Multilingual TEDx Portuguese
    • GlobalPhone Portuguese (Brazilian) v3.1
  • Punjabi
    • Common Voice Punjabi v7.0
  • Romanian
    • Common Voice Romanian v7.0
  • Russian
    • Common Voice Russian v7.0
    • Common Voice Russian v8.0
    • Common Voice Russian v9.0
    • Common Voice Russian v17.0
    • Multilingual TEDx Russian
    • Russian LibriSpeech
    • M-AILABS Russian
    • GlobalPhone Russian v3.1
  • Sorbian
    • Common Voice Sorbian Upper v7.0
  • Spanish
    • GlobalPhone Spanish (Latin American) v3.1
    • Common Voice Spanish v8.0
    • Multilingual LibriSpeech Spanish
    • M-AILABS Spanish
    • Google i18n Chile
    • Google i18n Columbia
    • Google i18n Peru
    • Google i18n Puerto Rico
    • Google i18n Venezuela
  • Swahili
    • Common Voice Swahili v8.0
    • Common Voice Swahili v9.0
    • ALFFA Swahili
    • GlobalPhone Swahili v3.1
  • Swedish
    • Common Voice Swedish v8.0
    • Common Voice Swedish v7.0
    • NST Swedish
    • GlobalPhone Swedish v3.1
  • Tamil
    • Common Voice Tamil v7.0
  • Tatar
    • Common Voice Tatar v7.0
  • Thai
    • Common Voice Thai v7.0
    • Common Voice Thai v8.0
    • Common Voice Thai v9.0
    • Common Voice Thai v16.1
    • GlobalPhone Thai v3.1
    • Lotus Corpus v1.0
    • Gowajee Corpus v0.9.3
    • Thai Elderly Speech dataset by Data Wow and VISAI v1.0.0
  • Turkish
    • Common Voice Turkish v7.0
    • Common Voice Turkish v8.0
    • Common Voice Turkish v9.0
    • Common Voice Turkish v16.1
    • GlobalPhone Turkish v3.1
    • MediaSpeech Turkish v1.1
  • Ukrainian
    • Common Voice Ukrainian v7.0
    • Common Voice Ukrainian v8.0
    • Common Voice Ukrainian v9.0
    • Common Voice Ukrainian v16.1
    • M-AILABS Ukrainian
    • GlobalPhone Ukrainian v3.1
  • Urdu
    • Common Voice Urdu v7.0
  • Uyghur
    • Common Voice Uyghur v7.0
  • Uzbek
    • Common Voice Uzbek v7.0
  • Vietnamese
    • Common Voice Vietnamese v7.0
    • Common Voice Vietnamese v17.0
    • VIVOS
    • Common Voice Vietnamese v8.0
    • Common Voice Vietnamese v9.0
    • GlobalPhone Vietnamese v3.1
  • Corpora
  • Kazakh

Kazakh#

ID

Language

Dialect

License

Common Voice Kazakh v7_0

Kazakh

N/A

CC-0

previous

Japanese Versatile Speech

next

Common Voice Kazakh v7.0

Show Source

© Copyright 2018-2024, Montreal Corpus Tools.

Created using Sphinx 7.3.7.

Built with the PyData Sphinx Theme 0.15.4.