Source: Common Voice English
Language: English
Dialects: General American English, British English, Nigerian English, [Indian English](Japanese tokenizer v2_1_0.md)
Number of hours: 2,322.80
Number of utterances: 1,625,987
Number of speakers: 71,160
Female speakers: 3,750
Male speakers: 14,586
Unknown speakers: 52,824
License: CC-0
Version: 17.0
Citation:
@article{ardila2019common,
title = {Common voice: A massively-multilingual speech corpus},
author = {Ardila, Rosana and Branson, Megan and Davis, Kelly and Henretty, Michael and Kohler, Michael and
Meyer, Josh and Morais, Reuben and Saunders, Lindsay and Tyers, Francis M and Weber, Gregor},
journal = {arXiv preprint arXiv:1912.06670},
year = {2019}
}
Please, note that no corpora are hosted by MFA, please see the link above for accessing the data.
If you have comments or questions about using this corpus for MFA, you can check previous MFA model discussion posts or create a new one.
Pronunciation dictionaries
|