VoxPopuli Croatian#

  • Source: VoxPopuli Croatian

  • Language: Serbo-Croatian

  • Dialects: N/A

  • Number of hours: 41.27

  • Number of utterances: 12,938

  • Number of speakers: 28

    • Female speakers: 8

    • Male speakers: 19

    • Unknown speakers: 1

  • License: CC-0

  • Citation:

@inproceedings{wang-etal-2021-voxpopuli,
	title = "{V}ox{P}opuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation",
	author = "Wang, Changhan  and
		Riviere, Morgane  and
		Lee, Ann  and
		Wu, Anne  and
		Talnikar, Chaitanya  and
		Haziza, Daniel  and
		Williamson, Mary  and
		Pino, Juan  and
		Dupoux, Emmanuel",
	booktitle = "Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)",
	month = aug,
	year = "2021",
	address = "Online",
	publisher = "Association for Computational Linguistics",
	url = "https://aclanthology.org/2021.acl-long.80",
	pages = "993--1003",
}
  • Please, note that no corpora are hosted by MFA, please see the link above for accessing the data.

  • If you have comments or questions about using this corpus for MFA, you can check previous MFA model discussion posts or create a new one.