TEDxJP-10K v1.1#

  • Source: TEDxJP-10K

  • Language: Japanese

  • Dialects: N/A

  • Number of hours: 8.85

  • Number of utterances: 9,962

  • Number of speakers: 271

    • Female speakers: 0

    • Male speakers: 0

    • Unknown speakers: 271

  • License: Apache 2.0

  • Version: 1.1

  • Citation:

@inproceedings{ando2020slp,
	author = {安藤慎太郎 and 藤原弘将},
	title = {テレビ録画とその字幕を利用した大規模日本語音声コーパスの構築},
	booktitle = {情報処理学会研究報告},
	series = {Vol.2020-SLP-134 No.8},
	date = {2020}
}
  • Please, note that no corpora are hosted by MFA, please see the link above for accessing the data.

  • If you have comments or questions about using this corpus for MFA, you can check previous MFA model discussion posts or create a new one.