Corpora#
ID |
Language |
Dialect |
License |
---|---|---|---|
English |
India |
CC BY-NC-ND 4.0 |
|
French |
N/A |
Apache 2.0 |
|
Mandarin |
China;Erhua |
CC BY-NC-ND 4.0 |
|
Mandarin |
China;Erhua |
Apache 2.0 |
|
Swahili |
N/A |
MIT |
|
English |
UK |
CC BY 3.0 |
|
Korean |
N/A |
CC BY-NC-ND 4.0 |
|
Korean |
N/A |
CC BY-NC-ND 4.0 |
|
English |
US |
Buckeye License |
|
Abkhaz |
N/A |
CC-0 |
|
Arabic |
N/A |
CC-0 |
|
Armenian |
N/A |
CC-0 |
|
Bashkir |
N/A |
CC-0 |
|
Basque |
N/A |
CC-0 |
|
Belarusian |
N/A |
CC-0 |
|
Bulgarian |
N/A |
CC-0 |
|
Bulgarian |
N/A |
CC-0 |
|
Bulgarian |
N/A |
CC-0 |
|
Bulgarian |
N/A |
CC-0 |
|
Mandarin |
China;Erhua |
CC-0 |
|
Mandarin |
China;Erhua |
CC-0 |
|
Mandarin |
China;Erhua |
CC-0 |
|
Mandarin |
Taiwan |
CC-0 |
|
Mandarin |
Taiwan |
CC-0 |
|
Mandarin |
Taiwan |
CC-0 |
|
Chuvash |
N/A |
CC-0 |
|
Czech |
N/A |
CC-0 |
|
Czech |
N/A |
CC-0 |
|
Czech |
N/A |
CC-0 |
|
Dutch |
N/A |
CC-0 |
|
English |
Nigeria;UK;US |
CC-0 |
|
French |
N/A |
CC-0 |
|
French |
N/A |
CC-0 |
|
French |
N/A |
CC-0 |
|
Georgian |
N/A |
CC-0 |
|
German |
N/A |
CC-0 |
|
German |
N/A |
CC-0 |
|
German |
N/A |
CC-0 |
|
Greek |
N/A |
CC-0 |
|
Guarani |
N/A |
CC-0 |
|
Hausa |
N/A |
CC-0 |
|
Hausa |
N/A |
CC-0 |
|
Hausa |
N/A |
CC-0 |
|
Hindi |
N/A |
CC-0 |
|
Hungarian |
N/A |
CC-0 |
|
Indonesian |
N/A |
CC-0 |
|
Italian |
N/A |
CC-0 |
|
Japanese |
N/A |
CC-0 |
|
Japanese |
N/A |
CC-0 |
|
Japanese |
N/A |
CC-0 |
|
Japanese |
N/A |
CC-0 |
|
Kazakh |
N/A |
CC-0 |
|
Korean |
N/A |
CC-0 |
|
Kurmanji |
N/A |
CC-0 |
|
Kyrgyz |
N/A |
CC-0 |
|
Maltese |
N/A |
CC-0 |
|
Polish |
N/A |
CC-0 |
|
Polish |
N/A |
CC-0 |
|
Portuguese |
Brazil;Portugal |
CC-0 |
|
Portuguese |
Brazil;Portugal |
CC-0 |
|
Punjabi |
N/A |
CC-0 |
|
Romanian |
N/A |
CC-0 |
|
Russian |
N/A |
CC-0 |
|
Russian |
N/A |
CC-0 |
|
Russian |
N/A |
CC-0 |
|
Croatian |
N/A |
CC-0 |
|
Croatian |
N/A |
CC-0 |
|
Sorbian |
Upper |
CC-0 |
|
Spanish |
Latin America;Spain |
CC-0 |
|
Swahili |
N/A |
CC-0 |
|
Swahili |
N/A |
CC-0 |
|
Swedish |
N/A |
CC-0 |
|
Swedish |
N/A |
CC-0 |
|
Tamil |
N/A |
CC-0 |
|
Tatar |
N/A |
CC-0 |
|
Thai |
N/A |
CC-0 |
|
Thai |
N/A |
CC-0 |
|
Thai |
N/A |
CC-0 |
|
Thai |
N/A |
CC-0 |
|
Turkish |
N/A |
CC-0 |
|
Turkish |
N/A |
CC-0 |
|
Turkish |
N/A |
CC-0 |
|
Turkish |
N/A |
CC-0 |
|
Ukrainian |
N/A |
CC-0 |
|
Ukrainian |
N/A |
CC-0 |
|
Ukrainian |
N/A |
CC-0 |
|
Ukrainian |
N/A |
CC-0 |
|
Urdu |
N/A |
CC-0 |
|
Uyghur |
N/A |
CC-0 |
|
Uzbek |
N/A |
CC-0 |
|
Vietnamese |
N/A |
CC-0 |
|
Vietnamese |
N/A |
CC-0 |
|
Vietnamese |
N/A |
CC-0 |
|
Vietnamese |
N/A |
CC-0 |
|
English |
US |
CC BY-NC-SA 4.0 |
|
Czech |
N/A |
CC BY-NC-ND 3.0 |
|
Korean |
N/A |
CC BY-NC-ND 4.0 |
|
Arabic |
N/A |
ELRA |
|
Bulgarian |
N/A |
ELRA |
|
Mandarin |
China;Erhua |
ELRA |
|
Croatian |
N/A |
ELRA |
|
Czech |
N/A |
ELRA |
|
French |
N/A |
ELRA |
|
German |
N/A |
ELRA |
|
Hausa |
N/A |
ELRA |
|
Japanese |
N/A |
ELRA |
|
Korean |
N/A |
ELRA |
|
Polish |
N/A |
ELRA |
|
Portuguese |
Brazil |
ELRA |
|
Russian |
N/A |
ELRA |
|
Spanish |
Latin America |
ELRA |
|
Swahili |
N/A |
ELRA |
|
Swedish |
N/A |
ELRA |
|
Thai |
N/A |
ELRA |
|
Turkish |
N/A |
ELRA |
|
Ukrainian |
N/A |
ELRA |
|
Vietnamese |
Hanoi;Ho Chi Minh City |
ELRA |
|
Spanish |
Latin America |
CC BY-SA 4.0 |
|
Spanish |
Latin America |
CC BY-SA 4.0 |
|
Spanish |
Latin America |
CC BY-SA 4.0 |
|
Spanish |
Latin America |
CC BY-SA 4.0 |
|
Spanish |
Latin America |
CC BY-SA 4.0 |
|
English |
Nigeria |
CC BY-SA 4.0 |
|
English |
UK |
CC BY-SA 4.0 |
|
Thai |
N/A |
MIT |
|
English |
Nigeria |
CC BY-NC-SA 3.0 |
|
Japanese |
N/A |
CC BY-SA 4.0 |
|
Korean |
N/A |
CC BY-NC-SA 4.0 |
|
English |
N/A |
CC BY-NC 4.0 |
|
Japanese |
N/A |
LaboroTV Non-commercial |
|
Czech |
N/A |
CC BY 4.0 |
|
English |
US |
CC BY 4.0 |
|
Thai |
N/A |
CC BY-SA-NC 3.0 |
|
Polish |
N/A |
M-AILABS License |
|
Russian |
N/A |
M-AILABS License |
|
Spanish |
Latin America;Spain |
M-AILABS License |
|
Ukrainian |
N/A |
M-AILABS License |
|
Arabic |
N/A |
CC BY 4.0 |
|
Turkish |
N/A |
CC BY 4.0 |
|
Japanese |
N/A |
Microsoft Research Data License |
|
French |
N/A |
CC BY 4.0 |
|
German |
N/A |
CC BY 4.0 |
|
Polish |
N/A |
CC BY 4.0 |
|
Portuguese |
Portugal |
CC BY 4.0 |
|
Spanish |
Spain |
CC BY 4.0 |
|
Portuguese |
Portugal |
CC BY-NC-ND 4.0 |
|
Russian |
N/A |
CC BY-NC-ND 4.0 |
|
English |
Nigeria;UK |
CC BY 3.0 |
|
Swedish |
N/A |
CC-0 |
|
Korean |
N/A |
CC BY-NC-ND 4.0 |
|
Croatian |
N/A |
CC BY-SA 4.0 |
|
Russian |
N/A |
Public domain in the USA |
|
Korean |
N/A |
CC BY-NC 2.0 |
|
Japanese |
N/A |
Apache 2.0 |
|
Thai |
N/A |
CC BY-SA 4.0 |
|
Mandarin |
China;Erhua |
Apache 2.0 |
|
English |
US |
LDC License |
|
Vietnamese |
Ho Chi Minh City |
CC BY-NC-SA 4.0 |
|
Croatian |
N/A |
CC-0 |
|
Czech |
N/A |
CC-0 |
|
Polish |
N/A |
CC-0 |
|
Korean |
N/A |
CC BY 4.0 |