English MFA ivector extractor v2.1.0#

  • Maintainer: Montreal Forced Aligner

  • Language: English

  • Model type: Ivector extractor

  • Features: MFCC

  • Architecture: ivector

  • Model version: v2.1.0

  • Trained date: 2023-01-04

  • Compatible MFA version: v2.1.0

  • License: CC BY 4.0

  • Citation:

@techreport{mfa_english_mfa_ivector_2023,
	author={McAuliffe, Michael and Sonderegger, Morgan},
	title={English MFA ivector extractor v2.1.0},
	address={\url{https://mfa-models.readthedocs.io/ivector/English/English MFA ivector extractor v2_1_0.html}},
	year={2023},
	month={Jan},
}

Training corpora

../../_images/full_logo_yellow.svg

Installation#

Install from the MFA command line:

mfa model download ivector english_mfa

Or download from the release page.

Intended use#

This model is intended for speaker diarization and clustering.

Performance Factors#

As ivector extractors are trained on a large amount of data and without reference to language-specific resources, they may be useful outside of the specific language variety being trained, however, differences in the languages may impact performance. Particularly sociolinguistic aspects related to identity presentation, the linguistic status voice quality (breathy, creaky modal), and non-linguistic factors like recording conditions and microphone response may affect diarization performance across languages.

Metrics#

Speaker diarization systems are evaluated through Equal Error Rate (EER), the error when false accept rate is equal to the false rejection rate.

  • EER: 0%

Ethical considerations#

Deploying any Speech-to-Text model into any production setting has ethical implications. You should consider these implications before use.

Demographic Bias#

You should assume every machine learning model has demographic bias unless proven otherwise. For ivector extraction models, diarization of male speakers generally has better performance than diarization of female speakers, even with equal amounts of training data.

Surveillance#

Speech-to-Text technologies may be misused to invade the privacy of others by recording and mining information from private conversations. This kind of individual privacy is protected by law in many countries. You should not assume consent to record and analyze private speech.