Tasks & Goals Getting started Data Results

ZeroSpeech 2019: TTS without T

Leaderboards

The leaderboard above presents the results obtained by the participants to the 2019 edition. The columns are sortable by clicking on the picture of each column header. A detailed view of the results is available by clicking on the details picture of each row, it includes audio samples of speech synthesis.

The score columns are interpreted as follows (see Evaluation Metrics for more details):

MOS:
- mean opinion score on speech synthesis
- scale is $[1, 5]$ , bigger is better
CER:
- character error rate after human transcription of speech synthesis
- scale is $[0, 1]$ , lower is better
Similarity:
- similarity to the target voice of speech synthesis
- scale is $[1, 5]$ , bigger is better
ABX:
- ABX error rate on embeddings
- scale is $[0, 100]$ , lower is better
Bitrate:
- bitrate of the embeddings
- scale is $]0, +\infty[$ , lower is better

#	Authors	Surprise language					Training language (English)
#	Authors	MOS	CER	Similarity	ABX	Bitrate	MOS	CER	Similarity	ABX	Bitrate
#	Authors	MOS	CER	Similarity	ABX	Bitrate	MOS	CER	Similarity	ABX	Bitrate
#	Authors	Surprise language					Training language (English)

Graphs

Made with

Last updated on May 24 14:53 2023