Selected article for: "low quality and open source"

Author: Mussakhojayeva, Saida; Janaliyeva, Aigerim; Mirzakhmetov, Almas; Khassanov, Yerbolat; Varol, Huseyin Atakan
Title: KazakhTTS: An Open-Source Kazakh Text-to-Speech Synthesis Dataset
  • Cord-id: fy5wbx0w
  • Document date: 2021_4_17
  • ID: fy5wbx0w
    Snippet: This paper introduces a high-quality open-source speech synthesis dataset for Kazakh, a low-resource language spoken by over 13 million people worldwide. The dataset consists of about 93 hours of transcribed audio recordings spoken by two professional speakers (female and male). It is the first publicly available large-scale dataset developed to promote Kazakh text-to-speech (TTS) applications in both academia and industry. In this paper, we share our experience by describing the dataset develop
    Document: This paper introduces a high-quality open-source speech synthesis dataset for Kazakh, a low-resource language spoken by over 13 million people worldwide. The dataset consists of about 93 hours of transcribed audio recordings spoken by two professional speakers (female and male). It is the first publicly available large-scale dataset developed to promote Kazakh text-to-speech (TTS) applications in both academia and industry. In this paper, we share our experience by describing the dataset development procedures and faced challenges, and discuss important future directions. To demonstrate the reliability of our dataset, we built baseline end-to-end TTS models and evaluated them using the subjective mean opinion score (MOS) measure. Evaluation results show that the best TTS models trained on our dataset achieve MOS above 4 for both speakers, which makes them applicable for practical use. The dataset, training recipe, and pretrained TTS models are freely available.

    Search related documents:
    Co phrase search for related documents
    • additional analysis and administration education: 1
    • additional analysis and low resource: 1
    • address order and low resource: 1, 2, 3
    • address order and low resource setting: 1
    Co phrase search for related documents, hyperlinks ordered by date