مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Persian Verion

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

video

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

sound

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Persian Version

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

View:

3,998
مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Download:

0
مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Cites:

3

Information Journal Paper

Title

DESIGN AND IMPLEMENTATION OF A TEXT TO SPEECH SYSTEM FOR FARSI LANGUAGE

Pages

  31-48

Abstract

 To develop a Text-To-Speech (TTS) system in Iranian Farsi language, in this paper three subsystems are designed and implemented. The first subsystem, called NLP (Natural Language Processor), consists of a Part-Of Speech (POS) tagging unit, a text to phoneme converter, and a module for disambiguation of homographs. The second subsystem is a prosody generator that uses a Recurrent Neural Network (RNN) with 289 nodes implemented in four layers. A concatenative speech synthesizer using Harmonic plus Noise Model (HNM) with new approaches in prosody modification is implemented as the third subsystem. To provide training data for prosody generator more efficiently, a couple of novel and hybrid algorithms are used for automated segmentation and labeling of speech at phoneme level with 97.2% accuracy. To evaluate the performance of system, rating scales recommended in ITU-T P.85 are used and average MOS (over six scales) of 3.59 was reached. This MOS shows that the performance of this Farsi ITS is comparable with modern English ITS systems.

Cites

References

  • No record.
  • Cite

    APA: Copy

    SHEYKHAN, M., NASIRZAD, M., & DAFTARIAN, A.. (2005). DESIGN AND IMPLEMENTATION OF A TEXT TO SPEECH SYSTEM FOR FARSI LANGUAGE. JOURNAL OF SCHOOL OF ENGINEERING, 17(2), 31-48. SID. https://sid.ir/paper/22592/en

    Vancouver: Copy

    SHEYKHAN M., NASIRZAD M., DAFTARIAN A.. DESIGN AND IMPLEMENTATION OF A TEXT TO SPEECH SYSTEM FOR FARSI LANGUAGE. JOURNAL OF SCHOOL OF ENGINEERING[Internet]. 2005;17(2):31-48. Available from: https://sid.ir/paper/22592/en

    IEEE: Copy

    M. SHEYKHAN, M. NASIRZAD, and A. DAFTARIAN, “DESIGN AND IMPLEMENTATION OF A TEXT TO SPEECH SYSTEM FOR FARSI LANGUAGE,” JOURNAL OF SCHOOL OF ENGINEERING, vol. 17, no. 2, pp. 31–48, 2005, [Online]. Available: https://sid.ir/paper/22592/en

    Related Journal Papers

    Related Seminar Papers

  • No record.
  • Related Plans

  • No record.
  • Recommended Workshops






    Move to top
    telegram sharing button
    whatsapp sharing button
    linkedin sharing button
    twitter sharing button
    email sharing button
    email sharing button
    email sharing button
    sharethis sharing button