مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Persian Verion

Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

video

Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

sound

Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Persian Version

Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

View:

1,947
Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Download:

0
Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Cites:

Information Journal Paper

Title

EMOTIONAL SPEECH RECOGNITION AND EMOTION IDENTIFICATION IN FARSI LANGUAGE

Pages

  13-27

Abstract

 Speech emotion can add more information to speech in comparison to available textual information. However, it will also lead to some problems in SPEECH RECOGNITION process.In a previous study, we depicted the substantial changes of speech parameters caused by SPEECH EMOTION. Therefore, in order to improve emotional SPEECH RECOGNITION rate, in a first step, the effects of emotion on speech parameters should be evaluated and in the next steps, emotional SPEECH RECOGNITION accuracy be improved through application of suitable parameters. The changes in speech parameters, i.e. formant frequencies and pitch frequency, due to anger and grief were evaluated for Farsi language in our former research. In this research, using those results, we try to improve emotional SPEECH RECOGNITION accuracy using baseline models. We show that adding parameters such as formant and pitch frequencies to the speech feature vector can improve recognition accuracy. The amount of improvement depends on parameter type, number of mixture components and the emotional condition.Proper identification of emotional condition can also help in improving SPEECH RECOGNITION accuracy. To recognize emotional condition of speech, formant and pitch frequencies were used successfully in two different approaches, namley decision tree and GMM.

Cites

  • No record.
  • References

  • No record.
  • Cite

    APA: Copy

    GHARAVIAN, D., & AHADI, S.M.. (2009). EMOTIONAL SPEECH RECOGNITION AND EMOTION IDENTIFICATION IN FARSI LANGUAGE. MODARES TECHNICAL AND ENGINEERING, -(34 (SPECIAL ISSUE ON ELECTRICAL ENGINEERING)), 13-27. SID. https://sid.ir/paper/25082/en

    Vancouver: Copy

    GHARAVIAN D., AHADI S.M.. EMOTIONAL SPEECH RECOGNITION AND EMOTION IDENTIFICATION IN FARSI LANGUAGE. MODARES TECHNICAL AND ENGINEERING[Internet]. 2009;-(34 (SPECIAL ISSUE ON ELECTRICAL ENGINEERING)):13-27. Available from: https://sid.ir/paper/25082/en

    IEEE: Copy

    D. GHARAVIAN, and S.M. AHADI, “EMOTIONAL SPEECH RECOGNITION AND EMOTION IDENTIFICATION IN FARSI LANGUAGE,” MODARES TECHNICAL AND ENGINEERING, vol. -, no. 34 (SPECIAL ISSUE ON ELECTRICAL ENGINEERING), pp. 13–27, 2009, [Online]. Available: https://sid.ir/paper/25082/en

    Related Journal Papers

    Related Seminar Papers

  • No record.
  • Related Plans

  • No record.
  • Recommended Workshops






    Move to top