مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Persian Verion

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

video

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

sound

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Persian Version

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

View:

857
مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Download:

0
مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Cites:

Information Journal Paper

Title

ROBUST RECOGNITION OF DIRECT AND TELEPHONY SPEECH USING PROPER EXTRACTION OF FEATURE VECTORS AND THEIR MODIFICATION BY NEURAL NETWORKS INVERSION

Pages

  21-29

Abstract

 A vast amount of research is going on for design of ROBUST SPEECH RECOGNITION in to alleviate speech variability conditions. One of the variability aspects is the difference between telephony speech and direct speech (recorded in noise free conditions). In this paper by using a set of experiments, it is shown that LHCB parameters are superior to traditional MFCCs for speech recognition applications when they are used in a NEURAL NETWORK based speech recognition system for both direct and telephony speech. Then by extraction of LHCBs from direct and telephony speech, and training of a MLP based speech recognition model, a direct and telephony speech recognition system is developed. Using a NEURAL NETWORK INVERSION based on gradient descent method, the telephony speech FEATURE VECTORS are modified toward to the direct speech FEATURE VECTORS and by training a second network on modified telephony and direct speech FEATURE VECTORS a 1.4% enhancement on speech recognition was achieved. Later, using general INVERSION method of NEURAL NETWORKs both telephony and direct speech FEATURE VECTORS are modified in a manner which mainly contains phonetic information and not other speech variations. Then by the training of the second NEURAL NETWORK on this dataset, the system achieved 2.98% and 1.68% higher recognition rate for direct and telephony speech, respectively. 

Cites

  • No record.
  • References

  • No record.
  • Cite

    APA: Copy

    VALI, MANSOUR, & SEYED SALEHI, A.. (2006). ROBUST RECOGNITION OF DIRECT AND TELEPHONY SPEECH USING PROPER EXTRACTION OF FEATURE VECTORS AND THEIR MODIFICATION BY NEURAL NETWORKS INVERSION. NASHRIYYAH-I MUHANDESI-I BARQ VA MUHANDESI-I KAMPYUTAR-I IRAN (PERSIAN), 4(1), 21-29. SID. https://sid.ir/paper/53747/en

    Vancouver: Copy

    VALI MANSOUR, SEYED SALEHI A.. ROBUST RECOGNITION OF DIRECT AND TELEPHONY SPEECH USING PROPER EXTRACTION OF FEATURE VECTORS AND THEIR MODIFICATION BY NEURAL NETWORKS INVERSION. NASHRIYYAH-I MUHANDESI-I BARQ VA MUHANDESI-I KAMPYUTAR-I IRAN (PERSIAN)[Internet]. 2006;4(1):21-29. Available from: https://sid.ir/paper/53747/en

    IEEE: Copy

    MANSOUR VALI, and A. SEYED SALEHI, “ROBUST RECOGNITION OF DIRECT AND TELEPHONY SPEECH USING PROPER EXTRACTION OF FEATURE VECTORS AND THEIR MODIFICATION BY NEURAL NETWORKS INVERSION,” NASHRIYYAH-I MUHANDESI-I BARQ VA MUHANDESI-I KAMPYUTAR-I IRAN (PERSIAN), vol. 4, no. 1, pp. 21–29, 2006, [Online]. Available: https://sid.ir/paper/53747/en

    Related Journal Papers

    Related Seminar Papers

  • No record.
  • Related Plans

  • No record.
  • Recommended Workshops






    Move to top
    telegram sharing button
    whatsapp sharing button
    linkedin sharing button
    twitter sharing button
    email sharing button
    email sharing button
    email sharing button
    sharethis sharing button