مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Persian Verion

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

video

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

sound

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Persian Version

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

View:

321
مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Download:

0
مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Cites:

Information Journal Paper

Title

Robust sub-band speech feature extraction using multiresolution convolutional neural networks

Pages

  1393-1404

Abstract

Convolutional neural networks (CNNs), as a kind of deep neural networks, have been recently used for acoustic modeling and feature extraction along with acoustic modeling in speech recognition systems. In this paper, we propose to use CNN for robust feature extraction from the noisy speech spectrum. In the proposed manner, CNN inputs are noisy speech spectrum and its targets are denoised logarithm of Mel filter bank energies (LMFBs). Consequently, CNN extracts robust features from speech spectrum. The drawback of CNN in the proposed method is its fixed frequency resolution. Thus, we propose to use multiple CNNs with different convolution filter sizes to provide different frequency resolutions for feature extraction from the speech spectrum. We named this method as Multiresolution CNN (MRCNN). Recognition accuracy on Aurora 2 database, shows that CNNs outperform deep belief networks such that, CNN recognition accuracy has 20% relative improvement on average over DBN. However, results show that MRCNN recognition accuracy has 1% relative improvement on average over CNN.

Cites

  • No record.
  • References

  • No record.
  • Cite

    APA: Copy

    Naderi, Navid, & NASERSHARIF, BABAK. (2019). Robust sub-band speech feature extraction using multiresolution convolutional neural networks. TABRIZ JOURNAL OF ELECTRICAL ENGINEERING, 49(3 (89) ), 1393-1404. SID. https://sid.ir/paper/256522/en

    Vancouver: Copy

    Naderi Navid, NASERSHARIF BABAK. Robust sub-band speech feature extraction using multiresolution convolutional neural networks. TABRIZ JOURNAL OF ELECTRICAL ENGINEERING[Internet]. 2019;49(3 (89) ):1393-1404. Available from: https://sid.ir/paper/256522/en

    IEEE: Copy

    Navid Naderi, and BABAK NASERSHARIF, “Robust sub-band speech feature extraction using multiresolution convolutional neural networks,” TABRIZ JOURNAL OF ELECTRICAL ENGINEERING, vol. 49, no. 3 (89) , pp. 1393–1404, 2019, [Online]. Available: https://sid.ir/paper/256522/en

    Related Journal Papers

    Related Seminar Papers

  • No record.
  • Related Plans

  • No record.
  • Recommended Workshops






    Move to top
    telegram sharing button
    whatsapp sharing button
    linkedin sharing button
    twitter sharing button
    email sharing button
    email sharing button
    email sharing button
    sharethis sharing button