مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Persian Verion

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

video

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

sound

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Persian Version

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

View:

1,154
مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Download:

0
مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Cites:

Information Journal Paper

Title

ADAPTIVE COMPRESSION OF WIDE-BAND SPEECH AND AUDIO USING WAVELET TRANSFORM

Pages

  63-67

Abstract

 The design of a new codec at 32 kb/s for audio and high quality speech (bandwidth limited to 7 kHz and sampled at 16 kHz with 16b/sample) is presented in this paper. This codec is a good substitute for the G721 ITU Standard and its 64 kb/s variant G722 that are based on ADPCM and dating from the late 1980s. This new codec comprises adaptive wavelet transform coding, PSYCHO-ACOUSTIC MODELing, quantization and variable length entropy and run-length coding. The novelty here is the use of a parametric wavelet kernel and the way the WAVELET PACKET tree (WPT) is expanded so that better matching is achieved with critical acoustic bands. The explicit kernel permits to control the sharpness of the basic half-band filter of which the filter used in the Fast Wavelet Transform (FWT) coding are derived. The PSYCHO-ACOUSTIC MODELing of MPEG1-Audio is used but instead of employing power spectrum for calculating the Signal-to-Mask ratio (S/M), we have directly used the energies of WPT output signals. As a consequence, the computation cost is reduced. The number of quantization bits in each band is controlled by the corresponding S/M ratio. The Variable Length Coding (VLC) used here is an extension of JPEG Huffman coding where some modifications are made to adapt this scheme to speech characteristics. The developed codec has the capability of reducing the bit-rate and controlling the required quality by changing the S/M ratios. Therefore, it can be used for fixed capacity channels by the same token. It is shown that this scheme has a very good quality at 32 kb/s and that the coded signal is quite indistinguishable from the PCM signal digitized at 16 kHz and 16b/sample.

Cites

  • No record.
  • References

  • No record.
  • Cite

    APA: Copy

    MORTAZAVI, T., & SAVOJI, H.. (2004). ADAPTIVE COMPRESSION OF WIDE-BAND SPEECH AND AUDIO USING WAVELET TRANSFORM. NASHRIYYAH-I MUHANDESI-I BARQ VA MUHANDESI-I KAMPYUTAR-I IRAN (PERSIAN), 2(2), 63-67. SID. https://sid.ir/paper/53668/en

    Vancouver: Copy

    MORTAZAVI T., SAVOJI H.. ADAPTIVE COMPRESSION OF WIDE-BAND SPEECH AND AUDIO USING WAVELET TRANSFORM. NASHRIYYAH-I MUHANDESI-I BARQ VA MUHANDESI-I KAMPYUTAR-I IRAN (PERSIAN)[Internet]. 2004;2(2):63-67. Available from: https://sid.ir/paper/53668/en

    IEEE: Copy

    T. MORTAZAVI, and H. SAVOJI, “ADAPTIVE COMPRESSION OF WIDE-BAND SPEECH AND AUDIO USING WAVELET TRANSFORM,” NASHRIYYAH-I MUHANDESI-I BARQ VA MUHANDESI-I KAMPYUTAR-I IRAN (PERSIAN), vol. 2, no. 2, pp. 63–67, 2004, [Online]. Available: https://sid.ir/paper/53668/en

    Related Journal Papers

    Related Seminar Papers

  • No record.
  • Related Plans

  • No record.
  • Recommended Workshops






    Move to top
    telegram sharing button
    whatsapp sharing button
    linkedin sharing button
    twitter sharing button
    email sharing button
    email sharing button
    email sharing button
    sharethis sharing button