مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

video

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

sound

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Persian Version

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

View:

148
مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Download:

89
مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Cites:

Information Seminar Paper

Title

SPEECH/MUSIC SEPARATION USING NON-NEGATIVE MATRIX FACTORIZATION WITH COMBINATION OF COST FUNCTIONS

Pages

  -

Abstract

 A SOLUTION FOR SEPARATING SPEECH FROM MUSIC SIGNAL AS A SINGLE CHANNEL SOURCE SEPARATION IS NON-NEGATIVE MATRIX FACTORIZATION (NMF). IN THIS APPROACH SPECTROGRAM OF EACH SOURCE SIGNAL IS FACTORIZED AS MULTIPLICATION OF TWO MATRICES WHICH ARE KNOWN AS BASIS AND WEIGHT MATRICES. TO ACHIEVE PROPER ESTIMATION OF SIGNAL SPECTROGRAM, WEIGHT AND BASIS MATRICES ARE UPDATED ITERATIVELY. TO ESTIMATE DISTANCE BETWEEN SIGNAL AND ITS ESTIMATION A COST FUNCTION IS USED USUALLY. DIFFERENT COST FUNCTIONS HAVE BEEN INTRODUCED BASED ON KULLBACK-LEIBLER (KL) AND ITAKURA-SAITO (IS) DIVERGENCES. IS DIVERGENCE IS SCALE-INVARIANT AND SO IT IS SUITABLE FOR THE CONDITIONS IN WHICH THE COEFFICIENTS OF SIGNAL HAVE A LARGE DYNAMIC RANGE, FOR EXAMPLE IN MUSIC SHORT-TERM SPECTRA. BASED ON THIS IS PROPERTY, IN THIS PAPER, WE PROPOSE TO USE IS DIVERGENCE AS COST FUNCTION OF NMF IN THE TRAINING STAGE FOR MUSIC AND ON THE OTHER HAND WE SUGGEST TO USE KL DIVERGENCE AS NMF COST FUNCTION IN THE TRAINING STAGE FOR SPEECH. MOREOVER, IN THE DECOMPOSITION STAGE, WE PROPOSE TO USE A LINEAR COMBINATION OF THESE TWO DIVERGENCES IN ADDITION TO A REGULARIZATION TERM WHICH CONSIDERS TEMPORAL CONTINUITY INFORMATION AS A PRIOR KNOWLEDGE. EXPERIMENTAL RESULTS ON ONE HOUR OF SPEECH AND MUSIC, SHOWS A GOOD TRADE-OFF BETWEEN SIGNAL TO INFERENCE RATIO (SIR) OF SPEECH AND MUSIC IN COMPARISON TO CONVENTIONAL NMF METHODS. ...

Multimedia

  • No record.
  • Cites

  • No record.
  • References

  • No record.
  • Cite

    APA: Copy

    NASERSHARIF, BABAK, & Abdali, Sara. (2015). SPEECH/MUSIC SEPARATION USING NON-NEGATIVE MATRIX FACTORIZATION WITH COMBINATION OF COST FUNCTIONS. INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND SIGNAL PROCESSING (AISP). SID. https://sid.ir/paper/927543/en

    Vancouver: Copy

    NASERSHARIF BABAK, Abdali Sara. SPEECH/MUSIC SEPARATION USING NON-NEGATIVE MATRIX FACTORIZATION WITH COMBINATION OF COST FUNCTIONS. 2015. Available from: https://sid.ir/paper/927543/en

    IEEE: Copy

    BABAK NASERSHARIF, and Sara Abdali, “SPEECH/MUSIC SEPARATION USING NON-NEGATIVE MATRIX FACTORIZATION WITH COMBINATION OF COST FUNCTIONS,” presented at the INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND SIGNAL PROCESSING (AISP). 2015, [Online]. Available: https://sid.ir/paper/927543/en

    Related Journal Papers

  • No record.
  • Related Seminar Papers

  • No record.
  • Related Plans

  • No record.
  • Recommended Workshops






    Move to top
    telegram sharing button
    whatsapp sharing button
    linkedin sharing button
    twitter sharing button
    email sharing button
    email sharing button
    email sharing button
    sharethis sharing button