مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

video

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

sound

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Persian Version

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

View:

110
مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Download:

48
مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Cites:

Information Seminar Paper

Title

SPEAKER WEIGHT ESTIMATION FROM SPEECH SIGNALS USING A FUSION OF THE I-VECTOR AND NFA FRAMEWORKS

Pages

  -

Abstract

 IN THIS PAPER, A NOVEL APPROACH FOR AUTOMATIC SPEAKER WEIGHT ESTIMATION FROM SPONTANEOUS TELEPHONE SPEECH SIGNALS IS PROPOSED. IN THIS METHOD, EACH UTTERANCE IS MODELED USING THE I-VECTOR FRAMEWORK WHICH IS BASED ON THE FACTOR ANALYSIS ON GAUSSIAN MIXTURE MODEL (GMM) MEAN SUPERVECTORS, AND THE NON-NEGATIVE FACTOR ANALYSIS (NFA) FRAMEWORK WHICH IS BASED ON A CONSTRAINED FACTOR ANALYSIS ON GMM WEIGHTS. THEN, THE AVAILABLE INFORMATION IN BOTH GAUSSIAN MEANS AND GAUSSIAN WEIGHTS IS EXPLOITED THROUGH A FEATURE-LEVEL FUSION OF THE I-VECTORS AND THE NFA VECTORS. FINALLY, A LEAST-SQUARES SUPPORT VECTOR REGRESSION (LS-SVR) IS EMPLOYED TO ESTIMATE THE WEIGHT OF SPEAKERS FROM GIVEN UTTERANCES. THE PROPOSED APPROACH IS EVALUATED ON THE TELEPHONE SPEECH SIGNALS OF NATIONAL INSTITUTE OF STANDARDS AND TECHNOLOGY (NIST) 2008 AND 2010 SPEAKER RECOGNITION EVALUATION (SRE) CORPORA. EXPERIMENTAL RESULTS OVER 2339 UTTERANCES SHOW THAT THE CORRELATION COEFFICIENTS BETWEEN ACTUAL AND ESTIMATED WEIGHTS OF MALE AND FEMALE SPEAKERS ARE 0.56 AND 0.49, RESPECTIVELY, WHICH INDICATE THE EFFECTIVENESS OF THE PROPOSED METHOD IN SPEAKER WEIGHT ESTIMATION.

Multimedia

  • No record.
  • Cites

  • No record.
  • References

  • No record.
  • Cite

    APA: Copy

    Poorjam, Amir Hossein, Bahari, Mohamad Hasan, & hamme, Hugo Van. (2015). SPEAKER WEIGHT ESTIMATION FROM SPEECH SIGNALS USING A FUSION OF THE I-VECTOR AND NFA FRAMEWORKS. INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND SIGNAL PROCESSING (AISP). SID. https://sid.ir/paper/927552/en

    Vancouver: Copy

    Poorjam Amir Hossein, Bahari Mohamad Hasan, hamme Hugo Van. SPEAKER WEIGHT ESTIMATION FROM SPEECH SIGNALS USING A FUSION OF THE I-VECTOR AND NFA FRAMEWORKS. 2015. Available from: https://sid.ir/paper/927552/en

    IEEE: Copy

    Amir Hossein Poorjam, Mohamad Hasan Bahari, and Hugo Van hamme, “SPEAKER WEIGHT ESTIMATION FROM SPEECH SIGNALS USING A FUSION OF THE I-VECTOR AND NFA FRAMEWORKS,” presented at the INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND SIGNAL PROCESSING (AISP). 2015, [Online]. Available: https://sid.ir/paper/927552/en

    Related Journal Papers

  • No record.
  • Related Seminar Papers

  • No record.
  • Related Plans

  • No record.
  • Recommended Workshops






    Move to top
    telegram sharing button
    whatsapp sharing button
    linkedin sharing button
    twitter sharing button
    email sharing button
    email sharing button
    email sharing button
    sharethis sharing button