مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

video

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

sound

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Persian Version

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

View:

281
مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Download:

104
مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Cites:

Information Journal Paper

Title

LANGUAGE MODEL ADAPTATION USING DIRICHLET CLASS LANGUAGE MODEL BASED ON PART-OF-SPEECH

Pages

  41-46

Abstract

 Language modeling has many applications in a large variety of domains. Performance of this model depends on its adaptation to a particular style of data. Accordingly, adaptation methods endeavour to apply syntactic and semantic characteristics of the language for language modeling. The previous adaptation methods such as family of Dirichlet class language model (DCLM) extract class of history words. These methods due to lake of syntactic information are not suitable for high morphology languages such as Farsi. In this paper, we present an idea for using syntactic information such as PART-OF-SPEECH (POS) in DCLM for combining with one of the language models of n-gram family. In our work, word clustering is based on POS of previous words and history words in DCLM. The performance of language models are evaluated on BijanKhan corpus using a hidden Markov model based ASR system. The results show that use of POS information along with history words and class of history words improves performance of language model, and decreases the PERPLEXITY on our corpus. Exploiting POS information along with DCLM, the WORD ERROR RATE of the ASR system decreases by 1.2% compared to DCLM.

Cites

  • No record.
  • References

    Cite

    APA: Copy

    HATAMI, ALI, AKBARI, AHMAD, & NASERSHARIF, BABAK. (2014). LANGUAGE MODEL ADAPTATION USING DIRICHLET CLASS LANGUAGE MODEL BASED ON PART-OF-SPEECH. JOURNAL OF INFORMATION SYSTEMS AND TELECOMMUNICATION (JIST), 2(1 (5)), 41-46. SID. https://sid.ir/paper/332625/en

    Vancouver: Copy

    HATAMI ALI, AKBARI AHMAD, NASERSHARIF BABAK. LANGUAGE MODEL ADAPTATION USING DIRICHLET CLASS LANGUAGE MODEL BASED ON PART-OF-SPEECH. JOURNAL OF INFORMATION SYSTEMS AND TELECOMMUNICATION (JIST)[Internet]. 2014;2(1 (5)):41-46. Available from: https://sid.ir/paper/332625/en

    IEEE: Copy

    ALI HATAMI, AHMAD AKBARI, and BABAK NASERSHARIF, “LANGUAGE MODEL ADAPTATION USING DIRICHLET CLASS LANGUAGE MODEL BASED ON PART-OF-SPEECH,” JOURNAL OF INFORMATION SYSTEMS AND TELECOMMUNICATION (JIST), vol. 2, no. 1 (5), pp. 41–46, 2014, [Online]. Available: https://sid.ir/paper/332625/en

    Related Journal Papers

  • No record.
  • Related Seminar Papers

  • No record.
  • Related Plans

  • No record.
  • Recommended Workshops






    Move to top
    telegram sharing button
    whatsapp sharing button
    linkedin sharing button
    twitter sharing button
    email sharing button
    email sharing button
    email sharing button
    sharethis sharing button