مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Persian Verion

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

video

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

sound

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Persian Version

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

View:

2,157
مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Download:

0
مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Cites:

Information Journal Paper

Title

PERSIAN NAME ENTITY RECOGNITION AND CLASSIFICATION

Pages

  77-88

Abstract

 Name entity recognition (NER) is a system that can identify one or more kinds of names in a text and classify them into specified categories. These categories can be name of people, organizations, companies, places (country, city, street, etc.), time related to names (date and time), financial values, percentages, etc. Although during the past decade a lot of researches has been done on NER in different languages, but lack of a system with admissible performance in Farsi texts is quietly sensible. In this paper, the Corpus of Research Center of Intelligent Signal Processing has been used to create a Farsi NER. In our proposed NER system, there exist three stages: preprocessing, feature extraction and classification. To prepare a data set in the preprocessing stage, by using the part of speech (POS) feature, names are extracted from text and then infinitives, time related names, counting names, and numbers are removed from data. This gives a more balanced data set for learning and classification. In the feature extraction stage, N-GRAM is computed as feature, and four classifiers (linear, KNN, Bayesian, NEURAL NETWORK) is learned in the classification stage. Because of lack of variety in the time related names and a few number of mixture of time related names with names in the other categories, an auxiliary list is used to identifying them. The results of research show, NEURAL NETWORK have better performance (99%) in distinct between the names of places and people. In general, KNN and linear classifiers obtain 91% success based on F-measure scale in classifying the names of places and people and general names. In classifying the time related names, using an auxiliary list, based on an F-measure scale, a 96% success was obtained.

Cites

  • No record.
  • References

  • No record.
  • Cite

    APA: Copy

    ESFAHANI, SEYYED ABDOLHAMID, RAHATI GHOOCHANI, SAEED, & JAHANGIRI, NADER. (2010). PERSIAN NAME ENTITY RECOGNITION AND CLASSIFICATION. SIGNAL AND DATA PROCESSING, -(1 (SERIAL 13)), 77-88. SID. https://sid.ir/paper/160693/en

    Vancouver: Copy

    ESFAHANI SEYYED ABDOLHAMID, RAHATI GHOOCHANI SAEED, JAHANGIRI NADER. PERSIAN NAME ENTITY RECOGNITION AND CLASSIFICATION. SIGNAL AND DATA PROCESSING[Internet]. 2010;-(1 (SERIAL 13)):77-88. Available from: https://sid.ir/paper/160693/en

    IEEE: Copy

    SEYYED ABDOLHAMID ESFAHANI, SAEED RAHATI GHOOCHANI, and NADER JAHANGIRI, “PERSIAN NAME ENTITY RECOGNITION AND CLASSIFICATION,” SIGNAL AND DATA PROCESSING, vol. -, no. 1 (SERIAL 13), pp. 77–88, 2010, [Online]. Available: https://sid.ir/paper/160693/en

    Related Journal Papers

    Related Seminar Papers

  • No record.
  • Related Plans

  • No record.
  • Recommended Workshops






    Move to top
    telegram sharing button
    whatsapp sharing button
    linkedin sharing button
    twitter sharing button
    email sharing button
    email sharing button
    email sharing button
    sharethis sharing button