Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Journal Paper

Paper Information

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Persian Verion

Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

video

Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

sound

Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Persian Version

Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

View:

999
Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Download:

0
Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Cites:

Information Journal Paper

Title

INCREASING PERFORMANCE OF TEXT CLASSIFICATION BASED ON IMPROVING FEATURE SELECTION

Pages

  313-328

Keywords

Not Registered.

Abstract

 In text classification, words are usually used as features. Since text classification methods are faced to too many features, here we propose several methods to improve feature selection in text classification, and we show merits of the proposed methods in comparison to the relevant ones. First, we propose two methods that just concentrate on all positive and negative correlation factors between features and document categories. The first method uses all correlation factors with a positive effect while the second one uses positive factors with a positive effect and negative factors with a negative effect in feature selection criteria. Our experiments show that the positive factors are more effective than the negative factors in feature selection criteria. Next, we propose a hybrid method. First, it uses Relief-F filtering method to select a number of features spending a low computation cost. Then, the selected features are further reduced using an SFS or SBS wrapper method. The hybrid method has a better performance in comparison to filtering methods. Our experiment results show that applying this method to SVM Light classifier on Reuters-21578 corpora we can remove up to 94% of features while improving classification accuracy. Moreover, it is worth to mention that Relief-F by itself has shown a good result where it is the first time it is applied to text domain.

Cites

  • No record.
  • References

  • No record.
  • Cite

    APA: Copy

    JALILI, S., & BITARAFAN, M.. (2006). INCREASING PERFORMANCE OF TEXT CLASSIFICATION BASED ON IMPROVING FEATURE SELECTION. JOURNAL OF FACULTY OF ENGINEERING (UNIVERSITY OF TEHRAN), 40(3 (97)), 313-328. SID. https://sid.ir/paper/14467/en

    Vancouver: Copy

    JALILI S., BITARAFAN M.. INCREASING PERFORMANCE OF TEXT CLASSIFICATION BASED ON IMPROVING FEATURE SELECTION. JOURNAL OF FACULTY OF ENGINEERING (UNIVERSITY OF TEHRAN)[Internet]. 2006;40(3 (97)):313-328. Available from: https://sid.ir/paper/14467/en

    IEEE: Copy

    S. JALILI, and M. BITARAFAN, “INCREASING PERFORMANCE OF TEXT CLASSIFICATION BASED ON IMPROVING FEATURE SELECTION,” JOURNAL OF FACULTY OF ENGINEERING (UNIVERSITY OF TEHRAN), vol. 40, no. 3 (97), pp. 313–328, 2006, [Online]. Available: https://sid.ir/paper/14467/en

    Related Journal Papers

    Related Seminar Papers

  • No record.
  • Related Plans

  • No record.
  • Recommended Workshops






    Move to top