مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

video

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

sound

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Persian Version

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

View:

486
مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Download:

181
مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Cites:

Information Journal Paper

Title

ANALYZING CONTENT-BASED HEURISTICS FOR PERSIAN WEB SPAM DETECTION

Pages

  25-39

Abstract

 The rapid growth of web spam in the World Wide Web has motivated researchers to propose algorithms for combating web spam. Despite using these techniques, the search engines do not perform well in detecting Persian spam websites. In this paper, we analyze the effectiveness of many previously proposed content-based features on detecting Persian spam websites, and also present a number of new content-based features. As another approach, we explain and examine our Bag-Of-Spam-Words (BOSW) method to do WEB SPAM DETECTION. In this method, we represent each document as a vector of specific words selected from a spam corpus. Finally, we apply a number of feature selection methods and use various kinds of classification algorithms to classify the Persian websites. For this purpose, we have created a dataset of Persian hosts. Our results show that using the BOSW method with the SVM classifier has the best performance in detecting Persian spam websites.

Multimedia

  • No record.
  • Cites

  • No record.
  • References

  • No record.
  • Cite

    APA: Copy

    RABBANI, ELAHE, & SHAKERY, AZADEH. (2014). ANALYZING CONTENT-BASED HEURISTICS FOR PERSIAN WEB SPAM DETECTION. INTERNATIONAL JOURNAL OF INFORMATION AND COMMUNICATION TECHNOLOGY RESEARCH, 6(3), 25-39. SID. https://sid.ir/paper/315128/en

    Vancouver: Copy

    RABBANI ELAHE, SHAKERY AZADEH. ANALYZING CONTENT-BASED HEURISTICS FOR PERSIAN WEB SPAM DETECTION. INTERNATIONAL JOURNAL OF INFORMATION AND COMMUNICATION TECHNOLOGY RESEARCH[Internet]. 2014;6(3):25-39. Available from: https://sid.ir/paper/315128/en

    IEEE: Copy

    ELAHE RABBANI, and AZADEH SHAKERY, “ANALYZING CONTENT-BASED HEURISTICS FOR PERSIAN WEB SPAM DETECTION,” INTERNATIONAL JOURNAL OF INFORMATION AND COMMUNICATION TECHNOLOGY RESEARCH, vol. 6, no. 3, pp. 25–39, 2014, [Online]. Available: https://sid.ir/paper/315128/en

    Related Journal Papers

  • No record.
  • Related Seminar Papers

  • No record.
  • Related Plans

  • No record.
  • Recommended Workshops






    Move to top
    telegram sharing button
    whatsapp sharing button
    linkedin sharing button
    twitter sharing button
    email sharing button
    email sharing button
    email sharing button
    sharethis sharing button