مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Seminar Paper

Paper Information

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

video

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

sound

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Persian Version

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

View:

98
مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Download:

67
مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Cites:

Information Seminar Paper

Title

Hierarchical Three-module Method of Text Classification in Web Big Data

Pages

  -

Abstract

 Text analysis is a method for extracting knowledge from text. Memory and time limitations in processing Big Data is crucial due to data sources distributed in web, search engines and socials network sites. In addition, due to automatizing search process, summarizing and finding the interests of users, immediate Classification of various texts in a streaming manner has gained attention in industrial and scientific fields. Hierarchical Classification of text is among common issues which is simply possible in traditional methods using bag of words; however, while talking about Big Data and when there are a lot of labels of classes, employing traditional methods will not meet the needs of societies. With the improvement of data in internet and social networks, more powerful methods are needed which can classify the data closely and immediately. Through abstraction in textual data, deep learning can deal with these challenges. In this paper a deep learning method will be introduced which is based on hierarchical Classification (HAN) named HAN-MODI and which can classify texts from social networks and web sites with an accuracy of 98. 81% at the real time bilingually in English and Farsi. This paper also shows that this complex network with three modules word, sentence and document can work better at word level and there is no need to know syntactic or semantics structure of language. The novelty of the proposed method is adding a third level to the hierarchical structure for general detection and for more exact detection of the class. In addition, Classification using this method will be multi-level Classification and finally with a change in HAN, this method can be used with Farsi texts. Model improvement is done by adding a new layer above the architecture HAN. We called it as segmentation of sentences into expressions Bag of Sentences and added a dynamicity window in any stage that applied attention mechanism simultaneously.

Video

Cites

  • No record.
  • References

  • No record.
  • Cite

    APA: Copy

    REZAEI, ZAHRA, Eslami, Behnaz, Amini, Mohammad Amin, & ESLAMI, MOHAMMAD. (2020). Hierarchical Three-module Method of Text Classification in Web Big Data. INTERNATIONAL CONFERENCE ON WEB RESEARCH. SID. https://sid.ir/paper/949215/en

    Vancouver: Copy

    REZAEI ZAHRA, Eslami Behnaz, Amini Mohammad Amin, ESLAMI MOHAMMAD. Hierarchical Three-module Method of Text Classification in Web Big Data. 2020. Available from: https://sid.ir/paper/949215/en

    IEEE: Copy

    ZAHRA REZAEI, Behnaz Eslami, Mohammad Amin Amini, and MOHAMMAD ESLAMI, “Hierarchical Three-module Method of Text Classification in Web Big Data,” presented at the INTERNATIONAL CONFERENCE ON WEB RESEARCH. 2020, [Online]. Available: https://sid.ir/paper/949215/en

    Related Journal Papers

  • No record.
  • Related Seminar Papers

  • No record.
  • Related Plans

  • No record.
  • Recommended Workshops






    مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources
    مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources
    مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources
    مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources
    مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources
    مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources
    مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources
    File Not Exists.
    Move to top
    telegram sharing button
    whatsapp sharing button
    linkedin sharing button
    twitter sharing button
    email sharing button
    email sharing button
    email sharing button
    sharethis sharing button