مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Seminar Paper

Paper Information

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

video

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

sound

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Persian Version

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

View:

53
مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Download:

53
مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Cites:

Information Seminar Paper

Title

A Semi-Automated Labeled Data Generation Approach Based on Deep Learning to Improve Sentiment Analysis in the Persian Language

Pages

  -

Abstract

 Opinions play an essential role in human life. With the ease of sharing opinions, ideas, and feelings on various topics through the web and social networks, the analysis of opinions and emotions has become increasingly important. As social networks continue to expand, the importance of Sentiment Analysis will only grow. While much research has been conducted on Sentiment Analysis in the Persian Language, its accuracy still falls short compared to available English methods, and it faces several challenges. One of the most significant challenges is the lack of labeled datasets. To improve Sentiment Analysis, numerous datasets have been collected during various research projects. Despite these efforts, the volume of labeled data remains insignificant because labeling unlabeled data is a costly and time-consuming process due to its manual and human nature. This research presents a semi-automatic method for generating labeled datasets. The proposed method combines pre-trained Deep Learning models with a human agent, allowing more labeled data to be obtained while spending less money, time, and manpower and using the power of Deep Learning models. Some unlabeled data were labeled based on this method and added to the basic dataset to create a new dataset called the “, proposed dataset”, . To evaluate the effectiveness of the proposed method, both the basic and proposed datasets were tested on the ParsBERT language model using the same test dataset. The results showed a 4% improvement in ParsBERT 's F1 score on the proposed dataset compared to the basic dataset. Notably, fine-tuning ParsBERT with the new dataset also made it more general and removed one of its weaknesses, i. e., overfitting.

Cites

  • No record.
  • References

  • No record.
  • Cite

    APA: Copy

    Ghannadan Shirazi, Golnaz, AZMI, REZA, & Shakibian, Hadi. (). . . SID. https://sid.ir/paper/1046867/en

    Vancouver: Copy

    Ghannadan Shirazi Golnaz, AZMI REZA, Shakibian Hadi. . . Available from: https://sid.ir/paper/1046867/en

    IEEE: Copy

    Golnaz Ghannadan Shirazi, REZA AZMI, and Hadi Shakibian, “,” presented at the . , [Online]. Available: https://sid.ir/paper/1046867/en

    Related Journal Papers

  • No record.
  • Related Seminar Papers

  • No record.
  • Related Plans

  • No record.
  • Recommended Workshops






    مرکز اطلاعات علمی SID
    strs
    دانشگاه امام حسین
    بنیاد ملی بازیهای رایانه ای
    کلید پژوه
    ایران سرچ
    ایران سرچ
    File Not Exists.
    Move to top
    telegram sharing button
    whatsapp sharing button
    linkedin sharing button
    twitter sharing button
    email sharing button
    email sharing button
    email sharing button
    sharethis sharing button