مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

video

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

sound

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Persian Version

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

View:

342
مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Download:

215
مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Cites:

Information Journal Paper

Title

MACHINE LEARNING APPROACHES TO TEXT SEGMENTATION

Pages

  395-403

Keywords

Not Registered.

Abstract

 Two machine learning approaches are introduced for text segmentation. The first approach is based on inductive learning in the form of a decision tree and the second uses the Naïve Bayes technique. A set of training data is generated from a wide category of compound text image documents for learning both the decision tree and the Naive Bayes Classifier (NBC). The compound documents used for generating the training data include both machine printed and handwritten texts with different fonts and sizes. The 18-Discrete Cosine Transform (DCT) coefficients are used as the main feature to distinguish texts from images. The trained decision tree and the Naive Bayes are tested with unseen documents and very promising results are obtained. Although the later method is more accurate and computationally faster, Finally, the results obtained from the proposed approaches are compared and contrasted with one wavelet based approach and it is illustrated that both methods presented in this paper are more effective.

Cites

  • No record.
  • References

  • No record.
  • Cite

    APA: Copy

    HAJI, M.M., & KATEBI, S.A.D.. (2006). MACHINE LEARNING APPROACHES TO TEXT SEGMENTATION. SCIENTIA IRANICA, 13(4), 395-403. SID. https://sid.ir/paper/289367/en

    Vancouver: Copy

    HAJI M.M., KATEBI S.A.D.. MACHINE LEARNING APPROACHES TO TEXT SEGMENTATION. SCIENTIA IRANICA[Internet]. 2006;13(4):395-403. Available from: https://sid.ir/paper/289367/en

    IEEE: Copy

    M.M. HAJI, and S.A.D. KATEBI, “MACHINE LEARNING APPROACHES TO TEXT SEGMENTATION,” SCIENTIA IRANICA, vol. 13, no. 4, pp. 395–403, 2006, [Online]. Available: https://sid.ir/paper/289367/en

    Related Journal Papers

  • No record.
  • Related Seminar Papers

  • No record.
  • Related Plans

  • No record.
  • Recommended Workshops






    Move to top
    telegram sharing button
    whatsapp sharing button
    linkedin sharing button
    twitter sharing button
    email sharing button
    email sharing button
    email sharing button
    sharethis sharing button