مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Persian Verion

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

video

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

sound

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Persian Version

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

View:

1,095
مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Download:

0
مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Cites:

1

Information Journal Paper

Title

EVALUATING TWO APPROACHES FOR FARSI OCR BASED ON SUB-WORD SHAPE RECOGNITION

Pages

  267-280

Abstract

 Two approaches for the recognition of printed Farsi documents based on SUB-WORD SHAPE recognition is proposed. First approach is based on recognition of SUB-WORD SHAPE as a whole and the second is based on the recognition of the body of sub-words. Sub-word body is constructed via removing dots and signs of the sub word. In second approach, information of dots and signs will be added after recognition of the body. Both approaches have two phases: training and test. In training phase, sub-words are clustered based on ISODATA algorithm. Initial centers of the clusters are computed through a hierarchical CLUSTERING algorithm. In first approach, sub-word recognition is performed in two stages: finding clusters close to the input sub-word and then finding the best match within the sub-words of these clusters. In the second approach another stage is required to find the final sub-word including dots and signs. Experimental results show that on clean images the first algorithm have better performance; 94% versus 93% in word level. But when dealing with low quality and noisy images, both algorithms are suffering from reduced accuracy. Sometimes this reduction is significant. The reasons of this behavior are inspected and some solutions are presented. Finally we compared both methods and inspected pros and cons of FARSI OCR based on SUB-WORD SHAPE.

Cites

References

  • No record.
  • Cite

    APA: Copy

    KHOSRAVI, H., & KABIR, E.A.. (2010). EVALUATING TWO APPROACHES FOR FARSI OCR BASED ON SUB-WORD SHAPE RECOGNITION. NASHRIYYAH-I MUHANDESI-I BARQ VA MUHANDESI-I KAMPYUTAR-I IRAN (PERSIAN), 7(4), 267-280. SID. https://sid.ir/paper/53709/en

    Vancouver: Copy

    KHOSRAVI H., KABIR E.A.. EVALUATING TWO APPROACHES FOR FARSI OCR BASED ON SUB-WORD SHAPE RECOGNITION. NASHRIYYAH-I MUHANDESI-I BARQ VA MUHANDESI-I KAMPYUTAR-I IRAN (PERSIAN)[Internet]. 2010;7(4):267-280. Available from: https://sid.ir/paper/53709/en

    IEEE: Copy

    H. KHOSRAVI, and E.A. KABIR, “EVALUATING TWO APPROACHES FOR FARSI OCR BASED ON SUB-WORD SHAPE RECOGNITION,” NASHRIYYAH-I MUHANDESI-I BARQ VA MUHANDESI-I KAMPYUTAR-I IRAN (PERSIAN), vol. 7, no. 4, pp. 267–280, 2010, [Online]. Available: https://sid.ir/paper/53709/en

    Related Journal Papers

    Related Seminar Papers

  • No record.
  • Related Plans

  • No record.
  • Recommended Workshops






    Move to top
    telegram sharing button
    whatsapp sharing button
    linkedin sharing button
    twitter sharing button
    email sharing button
    email sharing button
    email sharing button
    sharethis sharing button