مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

video

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

sound

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Persian Version

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

View:

466
مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Download:

123
مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Cites:

Information Journal Paper

Title

TREE WRAP-DATA EXTRACTION USING TREE MATCHING ALGORITHM

Pages

  43-55

Abstract

 In this paper, we develop a non-visual AUTOMATIC WRAPPER to extract data records from SEARCH ENGINE results pages which contain important information for computer users. Our wrapper consists of a series of data filter to detect and remove irrelevant data from the web page. In the filtering stages, we incorporate two main algorithms which are able to check the similarity of data records and to detect and extract the correct data region based on their component sizes. To evaluate the performance of our algorithm, we carry out experimental and deletion tests. Experimental tests show that our wrapper outperforms the existing state of the art wrappers such as ViNT and DEPTA. Deletion studies by replacing our novel techniques with state of the art conventional techniques show that our wrapper design is efficient and could robustly extract data records from SEARCH ENGINE results pages. With the speed advantages, our wrapper could be beneficial in processing large amount of web sites data, which could be helpful in meta SEARCH ENGINE development.

Cites

  • No record.
  • References

    Cite

    APA: Copy

    CHONG, J.L., & FAUZI, F.. (2010). TREE WRAP-DATA EXTRACTION USING TREE MATCHING ALGORITHM. MAJLESI JOURNAL OF ELECTRICAL ENGINEERING, 4(2 (13)), 43-55. SID. https://sid.ir/paper/572649/en

    Vancouver: Copy

    CHONG J.L., FAUZI F.. TREE WRAP-DATA EXTRACTION USING TREE MATCHING ALGORITHM. MAJLESI JOURNAL OF ELECTRICAL ENGINEERING[Internet]. 2010;4(2 (13)):43-55. Available from: https://sid.ir/paper/572649/en

    IEEE: Copy

    J.L. CHONG, and F. FAUZI, “TREE WRAP-DATA EXTRACTION USING TREE MATCHING ALGORITHM,” MAJLESI JOURNAL OF ELECTRICAL ENGINEERING, vol. 4, no. 2 (13), pp. 43–55, 2010, [Online]. Available: https://sid.ir/paper/572649/en

    Related Journal Papers

  • No record.
  • Related Seminar Papers

  • No record.
  • Related Plans

  • No record.
  • Recommended Workshops






    Move to top
    telegram sharing button
    whatsapp sharing button
    linkedin sharing button
    twitter sharing button
    email sharing button
    email sharing button
    email sharing button
    sharethis sharing button