مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Seminar Paper

Paper Information

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

video

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

sound

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Persian Version

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

View:

316
مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Download:

187
مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Cites:

Information Seminar Paper

Title

FAST AND SCALABLE PROTEIN MOTIF SEQUENCE CLUSTERING BASED ON HADOOP FRAMEWORK

Pages

  -

Abstract

 IN RECENT YEARS, WE ARE FACED WITH LARGE AMOUNTS OF SPORADIC UNSTRUCTURED DATA ON THE WEB. WITH THE EXPLOSIVE GROWTH OF SUCH DATA, THERE IS A GROWING NEED FOR EFFECTIVE METHODS SUCH AS CLUSTERING TO ANALYZE AND EXTRACT INFORMATION. BIOLOGICAL DATA FORMS AN IMPORTANT PART OF UNSTRUCTURED DATA ON THE WEB. PROTEIN SEQUENCE DATABASES ARE CONSIDERED AS A PRIMARY SOURCE OF BIOLOGICAL DATA. CLUSTERING CAN HELP TO ORGANIZE SEQUENCES INTO HOMOLOGOUS AND FUNCTIONALLY SIMILAR GROUPS AND CAN IMPROVE THE SPEED OF DATA PROCESSING AND ANALYSIS. PROTEINS ARE RESPONSIBLE FOR MOST OF THE ACTIVITIES IN CELLS. THE MAJORITY OF PROTEINS SHOW THEIR FUNCTION THROUGH INTERACTION WITH OTHER PROTEINS. HENCE, PREDICTION OF PROTEIN INTERACTIONS IS AN IMPORTANT RESEARCH AREA IN THE BIOMEDICAL SCIENCES. MOTIFS ARE FRAGMENTS FREQUENTLY OCCURRED IN PROTEIN SEQUENCES. A WELL-KNOWN METHOD TO SPECIFY THE PROTEIN INTERACTION IS BASED ON MOTIF CLUSTERING. EXISTING WORKS ON MOTIF CLUSTERING METHODS SHARE THE PROBLEM OF LIMITATION IN THE NUMBER OF CLUSTERS. HOWEVER, REGARDING THE VAST AMOUNT OF MOTIFS AND THE NECESSITY OF A LARGE NUMBER OF CLUSTERS, IT SEEMS THAT AN EFFICIENT, SCALABLE AND FAST METHOD IS NECESSARY TO CLUSTER SUCH LARGE NUMBER OF SEQUENCES. IN THIS PAPER, WE PROPOSE A NOVEL APPROACH TO CLUSTER A LARGE NUMBER OF MOTIFS. OUR APPROACH INCLUDES EXTRACTING MOTIFS WITHIN PROTEIN SEQUENCES, FEATURE SELECTION, PREPROCESSING, DIMENSION REDUCTION AND UTILIZING BIGFCM (A LARGE-SCALE FUZZY CLUSTERING) ON SEVERAL DISTRIBUTED NODES WITH HADOOP FRAMEWORK TO TAKE THE ADVANTAGE OF MAPREDUCE PROGRAMMING. EXPERIMENTAL RESULTS SHOW VERY GOOD PERFORMANCE OF OUR APPROACH.

Cites

  • No record.
  • References

  • No record.
  • Cite

    APA: Copy

    Farhangi, Erfan, GHADIRI, NASSER, Asadi, Mahsa, Nikbakht, Mohammad Amin, & Pitre, Sylvain. (2017). FAST AND SCALABLE PROTEIN MOTIF SEQUENCE CLUSTERING BASED ON HADOOP FRAMEWORK. INTERNATIONAL CONFERENCE ON WEB RESEARCH. SID. https://sid.ir/paper/946840/en

    Vancouver: Copy

    Farhangi Erfan, GHADIRI NASSER, Asadi Mahsa, Nikbakht Mohammad Amin, Pitre Sylvain. FAST AND SCALABLE PROTEIN MOTIF SEQUENCE CLUSTERING BASED ON HADOOP FRAMEWORK. 2017. Available from: https://sid.ir/paper/946840/en

    IEEE: Copy

    Erfan Farhangi, NASSER GHADIRI, Mahsa Asadi, Mohammad Amin Nikbakht, and Sylvain Pitre, “FAST AND SCALABLE PROTEIN MOTIF SEQUENCE CLUSTERING BASED ON HADOOP FRAMEWORK,” presented at the INTERNATIONAL CONFERENCE ON WEB RESEARCH. 2017, [Online]. Available: https://sid.ir/paper/946840/en

    Related Journal Papers

  • No record.
  • Related Seminar Papers

  • No record.
  • Related Plans

  • No record.
  • Recommended Workshops






    مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources
    مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources
    مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources
    مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources
    مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources
    مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources
    مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources
    File Not Exists.
    Move to top
    telegram sharing button
    whatsapp sharing button
    linkedin sharing button
    twitter sharing button
    email sharing button
    email sharing button
    email sharing button
    sharethis sharing button