مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Persian Verion

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

video

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

sound

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Persian Version

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

View:

1,118
مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Download:

0
مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Cites:

Information Journal Paper

Title

GENE EXPRESSION DATA CLUSTERING WITH RANDOM FOREST DISSIMILARITY

Pages

  109-118

Abstract

 Background: The CLUSTERING of GENE EXPRESSION DATA plays an important role in the diagnosis and treatment of cancer. These kinds of data are typically involve in a large number of variables (genes), in comparison with number of samples (patients). Many CLUSTERING methods have been built based on the dissimilarity among observations that are calculated by a distance function. As increasing the dimensions reduces the performance of distance functions, most of the methods provide low accuracy. In this paper a new dissimilarity measure is introduced based on a classification method, called Random forests (RF). The performance of this new measure has been evaluated in the GENE EXPRESSION DATA.Methods: In this article, the CLUSTERING problem of Chowdary data set, using the RF dissimilarity measure, is under consideration. At the first step, the CLUSTERING problem is converted to classification problem, thereafter; the new dissimilarity is calculated using the classification method of random forests. Finally, the data are clustered with a partition around mediod algorithm and the results are then evaluated by adjusted rand index. All the analysis is implemented with R software.Results: The value of adjusted rand index (0.8149) represents an acceptable agreement between clusters and true groups. The most effective gene in constructing the clusters was gene no.31 which was detected by using the unique ability of RF that is identifying the importance of variables.Conclusion: The RANDOM FOREST DISSIMILARITY is an efficient criterion for measuring dissimilarity in GENE EXPRESSION DATA CLUSTERING. Detection of effective genes in CLUSTERING that is done with RF, helps the researcher in the diagnosing and treatment of the cancers.

Cites

  • No record.
  • References

  • No record.
  • Cite

    APA: Copy

    FARHADI, ZOHREH, & SHAHSAVANI, DAVOOD. (2015). GENE EXPRESSION DATA CLUSTERING WITH RANDOM FOREST DISSIMILARITY. RAZI JOURNAL OF MEDICAL SCIENCES (JOURNAL OF IRAN UNIVERSITY OF MEDICAL SCIENCES), 22(136), 109-118. SID. https://sid.ir/paper/10872/en

    Vancouver: Copy

    FARHADI ZOHREH, SHAHSAVANI DAVOOD. GENE EXPRESSION DATA CLUSTERING WITH RANDOM FOREST DISSIMILARITY. RAZI JOURNAL OF MEDICAL SCIENCES (JOURNAL OF IRAN UNIVERSITY OF MEDICAL SCIENCES)[Internet]. 2015;22(136):109-118. Available from: https://sid.ir/paper/10872/en

    IEEE: Copy

    ZOHREH FARHADI, and DAVOOD SHAHSAVANI, “GENE EXPRESSION DATA CLUSTERING WITH RANDOM FOREST DISSIMILARITY,” RAZI JOURNAL OF MEDICAL SCIENCES (JOURNAL OF IRAN UNIVERSITY OF MEDICAL SCIENCES), vol. 22, no. 136, pp. 109–118, 2015, [Online]. Available: https://sid.ir/paper/10872/en

    Related Journal Papers

    Related Seminar Papers

  • No record.
  • Related Plans

  • No record.
  • Recommended Workshops






    Move to top
    telegram sharing button
    whatsapp sharing button
    linkedin sharing button
    twitter sharing button
    email sharing button
    email sharing button
    email sharing button
    sharethis sharing button