مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Persian Verion

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

video

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

sound

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Persian Version

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

View:

2,371
مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Download:

0
مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Cites:

1

Information Journal Paper

Title

RANDOM FORESTS ANALYSIS: A MODERN STATISTICAL METHOD FOR SCREENING IN HIGH-DIMENSIONAL STUDIES AND ITS APPLICATION IN A POPULATION-BASED GENETIC ASSOCIATION STUDY

Pages

  93-102

Abstract

 Background & Objectives: Technology advances in this century, especially, in molecular generics yields high volume, high dimensional data. This creates many unprecedented challenges for statisticians who are responsible for analysis of such data. Although LOGISTIC REGRESSION method is quite popular in association analysis in medical researches but it has some serious limitations in handling high dimensional data. In present study, our goal is introduce a modern model-free statistical method called random forest that we believe is able to overcome difficulties of the classical statistical methods in finding association between predictors and a trait.Material & Methods: In this study, the nonparametric random forest technique was employed to determine the important factors associated with ankylosing spondylitis (AS) disease. Genetic materials including information on HLA-B27 status (positive/negative) and 12 polymorphisms of the ERAP-1 gene were collected on 401 patients and 316 healthy controls. The data were analyzed both with the LOGISTIC REGRESSION method and RANDOM FORESTS technique and the results were compared.Results: Based on a stepwise LOGISTIC REGRESSION, HLA-B27 and rs28096 polymorphism were significantly associated with the disease. However, using the RANDOM FORESTS technique, we found that HLA-B27 and rs1065407 were the main factors associated with diseases and in fact rs28096 polymorphism becomes the third in importance ranking.Conclusion: The results from our study indicate some discrepancies between LOGISTIC REGRESSION and random forest analyses of HIGH-DIMENSIONAL DATA such as the genetic data that we are dealing here. Although LOGISTIC REGRESSION is quite popular, easy to employ, and is a predominant statistical method among researchers, but it has some serious limitations. On the other hand, more modern statistical such random forest enjoy a more methodological sophistication and yield more accurate and reliable results.Therefore, researchers should be aware of such alternatives and should use these alternatives accordingly and as situation arise in screening tests especially in genetic data analyses.

Cites

References

  • No record.
  • Cite

    APA: Copy

    NOORI, S., NOURIJELYANI, K., MOHAMMAD, K., NIKNAM, M.H., MAHMOUDI, M., ANDONIAN, L., & AKABERI, A.. (2011). RANDOM FORESTS ANALYSIS: A MODERN STATISTICAL METHOD FOR SCREENING IN HIGH-DIMENSIONAL STUDIES AND ITS APPLICATION IN A POPULATION-BASED GENETIC ASSOCIATION STUDY. JOURNAL OF NORTH KHORASAN UNIVERSITY OF MEDICAL SCIENCES, 3(BIOSTATISTICS AND EPIDEMIOLOGY SUPPLEMENT), 93-102. SID. https://sid.ir/paper/187070/en

    Vancouver: Copy

    NOORI S., NOURIJELYANI K., MOHAMMAD K., NIKNAM M.H., MAHMOUDI M., ANDONIAN L., AKABERI A.. RANDOM FORESTS ANALYSIS: A MODERN STATISTICAL METHOD FOR SCREENING IN HIGH-DIMENSIONAL STUDIES AND ITS APPLICATION IN A POPULATION-BASED GENETIC ASSOCIATION STUDY. JOURNAL OF NORTH KHORASAN UNIVERSITY OF MEDICAL SCIENCES[Internet]. 2011;3(BIOSTATISTICS AND EPIDEMIOLOGY SUPPLEMENT):93-102. Available from: https://sid.ir/paper/187070/en

    IEEE: Copy

    S. NOORI, K. NOURIJELYANI, K. MOHAMMAD, M.H. NIKNAM, M. MAHMOUDI, L. ANDONIAN, and A. AKABERI, “RANDOM FORESTS ANALYSIS: A MODERN STATISTICAL METHOD FOR SCREENING IN HIGH-DIMENSIONAL STUDIES AND ITS APPLICATION IN A POPULATION-BASED GENETIC ASSOCIATION STUDY,” JOURNAL OF NORTH KHORASAN UNIVERSITY OF MEDICAL SCIENCES, vol. 3, no. BIOSTATISTICS AND EPIDEMIOLOGY SUPPLEMENT, pp. 93–102, 2011, [Online]. Available: https://sid.ir/paper/187070/en

    Related Journal Papers

    Related Seminar Papers

  • No record.
  • Related Plans

  • No record.
  • Recommended Workshops






    Move to top
    telegram sharing button
    whatsapp sharing button
    linkedin sharing button
    twitter sharing button
    email sharing button
    email sharing button
    email sharing button
    sharethis sharing button