Search Results/Filters    

Filters

Year

Banks



Expert Group










Full-Text


Issue Info: 
  • Year: 

    1402
  • Volume: 

    12
  • Issue: 

    45
  • Pages: 

    47-66
Measures: 
  • Citations: 

    0
  • Views: 

    55
  • Downloads: 

    0
Abstract: 

1پیش بینی تقاضای محصولات زنجیره تأمین برای تعیین استراتژی ها و تصمیم گیری ها موضوعی بسیار با اهمیت و پرچالش است. با افزایش تنوع و تعداد محصولات، این چالش ها نیز افزایش می یابد. ارائه چارچوب ها و روش هایی که با وجود تنوع محصولی، تفاوت در کاربردها و ویژگی ها و حجم داده های مختلف، از انعطاف پذیری، دقت و مزیت های لازم برای پیش بینی همه دسته های محصولی برخوردار باشد، برای مدیران حیاتی است. در این راستا، دو مدل یادگیری با نظارت، XGBoost Regressor (XGBR) و Gradient Boosting Regressor (GBR)، بر روی مجموعه داده های Global Superstore، در سایت Kaggle پیاده‎سازی شده است. این مجموعه داده شامل 3788 محصول در سه Category محصولی متنوع، هفده Sub Category و51،290 سفارش است. حجم داده های محدود محصولات سبب می گردد پیش بینی بسیاری از محصولات و کسب نتیجه مناسب از روش ها میسر و مفید نگردد. با توجه به اینکه در این تحقیق تجربی هدف پیش بینی تقاضا، بکارگیری در تصمیمات استراتژیک است، رویکردی تجمیع محصولی برای این مسئله پیشنهاد شده که با توجه به مشابهت در محصولات Sub Categoryها پیش بینی آنها به صورت تفکیک شده صورت گیرد. به منظور بررسی اثر میزان داده بر عملکرد مدل ها، داده های مجموعه داده با استفاده از تکنیک Augmentation Data افزایش یافته و با اجرای مجدد مدل ها، نتایج پیش بینی دو مدل با هم مقایسه شده اند. براساس ارزیابی نتایج پیش بینی با داده های افزایش یافته با دو معیار MSE و MAE، مدل XGBR در کمترین مقدار به ترتیب به 12/0 و 10/0، و مدل GBR نیز به مقادیر 13/0 و 15/0 دست یافته است. همچنین، نتیجه معیار D2 Score در مدل XGBR در بیشترین مقدار 97/0 و در مدل GBR مقدار 96/0 است. با افزایش داده ها، مقادیر معیارهای اندازه گیری خطای به صورت چشمگیری و تا بیش از 80 درصد کاهش یافته و در داده های با حجم بیشتر، XGBR برتری نسبی دارد. چارچوب و مدل های پیشنهادی می تواند در صنایع با مسائل مشابه در سطح استراتژی استفاده شود.

Yearly Impact: مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

View 55

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesDownload 0 مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesCitation 0 مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesRefrence 0
Author(s): 

Issue Info: 
  • Year: 

    2021
  • Volume: 

    17
  • Issue: 

    2
  • Pages: 

    0-0
Measures: 
  • Citations: 

    1
  • Views: 

    29
  • Downloads: 

    0
Keywords: 
Abstract: 

Yearly Impact: مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

View 29

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesDownload 0 مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesCitation 1 مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesRefrence 0
Issue Info: 
  • Year: 

    2023
  • Volume: 

    9
  • Issue: 

    3
  • Pages: 

    378-387
Measures: 
  • Citations: 

    0
  • Views: 

    30
  • Downloads: 

    0
Abstract: 

Introduction: The present study discusses the importance of having a predictive method to determine the prognosis of patients with diseases like Covid-19. This method can assist physicians in making treatment decisions that improve survival rates and avoid unnecessary treatments. This research also highlights the importance of calibration, which is often overlooked in model evaluation. Without proper calibration, incorrect decisions can be made in disease treatment and preventive care. Therefore, the current study compares two highly accurate machine learning algorithms, Gradient Boosting and Extreme Gradient Boosting, not only in terms of prediction accuracy but also in terms of model calibration and speed. Methods: This study involved analyzing data from Covid-19 patients who were admitted to two hospitals in Mashhad city, Razavi Khorasan province, over a span of 18 months. The k-fold cross-validation method was employed on the training dataset (K=5) to conduct the study. The accuracy and calibration of two methods (Gradient Boosting and Extreme Gradient Boosting) in predicting survival were compared using the Concordance Index and calibration. Results: The Concordance Index values obtained for Gradient Boosting and Extreme Gradient Boosting models were 0. 734 and 0. 736, in the imbalanced and In the balanced data, the Concordance Index values were 0. 893 for Gradient Boosting and 0. 894 for Extreme Gradient Boosting. The surv. calib_beta index, the Gradient Boosting model had an estimated value of 0. 59 in the imbalanced data and 0. 66 in the balanced data. The Extreme Gradient Boosting model had an estimated value of 0. 86 in the balanced data and 0. 853 in the imbalanced data. The Extreme Gradient Boosting model was faster in the learning process compared to the Gradient Boosting model. Conclusion: The Gradient Boosting and Extreme Gradient Boosting models exhibited similar prediction accuracy and discrimination power, but the Extreme Gradient Boosting model demonstrated relatively good calibration compare to Gradient Boosting model.

Yearly Impact: مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

View 30

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesDownload 0 مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesCitation 0 مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesRefrence 3
Author(s): 

Ghafouri Kesbi Farhad

Issue Info: 
  • Year: 

    621
  • Volume: 

    12
  • Issue: 

    1
  • Pages: 

    31-37
Measures: 
  • Citations: 

    0
  • Views: 

    24
  • Downloads: 

    48
Abstract: 

AbstractThe aim of this study was to study the performance of xgboost algorithm in genomic evaluation of complex traits as an alternative for Gradient Boosting algorithm (GBM). To this end, genotypic matrices containing genotypic information for, respectively, 5,000 (S1), 10,000 (S2) and 50,000 (S3) single nucleotide polymorphisms (SNP) for 1000 individuals was simulated. Beside xgboost and GBM, the GBLUP which is known as an efficient algorithm in terms of accuracy, computing time and memory requirement was also used to predict genomic breeding values. xgboost, GBM and GBLUP were run in R software using xgboost, gbm and synbreed packages. All the analyses were done using a machine equipped with a Core i7-6800K CPU which had 6 physical cores. In addition, 32 gigabyte of memory was installed on the machine. The Person's correlation between predicted and true breeding values (rp,t) and the mean squared error (MSE) of prediction were computed to compare predictive performance of different methods. While GBLUP was the most efficient user of memory, GBM required a considerably high amount of memory to run. By increasing size of data from S1 to S3, GBM went out from the competition mainly due to its high demand for memory. Parallel computing with xgboost reduced running time by %99 compared to GBM. The speedup ratios (the ratio of the GBM runtime to the time taken by the parallel computing by xgboost) were 444 and 554 for the S1 and S2 scenarios, respectively. In addition, parallelization efficiency (speed up ratio/number of cores) were, respectively, 74 and 92 for the S1 and S2 scenarios, indicating that by increasing the size of data, the efficiency of parallel computing increased. The xgboost was considerably faster than GBLUP in all the scenarios studied. Accuracy of genomic breeding values predicted by xgboost was similar to those predicted by GBM. While the accuracy of prediction in terms of rp,t was higher for GBLUP, the MSE of prediction was lower for xgboost, specially for larger datasets. Our results showed that xgboost could be an efficient alternative for GBM as it had the same accuracy of prediction, was extremely fast and needed significantly lower memory requirement to predict the genomic breeding values.

Yearly Impact: مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

View 24

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesDownload 48 مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesCitation 0 مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesRefrence 0
Issue Info: 
  • Year: 

    2023
  • Volume: 

    56
  • Issue: 

    1
  • Pages: 

    159-172
Measures: 
  • Citations: 

    0
  • Views: 

    31
  • Downloads: 

    2
Abstract: 

Applications of machine learning techniques in concrete properties' prediction have great interest to many researchers worldwide. Indeed, some of the most common machine learning methods are those based on adopting Boosting algorithms. A new approach, Histogram-Based Gradient Boosting, was recently introduced to the literature. It is a technique that buckets continuous feature values into discrete bins to speed up the computations and reduce memory usage. Previous studies have discussed its efficiency in various scientific disciplines to save computational time and memory. However, the algorithm's accuracy is still unclear, and its application in concrete properties estimation has not yet been considered. This paper is devoted to evaluating the capability of Histogram-Based Gradient Boosting in predicting concrete's compressive strength and comparing its accuracy to other Boosting methods. Generally, the results of the study have shown that the Histogram-Based Gradient Boosting approach is capable of achieving reliable prediction of concrete compressive strength. Additionally, it showed the effects of each model's parameters on the accuracy of the estimation.

Yearly Impact: مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

View 31

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesDownload 2 مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesCitation 0 مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesRefrence 0
Issue Info: 
  • Year: 

    2023
  • Volume: 

    10
  • Issue: 

    1
  • Pages: 

    24-32
Measures: 
  • Citations: 

    0
  • Views: 

    58
  • Downloads: 

    23
Abstract: 

In this study, the differential pulse voltammetry (DPV) method was used to simultaneously determine bismuth and copper concentrations. A 25 bismuth and copper mixtures at the designed ratio were measured using the DPV technique. However, the overlapping differential pulse voltammograms obtained made it difficult to quantitatively analyze the concentrations based on adaptive peak current selection. To address this issue, the voltammograms were preprocessed using derivatization and peak subtraction. The second derivative voltammogram was found to be highly correlated with the copper-bismuth concentration ratio, resulting in improved fit and prediction accuracy. To further improve the accuracy and precision of the training and prediction results, XGBoost and Gradient Boosting regression models were applied. The XGBoost and Gradient Boosting regression models showed high accuracy and precision with r-squared values of 0.877 and 0.993 for copper, and 0.879 and 0.993 for bismuth, respectively. The mean recoveries of copper were 99.84% and 98.07%, while bismuth recoveries were 93.17% and 90.85% for XGBoost and Gradient Boosting, respectively. Additionally, cross-validation using 10 splits produced a mean score of 45.565 and a mean absolute error of 13.051 for copper, and a mean score of 13.600 and a mean absolute error of 10.920 for bismuth. Overall, the results indicate that the proposed method is an accurate and precise way to simultaneously determine bismuth and copper concentrations.

Yearly Impact: مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

View 58

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesDownload 23 مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesCitation 0 مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesRefrence 0
Issue Info: 
  • Year: 

    2023
  • Volume: 

    15
  • Issue: 

    3
  • Pages: 

    41-60
Measures: 
  • Citations: 

    0
  • Views: 

    70
  • Downloads: 

    13
Abstract: 

With the development of remote sensing science, the use of hyperspectral images is becoming more widespread. Classification is one of the most popular topics in hyperspectral remote sensing. In the last two decades, a number of methods have been proposed to address the problem of hyperspectral data classification.In the present study, a structure based on learning capsule networks has been used to classify hyperspectral images, so that the network structure can have the most optimal generation of features by using a convolution layer and a capsule layer, and at the same time Avoid overfitting the on training data. The obtained results show the high quality of production features in the proposed structure. With the development of remote sensing science, the use of hyperspectral images is becoming more widespread. Classification is one of the most popular topics in hyperspectral remote sensing. In the last two decades, a number of methods have been proposed to address the problem of hyperspectral data classification.In the present study, a structure based on learning capsule networks has been used to classify hyperspectral images, so that the network structure can have the most optimal generation of features by using a convolution layer and a capsule layer, and at the same time Avoid overfitting the on training data. The obtained results show the high quality of production features in the proposed structure.In order to improve the classification accuracy, the feature extraction approach through the designed network and the classification by the Extreme Gradient Boosting was compared with the classification method by the global deep network. The proposed capsule approach consists of 3 basic layers: 1) Prime caps, which are capsules of size 8 and 32 with 9 × 9 filters and movement step 2, 2) Digitcaps with 10 16-dimensional capsules, and 3) fully connected layer. The results of examining two approaches for deep networking as well as combining capsule networks with XGBoost reinforcement tree algorithm were compared. Approaches such as SVM, RF-200, LSTM, GRU and GRU-Pretanh were considered to compare the proposed approach based on the configurations mentioned in their research.Up in addition to the study and quality measurement of production vector deep features by the proposed method in different classifiers, the ability of deep global networks in the application of classification should also be examined. The results of examining two approaches for deep network and also combining CapsNet with XGBoost show that by using the proposed combined method, images are classified with 99% accuracy on training data and 97.5% accuracy on test data.Up in addition to the study and quality measurement of production vector deep features by the proposed method in different classifiers, the ability of deep global networks in the application of classification should also be examined.The results of examining two approaches for deep network and also combining CapsNet with XGBoost show that by using the proposed combined method, images are classified with 99% accuracy on training data and 97.5% accuracy on test data.

Yearly Impact: مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

View 70

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesDownload 13 مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesCitation 0 مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesRefrence 0
Author(s): 

Issue Info: 
  • Year: 

    2022
  • Volume: 

    208
  • Issue: 

    -
  • Pages: 

    0-0
Measures: 
  • Citations: 

    1
  • Views: 

    5
  • Downloads: 

    0
Keywords: 
Abstract: 

Yearly Impact: مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

View 5

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesDownload 0 مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesCitation 1 مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesRefrence 0
Author(s): 

Journal: 

SCIENTIFIC REPORTS

Issue Info: 
  • Year: 

    2022
  • Volume: 

    12
  • Issue: 

    1
  • Pages: 

    0-0
Measures: 
  • Citations: 

    1
  • Views: 

    16
  • Downloads: 

    0
Keywords: 
Abstract: 

Yearly Impact: مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

View 16

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesDownload 0 مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesCitation 1 مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesRefrence 0
Issue Info: 
  • Year: 

    2024
  • Volume: 

    10
Measures: 
  • Views: 

    34
  • Downloads: 

    2
Abstract: 

This paper explores the capability of various machine learning algorithms, including Random Forest and advanced Gradient Boosting techniques such as XGBoost, LightGBM, and CatBoost, to predict customer churn in the telecommunications sector. For this analysis, a dataset available to the public was employed. The performance of these algorithms was assessed using recognized metrics, including Accuracy, Precision, Recall, F1-score, and the Receiver Operating Characteristic Area Under Curve (ROC AUC). These metrics were evaluated at different phases: subsequent to data preprocessing and feature selection,following the application of SMOTE and ADASYN sampling methods,and after conducting hyperparameter tuning on the data that had been adjusted by SMOTE and ADASYN. The outcomes underscore the notable efficiency of upsampling techniques such as SMOTE and ADASYN in addressing the imbalance inherent in customer churn prediction. Notably, the application of random grid search for hyperparameter optimization did not significantly alter the results, which remained comparatively unchanged. The algorithms' performance post-ADASYN application marginally surpassed that observed after SMOTE application. Remarkably, LightGBM achieved an exceptional F1-score of 89% and an ROC AUC of 95% subsequent to the ADASYN sampling technique. This underlines the effectiveness of advanced Boosting algorithms and upsampling methods like SMOTE and ADASYN in navigating the complexities of imbalanced datasets and intricate feature interdependencies.

Yearly Impact:   مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

View 34

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesDownload 2
litScript
telegram sharing button
whatsapp sharing button
linkedin sharing button
twitter sharing button
email sharing button
email sharing button
email sharing button
sharethis sharing button