Background/ Objective: A major problem in the treatment of cancer is the lack of an appropriate method for the early diagnosis of the disease. The breast cancer is a widespread disease within the population of women, and its early diagnosis can greatly prevent the mortality rate. At present, there is no appropriate tumor marker for early diagnosis of the disease. The chemical reaction within an organ may be reflected in the form of proteomic patterns in the serum, sputum, or urine. The surface-enhanced laser desorption/ ionization time-of-flight mass spectrometry is a valuable tool for extracting proteomic patterns from biological samples. A major challenge in analysis of such patterns is the presentation of a data mining algorithm to select appropriate biomarkers to distinguish between healthy and cancer cases.Materials and Methods: In this research, the data corresponding to proteomic patterns of serum from patients with breast cancer was analyzed. Using a mathematical model and discrete wavelet transform, baseline and electrical noises were eliminated in the preprocessing stage with subsequent normalization of the mass spectra.Our hybrid data mining algorithm is based on a statistical test, class separability measure, and peak scoring. With our method, the best protein subset was selected from 13488 data points while maintaining the valuable information and discriminative power. The selected feature subset was then used for the detection of biomarkers.Results: Using the method of k-fold cross validation, the samples under study were divided randomly into two sets namely the learning and test sets. We identified the least threshold value of 1.96. The data mining algorithm was applied to the remaining data points from thresholding step. Then, the best feature subset was selected which included high power discriminatory biomarkers. Using linear discriminant analysis (LDA), 19 proteins were selected as biomarkers that were able to discriminate healthy and cancer samples with accuracy of 100%, specificity of 100%, and sensitivity of 100%.Conclusion: With the generation of complete information from biological specimens, we can use these to diagnose the diseases showing poor markers such as cancer. Disease diagnosis is an example of pattern recognition. In this paper, we have introduced a data mining algorithm to select the best feature subset from protein patterns. Our proposed method has shown to have a good discriminative power with reduction of the number of biomarkers. Our results suggest that the appropriate selection of significant proteins have an important effect for biomarker identification in the correct diagnosis of the disease.