dc.contributor.author | Alfian, Ganjar | |
dc.contributor.author | Syafrudin, Muhammad | |
dc.contributor.author | Fahrurrozi, Imam | |
dc.contributor.author | Fitriyani, Norma Latif | |
dc.contributor.author | Atmaji, Fransiskus Tatas Dwi | |
dc.contributor.author | Widodo, Tri | |
dc.contributor.author | Bahiyah, Nurul | |
dc.contributor.author | Beneš, Filip | |
dc.contributor.author | Rhee, Jongtae | |
dc.date.accessioned | 2022-11-07T15:12:05Z | |
dc.date.available | 2022-11-07T15:12:05Z | |
dc.date.issued | 2022 | |
dc.identifier.citation | Computers. 2022, vol. 11, issue 9, art. no. 136. | cs |
dc.identifier.issn | 2073-431X | |
dc.identifier.uri | http://hdl.handle.net/10084/148864 | |
dc.description.abstract | Developing a prediction model from risk factors can provide an efficient method to recognize breast cancer. Machine learning (ML) algorithms have been applied to increase the efficiency of diagnosis at the early stage. This paper studies a support vector machine (SVM) combined with an extremely randomized trees classifier (extra-trees) to provide a diagnosis of breast cancer at the early stage based on risk factors. The extra-trees classifier was used to remove irrelevant features, while SVM was utilized to diagnose the breast cancer status. A breast cancer dataset consisting of 116 subjects was utilized by machine learning models to predict breast cancer, while the stratified 10-fold cross-validation was employed for the model evaluation. Our proposed combined SVM and extra-trees model reached the highest accuracy up to 80.23%, which was significantly better than the other ML model. The experimental results demonstrated that by applying extra-trees-based feature selection, the average ML prediction accuracy was improved by up to 7.29% as contrasted to ML without the feature selection method. Our proposed model is expected to increase the efficiency of breast cancer diagnosis based on risk factors. In addition, we presented the proposed prediction model that could be employed for web-based breast cancer prediction. The proposed model is expected to improve diagnostic decision-support systems by predicting breast cancer disease accurately. | cs |
dc.language.iso | en | cs |
dc.publisher | MDPI | cs |
dc.relation.ispartofseries | Computers | cs |
dc.relation.uri | https://doi.org/10.3390/computers11090136 | cs |
dc.rights | © 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license. | cs |
dc.rights.uri | http://creativecommons.org/licenses/by/4.0 | cs |
dc.subject | breast cancer | cs |
dc.subject | support vector machine | cs |
dc.subject | extra-trees | cs |
dc.subject | risk factors | cs |
dc.title | Predicting breast cancer from risk factors using SVM and extra-trees-based feature selection method | cs |
dc.type | article | cs |
dc.identifier.doi | 10.3390/computers11090136 | |
dc.rights.access | openAccess | cs |
dc.type.version | publishedVersion | cs |
dc.type.status | Peer-reviewed | cs |
dc.description.source | Web of Science | cs |
dc.description.volume | 11 | cs |
dc.description.issue | 9 | cs |
dc.description.firstpage | art. no. 136 | cs |
dc.identifier.wos | 000856323500001 | |