Identification of triple negative breast cancer genes using rough set based feature selection algorithm & ensemble classifier
dc.contributor.author | Patil, Sujata | |
dc.contributor.author | Balmuri, Kavitha Rani | |
dc.contributor.author | Frnda, Jaroslav | |
dc.contributor.author | Parameshachari, B. D. | |
dc.contributor.author | Konda, Srinivas | |
dc.contributor.author | Nedoma, Jan | |
dc.date.accessioned | 2023-02-07T06:37:36Z | |
dc.date.available | 2023-02-07T06:37:36Z | |
dc.date.issued | 2022 | |
dc.identifier.citation | Human-Centric Computing and Information Sciences. 2022, vol. 12, art. no. 54. | cs |
dc.identifier.issn | 2192-1962 | |
dc.identifier.uri | http://hdl.handle.net/10084/149070 | |
dc.description.abstract | In recent decades, microarray datasets have played an important role in triple negative breast cancer (TNBC) detection. Microarray data classification is a challenging process due to the presence of numerous redundant and irrelevant features. Therefore, feature selection becomes irreplaceable in this research field that eliminates non-required feature vectors from the system. The selection of an optimal number of features significantly reduces the NP hard problem, so a rough set-based feature selection algorithm is used in this manuscript for selecting the optimal feature values. Initially, the datasets related to TNBC are acquired from gene expression omnibuses like GSE45827, GSE76275, GSE65194, GSE3744, GSE21653, and GSE7904. Then, a robust multi-array average technique is used for eliminating the outlier samples of TNBC/non-TNBC which helps enhancing classification performance. Further, the pre-processed microarray data are fed to a rough set theory for optimal gene selection, and then the selected genes are given as the inputs to the ensemble classification technique for classifying low-risk genes (non-TNBC) and high-risk genes (TNBC). The experimental evaluation showed that the ensemble-based rough set model obtained a mean accuracy of 97.24%, which superior related to other comparative machine learning techniques. | cs |
dc.language.iso | en | cs |
dc.publisher | Korea Information Processing Society | cs |
dc.relation.ispartofseries | Human-Centric Computing and Information Sciences | cs |
dc.relation.uri | https://doi.org/10.22967/HCIS.2022.12.054 | cs |
dc.rights | This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. | cs |
dc.rights.uri | http://creativecommons.org/licenses/by-nc/3.0/ | cs |
dc.subject | ensemble classifier | cs |
dc.subject | machine-learning technique | cs |
dc.subject | microarray data | cs |
dc.subject | robust multi-array average technique | cs |
dc.subject | rough set theory | cs |
dc.subject | triple negative breast cancer | cs |
dc.title | Identification of triple negative breast cancer genes using rough set based feature selection algorithm & ensemble classifier | cs |
dc.type | article | cs |
dc.identifier.doi | 10.22967/HCIS.2022.12.054 | |
dc.rights.access | openAccess | cs |
dc.type.version | publishedVersion | cs |
dc.type.status | Peer-reviewed | cs |
dc.description.source | Web of Science | cs |
dc.description.volume | 12 | cs |
dc.description.firstpage | art. no. 54 | cs |
dc.identifier.wos | 000890282100001 |
Files in this item
This item appears in the following Collection(s)
-
Články z časopisů s impakt faktorem / Articles from Impact Factor Journals [6377]
Články z časopisů (od roku 2008), které v době vydání článku měly impakt faktor. -
Publikační činnost Katedry telekomunikačních technologií / Publications of Department of Telecommunications (440) [369]
Kolekce obsahuje bibliografické záznamy publikační činnosti (článků) akademických pracovníků Katedry telekomunikačních technologií (440) v časopisech registrovaných ve Web of Science od roku 2003 po současnost. -
Publikační činnost VŠB-TUO ve Web of Science / Publications of VŠB-TUO in Web of Science [7798]
Kolekce obsahuje bibliografické záznamy článků akademických pracovníků VŠB-TUO publikovaných v časopisech indexovaných ve Web of Science od roku 1990 po současnost.
Except where otherwise noted, this item's license is described as This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.