dc.contributor.author | Pacheco Bonrostro, Joaquín | |
dc.contributor.author | Sáiz Vázquez, Olalla | |
dc.contributor.author | Casado Yusta, Silvia | |
dc.contributor.author | Ubillos Landa, Silvia | |
dc.date.accessioned | 2024-12-20T12:19:39Z | |
dc.date.available | 2024-12-20T12:19:39Z | |
dc.date.issued | 2023 | |
dc.identifier.uri | http://hdl.handle.net/10259/9823 | |
dc.description.abstract | In the design of classification models, irrelevant or noisy features are often generated. In some cases, there may even be negative interactions among features. These weaknesses can degrade the performance of the models. Feature selection is a task that searches for a small subset of relevant features from the original set that generate the most efficient models possible. In addition to improving the efficiency of the models, feature selection confers other advantages, such as greater ease in the generation of the necessary data as well as clearer and more interpretable models. In the case of medical applications, feature selection may help to distinguish which characteristics, habits, and factors have the greatest impact on the onset of diseases. However, feature selection is a complex task due to the large number of possible solutions. In the last few years, methods based on different metaheuristic strategies, mainly evolutionary algorithms, have been proposed. The motivation of this work is to develop a method that outperforms previous methods, with the benefits that this implies especially in the medical field. More precisely, the present study proposes a simple method based on tabu search and multistart techniques. The proposed method was analyzed and compared to other methods by testing their performance on several medical databases. Specifically, eight databases belong to the well-known repository of the University of California in Irvine and one of our own design were used. In these computational tests, the proposed method outperformed other recent methods as gauged by various metrics and classifiers. The analyses were accompanied by statistical tests, the results of which showed that the superiority of our method is significant and therefore strengthened these conclusions. In short, the contribution of this work is the development of a method that, on the one hand, is based on different strategies than those used in recent methods, and on the other hand, improves the performance of these methods. | en |
dc.description.sponsorship | This work was partially supported by FEDER funds and the Spanish State Research Agency (Projects PID2019-104263RB-C44 and PDC2021–121021-C22); the Regional Government of “Castilla y León”, Spain (Project BU071G19); the Regional Government of “Castilla y León”; and FEDER funds (Project BU056P20). | en |
dc.format.mimetype | application/vnd.openxmlformats-officedocument.wordprocessingml.document | |
dc.format.mimetype | application/zip | |
dc.format.mimetype | text/plain | |
dc.language.iso | eng | es |
dc.publisher | Universidad de Burgos | es |
dc.relation.isreferencedby | http://hdl.handle.net/10259/8409 | es |
dc.rights | Atribución-NoComercial 4.0 Internacional | * |
dc.rights.uri | http://creativecommons.org/licenses/by-nc/4.0/ | * |
dc.subject | Classification | en |
dc.subject | Feature selection | en |
dc.subject | Medical diagnosis | en |
dc.subject | Tabu search | en |
dc.subject.other | Investigación operativa | es |
dc.subject.other | Operations research | en |
dc.subject.other | Modelos matemáticos | es |
dc.subject.other | Mathematical models | en |
dc.title | Dataset of the paper “A multistart tabu search–based method for feature selection in medical applications". Scientific Reports, 13, 17140 | en |
dc.type | dataset | es |
dc.rights.accessRights | info:eu-repo/semantics/openAccess | es |
dc.identifier.doi | 10.36443/10259/9823 | |
dc.relation.projectID | info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/PID2019-104263RB-C44/ES/MEJORA EN LA TOMA DE DECISIONES EN EL AMBITO DE LA LOGISTICA Y PROBLEMAS RELACIONADOS. ENFOQUE MULTI-OBJETIVO/ | es |
dc.relation.projectID | info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/PDC2021-121021-C22/ES/SISTEMAS DE APOYO A LA TOMA DE DECISIONES EFICIENTES: PLANIFICACION DE LA LOGISTICA EXTERNA E INTERNA Y SELECCION DE CARTERAS/ | es |
dc.relation.projectID | info:eu-repo/grantAgreement/Junta de Castilla y León//BU071G19//Métodos heurísticos para problemas de optimización de recursos sanitarios con varios objetivos/ | es |
dc.relation.projectID | info:eu-repo/grantAgreement/Junta de Castilla y León//BU056P20//Análisis de problemas de logística sanitaria: Enfoque multi-objetivo y uso de metaheurísticas/ | es |
dc.type.hasVersion | info:eu-repo/semantics/acceptedVersion | es |
dc.publication.year | 2023 | |