Universidad de Burgos RIUBU Principal Default Universidad de Burgos RIUBU Principal Default
  • español
  • English
  • français
  • Deutsch
  • português (Brasil)
  • italiano
Universidad de Burgos RIUBU Principal Default
  • Ayuda
  • Contact Us
  • Send Feedback
  • Acceso abierto
    • Archivar en RIUBU
    • Acuerdos editoriales para la publicación en acceso abierto
    • Controla tus derechos, facilita el acceso abierto
    • Sobre el acceso abierto y la UBU
    • español
    • English
    • français
    • Deutsch
    • português (Brasil)
    • italiano
    • español
    • English
    • français
    • Deutsch
    • português (Brasil)
    • italiano
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Browse

    All of RIUBUCommunities and CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

    My Account

    LoginRegister

    Statistics

    View Usage Statistics

    Compartir

    View Item 
    •   RIUBU Home
    • E-Prints and Research Data
    • Untitled
    • Untitled
    • Untitled
    • View Item
    •   RIUBU Home
    • E-Prints and Research Data
    • Untitled
    • Untitled
    • Untitled
    • View Item

    Por favor, use este identificador para citar o enlazar este ítem: https://hdl.handle.net/10259/11282

    Título
    An Extensive Performance Comparison between Feature Reduction and Feature Selection Preprocessing Algorithms on Imbalanced Wide Data
    Autor
    Ramos Pérez, IsmaelUBU authority Orcid
    Barbero Aparicio, José AntonioUBU authority Orcid
    Canepa Oneto, Antonio JesúsUBU authority Orcid
    Arnaiz González, ÁlvarUBU authority Orcid
    Maudes Raedo, Jesús M.UBU authority Orcid
    Publicado en
    Information. 2024, V. 15, n. 4, 223
    Editorial
    MDPI
    Fecha de publicación
    2024-04
    ISSN
    2078-2489
    DOI
    10.3390/info15040223
    Abstract
    The most common preprocessing techniques used to deal with datasets having high dimensionality and a low number of instances—or wide data—are feature reduction (FR), feature selection (FS), and resampling. This study explores the use of FR and resampling techniques, expanding the limited comparisons between FR and filter FS methods in the existing literature, especially in the context of wide data. We compare the optimal outcomes from a previous comprehensive study of FS against new experiments conducted using FR methods. Two specific challenges associated with the use of FR are outlined in detail: finding FR methods that are compatible with wide data and the need for a reduction estimator of nonlinear approaches to process out-of-sample data. The experimental study compares 17 techniques, including supervised, unsupervised, linear, and nonlinear approaches, using 7 resampling strategies and 5 classifiers. The results demonstrate which configurations are optimal, according to their performance and computation time. Moreover, the best configuration—namely, k Nearest Neighbor (KNN) + the Maximal Margin Criterion (MMC) feature reducer with no resampling—is shown to outperform state-of-the-art algorithms.
    Palabras clave
    Feature selection
    Feature reduction
    Wide data
    High dimensional data
    Imbalanced data
    Machine learning
    Materia
    Informática
    Computer science
    Inteligencia artificial
    Artificial intelligence
    URI
    https://hdl.handle.net/10259/11282
    Versión del editor
    https://doi.org/10.3390/info15040223
    Collections
    • Artículos ADMIRABLE
    • Untitled
    Atribución 4.0 Internacional
    Documento(s) sujeto(s) a una licencia Creative Commons Atribución 4.0 Internacional
    Files in this item
    Nombre:
    Ramos-information_2024.pdf
    Tamaño:
    540.8Kb
    Formato:
    Adobe PDF
    Thumbnail
    FilesOpen

    Métricas

    Citas

    Ver estadísticas de uso

    Export

    RISMendeleyRefworksZotero
    • edm
    • marc
    • xoai
    • qdc
    • ore
    • ese
    • dim
    • uketd_dc
    • oai_dc
    • etdms
    • rdf
    • mods
    • mets
    • didl
    • premis
    Show full item record

    Universidad de Burgos

    Powered by MIT's. DSpace software, Version 5.10