Por favor, use este identificador para citar o enlazar este ítem: http://hdl.handle.net/10259/7425
Título
Simultaneous class-modelling in chemometrics: A generalization of Partial Least Squares class modelling for more than two classes by using error correcting output code matrices
Autor
Publicado en
Chemometrics and Intelligent Laboratory Systems. 2022, V. 227, 104614
Editorial
Elsevier
Fecha de publicación
2022-08
ISSN
0169-7439
DOI
10.1016/j.chemolab.2022.104614
Resumo
The paper presents a new methodology within the framework of the so-called compliant class-models, PLS2-CM,
designed with the purpose of improving the performance of class-modelling in a setting with more than two
classes. The improvement in the class-models is achieved through the use of multi-response PLS models with the
classes encoded via Error-Correcting Output Codes (ECOC), instead of the traditional class indicator variables
used in chemometrics.
The proposed PLS2-CM entails a decomposition of a class-modelling problem into a series of binary learners,
based on a family of code matrices with different code length, which are evaluated to obtain simultaneous
compliant class-models with the best performance.
The methodology develops both a new encoding system, based on multi-criteria optimization to search for
optimal coding matrices, and a new decoding system, based on probability thresholds to assign objects to classmodels. The whole procedure implies that the characteristics of the dataset at hand affect the final selection of the
coding matrix and therefore of built class-models, thus giving rise to a data-driven strategy.
The application of PLS2-CM to a variety of cases (controlled data, experimental data and repository datasets)
results in an enhanced class-modelling performance by means of the suggested procedure, as measured by the
DMCEN (Diagonal Modified Confusion Entropy) index and by sensitivity-specificity matrices. The predictive
ability of the compliant class-models has been evaluated.
Palabras clave
Compliant class-models
Error-correcting output code (ECOC)
Partial least squares multi-response regression (PLS2)
Sensitivity
Specificity
Diagonal modified confusion entropy (DMCEN)
Materia
Química
Chemistry
Matemáticas
Mathematics
Versión del editor
Aparece en las colecciones
Documento(s) sujeto(s) a una licencia Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 Internacional