Por favor, use este identificador para citar o enlazar este ítem: http://hdl.handle.net/10259/7337
Título
Nonlinear physics opens a new paradigm for accurate transcription start site prediction
Autor
Publicado en
BMC Bioinformatics. 2022, V. 23, n. 1, 565
Editorial
Springer Nature
Fecha de publicación
2022-12
DOI
10.1186/s12859-022-05129-4
Abstract
There is evidence that DNA breathing (spontaneous opening of the DNA strands)
plays a relevant role in the interactions of DNA with other molecules, and in particular
in the transcription process. Therefore, having physical models that can predict these
openings is of interest. However, this source of information has not been used before
either in transcription start sites (TSSs) or promoter prediction. In this article, one such
model is used as an additional information source that, when used by a machine learn‑
ing (ML) model, improves the results of current methods for the prediction of TSSs. In
addition, we provide evidence on the validity of the physical model, as it is able by itself
to predict TSSs with high accuracy. This opens an exciting avenue of research at the
intersection of statistical mechanics and ML, where ML models in bioinformatics can be
improved using physical models of DNA as feature extractors.
Palabras clave
DNA modelling
DNA breathing
Machine learning
TSS prediction
SVM
String kernels
Materia
Informática
Computer science
Versión del editor
Collections