Universidad de Burgos RIUBU Principal Default Universidad de Burgos RIUBU Principal Default
  • español
  • English
  • français
  • Deutsch
  • português (Brasil)
  • italiano
Universidad de Burgos RIUBU Principal Default
  • Ayuda
  • Contacto
  • Sugerencias
  • Acceso abierto
    • Archivar en RIUBU
    • Acuerdos editoriales para la publicación en acceso abierto
    • Controla tus derechos, facilita el acceso abierto
    • Sobre el acceso abierto y la UBU
    • español
    • English
    • français
    • Deutsch
    • português (Brasil)
    • italiano
    • español
    • English
    • français
    • Deutsch
    • português (Brasil)
    • italiano
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Listar

    Todo RIUBUComunidadesFechaAutor / DirectorTítuloMateria / AsignaturaEsta colecciónFechaAutor / DirectorTítuloMateria / Asignatura

    Mi cuenta

    AccederRegistro

    Estadísticas

    Ver Estadísticas de uso

    Compartir

    Ver ítem 
    •   RIUBU Principal
    • E-Prints y Datos de investigación
    • Departamentos y Centros
    • Departamento de Historia, Geografía y Comunicación
    • Área de Paleontología
    • Artículos Paleontología
    • Ver ítem
    •   RIUBU Principal
    • E-Prints y Datos de investigación
    • Departamentos y Centros
    • Departamento de Historia, Geografía y Comunicación
    • Área de Paleontología
    • Artículos Paleontología
    • Ver ítem

    Por favor, use este identificador para citar o enlazar este ítem: https://hdl.handle.net/10259/11433

    Título
    Fast and accurate estimation of multidimensional site frequency spectra from low-coverage high-throughput sequencing data
    Autor
    Mas Sandoval, Alex
    Pope, Nathaniel S
    Nielsen, Knud Nor
    Altinkaya, Isin
    Fumagalli, Matteo
    Korneliussen, Thorfinn Sand
    Publicado en
    GigaScience. 2022, V. 11, giac032
    Editorial
    Oxford University Press
    Fecha de publicación
    2022-05
    ISSN
    2047-217X
    DOI
    10.1093/gigascience/giac032
    Resumen
    Background: The site frequency spectrum summarizes the distribution of allele frequencies throughout the genome, and it is widely used as a summary statistic to infer demographic parameters and to detect signals of natural selection. The use of high-throughput low-coverage DNA sequencing data can lead to biased estimates of the site frequency spectrum due to high levels of uncertainty in genotyping. Results: Here we design and implement a method to efficiently and accurately estimate the multidimensional joint site frequency spectrum for large numbers of haploid or diploid individuals across an arbitrary number of populations, using low-coverage sequencing data. The method maximizes a likelihood function that represents the probability of the sequencing data observed given a multidimensional site frequency spectrum using genotype likelihoods. Notably, it uses an advanced binning heuristic paired with an accelerated expectation-maximization algorithm for a fast and memory-efficient computation, and can generate both unfolded and folded spectra and bootstrapped replicates for haploid and diploid genomes. On the basis of extensive simulations, we show that the new method requires remarkably less storage and is faster than previous implementations whilst retaining the same accuracy. When applied to low-coverage sequencing data from the fungal pathogen Neonectria neomacrospora, results recapitulate the patterns of population differentiation generated using the original high-coverage data. Conclusion: The new implementation allows for accurate estimation of population genetic parameters from arbitrarily large, low-coverage datasets, thus facilitating cost-effective sequencing experiments in model and non-model organisms.
    Palabras clave
    Site frequency spectrum
    High-throughput sequencing
    Genotype likelihoods
    Next-generation sequencing
    Maximum likelihood
    Population genetics
    Threading
    Materia
    Genética de poblaciones
    Population genetics
    Bioinformática
    Bioinformatics
    URI
    https://hdl.handle.net/10259/11433
    Versión del editor
    https://doi.org/10.1093/gigascience/giac032
    Aparece en las colecciones
    • Artículos Paleontología
    Atribución 4.0 Internacional
    Documento(s) sujeto(s) a una licencia Creative Commons Atribución 4.0 Internacional
    Ficheros en este ítem
    Nombre:
    Mas-GigaScience_2022.pdf
    Tamaño:
    3.910Mb
    Formato:
    Adobe PDF
    Thumbnail
    Visualizar/Abrir

    Métricas

    Citas

    Ver estadísticas de uso

    Exportar

    RISMendeleyRefworksZotero
    • edm
    • marc
    • xoai
    • qdc
    • ore
    • ese
    • dim
    • uketd_dc
    • oai_dc
    • etdms
    • rdf
    • mods
    • mets
    • didl
    • premis
    Mostrar el registro completo del ítem

    Universidad de Burgos

    Powered by MIT's. DSpace software, Version 5.10