Efficient ancestry and mutation simulation with msprime 1.0
Abstract Stochastic simulation is a key tool in population genetics, since the models involved are often analytically intractable and simulation is usually the only way of obtaining ground-truth data to evaluate inferences. Because of this necessity, a large number of specialised simulation programs have been developed, each filling a particular niche, but with largely overlapping functionality and a substantial duplication of effort. Here, we introduce<jats:monospace>msprime</jats:monospace>version 1.0, which efficiently implements ancestry and mutation simulations based on the succinct tree sequence data structure and<jats:monospace>tskit</jats:monospace>library. We summarise<jats:monospace>msprime</jats:monospace>’s many features, and show that its performance is excellent, often many times faster and more memory efficient than specialised alternatives. These high-performance features have been thoroughly tested and validated, and built using a collaborative, open source development model, which reduces duplication of effort and promotes software quality via community engagement..
Medienart: |
Preprint |
---|
Erscheinungsjahr: |
2023 |
---|---|
Erschienen: |
2023 |
Enthalten in: |
bioRxiv.org - (2023) vom: 08. Nov. Zur Gesamtaufnahme - year:2023 |
---|
Sprache: |
Englisch |
---|
Links: |
---|
Themen: |
---|
doi: |
10.1101/2021.08.31.457499 |
---|
funding: |
|
---|---|
Förderinstitution / Projekttitel: |
|
PPN (Katalog-ID): |
XBI032496664 |
---|
LEADER | 01000caa a22002652 4500 | ||
---|---|---|---|
001 | XBI032496664 | ||
003 | DE-627 | ||
005 | 20231205150317.0 | ||
007 | cr uuu---uuuuu | ||
008 | 210902s2023 xx |||||o 00| ||eng c | ||
024 | 7 | |a 10.1101/2021.08.31.457499 |2 doi | |
035 | |a (DE-627)XBI032496664 | ||
035 | |a (biorXiv)10.1101/2021.08.31.457499 | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
100 | 1 | |a Baumdicker, Franz |e verfasserin |0 (orcid)0000-0001-9106-7259 |4 aut | |
245 | 1 | 0 | |a Efficient ancestry and mutation simulation with msprime 1.0 |
264 | 1 | |c 2023 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a Computermedien |b c |2 rdamedia | ||
338 | |a Online-Ressource |b cr |2 rdacarrier | ||
520 | |a Abstract Stochastic simulation is a key tool in population genetics, since the models involved are often analytically intractable and simulation is usually the only way of obtaining ground-truth data to evaluate inferences. Because of this necessity, a large number of specialised simulation programs have been developed, each filling a particular niche, but with largely overlapping functionality and a substantial duplication of effort. Here, we introduce<jats:monospace>msprime</jats:monospace>version 1.0, which efficiently implements ancestry and mutation simulations based on the succinct tree sequence data structure and<jats:monospace>tskit</jats:monospace>library. We summarise<jats:monospace>msprime</jats:monospace>’s many features, and show that its performance is excellent, often many times faster and more memory efficient than specialised alternatives. These high-performance features have been thoroughly tested and validated, and built using a collaborative, open source development model, which reduces duplication of effort and promotes software quality via community engagement. | ||
650 | 4 | |a Biology |7 (dpeaa)DE-84 | |
650 | 4 | |a 570 |7 (dpeaa)DE-84 | |
700 | 1 | |a Bisschop, Gertjan |0 (orcid)0000-0001-8327-0142 |4 aut | |
700 | 1 | |a Goldstein, Daniel |0 (orcid)0000-0002-5967-5569 |4 aut | |
700 | 1 | |a Gower, Graham |0 (orcid)0000-0002-6197-3872 |4 aut | |
700 | 1 | |a Ragsdale, Aaron P. |0 (orcid)0000-0003-0715-3432 |4 aut | |
700 | 1 | |a Tsambos, Georgia |0 (orcid)0000-0001-7001-2275 |4 aut | |
700 | 1 | |a Zhu, Sha |0 (orcid)0000-0001-7566-2787 |4 aut | |
700 | 1 | |a Eldon, Bjarki |0 (orcid)0000-0001-9354-2391 |4 aut | |
700 | 1 | |a Ellerman, E. Castedo |0 (orcid)0000-0002-5014-4809 |4 aut | |
700 | 1 | |a Galloway, Jared G. |0 (orcid)0000-0001-5838-7840 |4 aut | |
700 | 1 | |a Gladstein, Ariella L. |0 (orcid)0000-0001-7735-2336 |4 aut | |
700 | 1 | |a Gorjanc, Gregor |0 (orcid)0000-0001-8008-2787 |4 aut | |
700 | 1 | |a Guo, Bing |0 (orcid)0000-0002-3998-5981 |4 aut | |
700 | 1 | |a Jeffery, Ben |0 (orcid)0000-0002-1982-6801 |4 aut | |
700 | 1 | |a Kretzschmar, Warren W. |0 (orcid)0000-0002-2575-0807 |4 aut | |
700 | 1 | |a Lohse, Konrad |0 (orcid)0000-0001-9918-058X |4 aut | |
700 | 1 | |a Matschiner, Michael |0 (orcid)0000-0003-4741-3884 |4 aut | |
700 | 1 | |a Nelson, Dominic |0 (orcid)0000-0003-2496-8588 |4 aut | |
700 | 1 | |a Pope, Nathaniel S. |0 (orcid)0000-0001-8409-7812 |4 aut | |
700 | 1 | |a Quinto-Cortés, Consuelo D. |0 (orcid)0000-0002-0338-3321 |4 aut | |
700 | 1 | |a Rodrigues, Murillo F. |0 (orcid)0000-0001-7508-1384 |4 aut | |
700 | 1 | |a Saunack, Kumar |0 (orcid)0000-0003-0502-1778 |4 aut | |
700 | 1 | |a Sellinger, Thibaut |0 (orcid)0000-0002-8538-7800 |4 aut | |
700 | 1 | |a Thornton, Kevin |0 (orcid)0000-0003-0743-4445 |4 aut | |
700 | 1 | |a van Kemenade, Hugo |0 (orcid)0000-0001-5715-8632 |4 aut | |
700 | 1 | |a Wohns, Anthony W. |0 (orcid)0000-0001-7353-1177 |4 aut | |
700 | 1 | |a Wong, Yan |0 (orcid)0000-0002-3536-6411 |4 aut | |
700 | 1 | |a Gravel, Simon |0 (orcid)0000-0002-9183-964X |4 aut | |
700 | 1 | |a Kern, Andrew D. |0 (orcid)0000-0003-4381-4680 |4 aut | |
700 | 1 | |a Koskela, Jere |0 (orcid)0000-0002-2836-8777 |4 aut | |
700 | 1 | |a Ralph, Peter L. |0 (orcid)0000-0002-9459-6866 |4 aut | |
700 | 1 | |a Kelleher, Jerome |0 (orcid)0000-0002-7894-5253 |4 aut | |
773 | 0 | 8 | |i Enthalten in |t bioRxiv.org |g (2023) vom: 08. Nov. |
773 | 1 | 8 | |g year:2023 |g day:08 |g month:11 |
856 | 4 | 0 | |u https://doi.org/10.1093/genetics/iyab229 |z lizenzpflichtig |3 Volltext |
856 | 4 | 0 | |u http://dx.doi.org/10.1101/2021.08.31.457499 |z kostenfrei |3 Volltext |
912 | |a GBV_XBI | ||
951 | |a AR | ||
952 | |j 2023 |b 08 |c 11 |