GLMsim: a GLM-based single cell RNA-seq simulator incorporating batch and biological effects
Abstract With development of the single cell RNA-seq technologies, large numbers of cells can now be routinely sequenced by different platforms. This requires us to choose an efficient integration tool to merge those cells, and computational simulators to help benchmark and assess the performance of these tools. Although existing single cell RNA-seq simulators can simulate library size, biological and batch effects separately, they currently do not capture associations among these three factors. Here we present GLMsim, the first single cell RNA-seq simulator to simultaneously capture the library size, biology and unwanted variation and their associations via a generalized linear model, and to simulate data resembling the original experimental data in these respects. GLMsim is capable of quantitatively benchmarking different single cell integration methods, and assessing their abilities to retain biology and remove library size and batch effects..
Medienart: |
Preprint |
---|
Erscheinungsjahr: |
2024 |
---|---|
Erschienen: |
2024 |
Enthalten in: |
bioRxiv.org - (2024) vom: 26. März Zur Gesamtaufnahme - year:2024 |
---|
Sprache: |
Englisch |
---|
Beteiligte Personen: |
Wang, Jianan [VerfasserIn] |
---|
Links: |
Volltext [kostenfrei] |
---|
Themen: |
---|
doi: |
10.1101/2024.03.20.586030 |
---|
funding: |
|
---|---|
Förderinstitution / Projekttitel: |
|
PPN (Katalog-ID): |
XBI043014763 |
---|
LEADER | 01000caa a22002652 4500 | ||
---|---|---|---|
001 | XBI043014763 | ||
003 | DE-627 | ||
005 | 20240327090711.0 | ||
007 | cr uuu---uuuuu | ||
008 | 240324s2024 xx |||||o 00| ||eng c | ||
024 | 7 | |a 10.1101/2024.03.20.586030 |2 doi | |
035 | |a (DE-627)XBI043014763 | ||
035 | |a (biorXiv)10.1101/2024.03.20.586030 | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
100 | 1 | |a Wang, Jianan |e verfasserin |0 (orcid)0000-0002-5213-8330 |4 aut | |
245 | 1 | 0 | |a GLMsim: a GLM-based single cell RNA-seq simulator incorporating batch and biological effects |
264 | 1 | |c 2024 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a Computermedien |b c |2 rdamedia | ||
338 | |a Online-Ressource |b cr |2 rdacarrier | ||
520 | |a Abstract With development of the single cell RNA-seq technologies, large numbers of cells can now be routinely sequenced by different platforms. This requires us to choose an efficient integration tool to merge those cells, and computational simulators to help benchmark and assess the performance of these tools. Although existing single cell RNA-seq simulators can simulate library size, biological and batch effects separately, they currently do not capture associations among these three factors. Here we present GLMsim, the first single cell RNA-seq simulator to simultaneously capture the library size, biology and unwanted variation and their associations via a generalized linear model, and to simulate data resembling the original experimental data in these respects. GLMsim is capable of quantitatively benchmarking different single cell integration methods, and assessing their abilities to retain biology and remove library size and batch effects. | ||
650 | 4 | |a Biology |7 (dpeaa)DE-84 | |
650 | 4 | |a 570 |7 (dpeaa)DE-84 | |
700 | 1 | |a Chen, Lizhong |0 (orcid)0000-0002-8326-6781 |4 aut | |
700 | 1 | |a Thijssen, Rachel |0 (orcid)0000-0002-4493-306X |4 aut | |
700 | 1 | |a Phipson, Belinda |0 (orcid)0000-0002-1711-7454 |4 aut | |
700 | 1 | |a Speed, Terence P. |0 (orcid)0000-0002-5403-7998 |4 aut | |
773 | 0 | 8 | |i Enthalten in |t bioRxiv.org |g (2024) vom: 26. März |
773 | 1 | 8 | |g year:2024 |g day:26 |g month:03 |
856 | 4 | 0 | |u http://dx.doi.org/10.1101/2024.03.20.586030 |z kostenfrei |3 Volltext |
912 | |a GBV_XBI | ||
951 | |a AR | ||
952 | |j 2024 |b 26 |c 03 |