Details der Publikation - Maximizing the potential of high-throughput next-generation sequencing through precise normalization based on read count distribution

Maximizing the potential of high-throughput next-generation sequencing through precise normalization based on read count distribution

Next-generation sequencing technologies have enabled many advances across diverse areas of biology, with many benefiting from increased sample size. Although the cost of running next-generation sequencing instruments has dropped substantially over time, the cost of sample preparation methods has lagged behind. To counter this, researchers have adapted library miniaturization protocols and large sample pools to maximize the number of samples that can be prepared by a certain amount of reagents and sequenced in a single run. However, due to high variability of sample quality, over and underrepresentation of samples in a sequencing run has become a major issue in high-throughput sequencing. This leads to misinterpretation of results due to increased noise, and additional time and cost rerunning underrepresented samples. To overcome this problem, we present a normalization method that uses shallow iSeq sequencing to accurately inform pooling volumes based on read distribution. This method is superior to the widely used fluorometry methods, which cannot specifically target adapter-ligated molecules that contribute to sequencing output. Our normalization method not only quantifies adapter-ligated molecules but also allows normalization of feature space; for example, we can normalize to reads of interest such as non-ribosomal reads. As a result, this normalization method improves the efficiency of high-throughput next-generation sequencing by reducing noise and producing higher average reads per sample with more even sequencing depth. IMPORTANCE High-throughput next generation sequencing (NGS) has significantly contributed to the field of genomics; however, further improvements can maximize the potential of this important tool. Uneven sequencing of samples in a multiplexed run is a common issue that leads to unexpected extra costs or low-quality data. To mitigate this problem, we introduce a normalization method based on read counts rather than library concentration. This method allows for an even distribution of features of interest across samples, improving the statistical power of data sets and preventing the financial loss associated with resequencing libraries. This method optimizes NGS, which already has huge importance across many areas of biology.

Medienart:	E-Artikel

Erscheinungsjahr:	2023
Erschienen:	2023

Enthalten in:	Zur Gesamtaufnahme - volume:8
Enthalten in:	mSystems - 8(2023), 4 vom: 31. Aug., Seite e0000623

Sprache:	Englisch

Beteiligte Personen:	Brennan, Caitriona [VerfasserIn] Salido, Rodolfo A [VerfasserIn] Belda-Ferre, Pedro [VerfasserIn] Bryant, MacKenzie [VerfasserIn] Cowart, Charles [VerfasserIn] Tiu, Maria D [VerfasserIn] González, Antonio [VerfasserIn] McDonald, Daniel [VerfasserIn] Tribelhorn, Caitlin [VerfasserIn] Zarrinpar, Amir [VerfasserIn] Knight, Rob [VerfasserIn]

Links:	Volltext

Themen:	Automation High-throughput sequencing Journal Article Large-scale studies Metagenomics Multiplexing NGS normalization Quantification Research Support, N.I.H., Extramural

Anmerkungen:	Date Completed 01.09.2023 Date Revised 23.03.2024 published: Print-Electronic Citation Status MEDLINE

doi:	10.1128/msystems.00006-23

funding:
Förderinstitution / Projekttitel:

PPN (Katalog-ID):	NLM358519926

Internformat


LEADER	01000caa a22002652 4500
001	NLM358519926
003	DE-627
005	20240323234612.0
007	cr uuu---uuuuu
008	231226s2023 xx \|\|\|\|\|o 00\| \|\|eng c
024	7		\|a 10.1128/msystems.00006-23 \|2 doi
028	5	2	\|a pubmed24n1342.xml
035			\|a (DE-627)NLM358519926
035			\|a (NLM)37350611
040			\|a DE-627 \|b ger \|c DE-627 \|e rakwb
041			\|a eng
100	1		\|a Brennan, Caitriona \|e verfasserin \|4 aut
245	1	0	\|a Maximizing the potential of high-throughput next-generation sequencing through precise normalization based on read count distribution
264		1	\|c 2023
336			\|a Text \|b txt \|2 rdacontent
337			\|a ƒaComputermedien \|b c \|2 rdamedia
338			\|a ƒa Online-Ressource \|b cr \|2 rdacarrier
500			\|a Date Completed 01.09.2023
500			\|a Date Revised 23.03.2024
500			\|a published: Print-Electronic
500			\|a Citation Status MEDLINE
520			\|a Next-generation sequencing technologies have enabled many advances across diverse areas of biology, with many benefiting from increased sample size. Although the cost of running next-generation sequencing instruments has dropped substantially over time, the cost of sample preparation methods has lagged behind. To counter this, researchers have adapted library miniaturization protocols and large sample pools to maximize the number of samples that can be prepared by a certain amount of reagents and sequenced in a single run. However, due to high variability of sample quality, over and underrepresentation of samples in a sequencing run has become a major issue in high-throughput sequencing. This leads to misinterpretation of results due to increased noise, and additional time and cost rerunning underrepresented samples. To overcome this problem, we present a normalization method that uses shallow iSeq sequencing to accurately inform pooling volumes based on read distribution. This method is superior to the widely used fluorometry methods, which cannot specifically target adapter-ligated molecules that contribute to sequencing output. Our normalization method not only quantifies adapter-ligated molecules but also allows normalization of feature space; for example, we can normalize to reads of interest such as non-ribosomal reads. As a result, this normalization method improves the efficiency of high-throughput next-generation sequencing by reducing noise and producing higher average reads per sample with more even sequencing depth. IMPORTANCE High-throughput next generation sequencing (NGS) has significantly contributed to the field of genomics; however, further improvements can maximize the potential of this important tool. Uneven sequencing of samples in a multiplexed run is a common issue that leads to unexpected extra costs or low-quality data. To mitigate this problem, we introduce a normalization method based on read counts rather than library concentration. This method allows for an even distribution of features of interest across samples, improving the statistical power of data sets and preventing the financial loss associated with resequencing libraries. This method optimizes NGS, which already has huge importance across many areas of biology
650		4	\|a Journal Article
650		4	\|a Research Support, N.I.H., Extramural
650		4	\|a NGS normalization
650		4	\|a automation
650		4	\|a high-throughput sequencing
650		4	\|a large-scale studies
650		4	\|a metagenomics
650		4	\|a multiplexing
650		4	\|a quantification
700	1		\|a Salido, Rodolfo A \|e verfasserin \|4 aut
700	1		\|a Belda-Ferre, Pedro \|e verfasserin \|4 aut
700	1		\|a Bryant, MacKenzie \|e verfasserin \|4 aut
700	1		\|a Cowart, Charles \|e verfasserin \|4 aut
700	1		\|a Tiu, Maria D \|e verfasserin \|4 aut
700	1		\|a González, Antonio \|e verfasserin \|4 aut
700	1		\|a McDonald, Daniel \|e verfasserin \|4 aut
700	1		\|a Tribelhorn, Caitlin \|e verfasserin \|4 aut
700	1		\|a Zarrinpar, Amir \|e verfasserin \|4 aut
700	1		\|a Knight, Rob \|e verfasserin \|4 aut
773	0	8	\|i Enthalten in \|t mSystems \|d 2016 \|g 8(2023), 4 vom: 31. Aug., Seite e0000623 \|w (DE-627)NLM260868094 \|x 2379-5077 \|7 nnns
773	1	8	\|g volume:8 \|g year:2023 \|g number:4 \|g day:31 \|g month:08 \|g pages:e0000623
856	4	0	\|u http://dx.doi.org/10.1128/msystems.00006-23 \|3 Volltext
912			\|a GBV_USEFLAG_A
912			\|a GBV_NLM
951			\|a AR
952			\|d 8 \|j 2023 \|e 4 \|b 31 \|c 08 \|h e0000623

Maximizing the potential of high-throughput next-generation sequencing through precise normalization based on read count distribution

Zugang & Verfügbarkeit

Zugehörige Publikationen/Bände