Structural variant analysis of a cancer reference cell line sample using multiple sequencing technologies
© 2022. The Author(s)..
BACKGROUND: The cancer genome is commonly altered with thousands of structural rearrangements including insertions, deletions, translocation, inversions, duplications, and copy number variations. Thus, structural variant (SV) characterization plays a paramount role in cancer target identification, oncology diagnostics, and personalized medicine. As part of the SEQC2 Consortium effort, the present study established and evaluated a consensus SV call set using a breast cancer reference cell line and matched normal control derived from the same donor, which were used in our companion benchmarking studies as reference samples.
RESULTS: We systematically investigated somatic SVs in the reference cancer cell line by comparing to a matched normal cell line using multiple NGS platforms including Illumina short-read, 10X Genomics linked reads, PacBio long reads, Oxford Nanopore long reads, and high-throughput chromosome conformation capture (Hi-C). We established a consensus SV call set of a total of 1788 SVs including 717 deletions, 230 duplications, 551 insertions, 133 inversions, 146 translocations, and 11 breakends for the reference cancer cell line. To independently evaluate and cross-validate the accuracy of our consensus SV call set, we used orthogonal methods including PCR-based validation, Affymetrix arrays, Bionano optical mapping, and identification of fusion genes detected from RNA-seq. We evaluated the strengths and weaknesses of each NGS technology for SV determination, and our findings provide an actionable guide to improve cancer genome SV detection sensitivity and accuracy.
CONCLUSIONS: A high-confidence consensus SV call set was established for the reference cancer cell line. A large subset of the variants identified was validated by multiple orthogonal methods.
Medienart: |
E-Artikel |
---|
Erscheinungsjahr: |
2022 |
---|---|
Erschienen: |
2022 |
Enthalten in: |
Zur Gesamtaufnahme - volume:23 |
---|---|
Enthalten in: |
Genome biology - 23(2022), 1 vom: 13. Dez., Seite 255 |
Sprache: |
Englisch |
---|
Beteiligte Personen: |
Talsania, Keyur [VerfasserIn] |
---|
Links: |
---|
Anmerkungen: |
Date Completed 15.12.2022 Date Revised 19.01.2023 published: Electronic Citation Status MEDLINE |
---|
doi: |
10.1186/s13059-022-02816-6 |
---|
funding: |
|
---|---|
Förderinstitution / Projekttitel: |
|
PPN (Katalog-ID): |
NLM350261296 |
---|
LEADER | 01000naa a22002652 4500 | ||
---|---|---|---|
001 | NLM350261296 | ||
003 | DE-627 | ||
005 | 20231226044617.0 | ||
007 | cr uuu---uuuuu | ||
008 | 231226s2022 xx |||||o 00| ||eng c | ||
024 | 7 | |a 10.1186/s13059-022-02816-6 |2 doi | |
028 | 5 | 2 | |a pubmed24n1167.xml |
035 | |a (DE-627)NLM350261296 | ||
035 | |a (NLM)36514120 | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
100 | 1 | |a Talsania, Keyur |e verfasserin |4 aut | |
245 | 1 | 0 | |a Structural variant analysis of a cancer reference cell line sample using multiple sequencing technologies |
264 | 1 | |c 2022 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a ƒaComputermedien |b c |2 rdamedia | ||
338 | |a ƒa Online-Ressource |b cr |2 rdacarrier | ||
500 | |a Date Completed 15.12.2022 | ||
500 | |a Date Revised 19.01.2023 | ||
500 | |a published: Electronic | ||
500 | |a Citation Status MEDLINE | ||
520 | |a © 2022. The Author(s). | ||
520 | |a BACKGROUND: The cancer genome is commonly altered with thousands of structural rearrangements including insertions, deletions, translocation, inversions, duplications, and copy number variations. Thus, structural variant (SV) characterization plays a paramount role in cancer target identification, oncology diagnostics, and personalized medicine. As part of the SEQC2 Consortium effort, the present study established and evaluated a consensus SV call set using a breast cancer reference cell line and matched normal control derived from the same donor, which were used in our companion benchmarking studies as reference samples | ||
520 | |a RESULTS: We systematically investigated somatic SVs in the reference cancer cell line by comparing to a matched normal cell line using multiple NGS platforms including Illumina short-read, 10X Genomics linked reads, PacBio long reads, Oxford Nanopore long reads, and high-throughput chromosome conformation capture (Hi-C). We established a consensus SV call set of a total of 1788 SVs including 717 deletions, 230 duplications, 551 insertions, 133 inversions, 146 translocations, and 11 breakends for the reference cancer cell line. To independently evaluate and cross-validate the accuracy of our consensus SV call set, we used orthogonal methods including PCR-based validation, Affymetrix arrays, Bionano optical mapping, and identification of fusion genes detected from RNA-seq. We evaluated the strengths and weaknesses of each NGS technology for SV determination, and our findings provide an actionable guide to improve cancer genome SV detection sensitivity and accuracy | ||
520 | |a CONCLUSIONS: A high-confidence consensus SV call set was established for the reference cancer cell line. A large subset of the variants identified was validated by multiple orthogonal methods | ||
650 | 4 | |a Journal Article | |
650 | 4 | |a Research Support, N.I.H., Intramural | |
650 | 4 | |a Research Support, U.S. Gov't, P.H.S. | |
650 | 4 | |a Research Support, Non-U.S. Gov't | |
650 | 4 | |a Research Support, N.I.H., Extramural | |
650 | 4 | |a Cancer | |
650 | 4 | |a Multiple platforms | |
650 | 4 | |a Next-generation sequencing technology | |
650 | 4 | |a Reference call set | |
650 | 4 | |a Structural variant calling algorithm | |
650 | 4 | |a Structural variation | |
700 | 1 | |a Shen, Tsai-Wei |e verfasserin |4 aut | |
700 | 1 | |a Chen, Xiongfong |e verfasserin |4 aut | |
700 | 1 | |a Jaeger, Erich |e verfasserin |4 aut | |
700 | 1 | |a Li, Zhipan |e verfasserin |4 aut | |
700 | 1 | |a Chen, Zhong |e verfasserin |4 aut | |
700 | 1 | |a Chen, Wanqiu |e verfasserin |4 aut | |
700 | 1 | |a Tran, Bao |e verfasserin |4 aut | |
700 | 1 | |a Kusko, Rebecca |e verfasserin |4 aut | |
700 | 1 | |a Wang, Limin |e verfasserin |4 aut | |
700 | 1 | |a Pang, Andy Wing Chun |e verfasserin |4 aut | |
700 | 1 | |a Yang, Zhaowei |e verfasserin |4 aut | |
700 | 1 | |a Choudhari, Sulbha |e verfasserin |4 aut | |
700 | 1 | |a Colgan, Michael |e verfasserin |4 aut | |
700 | 1 | |a Fang, Li Tai |e verfasserin |4 aut | |
700 | 1 | |a Carroll, Andrew |e verfasserin |4 aut | |
700 | 1 | |a Shetty, Jyoti |e verfasserin |4 aut | |
700 | 1 | |a Kriga, Yuliya |e verfasserin |4 aut | |
700 | 1 | |a German, Oksana |e verfasserin |4 aut | |
700 | 1 | |a Smirnova, Tatyana |e verfasserin |4 aut | |
700 | 1 | |a Liu, Tiantain |e verfasserin |4 aut | |
700 | 1 | |a Li, Jing |e verfasserin |4 aut | |
700 | 1 | |a Kellman, Ben |e verfasserin |4 aut | |
700 | 1 | |a Hong, Karl |e verfasserin |4 aut | |
700 | 1 | |a Hastie, Alex R |e verfasserin |4 aut | |
700 | 1 | |a Natarajan, Aparna |e verfasserin |4 aut | |
700 | 1 | |a Moshrefi, Ali |e verfasserin |4 aut | |
700 | 1 | |a Granat, Anastasiya |e verfasserin |4 aut | |
700 | 1 | |a Truong, Tiffany |e verfasserin |4 aut | |
700 | 1 | |a Bombardi, Robin |e verfasserin |4 aut | |
700 | 1 | |a Mankinen, Veronnica |e verfasserin |4 aut | |
700 | 1 | |a Meerzaman, Daoud |e verfasserin |4 aut | |
700 | 1 | |a Mason, Christopher E |e verfasserin |4 aut | |
700 | 1 | |a Collins, Jack |e verfasserin |4 aut | |
700 | 1 | |a Stahlberg, Eric |e verfasserin |4 aut | |
700 | 1 | |a Xiao, Chunlin |e verfasserin |4 aut | |
700 | 1 | |a Wang, Charles |e verfasserin |4 aut | |
700 | 1 | |a Xiao, Wenming |e verfasserin |4 aut | |
700 | 1 | |a Zhao, Yongmei |e verfasserin |4 aut | |
773 | 0 | 8 | |i Enthalten in |t Genome biology |d 2000 |g 23(2022), 1 vom: 13. Dez., Seite 255 |w (DE-627)NLM110197372 |x 1474-760X |7 nnns |
773 | 1 | 8 | |g volume:23 |g year:2022 |g number:1 |g day:13 |g month:12 |g pages:255 |
856 | 4 | 0 | |u http://dx.doi.org/10.1186/s13059-022-02816-6 |3 Volltext |
912 | |a GBV_USEFLAG_A | ||
912 | |a GBV_NLM | ||
951 | |a AR | ||
952 | |d 23 |j 2022 |e 1 |b 13 |c 12 |h 255 |