RapTCR: Rapid exploration and visualization of T-cell receptor repertoires
Abstract Motivation The acquisition of T-cell receptor (TCR) repertoire sequence data has become faster and cheaper due to advancements in high-throughput sequencing. However, fully exploiting the diagnostic and clinical potential within these TCR repertoires requires a thorough understanding of the inherent repertoire structure. Hence, visualizing the full space of TCR sequences could be a key step towards enabling exploratory analysis of TCR repertoire, driving their enhanced interrogation. Nonetheless, current methods remain limited to rough profiling of TCR V and J gene distributions. Addressing this need, we developed RapTCR, a tool for rapid visualization and post-analysis of TCR repertoires.Approach To overcome computational complexity, RapTCR introduces a novel, simple embedding strategy that represents TCR amino acid sequences as short vectors while retaining their pairwise alignment similarity. RapTCR then applies efficient algorithms for indexing these vectors and constructing their nearest neighbor network. It provides multiple visualization options to map and interactively explore a TCR network as a two-dimensional representation. Benchmarking analyses using epitope-annotated datasets demonstrate that these RapTCR visualizations capture TCR similarity features on a global level (e.g., J gene) and locally (e.g., epitope reactivity). RapTCR is available as a Python package, implementing the intuitive scikit-learn syntax to easily generate insightful, publication-ready figures for TCR repertoires of any size.Availability and Implementation RapTCR was written in Python 3. It is available as an anaconda package (<jats:ext-link xmlns:xlink="http://www.w3.org/1999/xlink" ext-link-type="uri" xlink:href="https://anaconda.org/vincentvandeuren/raptcr">https://anaconda.org/vincentvandeuren/raptcr</jats:ext-link>), and on github (<jats:ext-link xmlns:xlink="http://www.w3.org/1999/xlink" ext-link-type="uri" xlink:href="https://github.com/vincentvandeuren/RapTCR">https://github.com/vincentvandeuren/RapTCR</jats:ext-link>). Documentation and example notebooks are available at vincentvandeuren.github.io/rapTCR_docs/.Contact <jats:email>pieter.meysmanuantwerpen.be</jats:email>.
Medienart: |
Preprint |
---|
Erscheinungsjahr: |
2023 |
---|---|
Erschienen: |
2023 |
Enthalten in: |
bioRxiv.org - (2023) vom: 21. Sept. Zur Gesamtaufnahme - year:2023 |
---|
Sprache: |
Englisch |
---|
Beteiligte Personen: |
Van Deuren, Vincent M.L. [VerfasserIn] |
---|
Links: |
Volltext [kostenfrei] |
---|
Themen: |
---|
doi: |
10.1101/2023.09.13.557604 |
---|
funding: |
|
---|---|
Förderinstitution / Projekttitel: |
|
PPN (Katalog-ID): |
XBI040871991 |
---|
LEADER | 01000caa a22002652 4500 | ||
---|---|---|---|
001 | XBI040871991 | ||
003 | DE-627 | ||
005 | 20231205144149.0 | ||
007 | cr uuu---uuuuu | ||
008 | 230918s2023 xx |||||o 00| ||eng c | ||
024 | 7 | |a 10.1101/2023.09.13.557604 |2 doi | |
035 | |a (DE-627)XBI040871991 | ||
035 | |a (biorXiv)10.1101/2023.09.13.557604 | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
100 | 1 | |a Van Deuren, Vincent M.L. |e verfasserin |0 (orcid)0000-0002-7734-5547 |4 aut | |
245 | 1 | 0 | |a RapTCR: Rapid exploration and visualization of T-cell receptor repertoires |
264 | 1 | |c 2023 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a Computermedien |b c |2 rdamedia | ||
338 | |a Online-Ressource |b cr |2 rdacarrier | ||
520 | |a Abstract Motivation The acquisition of T-cell receptor (TCR) repertoire sequence data has become faster and cheaper due to advancements in high-throughput sequencing. However, fully exploiting the diagnostic and clinical potential within these TCR repertoires requires a thorough understanding of the inherent repertoire structure. Hence, visualizing the full space of TCR sequences could be a key step towards enabling exploratory analysis of TCR repertoire, driving their enhanced interrogation. Nonetheless, current methods remain limited to rough profiling of TCR V and J gene distributions. Addressing this need, we developed RapTCR, a tool for rapid visualization and post-analysis of TCR repertoires.Approach To overcome computational complexity, RapTCR introduces a novel, simple embedding strategy that represents TCR amino acid sequences as short vectors while retaining their pairwise alignment similarity. RapTCR then applies efficient algorithms for indexing these vectors and constructing their nearest neighbor network. It provides multiple visualization options to map and interactively explore a TCR network as a two-dimensional representation. Benchmarking analyses using epitope-annotated datasets demonstrate that these RapTCR visualizations capture TCR similarity features on a global level (e.g., J gene) and locally (e.g., epitope reactivity). RapTCR is available as a Python package, implementing the intuitive scikit-learn syntax to easily generate insightful, publication-ready figures for TCR repertoires of any size.Availability and Implementation RapTCR was written in Python 3. It is available as an anaconda package (<jats:ext-link xmlns:xlink="http://www.w3.org/1999/xlink" ext-link-type="uri" xlink:href="https://anaconda.org/vincentvandeuren/raptcr">https://anaconda.org/vincentvandeuren/raptcr</jats:ext-link>), and on github (<jats:ext-link xmlns:xlink="http://www.w3.org/1999/xlink" ext-link-type="uri" xlink:href="https://github.com/vincentvandeuren/RapTCR">https://github.com/vincentvandeuren/RapTCR</jats:ext-link>). Documentation and example notebooks are available at vincentvandeuren.github.io/rapTCR_docs/.Contact <jats:email>pieter.meysmanuantwerpen.be</jats:email> | ||
650 | 4 | |a Biology |7 (dpeaa)DE-84 | |
650 | 4 | |a 570 |7 (dpeaa)DE-84 | |
700 | 1 | |a Valkiers, Sebastiaan |0 (orcid)0000-0002-4940-1310 |4 aut | |
700 | 1 | |a Laukens, Kris |0 (orcid)0000-0002-8217-2564 |4 aut | |
700 | 1 | |a Meysman, Pieter |0 (orcid)0000-0001-5903-633X |4 aut | |
773 | 0 | 8 | |i Enthalten in |t bioRxiv.org |g (2023) vom: 21. Sept. |
773 | 1 | 8 | |g year:2023 |g day:21 |g month:09 |
856 | 4 | 0 | |u http://dx.doi.org/10.1101/2023.09.13.557604 |z kostenfrei |3 Volltext |
912 | |a GBV_XBI | ||
951 | |a AR | ||
952 | |j 2023 |b 21 |c 09 |