RapTCR: Rapid exploration and visualization of T-cell receptor repertoires

Abstract Motivation The acquisition of T-cell receptor (TCR) repertoire sequence data has become faster and cheaper due to advancements in high-throughput sequencing. However, fully exploiting the diagnostic and clinical potential within these TCR repertoires requires a thorough understanding of the inherent repertoire structure. Hence, visualizing the full space of TCR sequences could be a key step towards enabling exploratory analysis of TCR repertoire, driving their enhanced interrogation. Nonetheless, current methods remain limited to rough profiling of TCR V and J gene distributions. Addressing this need, we developed RapTCR, a tool for rapid visualization and post-analysis of TCR repertoires.Approach To overcome computational complexity, RapTCR introduces a novel, simple embedding strategy that represents TCR amino acid sequences as short vectors while retaining their pairwise alignment similarity. RapTCR then applies efficient algorithms for indexing these vectors and constructing their nearest neighbor network. It provides multiple visualization options to map and interactively explore a TCR network as a two-dimensional representation. Benchmarking analyses using epitope-annotated datasets demonstrate that these RapTCR visualizations capture TCR similarity features on a global level (e.g., J gene) and locally (e.g., epitope reactivity). RapTCR is available as a Python package, implementing the intuitive scikit-learn syntax to easily generate insightful, publication-ready figures for TCR repertoires of any size.Availability and Implementation RapTCR was written in Python 3. It is available as an anaconda package (<jats:ext-link xmlns:xlink="http://www.w3.org/1999/xlink" ext-link-type="uri" xlink:href="https://anaconda.org/vincentvandeuren/raptcr">https://anaconda.org/vincentvandeuren/raptcr</jats:ext-link>), and on github (<jats:ext-link xmlns:xlink="http://www.w3.org/1999/xlink" ext-link-type="uri" xlink:href="https://github.com/vincentvandeuren/RapTCR">https://github.com/vincentvandeuren/RapTCR</jats:ext-link>). Documentation and example notebooks are available at vincentvandeuren.github.io/rapTCR_docs/.Contact <jats:email>pieter.meysmanuantwerpen.be</jats:email>.

Medienart:

Preprint

Erscheinungsjahr:

2023

Erschienen:

2023

Enthalten in:

bioRxiv.org - (2023) vom: 21. Sept. Zur Gesamtaufnahme - year:2023

Sprache:

Englisch

Beteiligte Personen:

Van Deuren, Vincent M.L. [VerfasserIn]
Valkiers, Sebastiaan [VerfasserIn]
Laukens, Kris [VerfasserIn]
Meysman, Pieter [VerfasserIn]

Links:

Volltext [kostenfrei]

Themen:

570
Biology

doi:

10.1101/2023.09.13.557604

funding:

Förderinstitution / Projekttitel:

PPN (Katalog-ID):

XBI040871991