anndata: Annotated data

Summary anndata is a Python package for handling annotated data matrices in memory and on disk (<jats:ext-link xmlns:xlink="http://www.w3.org/1999/xlink" ext-link-type="uri" xlink:href="http://github.com/theislab/anndata">github.com/theislab/anndata</jats:ext-link>), positioned between pandas and xarray. anndata offers a broad range of computationally efficient features including, among others, sparse data support, lazy operations, and a PyTorch interface.Statement of need Generating insight from high-dimensional data matrices typically works through training models that annotate observations and variables via low-dimensional representations. In exploratory data analysis, this involvesiterativetraining and analysis using original and learned annotations and task-associated representations. anndata offers a canonical data structure for book-keeping these, which is neither addressed by pandas (McKinney, 2010), nor xarray (Hoyer &amp; Hamman, 2017), nor commonly-used modeling packages like scikit-learn (Pedregosa et al., 2011)..

Medienart:

Preprint

Erscheinungsjahr:

2023

Erschienen:

2023

Enthalten in:

bioRxiv.org - (2023) vom: 19. Jan. Zur Gesamtaufnahme - year:2023

Sprache:

Englisch

Beteiligte Personen:

Virshup, Isaac [VerfasserIn]
Rybakov, Sergei [VerfasserIn]
Theis, Fabian J. [VerfasserIn]
Angerer, Philipp [VerfasserIn]
Wolf, F. Alexander [VerfasserIn]

Links:

Volltext [kostenfrei]

Themen:

570
Biology

doi:

10.1101/2021.12.16.473007

funding:

Förderinstitution / Projekttitel:

PPN (Katalog-ID):

XBI033256462