GenArk : Towards a million UCSC Genome Browsers

Interactive graphical genome browsers are essential tools for biologists working with DNA sequences. Although tens of thousands of new genome assemblies have become available over the last decade, accessibility is limited by the work involved in manually creating browsers and curating annotations. The results can push the limits of data storage infrastructure. To facilitate managing this increasing number of genome assemblies, we created the Genome Archive (GenArk) collection of UCSC Genome Browsers from assemblies hosted at NCBI(1). Built on our established assembly hub system, this collection enables fast, on-demand visualization of chromosome regions without requiring a database server. Available annotations include gene models, some mapped through whole-genome alignments, repeat masks, GC content, and others. We also modified our popular BLAT(2) aligner and in-silico PCR to support a large number of genomes using limited RAM. Users can upload additional annotations themselves via track hubs(3) and custom tracks. We can import more annotations in bulk from third-party resources, demonstrated here with TOGA(4) gene models. 2,430 GenArk assemblies are listed at https://hgdownload.soe.ucsc.edu/hubs/ and can be found by searching on the main UCSC gateway page. We will continue to add human high-quality assemblies and for other organisms, we are looking forward to receiving requests from the research community for ever more browsers and whole-genome alignments via http://genome.ucsc.edu/assemblyRequest.html.

Errataetall:

UpdateIn: Genome Biol. 2023 Oct 2;24(1):217. - PMID 37784172

Medienart:

E-Artikel

Erscheinungsjahr:

2023

Erschienen:

2023

Enthalten in:

Zur Gesamtaufnahme - year:2023

Enthalten in:

Research square - (2023) vom: 03. Apr.

Sprache:

Englisch

Beteiligte Personen:

Clawson, Hiram [VerfasserIn]
Lee, Brian T [VerfasserIn]
Raney, Brian J [VerfasserIn]
Barber, Galt P [VerfasserIn]
Casper, Jonathan [VerfasserIn]
Diekhans, Mark [VerfasserIn]
Fischer, Clay [VerfasserIn]
Gonzalez, Jairo Navarro [VerfasserIn]
Hinrichs, Angie S [VerfasserIn]
Lee, Christopher M [VerfasserIn]
Nassar, Luis R [VerfasserIn]
Perez, Gerardo [VerfasserIn]
Wick, Brittney [VerfasserIn]
Schmelter, Daniel [VerfasserIn]
Speir, Matthew L [VerfasserIn]
Armstrong, Joel [VerfasserIn]
Zweig, Ann S [VerfasserIn]
Kuhn, Robert M [VerfasserIn]
Kirilenko, Bogdan M [VerfasserIn]
Hiller, Michael [VerfasserIn]
Haussler, David [VerfasserIn]
Kent, W James [VerfasserIn]
Haeussler, Maximilian [VerfasserIn]

Links:

Volltext

Themen:

Preprint

Anmerkungen:

Date Revised 23.02.2024

published: Electronic

UpdateIn: Genome Biol. 2023 Oct 2;24(1):217. - PMID 37784172

Citation Status PubMed-not-MEDLINE

doi:

10.21203/rs.3.rs-2697398/v1

funding:

Förderinstitution / Projekttitel:

PPN (Katalog-ID):

NLM355704773