PanCake: A Data Structure for Pangenomes

Ernst, Corinna; Rahmann, Sven

doi:10.4230/OASIcs.GCB.2013.35

Document

PanCake: A Data Structure for Pangenomes

Authors Corinna Ernst, Sven Rahmann

Part of: Volume: German Conference on Bioinformatics 2013 (GCB 2013)
Part of: Series: Open Access Series in Informatics (OASIcs)
License: Creative Commons Attribution 3.0 Unported license
Publication Date: 2013-09-09

PDF

File

PDF

OASIcs.GCB.2013.35.pdf

Filesize: 2.4 MB
11 pages

Document Identifiers

DOI: 10.4230/OASIcs.GCB.2013.35
URN: urn:nbn:de:0030-drops-42314

Subject Classification

Keywords

pangenome
data structure
core genome
comparative genomics

Metrics

Access Statistics
Total Accesses (updated on a weekly basis)

0

Document

0

Metadata

Abstract

We present a pangenome data structure ("PanCake") for sets of related genomes, based on bundling similar sequence regions into shared features, which are derived from genome-wide pairwise sequence alignments.
We discuss the design of the data structure, basic operations on it and methods to predict core genomes and singleton regions.
In contrast to many other pangenome analysis tools, like EDGAR or PGAT, PanCake is independent of gene annotations.
Nevertheless, comparison of identified core and singleton regions shows good agreements.
The PanCake data structure requires significantly less space than the sum of individual sequence files.

Cite As Get BibTex

Corinna Ernst and Sven Rahmann. PanCake: A Data Structure for Pangenomes. In German Conference on Bioinformatics 2013. Open Access Series in Informatics (OASIcs), Volume 34, pp. 35-45, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2013) https://doi.org/10.4230/OASIcs.GCB.2013.35

Author Details

Corinna Ernst

Sven Rahmann

Any Issues?

Feedback on the Current Page

Thanks for your feedback!

Feedback submitted to Dagstuhl Publishing

Could not send message

Please try again later or send an E-mail