Building and Documenting Workflows with Python-Based Snakemake

Authors Johannes Köster, Sven Rahmann



PDF
Thumbnail PDF

File

OASIcs.GCB.2012.49.pdf
  • Filesize: 398 kB
  • 8 pages

Document Identifiers

Author Details

Johannes Köster
Sven Rahmann

Cite As Get BibTex

Johannes Köster and Sven Rahmann. Building and Documenting Workflows with Python-Based Snakemake. In German Conference on Bioinformatics 2012. Open Access Series in Informatics (OASIcs), Volume 26, pp. 49-56, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2012) https://doi.org/10.4230/OASIcs.GCB.2012.49

Abstract

Snakemake is a novel workflow engine with a simple Python-derived workflow definition language and an optimizing execution environment. It is the first system that supports multiple named wildcards (or variables) in input and output filenames of each rule definition. It also allows to write human-readable workflows that document themselves. We have found Snakemake especially useful for building high-throughput sequencing data analysis pipelines and present examples from this area. Snakemake exemplifies a generic way to implement a domain specific language in python, without writing a full parser or introducing syntactical overhead by overloading language features.

Subject Classification

Keywords
  • workflow engine
  • dependency graph
  • knapsack problem
  • Python
  • high-throughput sequencing
  • next-generation sequencing

Metrics

  • Access Statistics
  • Total Accesses (updated on a weekly basis)
    0
    PDF Downloads
Questions / Remarks / Feedback
X

Feedback for Dagstuhl Publishing


Thanks for your feedback!

Feedback submitted

Could not send message

Please try again later or send an E-mail