Carregant...
Fitxers
Tipus de document
ArticleVersió
Versió publicadaData de publicació
Llicència de publicació
Si us plau utilitzeu sempre aquest identificador per citar o enllaçar aquest document: https://hdl.handle.net/2445/227061
DeBasher: a flow-based programming bash extension for the implementation of complex and interactive workflows with stateful processes
Títol de la revista
Autors
Director/Tutor
ISSN de la revista
Títol del volum
Recurs relacionat
Resum
Background: Bioinformatics data analysis faces significant challenges. As data analysis
often takes the form of pipelines or workflows, workflow managers (WfMs) have
become essential. Data flow programming constitutes the preferred approach in WfMs,
enabling parallel processes activated reactively based on input availability. While this
paradigm typically follows a linear, acyclic progression, cyclic workflows are sometimes
necessary in bioinformatics analyses. These cyclic workflows also present an oppor‑
tunity to explore workflow interactivity, a feature not widely implemented in existing
WfMs.
Results: We propose DeBasher, a tool that adopts the flow‑based programming (FBP)
paradigm, in which the workflow components are in control of their life cycle and can
store state information, allowing the execution of complex workflows that include
cycles. DeBasher also incorporates a powerful model of interactivity, where the user
can alter the behavior of a running workflow. Additionally, DeBasher allows the user
to define triggers so as to initiate the execution of a complete workflow or a part
of it. The ability to execute processes with state and in control of their life cycle
also has applications in dynamic scheduling tasks. Furthermore, DeBasher presents
a series of extra features, including the combination of multiple workflows at runt‑
ime through a feature we have called runtime piping, switching to static scheduling
to increase scalability, or implementing processes in multiple languages. DeBasher
has been successfully used to process 131.7 TB of genomic data by means of a variant
calling pipeline.
Conclusions: DeBasher is an FBP Bash extension that can be useful in a wide range
of situations and in particular when implementing complex workflows, workflows
with interactivity or triggers, or when a high scalability is required.
Keywords: Workflow, Pipeline, Workflow manager, Data flow programming, Flow‑
based programming, Bash, Workflow interactivity, Workflow triggers
Matèries (anglès)
Citació
Citació
ORTIZ MARTÍNEZ, Daniel. DeBasher: a flow-based programming bash extension for the implementation of complex and interactive workflows with stateful processes. _BMC Bioinformatics_. 2025. Vol. 26. [consulta: 21 de febrer de 2026]. ISSN: 1471-2105. [Disponible a: https://hdl.handle.net/2445/227061]