About Us +
The Energy Storage and Distributed Resources Division (ESDR) works on developing advanced batteries and fuel cells for transportation and stationary energy storage, grid-connected technologies for a cleaner, more reliable, resilient, and cost-effective future, and demand responsive and distributed energy technologies for a dynamic electric grid.
Research +
We work closely with academic, government and industry partners to conduct foundational and applied research that provides the groundwork for the development of transformative new energy technologies in the areas of energy storage and conversion, electrical grid, advanced materials for the energy infrastructure, science of manufacturing and water-energy nexus.

Visit our focus areas and research groups at the right to find out more.
Broad Challenges We Face +
Research Groups +
Publications
News
Seminars

FireWorks: a dynamic workflow system designed for high-throughput applications

Publication Type

Journal Article

Date Published

12/2015

Authors

Jain, Anubhav, Shyue Ping Ong, Wei Chen, Bharat Medasan, Xiaohui Qu, Michael Kocher, Miriam Brafman, Guido Petretto, Gian-Marco Rignanese, Geoffroy Hautier, Daniel Gunter, Kristin A Persson

DOI

10.1002/cpe.3505

Abstract

This paper introduces FireWorks, a workflow software for running high-throughput calculation workflows at supercomputing centers. FireWorks has been used to complete over 50 million CPU-hours worth of computational chemistry and materials science calculations at the National Energy Research Supercomputing Center. It has been designed to serve the demanding high-throughput computing needs of these applications, with extensive support for (i) concurrent execution through job packing, (ii) failure detection and correction, (iii) provenance and reporting for long-running projects, (iv) automated duplicate detection, and (v) dynamic workflows (i.e., modifying the workflow graph during runtime). We have found that these features are highly relevant to enabling modern data-driven and high-throughput science applications, and we discuss our implementation strategy that rests on Python and NoSQL databases (MongoDB). Finally, we present performance data and limitations of our approach along with planned future work.

Journal

Concurrency and Computation: Practice and Experience

Volume

Year of Publication

2015

Issue

Organization

Applied Energy Materials Group, Energy Storage and Distributed Resources Division

Research Areas

No Research Area