In line with our commitment to
#openscience, we are excited to introduce Dflow, an open-source Python toolkit designed for scientists to construct workflows with simple programming interfaces.
Github:
lnkd.in/geYpt5me
Paper:
arxiv.org/abs/2404.18392
Dflow enables complex process control and task scheduling across a distributed, heterogeneous infrastructure, leveraging containers and Kubernetes for flexibility. Dflow is highly observable and can scale to thousands of concurrent nodes per workflow, enhancing the efficiency of complex scientific computing tasks. The basic unit in Dflow, known as an Operation (OP), is reusable and independent of the underlying infrastructure or context. Dozens of workflow projects have been developed based on Dflow, spanning a wide range of projects. We anticipate that the reusability of Dflow and its components will encourage more scientists to publish their workflows and OP components. These components, in turn, can be adapted and reused in various contexts, fostering greater collaboration and innovation in the scientific community.
Below are a few
#openscience application built on Dflow already:
-- FPOP: A collection of OPs for first-principle calculation
lnkd.in/g7WTctC4
-- APEX: Alloy Property EXplorer
lnkd.in/gEg9BPCR
-- Rid-kit: Reinforced dynamics
lnkd.in/gZAGeaQ6
-- DeePKS flow: Machine learning functional of generalized Kohn–Sham density functional theory (DFT)
lnkd.in/gWRMugEh
and more ...
#AI4Science