Resources | Reproducibility of Computational Research Community of Practice

This page lists tools and resources relevant to reproducibility of computational research, organized by category. Inclusion here does not constitute endorsement, this is a curated reference list, not an exhaustive catalog. Resources are provided for reference only and are maintained by their respective developers and content owners.

Reproducibility Resources

Provenance capture tools

ReproZip

Traces system calls to bundle all files, libraries, and environment details needed to reproduce a computation into a portable archive.

reprozip.org ↗

Sciunit

Versioned execution history system that allows individual runs to be inspected, compared, and shared across branches.

GitHub ↗

noWorkflow

Captures fine-grained provenance from Python scripts without requiring a workflow definition, recording function calls and file accesses.

GitHub ↗

Workflow engines

Nextflow

Defines pipelines as DAGs of named processes with declared inputs and outputs, with strong incremental execution and container integration.

nextflow.io ↗

Snakemake

Python-based workflow management system that creates reproducible and scalable data analyses using a readable, domain-specific language.

snakemake.github.io ↗

DVC

Data version control system providing incremental execution, data versioning, and experiment comparison for explicitly defined workflows.

dvc.org ↗

CWL / cwltool

An open standard for writing portable, reproducible workflows. Tools that support CWL can run the same pipeline across different systems without modification.

commonwl.org ↗

Pegasus WMS

Runs large scientific workflows across HPC, cloud, and grid systems. Handles job scheduling, data movement, and failure recovery, and logs provenance automatically as jobs run.

pegasus.isi.edu ↗

Galaxy

A web-based platform where you can build, run, and share analysis workflows without writing code. Keeps a full history of every step, making results easy to reproduce and share.

galaxyproject.org ↗

WDL / Cromwell

A workflow language paired with an execution engine from the Broad Institute. Widely used in genomics for writing pipelines that run the same way locally, on a cluster, or in the cloud.

openwdl.org ↗

Notebook environments

Jupyter Lab

Next-generation web-based interface for Project Jupyter, providing a flexible environment for interactive and reproducible computing.

jupyterlab.readthedocs.io ↗

Jupyter Notebook

Original web application for creating and sharing computational documents with live code, equations, visualizations, and narrative text.

jupyter.org ↗

Papermill

Tool for parameterizing, executing, and analyzing Jupyter notebooks, enabling reproducible notebook workflows at scale.

papermill.readthedocs.io ↗

Binder / repo2docker

Point it at a GitHub repo and it spins up a live, reproducible environment in the browser. No setup required, anyone can run your notebooks exactly as you did.

mybinder.org ↗

Data versioning and management

DataLad

Git for data. Tracks versions of large datasets alongside code, records what commands produced what outputs, and lets you re-run any prior analysis from a single command.

datalad.org ↗

Experiment tracking

MLflow

Tracks what you ran, with what parameters, and what it produced. Useful for keeping experiments organized and being able to go back and reproduce any past result.

mlflow.org ↗

Weights & Biases

Logs every detail of a training or compute run (hyperparameters, metrics, code version, system info) so you can compare runs and reproduce any result later.

wandb.ai ↗

Sacred

A lightweight Python library that captures the configuration, code state, and outputs of each experiment run. Low overhead, easy to add to existing scripts.

GitHub ↗

Sumatra

Lightweight experiment tracking built for simulation workflows. Records parameters, code version, and outputs for each run with minimal changes to your existing scripts.

neuralensemble.org ↗

OpenML

A shared platform for reproducible ML experiments. Upload datasets, tasks, and results so others can benchmark against them or build on your work directly.

openml.org ↗

Publishing & packaging formats

RO-Crate

Machine-readable format for describing research objects and their provenance using JSON-LD, designed for interoperability at publication.

researchobject.org ↗

Whole Tale

Platform that packages code, data, and environments into shareable, executable units for frictionless reproducibility at publication.

wholetale.org ↗

Code Ocean

Helps computational teams provision compute, set up environments, scale pipelines, and get a reproducible, traceable record of their work.

codeocean.com ↗

Renku

An open platform from EPFL that ties together GitLab, Jupyter, and workflow tracking in one place. Designed for collaborative research where you want code, data, and results to stay connected.

renkulab.io ↗

Artifact repositories

Zenodo

Free, open repository from CERN for archiving code, data, and software. Assigns a DOI to everything you deposit, making it citable and permanently accessible.

zenodo.org ↗

Figshare

A repository for sharing research outputs, datasets, figures, code, and preprints with DOIs. Commonly required as the artifact submission platform at CS venues including SC and DSN.

figshare.com ↗

OSF

A free platform for managing research projects from start to finish. Supports preregistration, file storage, versioning, and sharing, all in one place and built around open science norms.

osf.io ↗

Containers & environments

Apptainer / Singularity

Rootless container platform designed for HPC environments, enabling portable and reproducible execution of software across systems.

apptainer.org ↗

Docker

Platform for developing, shipping, and running applications in containers, ensuring consistent environments across development and production.

docker.com ↗

Conda

Open-source package and environment management system that allows reproducible installation of software dependencies across platforms.

docs.conda.io ↗

Computational testbeds

Chameleon Cloud

An NSF-funded bare-metal cloud testbed for CS research, run by UChicago and TACC. Used as the default platform for SC artifact evaluation, with tools for packaging and sharing reproducible experiments.

chameleoncloud.org ↗

MINCER

A Chameleon Cloud appliance for standardized hardware performance measurement. Wraps PAPI in Docker containers so you can collect CPU, GPU, and energy metrics consistently across architectures.

REPETO webinar ↗

Standards, reports & guidelines

NASEM 2019 Report

The 2019 National Academies consensus report that set the definitions for reproducibility and replicability used across science today. The starting point for understanding why this matters and what to do about it.

nationalacademies.org ↗

FAIR Principles

Four principles: Findable, Accessible, Interoperable, Reusable, for how research data should be shared. Now referenced by most journals, funders, and artifact evaluation processes.

go-fair.org ↗

W3C PROV-DM

The W3C standard for representing provenance: who did what, when, and with what inputs. The common vocabulary that most provenance tools and formats are built on top of.

w3.org ↗

ProvONE

An extension of W3C PROV-DM designed specifically for scientific workflows. Adds concepts like programs, ports, and channels to better capture how data moves through a pipeline.

provone.org ↗

ACM Artifact Badging

ACM's official policy for recognizing reproducible research. Defines what "Artifacts Available," "Artifacts Evaluated," and "Results Reproduced" mean and how badges are awarded across ACM venues.

acm.org ↗

SC Reproducibility Initiative

SC's program requiring an Artifact Description appendix with every submitted paper, with optional Artifact Evaluation for accepted work. The main reproducibility process for the HPC and systems community.

sc25.supercomputing.org ↗

Community & initiatives

REPETO

NSF-funded reproducibility initiative based at UChicago. Runs the annual Summer of Reproducibility student program and hosts webinars and community events focused on practical reproducibility in HPC.

repeto.cs.uchicago.edu ↗

cTuning / CK

A framework for packaging and sharing ML and systems experiments so others can reproduce and build on them. Used by MLCommons and several conference artifact evaluation processes.

GitHub ↗

Lifecycle-Spanning Tool

Developed at RCC UChicago

Tracey

A lightweight, end-to-end reproducibility tool that captures computational workflows automatically from real executions covering the full arc from exploration to publication. No pipeline pre-specification required.

View on GitHub ↗View Website ↗

✓ Automatic background capture, no per-command annotation
✓ Derives workflow DAGs from execution history, not declarations
✓ Supports HPC environments with native Slurm integration
✓ Packages workflows into portable Apptainer/Singularity containers
✓ Minimal measurable overhead for polling-based capture mode

We are interested in suggestions of additional content for this section. Please share additional resources and tools that are missing on this list by emailing reproducibility@rcc.uchicago.edu.