Welcome to the MGHPCC Virtual Booth

Harvard University

Reproducible Data Pipelines in Environmental Health Analysis

Tuesday, November 17 1:30 pm EST
Passcode: 759941
Reproducibility is important in all research, but is especially important in environmental health studies that can play a role in determining EPA standards and other policy changes. This talk will discuss some of the challenges involved in working with large health data sets and in creating reproducible data pipelines, as well as reviewing the structure and thinking behind one pipeline preparing public data for use with our other data sources.
Harvard School of Public Health
Department of Biostatistics
Ben Sabath
Chat with us @
Attend Consortium Talks @


Featured projects

Air Force Arcade
Airborne Optical Systems Test Bed (AOSTB)
Black Hole Initiative
Center for Scientific Computing and Visualization Research (CSCVR)
Data Centric Low Emission Mobility
Fast Accurate NURBS Optimization (FANO)
Fusarium Pathogenomics: Understanding Fungal Pathogenicity through Genomics
GLEAM: Global Epidemic and Mobility project
Lichtman Lab - Center for Brain Science
Lincoln Laboratory Supercomputing Center (LLSC)
LLSC Articles and Publications
MIT Initiative on the Digital Economy
MIT Laboratory for Financial Engineering
Northeast Cyberteam
Northeastern University and Williams College SC20 Student Cluster Competition Team
Quantum Architectures at Goodwill Computing Lab
Simulating Large Biomolecular Assemblies
The Center for Information and Systems Engineering (CISE)
The Mass Open Cloud and Open Cloud Initiative
Video and Imagery Dataset to Drive Public Safety
Visibility Estimation through Image Analytics (VEIA)
linkedin facebook pinterest youtube rss twitter instagram facebook-blank rss-blank linkedin-blank pinterest youtube twitter instagram