The GREENER Geospatial Data Science Platform

GREENER bridges the gap between raw geospatial data and actionable research. We curate, process, and link decades of environmental exposure data to research cohorts.

Dataset Catalog Preview

Explore our comprehensive collection of environmental, health, and social datasets.

Dataset Name Source Agency Resolution Temporal Coverage

Visualizing Complex Ecosystems

Transforming raw environmental data into actionable, easy-to-understand insights.

A Curated Library of Environmental Exposure Data

GREENER maintains a continuously growing catalog of standardized environmental and social datasets — from satellite-derived air quality and greenspace indices to federal vulnerability and housing indicators. Every dataset is harmonized to CONUS census geographies and validated for spatial and temporal completeness before publication.

PUBLIC PM2.5 Harvard · 1km PUBLIC NDVI Landsat · 30m PUBLIC SVI CDC · Tract PUBLIC COI 3.0 BU · Tract PUBLIC Smoke PM2.5 Stanford · 10km COMING SOON EPA AQS EPA · Monitor

Secure, Institutional-Grade Geocoding

Participant addresses are geocoded entirely on-server within Mount Sinai's institutional infrastructure — no protected health information leaves our network. Each address is resolved to a precise coordinate and can be matched to any standard geography — census tract, block group, block, or ZCTA — or used directly for point-level residential extraction. Every geocoded record includes a match score, match type, and quality flag for transparent downstream QC.

Address File De-identified On-Server No PII exits network Census Tract Match score + flag

From Participant Address to Linked Exposure Dataset

GREENER links participant cohorts to multiple environmental exposure layers simultaneously — air quality, greenspace, social vulnerability, climate, wildfire smoke, and more. Linkage is performed at the census tract level or at residence-level resolution, depending on the dataset and your study design.

Air Quality Greenspace Social Climate Wildfire Satellite

Define Any Study Area, At Any Scale

Researchers can define their geographic area of interest by drawing a polygon, entering a bounding box, or selecting standard geographies — state, county, ZIP code, or census tract. Extraction is available at multiple spatial resolutions, from ZCTA-level aggregates to fine-scale grid exposures.

Region Selected Supports: State · County · ZCTA · Tract · Custom Polygon · Bounding Box

Transparent Data Quality, Every Delivery

Every dataset delivery includes a complete data dictionary documenting variable definitions, units, and sources. Cohort linkage requests also include a match rate report and QA flag columns. Temporal gaps, geographic mismatches, and missing values are documented before data leaves the platform — so your methods section reflects the actual state of the data.

DATA QUALITY REPORT Geocoding Match Rate Address → Census Tract assignment 99.4% Data Dictionary Variable names, units, sources Included Temporal Coverage Verified per variable per year Validated QA Flags Temporal gaps documented before delivery 2 flagged

How GREENER Works

From Participant Address to Linked Exposure Dataset

A secure, four-step pipeline — run on institutional infrastructure

STEP 1 STEP 2 STEP 3 STEP 4 OUTPUT 📍 Participant Address De-identified list 🔒 Secure Geocoding GIS · On-server 🗺️ Census Tract Assignment 2020 TIGER/Line Exposure Layers PM2.5 · NO₂ · O₃ NDVI · Greenspace SVI · COI · PLACES Climate · Fire · More 📄 Linked Dataset CSV · Parquet · Dict
Step 1 — Submit Your Cohort

Submit Your Participant List

Upload a de-identified participant list with address fields through your secure GREENER account. The platform deduplicates records, flags formatting issues, and prepares the data for on-server geocoding. No protected health information is transmitted or stored externally.

Step 2 — Geocoding & Geography Assignment

Secure Geocoding & Spatial Assignment

Addresses are geocoded using the Mount Sinai institutional GIS deployment. Each participant is matched to a geography (e.g. census tract, zip code, or custom polygon) and assigned a match score and quality flag. The geocoding process is entirely server-side and produces no outbound data transfer.

Step 3 — Select Your Exposures

Browse & Select Datasets

Explore the GREENER Data Library and add datasets to your request cart. Choose from air quality, vegetation indices, social vulnerability, climate, and built environment indicators. Specify the years, temporal resolution, and spatial aggregation level that fit your study design.

Step 4 — Receive Your Linked Dataset

Delivery with Full Documentation

Your request is processed by the GREENER team and delivered as a CSV or Apache Parquet file. Every delivery includes a data dictionary, match rate report, QA flag columns, and metadata describing the source, processing steps, and spatial resolution of each variable.

Step 5 — Analyze & Publish

Accelerate Your Research

By abstracting away the complex geospatial joins and harmonizing diverse datasets, GREENER enables you to move directly to statistical modeling and visualization. Build reproducible pipelines and accelerate your environmental health research.

Data Sources

GREENER curates data from peer-reviewed models and federal agencies, processed on institutional HPC infrastructure and validated before publication.

Data Source
Dataset Categories
EPA
Air quality Environmental justice
NASA
Greenspace (MODIS NDVI) Active fire (FIRMS) Climate
CDC / ATSDR
Social vulnerability Health indicators
USGS
Greenspace (Landsat NDVI) Elevation and terrain
Harvard University
PM2.5 NO2 O3
ECMWF
Climate reanalysis
NOAA
Climate observation data Smoke plumes
HUD
Housing affordability Built environment
Boston University
Social opportunity indicators
Stanford University
Wildfire smoke PM2.5
USDA Economic Research Service
Food access Built environment
All data is sourced from publicly available records provided by [NASA/Harvard/Yale/etc.]. Use of these names is for informational purposes only and does not imply any affiliation with, sponsorship by, or endorsement from these institutions. All trademarks are the property of their respective owners.

Our Team

Ready to Link Your Cohort?

GREENER is available to Mount Sinai investigators and CTSA-affiliated institutions. Register for an account to browse the full Data Library, or contact the team to discuss a cohort linkage request.

Request Access Contact The Team

This platform is supported by the Clinical and Translational Science Awards (CTSA) program, Grant UL1TR004419, from the National Center for Advancing Translational Sciences (NCATS), National Institutes of Health. The content is solely the responsibility of the authors and does not necessarily represent the official views of the NIH. Icahn School of Medicine at Mount Sinai | ConduITS — Conduits for the Integration of Translational Science

CTSA Logo Mount Sinai Logo