Files

327 lines
12 KiB
Markdown

# DS-00008 — EPA Air Quality System (AQS)
**Environmental Health & Quality of Life Indicators**
## Overview
The EPA Air Quality System (AQS) is the **authoritative source** for ambient air quality measurements in the United States. This data source provides regulatory-grade air quality data from 4,000+ monitoring stations nationwide, with a focus on parameters most critical to human health and wellbeing.
**Key Insight:** Air quality is a **structural determinant of wellbeing**. You cannot "self-care" your way out of breathing toxic air. PM2.5 exposure reduces life expectancy by months to years in polluted areas. Environmental injustice: low-income communities and communities of color are disproportionately exposed.
## Why This Matters for Substrate
### Human Progress & Wellbeing Focus
Air quality is a fundamental structural constraint on human flourishing:
- **Life Expectancy:** PM2.5 reduces longevity by 1.8 years globally (Air Quality Life Index)
- **Involuntary Exposure:** You breathe ~20,000 times per day — exposure is unavoidable
- **Environmental Injustice:** ZIP code determines exposure — structural inequality
- **Health Impacts:** Cardiovascular disease, respiratory disease, cognitive decline, pregnancy outcomes
- **Quality of Life:** Restricted outdoor activity on high pollution days, healthcare costs, lost productivity
**Unlike individual health behaviors (diet, exercise), air quality is a collective problem requiring structural solutions.**
## Data Source Details
### Authority
- **Organization:** U.S. Environmental Protection Agency (EPA)
- **Office:** Office of Air Quality Planning and Standards (OAQPS)
- **Legal Mandate:** Clean Air Act (1970, amended 1990)
- **Data Quality:** Federal Reference/Equivalent Methods (FRM/FEM) — regulatory-grade
- **Established:** 1971 (50+ years of air quality monitoring)
### Coverage
- **Geographic:** United States (50 states, DC, territories)
- **Temporal:** 1980-present (45+ years of validated data)
- **Granularity:** Monitoring site level (latitude/longitude)
- **Network Size:** 4,000+ active monitoring stations
- **Update Frequency:** Continuous monitoring; 6-month validation lag for finalized data
### Key Parameters (Health Priority)
| Code | Parameter | Health Impact | Priority |
|------|-----------|---------------|----------|
| **88101** | **PM2.5** | Mortality, cardiovascular disease, respiratory disease, cognitive decline, reduced life expectancy | **CRITICAL** |
| **44201** | **Ozone (O3)** | Respiratory irritant, asthma exacerbation, lung damage | **HIGH** |
| 42401 | SO2 | Respiratory irritant | Medium |
| 42101 | CO | Cardiovascular stress | Medium |
| 42602 | NO2 | Respiratory irritant, ozone precursor | Medium |
| 81102 | PM10 | Respiratory health | Medium |
## Repository Structure
```
DS-00008—EPA_Air_Quality_System/
├── README.md # This file (overview and usage guide)
├── source.md # Comprehensive cataloging (authority, methodology, limitations)
├── update.ts # TypeScript data fetcher with rate limiting
├── .env.example # Environment variable template (API credentials)
├── .gitignore # Git ignore patterns (protects API keys, data files)
└── data/ # Air quality data (JSON files)
└── README.md # Data structure documentation
```
## Quick Start
### Prerequisites
- **Bun** (JavaScript runtime): https://bun.sh/
- **EPA AQS API Key** (free, immediate approval)
### 1. Register for API Access
**Option A: Email Registration**
```bash
# Email aqs.support@epa.gov
Subject: AQS API Access Request
Body: Please provide API key for email: your_email@example.com
```
**Option B: Automated Signup**
```bash
curl "https://aqs.epa.gov/data/api/signup?email=your_email@example.com"
```
You will receive your API key via email (typically within minutes).
### 2. Configure Environment Variables
```bash
# Copy example environment file
cp .env.example .env
# Edit .env with your credentials
# Replace your_email@example.com and your_api_key_here
nano .env
```
### 3. Fetch Air Quality Data
**Default: Fetch PM2.5 and Ozone for California (last year)**
```bash
bun update.ts
```
**Custom: Specify year, states, parameters**
```bash
# Multiple states, specific year
bun update.ts --year 2023 --states CA,NY,TX
# Focus on PM2.5 only (most health-critical)
bun update.ts --year 2023 --states CA --parameters PM25
# Full criteria pollutants
bun update.ts --year 2023 --states CA,NY,TX,FL --parameters PM25,OZONE,SO2,CO,NO2,PM10
```
**Get help**
```bash
bun update.ts --help
```
### 4. View Results
Data files are saved in `data/` directory:
```bash
ls -lh data/
# aqs_2023_CA_2025-10-27.json
# aqs_2023_CA_stats_2025-10-27.json
```
## API Rate Limits (CRITICAL)
**EPA enforces strict rate limits:**
- ⚠️ **10 requests per minute** (HARD LIMIT)
- ⚠️ **Account suspension if violated**
**The update.ts script automatically enforces 6-second delays between requests.**
**Do NOT bypass rate limiting.** EPA will suspend your account.
## Data Validation Lag
- **Real-time to preliminary:** <1 hour (via AirNow API)
- **Preliminary to validated:** 6-12 months (quality assurance)
- **AQS finalized data:** 6-12 months after collection
**For real-time air quality, use AirNow API instead:** https://www.airnow.gov/
## Environmental Health Context
### Why Air Quality is a Structural Wellbeing Determinant
1. **Involuntary Exposure**
- You breathe ~20,000 times per day
- Cannot avoid ambient air pollution without relocating
- Relocation requires economic resources (not "personal choice")
2. **Life Expectancy Impact**
- PM2.5 reduces longevity by months to years in polluted areas
- Equivalent to smoking in highly polluted regions
- Measurable, quantifiable health burden
3. **Environmental Injustice**
- Low-income communities disproportionately exposed (NEJM 2021)
- Communities of color exposed to higher pollution even controlling for income
- Proximity to highways, industrial facilities, ports (structural inequality)
- **Monitoring gap:** Low-income communities historically undermonitored (data invisibility → policy neglect)
4. **Health Equity**
- Cardiovascular disease: PM2.5 linked to stroke, heart attack, atherosclerosis
- Respiratory disease: Asthma, COPD, lung cancer (IARC Group 1 carcinogen)
- Cognitive decline: Dementia, Alzheimer's, childhood cognitive impairment
- Pregnancy outcomes: Low birth weight, preterm birth
5. **Quality of Life**
- Outdoor activity restrictions on high pollution days
- Healthcare costs (emergency visits, hospitalizations)
- Lost work/school days (respiratory illness)
- Mental health impacts (environmental degradation stress)
**You cannot "self-care" your way out of this. It requires collective action, policy change, and structural intervention.**
## Use Cases
### 1. Environmental Justice Research
**Research Question:** Which communities are disproportionately exposed to PM2.5?
```bash
# Fetch PM2.5 data for multiple states
bun update.ts --year 2023 --states CA,NY,TX,IL --parameters PM25
# Cross-reference with Census demographic data (DS-00006)
# Identify exposure disparities by race, income, ZIP code
```
### 2. Life Expectancy Modeling
**Research Question:** How does PM2.5 exposure impact life expectancy across U.S. counties?
```bash
# Fetch multi-year PM2.5 data
bun update.ts --year 2023 --states ALL --parameters PM25
# Link to CDC mortality data (DS-00005)
# Calculate life expectancy impact using AQLI conversion factors
# (1 µg/m³ PM2.5 increase = ~0.1 year life expectancy loss)
```
### 3. Policy Evaluation
**Research Question:** Did Clean Air Act regulations reduce ozone levels?
```bash
# Fetch historical data (multiple years)
bun update.ts --year 2020 --states CA --parameters OZONE
bun update.ts --year 2015 --states CA --parameters OZONE
bun update.ts --year 2010 --states CA --parameters OZONE
# Analyze trends over time
# Evaluate regulatory effectiveness
```
### 4. Health Impact Assessment
**Research Question:** What are the health costs of air pollution in California?
```bash
# Fetch PM2.5 and Ozone
bun update.ts --year 2023 --states CA --parameters PM25,OZONE
# Link to health outcomes data (hospitalizations, mortality)
# Calculate attributable burden using EPA BenMAP tools
```
## Known Limitations
### Coverage Gaps
- **Urban bias:** 85% of monitors in metropolitan areas; rural areas undermonitored
- **Environmental justice monitoring gap:** Low-income communities historically excluded
- **Tribal lands:** Limited tribal monitoring (improving)
- **Territories:** Limited coverage in Puerto Rico, U.S. Virgin Islands
### Methodological Limitations
- **Point measurements:** Monitors represent ~1-10 km radius (not every location monitored)
- **24-hour averages for PM:** Daily averages mask hour-to-hour variability
- **Spatial scale mismatch:** Within-neighborhood gradients missed
- **Indoor air quality:** Not measured (people spend 90% of time indoors)
### Temporal Limitations
- **6-12 month validation lag:** Not suitable for real-time analysis (use AirNow API)
- **Historical data:** Digital records begin 1980 (pre-1980 limited)
### Inappropriate Uses
1.**DO NOT use for real-time alerts** → Use AirNow API
2.**DO NOT use for individual exposure** → Use personal monitors, exposure modeling
3.**DO NOT assume unmonitored = clean** → Absence of data ≠ absence of pollution
4.**DO NOT ignore monitoring gaps** → Undermonitoring = data invisibility
## Related Data Sources
| Source | Relationship | Use Case |
|--------|--------------|----------|
| **DS-00005** — CDC WONDER Mortality | Health outcomes | Air pollution-attributable deaths |
| **DS-00006** — Census ACS Social Wellbeing | Demographics | Environmental justice analysis |
| **DS-00001** — WHO Global Health Observatory | Global context | International air quality comparisons |
| **DS-00003** — World Bank Open Data | Economic indicators | Air quality and economic development |
## External Resources
### Official Documentation
- **EPA AQS Homepage:** https://aqs.epa.gov/
- **API Documentation:** https://aqs.epa.gov/aqsweb/documents/data_api.html
- **40 CFR Part 58 (Monitoring Requirements):** https://www.ecfr.gov/current/title-40/chapter-I/subchapter-C/part-58
### Research & Analysis Tools
- **Air Quality Life Index (AQLI):** https://aqli.epic.uchicago.edu/
- **EPA BenMAP (Health Impact Assessment):** https://www.epa.gov/benmap
- **AirNow (Real-time Data):** https://www.airnow.gov/
### Key Research
- **Harvard Six Cities Study:** Seminal air pollution epidemiology (PM2.5 and mortality)
- **American Cancer Society CPS-II:** Air pollution and life expectancy
- **Environmental Justice Literature:** Exposure disparities by race, income (NEJM 2021)
## Citation
**APA 7th:**
```
U.S. Environmental Protection Agency. (2025). Air Quality System (AQS).
https://aqs.epa.gov/aqsweb/
```
**Data Citation (Specific):**
```
U.S. Environmental Protection Agency. (2024). "PM2.5 Daily Average Concentrations,
2020-2023" [Parameter Code: 88101]. Air Quality System.
https://aqs.epa.gov/aqsweb/. Accessed October 27, 2025.
```
## Contributing
### Report Issues
- Data quality concerns: aqs.support@epa.gov
- Script bugs/improvements: Create issue in Substrate repository
### Extend Functionality
Contributions welcome:
- Additional data processing utilities
- Integration with Census demographic data
- Environmental justice analysis tools
- Visualization dashboards
## License
**Data:** Public Domain (U.S. Government Work) — CC0 1.0 Universal
**Code:** (Inherit from Substrate project license)
## Contact
**Data Source Cataloger:** DM-001
**Created:** 2025-10-27
**Last Updated:** 2025-10-27
**Status:** Reviewed
---
**Remember:** Air quality is not an individual choice — it's a structural determinant of wellbeing. This data enables us to measure environmental injustice, evaluate policy effectiveness, and advocate for cleaner air as a human right.