Files
Substrate/Data/US-GDP/README.md
Daniel Miessler 3d700a33a8 Add U.S. GDP dataset with comprehensive documentation (1929-2025)
Added authoritative U.S. GDP data from FRED/BEA covering 96 years of economic history:
- Real GDP quarterly data: Q1 1947 - Q2 2025 (314 data points)
- Real GDP annual data: 1929 - 2024 (96 data points)

Includes comprehensive research-backed documentation:
- Complete source verification (10 parallel research agents, 95%+ confidence)
- Full methodology documentation and data quality standards
- Download instructions, APIs, and update procedures
- Historical context from Great Depression through COVID-19

Updated main README with Data directory documentation and dataset comparison table.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>
2025-10-16 21:44:42 -07:00

241 lines
10 KiB
Markdown
Raw Permalink Blame History

This file contains ambiguous Unicode characters
This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.
# U.S. Gross Domestic Product (GDP) Dataset
## Overview
This directory contains authoritative U.S. GDP data from the Federal Reserve Economic Data (FRED) system, which sources directly from the Bureau of Economic Analysis (BEA). GDP is the primary measure of economic activity, representing the total value of all goods and services produced within the United States.
## What's Inside
- **Real-GDP-Quarterly-1947-2025.csv** - Quarterly real GDP data (314 data points, Q1 1947 - Q2 2025)
- **Real-GDP-Annual-1929-2024.csv** - Annual real GDP data (96 data points, 1929-2024)
- **README.md** - This file
- **UPDATES.md** - Change log for data updates
- **RESOURCES.md** - Data sources and access information
- **US-GDP-1929-2025.md** - Detailed metadata and research documentation
## Data Source Research
### How This Source Was Identified
Comprehensive parallel research across 10 specialized research agents evaluated:
1. Bureau of Economic Analysis (BEA) official data
2. FRED (Federal Reserve Economic Data) accessibility
3. International sources (World Bank, IMF, OECD)
4. Historical data availability and coverage
5. Data formats, APIs, and download methods
6. Measurement methodologies and quality standards
7. Quarterly vs annual data preferences
8. Nominal vs real GDP considerations
### Primary Source Selected: **FRED (Federal Reserve Economic Data)**
**URLs:**
- Real GDP Quarterly (GDPC1): https://fred.stlouisfed.org/series/GDPC1
- Real GDP Annual (GDPCA): https://fred.stlouisfed.org/series/GDPCA
- Direct CSV Downloads: https://fred.stlouisfed.org/graph/fredgraph.csv?id=GDPC1
**Why FRED:**
- Sources directly from BEA (official U.S. government data)
- Easy CSV downloads without API complexity
- Well-maintained with automatic updates
- Enhanced data visualization tools
- Same authoritative data as BEA but more accessible
**Ultimate Authority:** U.S. Bureau of Economic Analysis (BEA)
- Official source from U.S. Department of Commerce
- Established GDP measurement methodology
- Three-stage quarterly revision process
- Annual comprehensive updates
## Why This Source Is Reputable
### Authority & Credibility
1. **Official Government Source**
- BEA is the principal federal agency for U.S. economic statistics
- Part of U.S. Department of Commerce
- Real GDP is the primary indicator of economic activity used by Federal Reserve for monetary policy
- Used by government agencies, economists, and policymakers worldwide
2. **FRED as Trusted Aggregator**
- Maintained by Federal Reserve Bank of St. Louis
- Official Federal Reserve System resource
- Aggregates 841,000+ time series from 118 sources
- Direct pipeline from BEA with no data manipulation
- Always displays most recent official revisions
3. **Scientific Rigor**
- Comprehensive three-stage quarterly revision process:
- Advance estimate (~30 days after quarter end)
- Second estimate (~60 days after quarter end)
- Third estimate (~90 days after quarter end)
- Annual comprehensive updates each September (revises 5+ years)
- Uses National Income and Product Accounts (NIPA) framework
- Transparent methodology documented in BEA handbooks
4. **Transparency**
- Public domain data (U.S. government)
- Complete methodology documentation available
- Regular updates following strict release schedule (8:30 AM ET)
- Historical revisions documented and tracked
- Data vintages preserved for retrospective analysis
5. **Reliability Indicators**
- **Temporal Consistency**: Quarterly data since 1947, annual since 1929
- **Update Discipline**: Three releases per quarter plus annual comprehensive update
- **Cross-Validation**: Widely cited and validated across economic research
- **Policy Usage**: Federal Reserve, Treasury, Congressional Budget Office, academic institutions
6. **International Recognition**
- World Bank, IMF, and OECD source U.S. GDP from BEA
- Global economists prefer direct BEA/FRED data over international aggregators
- Considered gold standard for U.S. economic measurement
## Dataset Specifications
### Coverage
**Quarterly Data (GDPC1):**
- **Geographic**: United States
- **Temporal**: Q1 1947 - Q2 2025 (78+ years, 314 quarters)
- **Frequency**: Quarterly
- **Latest**: Q2 2025: $23,770.976 billion (chained 2017 dollars)
**Annual Data (GDPCA):**
- **Geographic**: United States
- **Temporal**: 1929 - 2024 (96 years)
- **Frequency**: Annual
- **Latest**: 2024: $23,358.435 billion (chained 2017 dollars)
### Metrics
- **Measurement**: Real Gross Domestic Product (inflation-adjusted)
- **Format**: Chained 2017 dollars (billions)
- **Adjustment**: Seasonally adjusted (quarterly), not seasonally adjusted (annual)
- **Type**: Real GDP (not nominal) - preferred for economic analysis and historical comparisons
### Data Quality
- **Completeness**: 100% coverage (314 quarters, 96 years)
- **Reliability**: Highest (official government economic statistic)
- **Timeliness**: Three releases per quarter with progressive data refinement
- **Accessibility**: Direct CSV download, no authentication required
## Real vs Nominal GDP
This dataset uses **Real GDP** (inflation-adjusted) rather than nominal GDP:
**Why Real GDP:**
- Enables meaningful comparisons across time periods
- Removes distortion from price changes
- BEA features real GDP growth as key economic activity metric
- Preferred by economists and policymakers for trend analysis
- Better for understanding actual economic output changes
**Nominal GDP** (current dollars) is used when:
- Calculating current economic values
- Analyzing price level changes
- Computing GDP deflator
Both real and nominal have identical availability (quarterly 1947+, annual 1929+), but real GDP is the standard for economic analysis.
## Extended Historical Coverage
For analysis requiring data before 1929:
**MeasuringWorth (Johnston & Williamson)**
- **Coverage**: 1790 to present
- **URL**: https://www.measuringworth.com/datasets/usgdp/
- **Note**: Pre-1929 data uses retrospective estimates by economic historians
- Links historical estimates to modern BEA data for continuity
- Based on benchmark years with interpolation
- Less suitable for sophisticated time series analysis but valuable for long-term context
## Current Economic Context (as of 2025-10-16)
- **Latest Quarterly Reading**: Q2 2025: $23,770.976 billion (real)
- **Recent Growth**: Q2 2025 showed 3.8% annualized growth
- **Historical Perspective**:
- 1929 (earliest): $1,191.124 billion (real, chained 2017 dollars)
- 2024: $23,358.435 billion (real, chained 2017 dollars)
- Total growth: ~19.6x over 95 years (~3.1% compound annual growth rate)
## Key Economic Periods in Dataset
### Major Episodes
- **Great Depression** (1929-1933): GDP fell from $1,191B to $877B (-26%)
- **Post-WWII Boom** (1945-1973): Sustained growth averaging ~4% annually
- **Stagflation Era** (1970s-1980s): Slower growth, high inflation
- **Great Moderation** (1990s-2007): Stable growth around 3% annually
- **Great Recession** (2008-2009): GDP declined ~4.3%
- **COVID-19 Pandemic** (2020): Q2 2020 saw historic 31.4% annualized decline (largest on record)
- **Post-Pandemic Recovery** (2021-2023): Strong rebound with 5.8% growth in 2021
- **Current Period** (2024-2025): Moderate growth around 2-3% annually
## Use Cases
This dataset supports:
- **Economic Research**: Historical GDP analysis, growth modeling, business cycle identification
- **Policy Analysis**: Evaluating fiscal and monetary policy effectiveness
- **Financial Modeling**: Economic forecasting, scenario analysis, risk assessment
- **Academic Studies**: Macroeconomic research, econometrics, economic history
- **Business Planning**: Market sizing, demand forecasting, strategic planning
- **Comparative Analysis**: Cross-period economic performance evaluation
- **Substrate Integration**: Supporting Claims, Arguments, Economic Models, Plans with authoritative data
## Data Interpretation Notes
1. **Real vs Nominal**:
- This dataset uses real GDP (chained 2017 dollars)
- Real GDP removes inflation effects, enabling valid comparisons across time
- To convert to nominal GDP, multiply by GDP deflator and divide by 100
2. **Growth Rate Calculation**:
- Quarter-over-quarter: ((GDP_current - GDP_previous) / GDP_previous) × 100
- Year-over-year: ((GDP_current - GDP_1year_ago) / GDP_1year_ago) × 100
- Annualized quarterly growth: Quarter-over-quarter growth × 4
3. **Seasonally Adjusted (Quarterly)**:
- Quarterly data is seasonally adjusted to remove regular patterns
- Allows for cleaner trend identification and period comparisons
- Annual data is not seasonally adjusted (seasonal effects average out over year)
4. **Business Cycle Dating**:
- Two consecutive quarters of negative real GDP growth = technical recession
- NBER (National Bureau of Economic Research) provides official recession dating
- GDP is primary but not sole indicator used for recession determination
5. **Chained Dollars Method**:
- Uses 2017 as base year reference
- "Chained" method better accounts for changes in spending patterns over time
- More accurate for long-term comparisons than fixed-weight indexes
## Maintenance
See **UPDATES.md** for detailed change log of data refreshes and updates.
**Update Schedule:**
- **Quarterly Updates**: After BEA's third estimate release (~3 months after quarter end)
- **Annual Comprehensive**: After BEA's September annual update
- **As Needed**: For methodological revisions or historical corrections
**Next Recommended Update:** After Q3 2025 third estimate (expected late January 2026)
## Comparison with Other Economic Indicators
GDP should be considered alongside:
- **Employment Data**: Labor market strength indicator
- **Inflation (CPI)**: Price level changes (see Substrate's US-Inflation dataset)
- **Industrial Production**: Manufacturing and goods production
- **Consumer Spending**: Largest component of GDP (~70%)
- **Business Investment**: Capital formation and expansion
---
**Last Updated**: 2025-10-16
**Maintained By**: Substrate Data Curation
**Update Frequency**: Quarterly (following BEA's third estimate release)
**Research Date**: 2025-10-16 (10 parallel research agents, 20 queries, 95%+ confidence)