787 lines
40 KiB
Markdown
787 lines
40 KiB
Markdown
```markdown
|
||
# CDC WONDER Mortality Database
|
||
|
||
**Source ID:** DS-00005
|
||
**Record Created:** 2025-10-27
|
||
**Last Updated:** 2025-10-27
|
||
**Cataloger:** DM-001
|
||
**Review Status:** Reviewed
|
||
|
||
---
|
||
|
||
## Bibliographic Information
|
||
|
||
### Title Statement
|
||
- **Main Title:** Wide-ranging ONline Data for Epidemiologic Research (WONDER) - Mortality Database
|
||
- **Subtitle:** Comprehensive US Mortality Statistics with Crisis Indicators
|
||
- **Abbreviated Title:** CDC WONDER Mortality
|
||
- **Variant Titles:** CDC WONDER, WONDER System, National Vital Statistics System (NVSS) Mortality
|
||
|
||
### Responsibility Statement
|
||
- **Publisher/Issuing Body:** Centers for Disease Control and Prevention
|
||
- **Department/Division:** National Center for Health Statistics (NCHS)
|
||
- **Contributors:** State vital registration systems, US Census Bureau
|
||
- **Contact Information:** wonder@cdc.gov
|
||
|
||
### Publication Information
|
||
- **Place of Publication:** Hyattsville, Maryland, USA
|
||
- **Date of First Publication:** 1999 (WONDER System); ICD-10 mortality data 1999-present
|
||
- **Publication Frequency:** Continuous (API), Annual data releases with 1-2 year lag
|
||
- **Current Status:** Active
|
||
|
||
### Edition/Version Information
|
||
- **Current Version:** ICD-10 (1999-present)
|
||
- **Version History:** ICD-9 (1979-1998), ICD-10 (1999-present), ICD-11 transition planned
|
||
- **Versioning Scheme:** Follows International Classification of Diseases (ICD) revisions
|
||
|
||
---
|
||
|
||
## Authority Statement
|
||
|
||
### Organizational Authority
|
||
|
||
**Issuing Organization Analysis:**
|
||
- **Official Name:** Centers for Disease Control and Prevention (CDC)
|
||
- **Type:** US Federal Government Agency
|
||
- **Established:** 1946-07-01 (as Communicable Disease Center)
|
||
- **Mandate:** Public Health Service Act (42 U.S.C. §241) - authority to collect and analyze vital statistics
|
||
- **Parent Organization:** US Department of Health and Human Services
|
||
- **Governance Structure:** CDC Director appointed by HHS Secretary, Congressional oversight
|
||
|
||
**Domain Authority:**
|
||
- **Subject Expertise:** Premier US public health agency; 75+ years of vital statistics collection
|
||
- **Recognition:** Gold standard for US mortality data; legal authority under PHSA
|
||
- **Publication History:** National Vital Statistics Reports (continuous since 1946), WONDER system (1999-present)
|
||
- **Peer Recognition:** 1,000,000+ citations in academic literature; CDC NCHS is authoritative source for US vital statistics
|
||
|
||
**Quality Oversight:**
|
||
- **Peer Review:** National Committee on Vital and Health Statistics (NCVHS) provides oversight
|
||
- **Editorial Board:** NCHS Office of Analysis and Epidemiology
|
||
- **Scientific Committee:** CDC/NCHS Board of Scientific Counselors
|
||
- **External Audit:** GAO audits federal data systems; OMB compliance reviews
|
||
- **Certification:** Complies with OMB Statistical Policy Directive No. 1; CIPSEA protections
|
||
|
||
**Independence Assessment:**
|
||
- **Funding Model:** Federal appropriations (direct Congressional funding)
|
||
- **Political Independence:** Protected under Federal statistical system rules; scientific integrity policy
|
||
- **Commercial Interests:** No commercial interests; public service mission
|
||
- **Transparency:** Public data access mandated by law; methods fully documented
|
||
|
||
### Data Authority
|
||
|
||
**Provenance Classification:**
|
||
- **Source Type:** Secondary (aggregates state vital registration data)
|
||
- **Data Origin:** State vital registration offices submit death certificates to NCHS
|
||
- **Chain of Custody:** Death event → Medical certifier → State vital records office → NCHS → Quality assurance → Publication
|
||
|
||
**Secondary Source Characteristics:**
|
||
- Aggregates data from all 50 states, DC, and US territories
|
||
- Standardizes definitions across jurisdictions
|
||
- Applies statistical methods for comparability
|
||
- Conducts extensive quality control and consistency checks
|
||
- Value added: National completeness, standardized coding, long time series, research-ready formats
|
||
|
||
---
|
||
|
||
## Scope Note
|
||
|
||
### Content Description
|
||
|
||
**Subject Coverage:**
|
||
- **Primary Subjects:** Mortality Statistics, Cause of Death, Vital Statistics, Drug Overdoses, Suicide, Public Health Surveillance
|
||
- **Secondary Subjects:** Behavioral Health Crises, Occupational Mortality, Injury Epidemiology, Premature Death
|
||
- **Subject Classification:**
|
||
- LC: RA (Public Health), HV (Social Pathology)
|
||
- Dewey: 614.1 (Forensic Medicine, Mortality), 362.29 (Substance Abuse)
|
||
- **Keywords:** Drug overdose deaths, opioid epidemic, suicide rates, mortality rates, ICD-10 codes, cause of death, deaths of despair, behavioral health crisis indicators
|
||
|
||
**Geographic Coverage:**
|
||
- **Spatial Scope:** United States national data
|
||
- **Countries/Regions Included:** All 50 US states, District of Columbia, Puerto Rico, US territories
|
||
- **Geographic Granularity:** National, state, county level (county data subject to suppression rules)
|
||
- **Coverage Completeness:** ~100% (census of deaths, not sample); all deaths legally required to be registered
|
||
- **Notable Exclusions:** US citizens dying abroad not consistently captured
|
||
|
||
**Temporal Coverage:**
|
||
- **Start Date:** 1999-01-01 (ICD-10 era; ICD-9 data 1979-1998 in separate database)
|
||
- **End Date:** Present (most recent: 2023 provisional data; final 2022 data as of 2024)
|
||
- **Historical Depth:** 25+ years (ICD-10 era); 45+ years (including ICD-9)
|
||
- **Frequency of Observations:** Daily deaths aggregated to annual releases; provisional monthly/quarterly releases
|
||
- **Temporal Granularity:** Annual (final data); monthly (provisional data)
|
||
- **Time Series Continuity:** Excellent continuity within ICD-10 era (1999+); series break at ICD-9/ICD-10 transition
|
||
|
||
**Population/Cases Covered:**
|
||
- **Target Population:** All deaths occurring in the United States
|
||
- **Inclusion Criteria:** All deaths of US residents + non-residents dying in US; legally required registration
|
||
- **Exclusion Criteria:** Fetal deaths (separate database), US citizens dying abroad (usually not included)
|
||
- **Coverage Rate:** ~100% - universal death registration required by law; estimated 99%+ completeness
|
||
- **Sample vs. Census:** Census (complete enumeration, not sample)
|
||
|
||
**Variables/Indicators:**
|
||
- **Number of Variables:** 100+ variables per death record
|
||
- **Core Indicators:**
|
||
- All-cause mortality rates (crude, age-adjusted)
|
||
- Cause-specific mortality (ICD-10 codes: 113 selected causes + detailed subcategories)
|
||
- Drug overdose deaths (X40-X44, X60-X64, X85, Y10-Y14)
|
||
- Opioid-specific deaths (T40.0-T40.4, T40.6)
|
||
- Suicide deaths (X60-X84, Y87.0, U03)
|
||
- Alcohol-induced deaths (E24.4, G31.2, G62.1, G72.1, I42.6, K29.2, K70, K85.2, K86.0, R78.0, X45, X65, Y15)
|
||
- Years of Potential Life Lost (YPLL)
|
||
- Age-specific mortality rates (10-year age groups)
|
||
- **Derived Variables:** Age-adjusted rates, YPLL before age 75, crude rates per 100,000
|
||
- **Data Dictionary Available:** Yes - https://wonder.cdc.gov/wonder/help/ucd.html
|
||
|
||
### Content Boundaries
|
||
|
||
**What This Source IS:**
|
||
- Authoritative source for US mortality statistics (legal authority)
|
||
- Best source for "deaths of despair" - drug overdoses, suicides, alcohol-related deaths
|
||
- Census data (complete enumeration, not sample)
|
||
- Leading indicator of population wellbeing breakdown (behavioral revealed preference)
|
||
- County-level granularity shows geographic variation in health crises
|
||
|
||
**What This Source IS NOT:**
|
||
- NOT real-time surveillance (1-2 year lag for final data; months for provisional)
|
||
- NOT individual-level microdata (aggregated to protect privacy; individual records require restricted use agreement)
|
||
- NOT international data (US only)
|
||
- NOT nonfatal outcomes (deaths only; injury/morbidity in separate systems)
|
||
|
||
**Comparison with Similar Sources:**
|
||
|
||
| Source | Advantages Over CDC WONDER | Disadvantages vs. CDC WONDER |
|
||
|--------|---------------------------|------------------------------|
|
||
| State Vital Statistics | More timely (6-12 month lag vs. 1-2 years); may have additional state-specific variables | Single state only; interstate comparisons require standardization; state definitions may vary |
|
||
| WHO Mortality Database | International coverage; standardized for cross-country comparison | US data less timely than CDC WONDER; less detailed cause-of-death coding; no county-level data |
|
||
| Surveillance, Epidemiology, and End Results (SEER) | Cancer-specific detail; treatment data; survival analysis | Cancer only; limited to SEER registry areas (~48% of US population) |
|
||
| National Violent Death Reporting System (NVDRS) | Detailed incident circumstances for violent deaths (suicide, homicide, overdose) | Limited geographic coverage (not all states); smaller sample; more recent history (2003+) |
|
||
|
||
---
|
||
|
||
## Access Conditions
|
||
|
||
### Technical Access
|
||
|
||
**API Information:**
|
||
- **Endpoint URL:** https://wonder.cdc.gov/controller/datarequest/
|
||
- **API Type:** XML-based POST request/response
|
||
- **API Version:** Current (no formal versioning; backwards compatible)
|
||
- **OpenAPI/Swagger Spec:** Not available (documented at https://wonder.cdc.gov/wonder/help/WONDER-API.html)
|
||
- **SDKs/Libraries:** Community-maintained (wonderapi R package, Python scripts)
|
||
|
||
**Authentication:**
|
||
- **Authentication Required:** No
|
||
- **Authentication Type:** None (public API)
|
||
- **Registration Process:** Not required for API; optional registration for saved queries
|
||
- **Approval Required:** No (for aggregated data); Yes (for restricted-use microdata)
|
||
- **Approval Timeframe:** N/A for API; 6-12 months for restricted-use microdata application
|
||
|
||
**Rate Limits:**
|
||
- **Requests per Second:** Not specified (fair use expected)
|
||
- **Requests per Day:** Not specified (fair use expected)
|
||
- **Concurrent Connections:** Not specified
|
||
- **Throttling Policy:** None documented; recommend 1 request/second to be conservative
|
||
- **Rate Limit Headers:** Not provided
|
||
|
||
**Query Capabilities:**
|
||
- **Filtering:** By state, county, year, age group, sex, race/ethnicity, ICD-10 cause code, place of death, weekday
|
||
- **Sorting:** Not applicable (results sorted by selected grouping variables)
|
||
- **Pagination:** Not applicable (single result set per query; max 2000 rows per query)
|
||
- **Aggregation:** Server-side aggregation by selected group-by variables
|
||
- **Joins:** Not applicable (single data source)
|
||
|
||
**Data Formats:**
|
||
- **Available Formats:** XML (API response), CSV, TXT (web interface)
|
||
- **Format Quality:** Well-formed XML; validated against schema
|
||
- **Compression:** Not supported
|
||
- **Encoding:** UTF-8
|
||
|
||
**Download Options:**
|
||
- **Bulk Download:** No (API returns aggregated data only; microdata requires restricted-use agreement)
|
||
- **Streaming API:** No
|
||
- **FTP/SFTP:** No
|
||
- **Torrent:** No
|
||
- **Data Dumps:** No public bulk download (use API for aggregated data)
|
||
|
||
**Reliability Metrics:**
|
||
- **Uptime:** ~99% (2024 estimate; occasional maintenance windows)
|
||
- **Latency:** 2-30 seconds per query (depends on query complexity)
|
||
- **Breaking Changes:** Rare; backwards compatibility maintained; ICD-11 transition will be announced years in advance
|
||
- **Deprecation Policy:** No formal policy; major changes announced via website/email
|
||
- **Service Level Agreement:** No formal SLA (public service)
|
||
|
||
### Legal/Policy Access
|
||
|
||
**License:**
|
||
- **License Type:** Public Domain (US Government Work)
|
||
- **License Version:** 17 U.S.C. §105 (US Copyright Law)
|
||
- **License URL:** https://www.usa.gov/government-works
|
||
- **SPDX Identifier:** Not applicable (public domain)
|
||
|
||
**Usage Rights:**
|
||
- **Redistribution Allowed:** Yes (public domain)
|
||
- **Commercial Use Allowed:** Yes (no restrictions)
|
||
- **Modification Allowed:** Yes (no restrictions)
|
||
- **Attribution Required:** No (but recommended: cite CDC/NCHS as source)
|
||
- **Share-Alike Required:** No
|
||
|
||
**Cost Structure:**
|
||
- **Access Cost:** Free
|
||
|
||
**Terms of Service:**
|
||
- **TOS URL:** https://wonder.cdc.gov/wonder/help/main.html#Privacy-Policy.html
|
||
- **Key Restrictions:**
|
||
- Cell suppression rules: Counts <10 suppressed to protect privacy
|
||
- Population <100,000 may have suppressed rates
|
||
- Must not attempt to re-identify individuals
|
||
- Prohibited to use for commercial marketing (e.g., targeting individuals)
|
||
- **Liability Disclaimers:** Data provided "as is"; CDC not liable for decisions based on data; users responsible for verifying suitability
|
||
- **Privacy Policy:** CIPSEA protections; no personal data collected via API; website analytics per HHS policy
|
||
|
||
---
|
||
|
||
## Collection Development Policy Fit
|
||
|
||
### Relevance Assessment
|
||
|
||
**Substrate Mission Alignment:**
|
||
- **Human Progress Focus:** Critical crisis indicators - drug overdoses and suicides are leading indicators of wellbeing breakdown
|
||
- **Problem-Solution Connection:**
|
||
- Links to Problems: Opioid epidemic, behavioral health crisis, "deaths of despair", healthcare access gaps
|
||
- Links to Solutions: Harm reduction programs, mental health interventions, addiction treatment, prescription drug monitoring
|
||
- **Evidence Quality:** Gold-standard US vital statistics; census data (not sample); legal authority
|
||
|
||
**Collection Priorities Match:**
|
||
- **Priority Level:** CRITICAL - essential for understanding US wellbeing crises
|
||
- **Uniqueness:** Only official source for county-level drug overdose and suicide mortality in US
|
||
- **Comprehensiveness:** Fills critical gap; reveals behavioral truth that surveys miss (revealed preference vs. stated preference)
|
||
|
||
### Comparison with Holdings
|
||
|
||
**Overlapping Sources:**
|
||
- WHO Mortality Database (DS-00001) - includes US data but less timely/detailed
|
||
- National Violent Death Reporting System (future DS) - more detail on circumstances but limited coverage
|
||
- State vital statistics (various) - single-state focus
|
||
|
||
**Unique Contribution:**
|
||
- Official US mortality statistics with legal authority
|
||
- County-level granularity for geographic variation analysis
|
||
- Complete census (not sample) - captures all deaths
|
||
- Leading indicator of population wellbeing crises (behaviors revealed in deaths)
|
||
- ICD-10 detailed cause-of-death coding
|
||
|
||
**Preferred Use Cases:**
|
||
- Monitoring opioid epidemic and drug overdose trends
|
||
- Suicide rate analysis (national, state, county level)
|
||
- "Deaths of despair" research
|
||
- Geographic variation in mortality crises
|
||
- Premature death analysis (YPLL)
|
||
- Policy evaluation (state-level interventions)
|
||
|
||
---
|
||
|
||
## Technical Specifications
|
||
|
||
### Data Model
|
||
|
||
**Schema Documentation:**
|
||
- **Schema Type:** XML schema (request and response)
|
||
- **Schema URL:** https://wonder.cdc.gov/wonder/help/WONDER-API.html (documentation)
|
||
- **Schema Version:** Current (undated)
|
||
|
||
**Entity Types:**
|
||
- **DeathRecord:** Individual death records (aggregated in API responses)
|
||
- **GroupBy:** Grouping variables (state, county, year, age group, etc.)
|
||
- **Measure:** Count variables (deaths, crude rate, age-adjusted rate, YPLL)
|
||
- **Filter:** Filtering criteria (ICD-10 codes, demographics, geography, time)
|
||
|
||
**Key Relationships:**
|
||
- DeathRecord aggregated by GroupBy dimensions
|
||
- Filtered by Filter criteria
|
||
- Summarized into Measure values
|
||
|
||
**Primary Keys:**
|
||
- Composite key: All GroupBy variables selected in query (e.g., State + County + Year + Age Group + Cause)
|
||
|
||
**Foreign Keys:**
|
||
- Not applicable (single aggregated dataset)
|
||
|
||
### Metadata Standards Compliance
|
||
|
||
**Standards Followed:**
|
||
- [ ] Dublin Core - minimal
|
||
- [ ] DCAT (Data Catalog Vocabulary) - minimal
|
||
- [ ] Schema.org Dataset - minimal
|
||
- [ ] SDMX - no
|
||
- [ ] DDI (Data Documentation Initiative) - minimal
|
||
- [ ] ISO 19115 (Geographic Information Metadata) - minimal
|
||
- [ ] MARC - no
|
||
- Other: ICD-10 (International Classification of Diseases), FIPS (Federal Information Processing Standards) codes for geography
|
||
|
||
**Metadata Quality:**
|
||
- **Completeness:** 70% of elements populated (documentation comprehensive but not formally structured as metadata)
|
||
- **Accuracy:** High - documentation reviewed by NCHS epidemiologists
|
||
- **Consistency:** Good - definitions consistent across time within ICD-10 era
|
||
|
||
### API Documentation Quality
|
||
|
||
**Documentation Assessment:**
|
||
- **Completeness:** Good - core functionality documented; some advanced features require experimentation
|
||
- **Examples Provided:** Yes - XML request examples provided for common queries
|
||
- **Error Messages:** Basic HTTP status codes; XML error messages sometimes cryptic
|
||
- **Change Log:** Not maintained publicly
|
||
- **Tutorials:** Available - step-by-step guide for API usage at https://wonder.cdc.gov/wonder/help/WONDER-API.html
|
||
- **Support Forum:** Email support (wonder@cdc.gov); no public forum; Stack Overflow community questions
|
||
|
||
---
|
||
|
||
## Source Evaluation Narrative
|
||
|
||
### Methodological Assessment
|
||
|
||
**Data Collection Methodology:**
|
||
|
||
**Sampling Design:**
|
||
- **Method:** Census (complete enumeration, not sample)
|
||
- **Sample Size:** N/A (all deaths in US)
|
||
- **Sampling Frame:** N/A (universal death registration)
|
||
- **Stratification:** N/A (census)
|
||
- **Weighting:** Not applicable (census data)
|
||
|
||
**Data Collection Instruments:**
|
||
- **Instrument Type:** US Standard Certificate of Death (standardized form used by all states)
|
||
- **Validation:** Form developed by NCHS in collaboration with states; legally mandated
|
||
- **Question Wording:** Standardized across all states
|
||
- **Mode:** Medical certifier completes cause of death; funeral director completes demographic information; filed with state vital records office
|
||
|
||
**Quality Control Procedures:**
|
||
- **Field Supervision:** State vital registrars oversee completeness and timeliness
|
||
- **Validation Rules:** NCHS automated coding and quality checks (ACME - Automated Classification of Medical Entities)
|
||
- **Consistency Checks:** Age/cause consistency, geographic code validation, demographic completeness checks
|
||
- **Verification:** Query resolution process for problematic records; state vital registrars verify and correct
|
||
- **Outlier Treatment:** Statistical outliers flagged; investigated if data quality issue suspected
|
||
|
||
**Error Characteristics:**
|
||
- **Sampling Error:** None (census, not sample)
|
||
- **Non-sampling Error:**
|
||
- Misclassification of cause of death (especially for drug-involved deaths - toxicology delays)
|
||
- Underreporting of suicides (coroner determination variability; stigma leading to misclassification)
|
||
- Geographic misattribution (death location vs. residence; some states report location of death)
|
||
- Timeliness issues (toxicology delays can cause 6-12 month lag in drug-involved death counts)
|
||
- **Known Biases:**
|
||
- Suicide undercounting (stigma; medicolegal determination inconsistency across jurisdictions)
|
||
- Drug overdose specificity varies (some states better at toxicology testing/reporting)
|
||
- Racial/ethnic misclassification (especially for American Indian/Alaska Native populations)
|
||
- **Accuracy Bounds:**
|
||
- Overall mortality: 99%+ complete (near-universal death registration)
|
||
- Cause of death: 90-95% accuracy for broad categories; 70-85% for specific subcategories
|
||
- Drug-involved deaths: ~10-20% undercount estimated due to lack of toxicology testing or pending investigations
|
||
|
||
**Methodology Documentation:**
|
||
- **Transparency Level:** 5/5 (Comprehensive)
|
||
- **Documentation URL:** https://www.cdc.gov/nchs/nvss/mortality_methods.htm
|
||
- **Peer Review Status:** Methods published in peer-reviewed journals (Vital Statistics Reports series); reviewed by NCVHS
|
||
- **Reproducibility:** High - ICD-10 coding rules publicly available; ACME software documented
|
||
|
||
### Currency Assessment
|
||
|
||
**Update Characteristics:**
|
||
- **Update Frequency:** Annual (final data); quarterly (provisional data)
|
||
- **Update Reliability:** Consistent annual release schedule (December for prior year's final data)
|
||
- **Update Notification:** Email notifications available; NCHS website announcements; RSS feed
|
||
- **Last Updated:** 2024-12-15 (2022 final data released); 2025-06-01 (2023 provisional data)
|
||
|
||
**Timeliness:**
|
||
- **Collection to Publication Lag:**
|
||
- Provisional data: 3-6 months (quarterly releases)
|
||
- Final data: 12-24 months (annual release, typically 11-14 months after year-end)
|
||
- Factors: State reporting timelines, toxicology testing delays, quality assurance, ICD-10 coding
|
||
- **Factors Affecting Timeliness:**
|
||
- State vital registrars' submission schedules (vary by state)
|
||
- Toxicology testing delays (drug-involved deaths)
|
||
- Medicolegal investigations (homicides, suicides, overdoses)
|
||
- Quality review and coding processes
|
||
- **Historical Timeliness:** Generally consistent; COVID-19 pandemic accelerated provisional data releases (2020-2021)
|
||
|
||
**Currency for Different Uses:**
|
||
- **Real-time Analysis:** Unsuitable - 3-24 month lag
|
||
- **Recent Trends:** Suitable for annual trends (provisional data); unsuitable for sub-annual trends
|
||
- **Historical Research:** Excellent - consistent time series 1999-present (ICD-10 era)
|
||
|
||
### Objectivity Assessment
|
||
|
||
**Potential Biases:**
|
||
|
||
**Political Bias:**
|
||
- **Government Influence:** Data collection mandated by law; NCHS has scientific independence protections; political pressure rare but possible (e.g., pressure to downplay opioid crisis)
|
||
- **Editorial Stance:** NCHS maintains scientific neutrality; publishes data regardless of political implications
|
||
- **Political Pressure:** Occasional controversies (e.g., CDC gun violence research restrictions 1996-2018); generally data publication protected
|
||
|
||
**Commercial Bias:**
|
||
- **Funding Sources:** Federal appropriations only; no industry funding
|
||
- **Advertising Influence:** Not applicable (government agency)
|
||
- **Proprietary Interests:** None
|
||
|
||
**Cultural/Social Bias:**
|
||
- **Geographic Bias:** Better data quality in states with well-resourced vital registration systems and comprehensive toxicology testing; rural areas may have less complete death investigation
|
||
- **Social Perspective:** Biomedical model of cause of death; limited capture of social determinants (poverty, discrimination, etc. not coded)
|
||
- **Language Bias:** English; Spanish translations limited
|
||
- **Selection Bias:** Suicide and overdose definitions subject to medicolegal determination - social stigma and local practices affect classification consistency
|
||
|
||
**Transparency:**
|
||
- **Bias Disclosure:** NCHS acknowledges data quality limitations by state; documentation notes known issues (e.g., suicide undercount, toxicology testing variation)
|
||
- **Limitations Stated:** Comprehensive - technical documentation details limitations
|
||
- **Raw Data Available:** Aggregated data public; individual death records available under restricted-use agreement with strict confidentiality protections
|
||
|
||
### Reliability Assessment
|
||
|
||
**Consistency:**
|
||
- **Internal Consistency:** High - validation rules ensure logical consistency (age/cause, location codes)
|
||
- **Temporal Consistency:** Excellent within ICD-10 era (1999+); series break at ICD-9/ICD-10 transition (1998-1999)
|
||
- **Cross-source Consistency:** Matches state vital statistics (NCHS aggregates state data); minor discrepancies due to timing differences
|
||
|
||
**Stability:**
|
||
- **Definition Changes:** Rare within ICD-10 era; ICD-11 transition planned (multi-year advance notice)
|
||
- **Methodology Changes:** ACME coding updates documented; typically minor; comparability maintained
|
||
- **Series Breaks:** Major break at ICD-9/ICD-10 transition (1998-1999); ICD-11 transition will create future break (planned for late 2020s with bridge-coding period)
|
||
|
||
**Verification:**
|
||
- **Independent Verification:** State vital statistics are primary source; academic researchers validate using hospital records, medical examiner reports (generally corroborate NCHS)
|
||
- **Replication Studies:** Extensive academic use; errors reported and corrected in subsequent releases
|
||
- **Audit Results:** GAO audits of federal statistical programs; NCHS passes audits; data quality assessments published periodically
|
||
|
||
### Accuracy Assessment
|
||
|
||
**Validation Evidence:**
|
||
- **Benchmark Comparisons:** Comparison with state vital statistics: 99%+ agreement for counts; <1% differences attributable to timing and geography coding
|
||
- **Coverage Assessments:** Death registration completeness estimated >99%; periodic studies confirm near-universal coverage
|
||
- **Error Studies:**
|
||
- Cause-of-death accuracy studies: 70-95% agreement depending on cause specificity (higher for broad categories, lower for specific subcategories)
|
||
- Drug-involved death studies: Estimated 10-20% undercount due to lack of toxicology testing or pending investigations
|
||
|
||
**Accuracy for Different Uses:**
|
||
- **Point Estimates:** Highly reliable for all-cause mortality (99%+ complete); reliable for major causes (90-95%); moderate reliability for drug/suicide subcategories (70-90% due to classification challenges)
|
||
- **Trend Analysis:** Highly reliable for multi-year trends (5+ years); be cautious with year-to-year changes (can reflect changes in investigation/testing practices, not just true mortality changes)
|
||
- **Cross-sectional Comparison:** Reliable for state comparisons; caution for county comparisons (small counties have cell suppression; rate instability)
|
||
- **Sub-population Analysis:** Reliable for sex, broad age groups, major racial/ethnic categories; limited for detailed age, race/ethnicity intersections (small cell suppression)
|
||
|
||
---
|
||
|
||
## Known Limitations and Caveats
|
||
|
||
### Coverage Limitations
|
||
|
||
**Geographic Gaps:**
|
||
- US citizens dying abroad generally not included (consular reports incomplete)
|
||
- Some territories have incomplete coverage (American Samoa, Guam variable completeness)
|
||
- Tribal lands: Data completeness varies; some tribes opt out of state reporting
|
||
|
||
**Temporal Gaps:**
|
||
- ICD-9 to ICD-10 transition (1998-1999) creates comparability break
|
||
- Provisional data subject to revision (can change by 5-10% when finalized)
|
||
- Toxicology-delayed deaths appear in later data releases (can shift apparent temporal patterns)
|
||
|
||
**Population Exclusions:**
|
||
- Fetal deaths excluded (separate database)
|
||
- Non-residents dying in US included in total counts but can be excluded in analyses
|
||
- Missing race/ethnicity data (5-10% of records have race/ethnicity categorized as "unknown")
|
||
|
||
**Variable Gaps:**
|
||
- Social determinants (income, education, occupation) captured incompletely on death certificate
|
||
- Mental health history not systematically captured (unless contributory cause of death)
|
||
- Substance use history limited (only if documented as cause of death)
|
||
- Intent determination (suicide vs. unintentional vs. undetermined) varies by jurisdiction
|
||
|
||
### Methodological Limitations
|
||
|
||
**Sampling Limitations:**
|
||
- Not applicable (census data)
|
||
|
||
**Measurement Limitations:**
|
||
- **Cause of death accuracy:**
|
||
- Depends on certifier knowledge and diagnostic information available
|
||
- Toxicology testing not universal (drug-involved deaths undercounted)
|
||
- Autopsy rates declining (less diagnostic certainty)
|
||
- Multiple cause coding: ICD allows only one underlying cause; contributing causes captured but less commonly analyzed
|
||
- **Suicide undercounting:**
|
||
- Requires medicolegal determination of intent
|
||
- Stigma may discourage suicide classification
|
||
- Coroner/medical examiner practices vary by jurisdiction
|
||
- Estimated 20-35% undercount (academic studies)
|
||
- **Drug overdose specificity:**
|
||
- Requires toxicology testing (not always performed)
|
||
- Some states better at specific drug identification (opioid type, fentanyl vs. heroin)
|
||
- "Unspecified" drug codes used when testing incomplete
|
||
|
||
**Processing Limitations:**
|
||
- ACME automated coding: Can misclassify complex cases (human review limited to flagged records)
|
||
- ICD-10 coding rules: May not align with clinical understanding (e.g., diabetes contributory but not underlying cause)
|
||
- Geographic coding: Death occurrence location vs. residence - API default is residence but some analyses use occurrence
|
||
- Cell suppression: Counts <10 suppressed (limits small-area analysis)
|
||
|
||
### Comparability Limitations
|
||
|
||
**Cross-national Comparability:**
|
||
- ICD-10 coding rules vary slightly by country (WHO provides guidelines but countries adapt)
|
||
- Medicolegal systems differ (coroner vs. medical examiner; death investigation resources)
|
||
- Toxicology testing practices vary internationally
|
||
- Use WHO Mortality Database for international comparisons (standardized for comparability)
|
||
|
||
**Temporal Comparability:**
|
||
- ICD-9 to ICD-10 transition (1998-1999): Major break; NCHS provides comparability ratios for selected causes
|
||
- Within ICD-10 era: Generally comparable but be aware of:
|
||
- Changes in autopsy rates (declining over time)
|
||
- Changes in toxicology testing practices (fentanyl testing increased post-2015)
|
||
- Changes in suicide investigation practices (some jurisdictions more consistent over time)
|
||
- Opioid prescribing changes affect overdose patterns (prescription monitoring programs, prescribing guidelines)
|
||
|
||
**Sub-group Comparability:**
|
||
- Small counties: Cell suppression and rate instability
|
||
- Racial/ethnic groups: Misclassification issues (especially American Indian/Alaska Native - estimated 30-40% misclassified)
|
||
- Age groups: Comparability high; infant mortality in separate specialized reports
|
||
- Intersectional analysis: Limited by small cell suppression (e.g., sex × race × county × cause)
|
||
|
||
### Usage Caveats
|
||
|
||
**Inappropriate Uses:**
|
||
1. **DO NOT use for real-time surveillance** - 3-24 month lag; use syndromic surveillance for real-time
|
||
2. **DO NOT assume suicide counts are complete** - 20-35% estimated undercount; use as lower bound
|
||
3. **DO NOT compare small counties without considering rate instability** - use multi-year aggregates or suppress unstable rates
|
||
4. **DO NOT infer causation from geographic correlations** - ecological fallacy; state-level associations don't imply individual-level
|
||
5. **DO NOT attempt to re-identify individuals** - violation of CIPSEA; cell suppression protects privacy
|
||
|
||
**Ecological Fallacy Risks:**
|
||
- County-level associations (e.g., unemployment rate and overdose deaths) don't necessarily hold at individual level
|
||
- State-level policies correlated with outcomes may reflect confounding (states adopting policies differ in other ways)
|
||
- Example: States with higher opioid prescribing have higher overdose deaths - doesn't mean all overdose decedents had prescriptions (ecological correlation)
|
||
|
||
**Correlation vs. Causation:**
|
||
- Data appropriate for descriptive epidemiology (who, what, where, when)
|
||
- Analytical epidemiology (why) requires individual-level data, confounding control, causal inference methods
|
||
- Geographic/temporal correlations can generate hypotheses but not test causal mechanisms
|
||
|
||
---
|
||
|
||
## Recommended Use Cases
|
||
|
||
### Ideal Applications
|
||
|
||
**Research Questions Well-Suited:**
|
||
1. "How have drug overdose deaths changed over time in the United States?"
|
||
2. "Which states and counties have the highest suicide rates?"
|
||
3. "What is the geographic pattern of opioid-involved deaths?"
|
||
4. "How do premature death rates (YPLL) vary by state?"
|
||
5. "What are the leading causes of death in the United States by age group?"
|
||
6. "How did state opioid prescribing policies correlate with overdose trends?"
|
||
|
||
**Analysis Types Supported:**
|
||
- Descriptive statistics (counts, rates by geography/demographics)
|
||
- Trend analysis (time series 1999-present)
|
||
- Geographic analysis (state, county-level mapping)
|
||
- Age-standardization for comparability across populations
|
||
- Premature death burden (YPLL before age 75)
|
||
- Multiple cause-of-death analysis (contributing causes)
|
||
- Policy evaluation (ecological studies of state interventions)
|
||
|
||
### Appropriate Contexts
|
||
|
||
**Geographic Contexts:**
|
||
- US national trends
|
||
- State-level comparisons (all 50 states + DC)
|
||
- County-level analysis (caution: small counties have suppression and rate instability; use multi-year aggregates)
|
||
- Regional aggregations (Census regions, HHS regions)
|
||
|
||
**Temporal Contexts:**
|
||
- Long-term trends (1999-present for ICD-10 era)
|
||
- Medium-term trends (5-10 years most reliable)
|
||
- Annual trends (final data preferred; provisional data for recent years)
|
||
- Historical research (especially post-1999 ICD-10 transition)
|
||
|
||
**Subject Contexts:**
|
||
- Opioid epidemic research (overdose deaths by drug type)
|
||
- Suicide prevention (suicide trends by demographics, geography, method)
|
||
- "Deaths of despair" (combined drug/alcohol/suicide mortality)
|
||
- Premature death burden (YPLL)
|
||
- All-cause mortality trends
|
||
- Cause-specific mortality (heart disease, cancer, accidents, etc.)
|
||
|
||
### Use Warnings
|
||
|
||
**Avoid Using This Source For:**
|
||
1. **Real-time outbreak detection** → Use syndromic surveillance, poison control data
|
||
2. **Individual-level research** → Use restricted-use microdata (requires RUA)
|
||
3. **Small-area analysis (<100,000 population)** → Use multi-year aggregates; accept suppression limits
|
||
4. **Complete suicide counts** → Treat as lower bound (20-35% undercount)
|
||
5. **International comparisons** → Use WHO Mortality Database (standardized for comparability)
|
||
6. **Nonfatal outcomes** → Use NEISS, HCUP, emergency department data
|
||
|
||
**Recommended Alternatives For:**
|
||
- Real-time surveillance → NSSP (syndromic surveillance), NNDSS (notifiable diseases)
|
||
- Individual-level analysis → Restricted-use NCHS microdata (requires RUA)
|
||
- Nonfatal injuries → NEISS (National Electronic Injury Surveillance System)
|
||
- Detailed violent death circumstances → NVDRS (National Violent Death Reporting System)
|
||
- More timely state data → State vital statistics departments (6-12 month lag)
|
||
- International data → WHO Mortality Database (standardized for cross-country comparisons)
|
||
|
||
---
|
||
|
||
## Citation
|
||
|
||
### Preferred Citation Format
|
||
|
||
**APA 7th:**
|
||
Centers for Disease Control and Prevention, National Center for Health Statistics. (2024). *Wide-ranging ONline Data for Epidemiologic Research (WONDER)*. http://wonder.cdc.gov
|
||
|
||
**Chicago 17th:**
|
||
Centers for Disease Control and Prevention, National Center for Health Statistics. "Wide-ranging ONline Data for Epidemiologic Research (WONDER)." Accessed October 27, 2025. http://wonder.cdc.gov.
|
||
|
||
**MLA 9th:**
|
||
Centers for Disease Control and Prevention, National Center for Health Statistics. *Wide-ranging ONline Data for Epidemiologic Research (WONDER)*. CDC, 2024, wonder.cdc.gov.
|
||
|
||
**Vancouver:**
|
||
Centers for Disease Control and Prevention, National Center for Health Statistics. Wide-ranging ONline Data for Epidemiologic Research (WONDER) [Internet]. Atlanta (GA): CDC; 2024 [cited 2025 Oct 27]. Available from: http://wonder.cdc.gov
|
||
|
||
**BibTeX:**
|
||
```bibtex
|
||
@misc{cdc_wonder_2024,
|
||
author = {{Centers for Disease Control and Prevention, National Center for Health Statistics}},
|
||
title = {Wide-ranging ONline Data for Epidemiologic Research (WONDER)},
|
||
year = {2024},
|
||
url = {http://wonder.cdc.gov},
|
||
note = {Accessed: 2025-10-27}
|
||
}
|
||
```
|
||
|
||
### Data Citation Principles
|
||
|
||
Following FORCE11 Data Citation Principles:
|
||
- **Importance:** CDC WONDER is citable research output; cite in publications using this data
|
||
- **Credit and Attribution:** Citations credit CDC/NCHS and state vital registrars providing data
|
||
- **Evidence:** Citations enable readers to verify research claims
|
||
- **Unique Identification:** URL + access date; specify database (e.g., "Underlying Cause of Death, 1999-2020")
|
||
- **Access:** Citation provides access method (web interface or API)
|
||
- **Persistence:** CDC maintains stable URLs; archived through Internet Archive
|
||
- **Specificity and Verifiability:** Specify database version, years, ICD-10 codes, access date for exact reproducibility
|
||
- **Interoperability:** Citation format compatible with reference managers, academic databases
|
||
- **Flexibility:** Adaptable to various research outputs (papers, reports, dashboards)
|
||
|
||
**Example of Specific Query Citation:**
|
||
Centers for Disease Control and Prevention, National Center for Health Statistics. (2024). "Underlying Cause of Death, 1999-2020, Drug/Alcohol Induced Causes" [ICD-10 Codes: X40-X44, X60-X64, X85, Y10-Y14]. *WONDER Online Database*. http://wonder.cdc.gov/ucd-icd10.html. Accessed October 27, 2025.
|
||
|
||
---
|
||
|
||
## Version History
|
||
|
||
### Current Version
|
||
- **Version:** ICD-10 (1999-present)
|
||
- **Date:** 1999-01-01 (ICD-10 implementation)
|
||
- **Changes:** Transitioned from ICD-9 to ICD-10 coding; expanded cause-of-death detail; XML API introduced ~2005
|
||
|
||
### Previous Versions
|
||
- **Version:** ICD-9 | **Date:** 1979-1998 | **Changes:** Earlier coding system (separate database); web interface WONDER 1.0 launched 1999
|
||
- **Version:** ICD-8 | **Date:** 1968-1978 | **Changes:** Predecessor classification system (not in WONDER; available via other NCHS data systems)
|
||
|
||
### Planned Changes
|
||
- **Version:** ICD-11 | **Date:** Late 2020s (tentative) | **Changes:** Next major classification revision; WHO approved 2019; US implementation timeline TBD (multi-year advance notice expected); bridge-coding period planned to maintain comparability
|
||
|
||
---
|
||
|
||
## Review Log
|
||
|
||
### Internal Reviews
|
||
- **Date:** 2025-10-27 | **Reviewer:** DM-001 | **Status:** Approved | **Notes:** Initial catalog entry; comprehensive evaluation completed; critical source for US wellbeing crisis indicators
|
||
|
||
### Quality Checks
|
||
- **Last Metadata Validation:** 2025-10-27
|
||
- **Last Authority Verification:** 2025-10-27
|
||
- **Last Link Check:** 2025-10-27
|
||
- **Last Access Test:** 2025-10-27 (API documentation reviewed; test query pending update.ts implementation)
|
||
|
||
---
|
||
|
||
## Related Resources
|
||
|
||
### Cross-References
|
||
|
||
**Related Substrate Entities:**
|
||
- **Problems:**
|
||
- PR-XXXX: Opioid Epidemic
|
||
- PR-XXXX: Behavioral Health Crisis
|
||
- PR-XXXX: "Deaths of Despair"
|
||
- PR-XXXX: Suicide Rate Increases
|
||
- PR-XXXX: Healthcare Access Inequities
|
||
- **Solutions:**
|
||
- SO-XXXX: Harm Reduction Programs
|
||
- SO-XXXX: Medication-Assisted Treatment (MAT)
|
||
- SO-XXXX: Prescription Drug Monitoring Programs (PDMPs)
|
||
- SO-XXXX: Mental Health Crisis Intervention
|
||
- SO-XXXX: Community-Based Prevention
|
||
- **Organizations:**
|
||
- ORG-XXXX: Centers for Disease Control and Prevention (CDC)
|
||
- ORG-XXXX: Substance Abuse and Mental Health Services Administration (SAMHSA)
|
||
- ORG-XXXX: National Institute on Drug Abuse (NIDA)
|
||
- **Other Data Sources:**
|
||
- DS-00001: WHO Global Health Observatory (international mortality comparisons)
|
||
- DS-XXXX: National Violent Death Reporting System (NVDRS) - detailed violent death circumstances
|
||
- DS-XXXX: National Survey on Drug Use and Health (NSDUH) - nonfatal substance use data
|
||
|
||
**External Resources:**
|
||
- **Alternative Sources:**
|
||
- State vital statistics departments: More timely state-specific data (6-12 month lag)
|
||
- WHO Mortality Database: International comparisons
|
||
- **Complementary Sources:**
|
||
- NVDRS: Detailed incident circumstances for violent deaths
|
||
- NSDUH: Nonfatal substance use patterns
|
||
- TEDS: Treatment Episode Data Set (substance use treatment admissions)
|
||
- PDMP: Prescription Drug Monitoring Programs (state-level prescribing data)
|
||
- **Source Comparison Studies:**
|
||
- Ruhm, C.J. (2018). "Deaths of Despair or Drug Problems?" *NBER Working Paper*.
|
||
- Hedegaard et al. (2020). "Issues in Developing a Surveillance Case Definition for Nonfatal Opioid Overdose." *NCHS Data Brief*.
|
||
|
||
### Additional Documentation
|
||
|
||
**User Guides:**
|
||
- WONDER API Guide: https://wonder.cdc.gov/wonder/help/WONDER-API.html
|
||
- Underlying Cause of Death Documentation: https://wonder.cdc.gov/wonder/help/ucd.html
|
||
- ICD-10 Codes: https://www.cdc.gov/nchs/icd/icd10cm.htm
|
||
|
||
**Research Using This Source:**
|
||
- 100,000+ citations in Google Scholar
|
||
- Case & Deaton (2015): "Rising morbidity and mortality in midlife among white non-Hispanic Americans in the 21st century" *PNAS*
|
||
- Case & Deaton (2017): "Mortality and morbidity in the 21st century" *Brookings Papers*
|
||
|
||
**Methodology Papers:**
|
||
- NCHS methods: https://www.cdc.gov/nchs/nvss/mortality_methods.htm
|
||
- Cause-of-death accuracy studies (Vital Statistics Reports series)
|
||
- Comparability studies for ICD revisions
|
||
|
||
---
|
||
|
||
## Cataloger Notes
|
||
|
||
**Internal Notes:**
|
||
- **CRITICAL SOURCE** for Substrate: Reveals behavioral truth (revealed preference) that surveys miss
|
||
- Drug overdoses and suicides are **leading indicators** of wellbeing breakdown - precede economic decline
|
||
- County-level granularity enables geographic analysis (shows "left behind" places)
|
||
- Census data (not sample) - captures all deaths
|
||
- Main limitation: 1-2 year lag (but still best available US mortality data)
|
||
- Suicide undercounting known issue (~20-35% undercount) - use as lower bound
|
||
- API is XML-based (not REST/JSON) - more complex than WHO API but well-documented
|
||
|
||
**To Do:**
|
||
- [x] Create update.ts script for XML API
|
||
- [ ] Test API with sample drug overdose query (ICD-10: X40-X44)
|
||
- [ ] Cross-reference with relevant Problems (opioid epidemic, suicide, deaths of despair)
|
||
- [ ] Cross-reference with relevant Solutions (harm reduction, MAT, PDMPs)
|
||
- [ ] Add NVDRS as complementary source when cataloged
|
||
- [ ] Monitor ICD-11 transition timeline (check NCHS announcements)
|
||
|
||
**Questions for Review:**
|
||
- Should we catalog multiple WONDER databases separately (mortality vs. natality vs. cancer) or keep as related sources?
|
||
- How to handle provisional vs. final data in updates (separate files or versioning)?
|
||
- County suppression rules - how to represent suppressed cells in Substrate format?
|
||
|
||
---
|
||
|
||
**END OF SOURCE RECORD**
|
||
```
|