Files
Daniel Miessler 9066ad477b Add Bay Area COVID wastewater and Pulitzer Prize datasets
Added two comprehensive datasets with full documentation:

1. Bay Area COVID-19 Wastewater Surveillance (2022-2025)
   - California statewide COVID-19 wastewater data
   - 161 weekly data points from CDPH
   - Leading health indicator for viral trends
   - Includes automated update scripts

2. Pulitzer Prize Winners - Arts & Letters (1918-2024)
   - 249 winners across 107 years
   - Poetry, Drama, and General/Special categories
   - High-quality curated data from Wikidata
   - CSV files for each category

Added master Data directory documentation (Data/README.md) describing:
- Data philosophy and quality standards
- All four current datasets
- Contribution guidelines
- File naming conventions

Includes utility commands:
- get-bay-area-covid-status: Analyze current COVID wastewater levels
- get-california-wastewater-data: Fetch latest surveillance data

Updated .gitignore to exclude large raw data files (278MB+).

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>
2025-10-16 22:09:43 -07:00

21 lines
492 B
Markdown

# Pulitzer Prize Winners Resources
## Official Source
**Pulitzer Prizes Official Website**: https://www.pulitzer.org
- Complete historical records (1917-present)
- Prize winners by year and category
- Updated annually (typically April/May)
## Data Source
**Wikidata SPARQL Query Service**: https://query.wikidata.org/
- Structured knowledge base
- Open data (CC0 public domain)
- SPARQL query language for data extraction
- Direct CSV export capability
---
**Last Updated**: 2025-10-07