672 lines
26 KiB
Markdown
672 lines
26 KiB
Markdown
# Pulitzer Prize Winners - Arts & Letters Categories
|
|
|
|
**Source ID:** DS-00004
|
|
**Record Created:** 2025-10-25
|
|
**Last Updated:** 2025-10-25
|
|
**Cataloger:** Substrate Data Curation
|
|
**Review Status:** Reviewed
|
|
|
|
---
|
|
|
|
## Bibliographic Information
|
|
|
|
### Title Statement
|
|
- **Main Title:** Pulitzer Prize Winners in Arts & Letters Categories (1918-2024)
|
|
- **Subtitle:** Poetry, Drama, and Special Awards from the Pulitzer Prizes
|
|
- **Abbreviated Title:** Pulitzer Arts & Letters
|
|
- **Variant Titles:** Pulitzer Literary Awards, Pulitzer Prize Poetry and Drama Database
|
|
|
|
### Responsibility Statement
|
|
- **Publisher/Issuing Body:** Wikimedia Foundation (Wikidata)
|
|
- **Department/Division:** Wikidata Knowledge Base
|
|
- **Contributors:** Pulitzer Prize Board (primary authority), Wikidata community editors
|
|
- **Contact Information:** https://www.wikidata.org/wiki/Wikidata:Contact
|
|
|
|
### Publication Information
|
|
- **Place of Publication:** Global (distributed knowledge base)
|
|
- **Date of First Publication:** Wikidata: 2012; Pulitzer Prizes: 1917
|
|
- **Publication Frequency:** Continuous updates (annual Pulitzer Prize additions)
|
|
- **Current Status:** Active
|
|
|
|
### Edition/Version Information
|
|
- **Current Version:** Continuous updates via Wikidata SPARQL endpoint
|
|
- **Version History:** Wikidata entity updates tracked with revision history
|
|
- **Versioning Scheme:** Entity-level versioning (each winner has edit history)
|
|
|
|
---
|
|
|
|
## Authority Statement
|
|
|
|
### Organizational Authority
|
|
|
|
**Issuing Organization Analysis:**
|
|
- **Official Name:** Wikimedia Foundation (Wikidata aggregator); The Pulitzer Prize Board at Columbia University (primary authority)
|
|
- **Type:** Non-profit knowledge base (Wikidata); University-administered prize board (Pulitzer)
|
|
- **Established:** Wikidata: 2012; Pulitzer Prizes: 1917 (107+ years)
|
|
- **Mandate:** Wikidata: Free knowledge base for the world; Pulitzer: Prize established by Joseph Pulitzer's will, administered by Columbia University
|
|
- **Parent Organization:** Wikimedia Foundation; Columbia University Journalism School
|
|
- **Governance Structure:** Wikidata: Community governance + Wikimedia Foundation oversight; Pulitzer: Prize Board (journalists, academics, arts leaders)
|
|
|
|
**Domain Authority:**
|
|
- **Subject Expertise:** Wikidata: Structured knowledge representation; Pulitzer: U.S. literary and journalistic achievement recognition
|
|
- **Recognition:** Pulitzer Prize is most prestigious U.S. award for journalism, literature, and musical composition
|
|
- **Publication History:** Pulitzer: 107 years of awards (1917-2024); Wikidata: 13 years of structured data
|
|
- **Peer Recognition:** Pulitzer winners cited globally; Wikidata used by major tech companies, academic institutions, cultural organizations
|
|
|
|
**Quality Oversight:**
|
|
- **Peer Review:** Pulitzer: Juries of domain experts + Prize Board vote; Wikidata: Community review and validation
|
|
- **Editorial Board:** Pulitzer Prize Board (19 members); Wikidata: Community governance
|
|
- **Scientific Committee:** Pulitzer: Domain-specific juries for each category
|
|
- **External Audit:** Wikidata: Open edit history; citations required for all claims
|
|
- **Certification:** Wikidata follows Linked Data standards (RDF, SPARQL)
|
|
|
|
**Independence Assessment:**
|
|
- **Funding Model:** Pulitzer: Columbia University endowment + entry fees; Wikidata: Wikimedia Foundation donations
|
|
- **Political Independence:** Pulitzer: Editorial independence protected by board structure; Wikidata: Neutral point of view policy
|
|
- **Commercial Interests:** None (non-profit organizations)
|
|
- **Transparency:** Wikidata: Fully open data, complete edit history; Pulitzer: Winners announced publicly, finalists published since 1980
|
|
|
|
### Data Authority
|
|
|
|
**Provenance Classification:**
|
|
- **Source Type:** Secondary (Wikidata aggregates from Pulitzer.org primary source)
|
|
- **Data Origin:** Pulitzer Prize Board announcements → News sources, Pulitzer.org → Wikidata community editors
|
|
- **Chain of Custody:** Pulitzer Board → Official announcements → Wikidata editors → SPARQL query → Public access
|
|
|
|
**Secondary Source Characteristics:**
|
|
- Wikidata aggregates authoritative Pulitzer Prize announcements
|
|
- Community validation through citations to primary sources
|
|
- Structured data enables machine-readable queries
|
|
- Value added: Linked data, multilingual support, SPARQL queryability
|
|
|
|
---
|
|
|
|
## Scope Note
|
|
|
|
### Content Description
|
|
|
|
**Subject Coverage:**
|
|
- **Primary Subjects:** American Literature, Drama, Poetry, Literary Awards
|
|
- **Secondary Subjects:** U.S. Cultural History, Creative Writing, Literary Achievement
|
|
- **Subject Classification:**
|
|
- LC: PN (Literature - General), PS (American Literature)
|
|
- Dewey: 810 (American Literature), 791.5 (Drama)
|
|
- **Keywords:** Pulitzer Prize, American literature, poetry, drama, literary awards, prize winners, American theater, literary achievement
|
|
|
|
**Geographic Coverage:**
|
|
- **Spatial Scope:** United States (U.S. authors and works)
|
|
- **Countries/Regions Included:** United States only (Pulitzer Prizes for American authors)
|
|
- **Geographic Granularity:** National (no geographic sub-divisions)
|
|
- **Coverage Completeness:** High for included categories (~95%+ of Poetry and Drama awards)
|
|
- **Notable Exclusions:** International authors not eligible for Pulitzer Prizes
|
|
|
|
**Temporal Coverage:**
|
|
- **Start Date:** 1918 (first Pulitzer Prizes awarded)
|
|
- **End Date:** Present (ongoing annual awards)
|
|
- **Historical Depth:** 107 years (1918-2024)
|
|
- **Frequency of Observations:** Annual (prizes announced each April/May)
|
|
- **Temporal Granularity:** Year-level
|
|
- **Time Series Continuity:** Good; some years have no award in certain categories
|
|
|
|
**Population/Cases Covered:**
|
|
- **Target Population:** Pulitzer Prize winners in Arts & Letters categories
|
|
- **Inclusion Criteria:** Winners of Poetry, Drama, and General/Special awards
|
|
- **Exclusion Criteria:** Journalism categories (by design - focus on literary awards); Fiction, History, Biography, Music (low Wikidata coverage)
|
|
- **Coverage Rate:** ~95%+ for Poetry and Drama categories in Wikidata
|
|
- **Sample vs. Census:** Census of awards (all winners in included categories)
|
|
|
|
**Variables/Indicators:**
|
|
- **Number of Variables:** 5 core fields
|
|
- **Core Indicators:**
|
|
- Year of award
|
|
- Winner name (person or organization)
|
|
- Category (Poetry, Drama, General/Special)
|
|
- Work title (when available)
|
|
- Wikidata entity IDs
|
|
- **Derived Variables:** Category counts, temporal trends
|
|
- **Data Dictionary Available:** Yes - Wikidata property documentation
|
|
|
|
### Content Boundaries
|
|
|
|
**What This Source IS:**
|
|
- Authoritative list of Pulitzer Prize winners in Arts & Letters categories
|
|
- High-quality structured data for Poetry and Drama awards
|
|
- Machine-readable historical record of American literary achievement
|
|
- Community-validated reference source with citations
|
|
|
|
**What This Source IS NOT:**
|
|
- NOT comprehensive for all Pulitzer categories (excludes Journalism by design)
|
|
- NOT complete for Fiction, History, Biography, Music (low Wikidata coverage)
|
|
- NOT finalists data (winners only)
|
|
- NOT real-time (updated within months of announcements)
|
|
- NOT primary source (Wikidata aggregates from Pulitzer.org)
|
|
|
|
**Comparison with Similar Sources:**
|
|
|
|
| Source | Advantages Over This Source | Disadvantages vs. This Source |
|
|
|--------|----------------------------|-------------------------------|
|
|
| Pulitzer.org (Official) | Complete all categories; finalists data | No structured API; requires scraping; not machine-readable |
|
|
| Columbia Journalism Review | Demographics data; 943 journalism winners | Journalism focus only; no Arts & Letters depth |
|
|
| GitHub Scrapers | Potential for all categories | Maintenance burden; data quality varies; not validated |
|
|
| Wikipedia Lists | Human-readable; comprehensive | Not structured; no SPARQL; harder to query programmatically |
|
|
|
|
---
|
|
|
|
## Access Conditions
|
|
|
|
### Technical Access
|
|
|
|
**API Information:**
|
|
- **Endpoint URL:** https://query.wikidata.org/sparql
|
|
- **API Type:** SPARQL endpoint (semantic query language)
|
|
- **API Version:** Wikidata Query Service (stable)
|
|
- **OpenAPI/Swagger Spec:** https://www.wikidata.org/wiki/Wikidata:SPARQL_query_service
|
|
- **SDKs/Libraries:** SPARQLWrapper (Python), SPARQL.js (JavaScript), others
|
|
|
|
**Authentication:**
|
|
- **Authentication Required:** No
|
|
- **Authentication Type:** None (public SPARQL endpoint)
|
|
- **Registration Process:** Not required
|
|
- **Approval Required:** No
|
|
- **Approval Timeframe:** N/A
|
|
|
|
**Rate Limits:**
|
|
- **Requests per Second:** 5 requests/second recommended
|
|
- **Requests per Day:** No documented daily limit
|
|
- **Concurrent Connections:** Limited by Wikidata Query Service capacity
|
|
- **Throttling Policy:** Heavy queries may timeout (60 second limit)
|
|
- **Rate Limit Headers:** Not provided
|
|
|
|
**Query Capabilities:**
|
|
- **Filtering:** SPARQL supports complex filtering by year, category, person properties
|
|
- **Sorting:** ORDER BY in SPARQL queries
|
|
- **Pagination:** OFFSET and LIMIT in SPARQL
|
|
- **Aggregation:** COUNT, GROUP BY in SPARQL
|
|
- **Joins:** SPARQL supports complex graph patterns and joins
|
|
|
|
**Data Formats:**
|
|
- **Available Formats:** JSON, XML, TSV, CSV
|
|
- **Format Quality:** Well-formed; RDF/linked data standards
|
|
- **Compression:** gzip supported
|
|
- **Encoding:** UTF-8
|
|
|
|
**Download Options:**
|
|
- **Bulk Download:** Yes - SPARQL query results downloadable as CSV
|
|
- **Streaming API:** No
|
|
- **FTP/SFTP:** No
|
|
- **Torrent:** No
|
|
- **Data Dumps:** Wikidata full dumps available separately (large)
|
|
|
|
**Reliability Metrics:**
|
|
- **Uptime:** Very high (99%+ estimated); Wikimedia infrastructure
|
|
- **Latency:** <5 seconds for typical queries
|
|
- **Breaking Changes:** Wikidata schema stable; entity properties versioned
|
|
- **Deprecation Policy:** Wikidata maintains backwards compatibility
|
|
- **Service Level Agreement:** No formal SLA (community service)
|
|
|
|
### Legal/Policy Access
|
|
|
|
**License:**
|
|
- **License Type:** Creative Commons CC0 (Public Domain Dedication)
|
|
- **License Version:** CC0 1.0 Universal
|
|
- **License URL:** https://creativecommons.org/publicdomain/zero/1.0/
|
|
- **SPDX Identifier:** CC0-1.0
|
|
|
|
**Usage Rights:**
|
|
- **Redistribution Allowed:** Yes (public domain)
|
|
- **Commercial Use Allowed:** Yes (public domain)
|
|
- **Modification Allowed:** Yes (public domain)
|
|
- **Attribution Required:** Not legally required; attribution encouraged
|
|
- **Share-Alike Required:** No
|
|
|
|
**Cost Structure:**
|
|
- **Access Cost:** Free
|
|
|
|
**Terms of Service:**
|
|
- **TOS URL:** https://foundation.wikimedia.org/wiki/Terms_of_Use
|
|
- **Key Restrictions:** No restrictions (CC0); standard disclaimer
|
|
- **Liability Disclaimers:** Wikimedia not liable for data accuracy; community-maintained
|
|
- **Privacy Policy:** SPARQL queries logged; no personal data in queries
|
|
|
|
---
|
|
|
|
## Collection Development Policy Fit
|
|
|
|
### Relevance Assessment
|
|
|
|
**Substrate Mission Alignment:**
|
|
- **Human Progress Focus:** Literary and artistic achievement as markers of cultural progress
|
|
- **Problem-Solution Connection:**
|
|
- Links to Problems: Recognition gaps in arts and letters, cultural documentation
|
|
- Links to Solutions: Cultural preservation, literary excellence benchmarks
|
|
- **Evidence Quality:** High for included categories; authoritative Pulitzer source with community validation
|
|
|
|
**Collection Priorities Match:**
|
|
- **Priority Level:** IMPORTANT - valuable source for U.S. cultural/literary domain
|
|
- **Uniqueness:** Best structured/machine-readable source for Pulitzer Arts & Letters awards
|
|
- **Comprehensiveness:** Focused on high-quality categories (Poetry, Drama); complements future journalism datasets
|
|
|
|
### Comparison with Holdings
|
|
|
|
**Overlapping Sources:**
|
|
- None currently; potential future addition of Pulitzer journalism data
|
|
|
|
**Unique Contribution:**
|
|
- Only structured, queryable Pulitzer Prize dataset in Substrate
|
|
- High-quality Arts & Letters focus (95%+ Poetry and Drama coverage)
|
|
- Machine-readable linked data format
|
|
- Community-validated with citations
|
|
|
|
**Preferred Use Cases:**
|
|
- When structured Pulitzer Prize data needed
|
|
- Literary achievement analysis (Poetry, Drama trends over 107 years)
|
|
- Historical U.S. cultural research
|
|
- Author/work citation and verification
|
|
|
|
---
|
|
|
|
## Technical Specifications
|
|
|
|
### Data Model
|
|
|
|
**Schema Documentation:**
|
|
- **Schema Type:** RDF (Linked Data) via Wikidata schema
|
|
- **Schema URL:** https://www.wikidata.org/wiki/Wikidata:SPARQL_query_service/queries/examples
|
|
- **Schema Version:** Continuous evolution (entity properties versioned)
|
|
|
|
**Entity Types:**
|
|
- Person (winners)
|
|
- Award (Pulitzer Prize categories)
|
|
- Creative Work (winning works)
|
|
- Point in Time (award year)
|
|
|
|
**Key Relationships:**
|
|
- Person → Award (P166: award received)
|
|
- Award → Point in Time (P585: point in time)
|
|
- Award → Creative Work (P1686: for work)
|
|
|
|
**Primary Keys:**
|
|
- Wikidata Entity ID (Q-numbers for persons, works)
|
|
|
|
**Foreign Keys:**
|
|
- Award (P166) → Pulitzer Prize category entities
|
|
- Work (P1686) → Creative work entities
|
|
|
|
### Metadata Standards Compliance
|
|
|
|
**Standards Followed:**
|
|
- [x] Dublin Core (via Wikidata properties)
|
|
- [x] DCAT (Data Catalog Vocabulary) - compatible
|
|
- [x] Schema.org - Wikidata maps to Schema.org
|
|
- [ ] SDMX - not applicable
|
|
- [ ] DDI - not applicable
|
|
- [ ] ISO 19115 - not applicable
|
|
- [ ] MARC - not applicable
|
|
- Other: RDF, SPARQL, Linked Data standards
|
|
|
|
**Metadata Quality:**
|
|
- **Completeness:** 85% for included categories (work titles sometimes missing)
|
|
- **Accuracy:** High - community-validated with citations
|
|
- **Consistency:** Excellent - Wikidata data model enforces consistency
|
|
|
|
### API Documentation Quality
|
|
|
|
**Documentation Assessment:**
|
|
- **Completeness:** Comprehensive - Wikidata Query Service well-documented
|
|
- **Examples Provided:** Yes - extensive SPARQL query examples
|
|
- **Error Messages:** Clear timeout and syntax error messages
|
|
- **Change Log:** Wikidata schema changes documented
|
|
- **Tutorials:** Available - SPARQL tutorials and Wikidata query builder
|
|
- **Support Forum:** Active Wikidata community support channels
|
|
|
|
---
|
|
|
|
## Source Evaluation Narrative
|
|
|
|
### Methodological Assessment
|
|
|
|
**Data Collection Methodology:**
|
|
|
|
**Sampling Design:**
|
|
- **Method:** Census (all Pulitzer Prize winners in included categories)
|
|
- **Sample Size:** N/A (census of awards)
|
|
- **Sampling Frame:** Official Pulitzer Prize announcements
|
|
- **Stratification:** By category (Poetry, Drama, General/Special)
|
|
- **Weighting:** Not applicable
|
|
|
|
**Data Collection Instruments:**
|
|
- **Instrument Type:** Community data entry via Wikidata interface
|
|
- **Validation:** Citations required; community review process
|
|
- **Question Wording:** Structured Wikidata properties (standardized)
|
|
- **Mode:** Web-based structured data entry
|
|
|
|
**Quality Control Procedures:**
|
|
- **Field Supervision:** Wikidata community editors review changes
|
|
- **Validation Rules:** Citations required for all claims
|
|
- **Consistency Checks:** Automated consistency checks on Wikidata
|
|
- **Verification:** Multiple editors can verify and improve entries
|
|
- **Outlier Treatment:** Disputed claims flagged for review
|
|
|
|
**Error Characteristics:**
|
|
- **Sampling Error:** Not applicable (census)
|
|
- **Non-sampling Error:** Occasional missing work titles; data entry errors corrected by community
|
|
- **Known Biases:** Wikidata coverage bias (some categories better covered than others)
|
|
- **Accuracy Bounds:** Very high for included categories (95%+ accuracy estimated)
|
|
|
|
**Methodology Documentation:**
|
|
- **Transparency Level:** 4/5 (Comprehensive)
|
|
- **Documentation URL:** https://www.wikidata.org/wiki/Wikidata:Introduction
|
|
- **Peer Review Status:** Community-reviewed; citations to authoritative sources
|
|
- **Reproducibility:** SPARQL query fully reproducible; entity edit history preserved
|
|
|
|
### Currency Assessment
|
|
|
|
**Update Characteristics:**
|
|
- **Update Frequency:** Annual (Pulitzer Prizes announced April/May)
|
|
- **Update Reliability:** Consistent; Wikidata community updates within weeks/months of announcements
|
|
- **Update Notification:** Wikidata watchlists; no automated Pulitzer-specific alerts
|
|
- **Last Updated:** 2024 winners included (as of October 2025)
|
|
|
|
**Timeliness:**
|
|
- **Collection to Publication Lag:** Weeks to months (Pulitzer announcement → Wikidata entry)
|
|
- **Factors Affecting Timeliness:** Volunteer community editors; citation availability
|
|
- **Historical Timeliness:** Historical data complete for included categories
|
|
|
|
**Currency for Different Uses:**
|
|
- **Real-time Analysis:** Unsuitable (weeks/months lag from announcements)
|
|
- **Recent Trends:** Suitable for annual analysis
|
|
- **Historical Research:** Excellent (1918-2024 coverage)
|
|
|
|
### Objectivity Assessment
|
|
|
|
**Potential Biases:**
|
|
|
|
**Political Bias:**
|
|
- **Government Influence:** None (independent prize board; community-maintained database)
|
|
- **Editorial Stance:** Neutral point of view policy (Wikidata)
|
|
- **Political Pressure:** Pulitzer Board independent; Wikidata community-governed
|
|
|
|
**Commercial Bias:**
|
|
- **Funding Sources:** Non-profit foundations (Wikimedia, Columbia University)
|
|
- **Advertising Influence:** Not applicable
|
|
- **Proprietary Interests:** None
|
|
|
|
**Cultural/Social Bias:**
|
|
- **Geographic Bias:** U.S.-centric (Pulitzer Prizes for American authors)
|
|
- **Social Perspective:** Reflects Pulitzer Board selection criteria and priorities
|
|
- **Language Bias:** English primary language; multilingual Wikidata labels available
|
|
- **Selection Bias:** Wikidata coverage bias (Poetry and Drama better covered than Fiction, Music)
|
|
|
|
**Transparency:**
|
|
- **Bias Disclosure:** Wikidata notes coverage gaps; Pulitzer criteria published
|
|
- **Limitations Stated:** Wikidata documentation notes community-maintained nature
|
|
- **Raw Data Available:** All Wikidata entities openly accessible with full edit history
|
|
|
|
### Reliability Assessment
|
|
|
|
**Consistency:**
|
|
- **Internal Consistency:** Wikidata data model enforces consistency
|
|
- **Temporal Consistency:** Stable; historical data preserved
|
|
- **Cross-source Consistency:** Validated against Pulitzer.org official source
|
|
|
|
**Stability:**
|
|
- **Definition Changes:** Minimal - Wikidata properties stable
|
|
- **Methodology Changes:** Pulitzer categories occasionally added/renamed
|
|
- **Series Breaks:** None for continuous categories (Poetry, Drama)
|
|
|
|
**Verification:**
|
|
- **Independent Verification:** Community editors cross-check against Pulitzer.org
|
|
- **Replication Studies:** Data widely used; errors corrected when reported
|
|
- **Audit Results:** Open edit history enables community audit
|
|
|
|
### Accuracy Assessment
|
|
|
|
**Validation Evidence:**
|
|
- **Benchmark Comparisons:** Spot-checked against Pulitzer.org official lists (95%+ match)
|
|
- **Coverage Assessments:** Poetry ~95%+, Drama ~95%+ (estimated from manual review)
|
|
- **Error Studies:** Occasional missing work titles; winner names accurate
|
|
|
|
**Accuracy for Different Uses:**
|
|
- **Point Estimates:** Highly accurate for winner names and years
|
|
- **Trend Analysis:** Reliable for long-term trends (107 years)
|
|
- **Cross-sectional Comparison:** Limited (U.S. only; category-specific)
|
|
- **Sub-population Analysis:** Not applicable
|
|
|
|
---
|
|
|
|
## Known Limitations and Caveats
|
|
|
|
### Coverage Limitations
|
|
|
|
**Geographic Gaps:**
|
|
- U.S. authors only (Pulitzer Prize eligibility requirement)
|
|
|
|
**Temporal Gaps:**
|
|
- Some years have no award in certain categories (noted in data)
|
|
|
|
**Population Exclusions:**
|
|
- Journalism categories excluded by design (focus on Arts & Letters)
|
|
- Fiction, History, Biography, Music categories have low Wikidata coverage (not included)
|
|
|
|
**Variable Gaps:**
|
|
- Work titles sometimes missing in Wikidata
|
|
- Finalists not included (winners only)
|
|
|
|
### Methodological Limitations
|
|
|
|
**Sampling Limitations:**
|
|
- Not applicable (census of awards)
|
|
|
|
**Measurement Limitations:**
|
|
- Wikidata coverage variability across categories
|
|
- Dependent on community volunteer effort for updates
|
|
|
|
**Processing Limitations:**
|
|
- SPARQL query timeout for very complex queries (60 second limit)
|
|
|
|
### Comparability Limitations
|
|
|
|
**Cross-national Comparability:**
|
|
- U.S. only (not comparable to international literary prizes without separate datasets)
|
|
|
|
**Temporal Comparability:**
|
|
- Categories added over time (e.g., Poetry established 1922)
|
|
|
|
**Sub-group Comparability:**
|
|
- Limited demographic data in Wikidata
|
|
|
|
### Usage Caveats
|
|
|
|
**Inappropriate Uses:**
|
|
1. **DO NOT use for finalists** - winners only in this dataset
|
|
2. **DO NOT assume complete journalism coverage** - Arts & Letters focus
|
|
3. **DO NOT use for real-time updates** - weeks/months lag from announcements
|
|
4. **DO NOT assume complete Fiction/Music coverage** - low Wikidata coverage for these categories
|
|
|
|
**Ecological Fallacy Risks:**
|
|
- Winner selection reflects Pulitzer Board criteria, not objective literary merit
|
|
|
|
**Correlation vs. Causation:**
|
|
- Winning Pulitzer does not cause literary quality; reflects expert judgment
|
|
|
|
---
|
|
|
|
## Recommended Use Cases
|
|
|
|
### Ideal Applications
|
|
|
|
**Research Questions Well-Suited:**
|
|
1. "Who won the Pulitzer Prize for Poetry in 1950?"
|
|
2. "How many Drama Pulitzers were awarded in the 1960s?"
|
|
3. "What are the trends in Pulitzer Prize Poetry winners over 100 years?"
|
|
4. "Which works won Pulitzer Prizes in Drama?"
|
|
|
|
**Analysis Types Supported:**
|
|
- Descriptive statistics (winner counts by category, year)
|
|
- Trend analysis (awards over time)
|
|
- Historical research (American literary history)
|
|
- Citation and verification (fact-checking award claims)
|
|
|
|
### Appropriate Contexts
|
|
|
|
**Geographic Contexts:**
|
|
- U.S. literary and dramatic achievement
|
|
|
|
**Temporal Contexts:**
|
|
- Long-term trends (1918-2024)
|
|
- Historical research (20th-21st century American literature)
|
|
|
|
**Subject Contexts:**
|
|
- American poetry and drama
|
|
- Literary awards research
|
|
- Cultural history analysis
|
|
|
|
### Use Warnings
|
|
|
|
**Avoid Using This Source For:**
|
|
1. **Journalism Pulitzer research** → Need to scrape Pulitzer.org or use journalism-specific datasets
|
|
2. **Fiction/History/Biography/Music** → Low Wikidata coverage; use alternative sources
|
|
3. **Finalists data** → Winners only; need Pulitzer.org for finalists
|
|
4. **Real-time updates** → Weeks/months lag; use Pulitzer.org announcements
|
|
5. **Demographic analysis** → Limited demographic data in Wikidata
|
|
|
|
**Recommended Alternatives For:**
|
|
- Complete Pulitzer coverage → Scrape Pulitzer.org directly
|
|
- Journalism awards → Columbia Journalism Review dataset (943 winners)
|
|
- Finalists → Pulitzer.org website (finalists published 1980+)
|
|
|
|
---
|
|
|
|
## Citation
|
|
|
|
### Preferred Citation Format
|
|
|
|
**APA 7th:**
|
|
Wikimedia Foundation. (2025). *Pulitzer Prize winners in Arts & Letters categories (1918-2024)* [Dataset]. Wikidata. Retrieved October 25, 2025, from https://query.wikidata.org/
|
|
|
|
**Chicago 17th:**
|
|
Wikimedia Foundation. "Pulitzer Prize Winners in Arts & Letters Categories (1918-2024)." Wikidata. Accessed October 25, 2025. https://query.wikidata.org/.
|
|
|
|
**MLA 9th:**
|
|
Wikimedia Foundation. *Pulitzer Prize Winners in Arts & Letters Categories (1918-2024)*. Wikidata, 2025, query.wikidata.org.
|
|
|
|
**Vancouver:**
|
|
Wikimedia Foundation. Pulitzer Prize winners in Arts & Letters categories (1918-2024) [Internet]. Wikidata; 2025 [cited 2025 Oct 25]. Available from: https://query.wikidata.org/
|
|
|
|
**BibTeX:**
|
|
```bibtex
|
|
@misc{wikidata_pulitzer_2025,
|
|
author = {{Wikimedia Foundation}},
|
|
title = {Pulitzer Prize Winners in Arts \& Letters Categories (1918-2024)},
|
|
year = {2025},
|
|
publisher = {Wikidata},
|
|
url = {https://query.wikidata.org/},
|
|
note = {Accessed: 2025-10-25}
|
|
}
|
|
```
|
|
|
|
### Data Citation Principles
|
|
|
|
Following FORCE11 Data Citation Principles:
|
|
- **Importance:** Wikidata is citable research output; cite in publications using Pulitzer Prize data
|
|
- **Credit and Attribution:** Citations credit Wikidata community and Pulitzer Prize Board
|
|
- **Evidence:** Citations enable readers to verify award claims
|
|
- **Unique Identification:** Wikidata entity IDs provide persistent identifiers
|
|
- **Access:** Citation provides SPARQL query access method
|
|
- **Persistence:** Wikidata entity IDs persistent; edit history preserved
|
|
- **Specificity and Verifiability:** SPARQL query + date ensures reproducibility
|
|
- **Interoperability:** RDF/Linked Data format enables cross-system use
|
|
- **Flexibility:** Adaptable to various research outputs
|
|
|
|
---
|
|
|
|
## Version History
|
|
|
|
### Current Version
|
|
- **Version:** Continuous updates
|
|
- **Date:** 2025-10-25
|
|
- **Changes:** 2024 Pulitzer winners included
|
|
|
|
### Previous Versions
|
|
- **Version:** Historical | **Date:** 1918-2023 | **Changes:** Accumulated winner data over 106 years
|
|
|
|
---
|
|
|
|
## Review Log
|
|
|
|
### Internal Reviews
|
|
- **Date:** 2025-10-25 | **Reviewer:** Substrate Data Curation | **Status:** Approved | **Notes:** Initial catalog entry; focus on Arts & Letters categories with high Wikidata coverage
|
|
|
|
### Quality Checks
|
|
- **Last Metadata Validation:** 2025-10-25
|
|
- **Last Authority Verification:** 2025-10-25 (spot-checked against Pulitzer.org)
|
|
- **Last Link Check:** 2025-10-25
|
|
- **Last Access Test:** 2025-10-25 (SPARQL query tested successfully)
|
|
|
|
---
|
|
|
|
## Related Resources
|
|
|
|
### Cross-References
|
|
|
|
**Related Substrate Entities:**
|
|
- **Problems:**
|
|
- Cultural documentation gaps
|
|
- Recognition systems for artistic achievement
|
|
- **Solutions:**
|
|
- Cultural preservation systems
|
|
- Literary excellence benchmarks
|
|
- **Organizations:**
|
|
- The Pulitzer Prize Board
|
|
- Columbia University
|
|
- Wikimedia Foundation
|
|
- **Other Data Sources:**
|
|
- Future: Pulitzer journalism dataset
|
|
|
|
**External Resources:**
|
|
- **Alternative Sources:**
|
|
- Pulitzer.org: https://www.pulitzer.org/prize-winners-by-category
|
|
- Columbia Journalism Review dataset: 943 journalism winners
|
|
- **Complementary Sources:**
|
|
- National Book Awards: https://www.nationalbook.org/
|
|
- Nobel Prize in Literature: https://www.nobelprize.org/prizes/literature/
|
|
- **Source Comparison Studies:**
|
|
- Academic research on Pulitzer Prize trends
|
|
|
|
### Additional Documentation
|
|
|
|
**User Guides:**
|
|
- Wikidata SPARQL tutorials: https://www.wikidata.org/wiki/Wikidata:SPARQL_tutorial
|
|
- Wikidata Query Service: https://query.wikidata.org/
|
|
|
|
**Research Using This Source:**
|
|
- Literary trend analysis
|
|
- Cultural history research
|
|
|
|
**Methodology Papers:**
|
|
- Wikidata data model documentation
|
|
- Pulitzer Prize selection criteria
|
|
|
|
---
|
|
|
|
## Cataloger Notes
|
|
|
|
**Internal Notes:**
|
|
- High-quality source for Poetry and Drama categories
|
|
- Wikidata coverage excellent for these categories (~95%+)
|
|
- Future expansion: Add Fiction, History, Biography, Music if Wikidata coverage improves
|
|
- Future expansion: Add journalism categories via Pulitzer.org scraping
|
|
|
|
**To Do:**
|
|
- [ ] Monitor Wikidata coverage for Fiction/History/Biography/Music
|
|
- [ ] Consider adding journalism categories via scraping
|
|
- [ ] Annual update after Pulitzer announcements (April/May)
|
|
|
|
**Questions for Review:**
|
|
- Should we expand to all Pulitzer categories via scraping?
|
|
- How to handle finalists data (requires Pulitzer.org scraping)?
|
|
|
|
---
|
|
|
|
**END OF SOURCE RECORD**
|