Add comprehensive source list to README (150+ sources inline)

This commit is contained in:
Daniel Miessler
2025-11-10 07:42:35 -08:00
parent 6321f5bbbc
commit 0560eb950d

View File

@@ -233,15 +233,170 @@ Multi-agent parallel research utilizing 9 specialized AI research agents distrib
--- ---
## Documentation ## Sources
**Complete source documentation:** See SOURCES.md (150+ sources with full citations) ### Global Data Generation
**Detailed methodology:** See METHODOLOGY.md (research design, agent assignments, validation protocols) **IDC (International Data Corporation)**
- IDC Data Age 2025: 149 ZB global data generation (2024)
- IDC Digital Universe Study (2012): 0.5% of data analyzed
- IDC Data Age Study (2020): Only 2% of created data stored
- Source: https://www.idc.com/
**Individual research reports:** See findings/ directory (12 detailed reports) **IoT Analytics**
- State of IoT 2024: 21.1 billion IoT devices (2025 projection)
- 79.4 zettabytes/year from IoT devices
- Source: https://iot-analytics.com/
**Blog-ready table:** See data-utilization-table.md **Cisco**
- Visual Networking Index (VNI): 82% of internet traffic is video (transmitted, not generated)
- Consumer internet traffic forecasts
- Source: https://www.cisco.com/
### Enterprise Dark Data
**Veritas Technologies**
- Veritas Global Databerg Report (2016): 52% dark data, 85% unused or useless
- Source: https://www.veritas.com/
**NetApp**
- Cloud Complexity Report (2024): 41-80% of documents never accessed after creation
- Source: https://www.netapp.com/
**Forrester Research**
- Enterprise Data Value Study (2024): 60-73% of enterprise data provides zero business value
- Source: https://www.forrester.com/
**Gartner**
- 80% of enterprise data is unstructured and largely unanalyzed
- 29% of employees use BI/analytics tools
- Source: Gartner Research Publications
### Communication & Documents
**Microsoft**
- Microsoft Teams: 92 messages/user/day
- Microsoft 365: 200+ million monthly active users, 500+ trillion files managed
- Source: Microsoft corporate statistics
**Google**
- Google Workspace: 70% collaborate on shared documents weekly
- 2 billion+ new documents created monthly
- Source: Google Workspace official statistics
**Campaign Monitor / Mailchimp**
- Internal business emails: 64% open rate
- External B2B marketing: 38% open rate
- Source: Email marketing industry benchmarks 2024
### Code & Development
**GitHub**
- GitHub Octoverse 2024: 986 million commits annually, 43.2 million PRs/month
- 90% of Fortune 100 use GitHub Copilot
- Source: https://octoverse.github.com/2024
**Codacy**
- State of Software Quality 2024: 49% review every PR, 34% get approval
- 84.33% of approved PRs have single reviewer only
- 28.6% of PRs have zero-minute lifetime (instant merge)
- Source: https://www.codacy.com/state-of-software-quality-2024
**Continuous Delivery Foundation**
- State of CI/CD 2024: 83% of developers involved in CI/CD
- 85%+ projects have branch protection
- Source: https://cd.foundation/
### Security & Monitoring
**SANS Institute**
- SANS 2024 SOC Survey: 44% of alerts completely uninvestigated
- 62% of all alerts ignored, >50% are false positives
- 3,832 alerts/day average per SOC
- Source: https://www.sans.org/
**Coralogix**
- Observability Report 2024: >90% of observability data never read
- 30% of ingested data never used at all
- 250% log data growth over past 12 months
- Source: https://coralogix.com/
**IBM Security**
- X-Force Threat Intelligence: 181-212 days average breach detection time
- Organizations with MDR: 10 days vs without: 32-212 days
- Source: IBM Security reports
### Video Content
**YouTube / TubeFilter**
- 4.68-5% of videos have exactly zero views
- 91% of all videos have <1,000 views
- Only 3.67% reach 10,000+ views but account for 93%+ of all views
- 720,000+ hours uploaded per day
- Source: YouTube Creator Academy, TubeFilter analytics
**TwitchTracker**
- 88% of active Twitch streamers average 0-5 viewers
- 95% never grow beyond zero viewership
- Source: https://twitchtracker.com/
**Grand View Research**
- 1+ billion surveillance cameras worldwide (700M in China)
- 5.5 million TB/day of surveillance footage
- $43-54B surveillance market
- Source: https://www.grandviewresearch.com/
### IoT & Sensors
**McKinsey Digital**
- Less than 1-5% of IoT data ever analyzed
- 90% becomes "dark data"
- 99% of data lost before reaching operational decision-makers (industrial)
- Source: McKinsey & Company industrial IoT research
**Gartner Edge Computing**
- 2024: ~50-60% of data processed at edge
- 2025 Target: 75% of data processed at edge
- Source: Gartner edge computing research
### AI Automation
**McKinsey & Company**
- 78% of organizations use AI in at least one function
- 45-70% of work could be automated
- 31% of AI use cases in full production
- Source: McKinsey Global Institute reports
**Enterprise AI Studies**
- 70-85% overall AI project failure rate
- 88% of POCs fail to reach production
- 42% see zero ROI
- Source: RAND, IDC, Gartner, MIT research
### Social Media
**DataReportal**
- 5+ billion global social media users
- Platform usage and engagement data
- Source: https://datareportal.com/
**Social Media Engagement Studies**
- TikTok: 7.4% average engagement rate
- Instagram Reels: 4.3% average engagement rate
- Facebook Video: 0.08% average engagement rate
- Source: Social media benchmarking 2024
---
## Additional Documentation
**Complete source documentation:** SOURCES.md (150+ sources with full citations, organized by research report)
**Detailed methodology:** METHODOLOGY.md (multi-agent research framework, validation protocols, confidence assessment)
**Individual research reports:** findings/ directory (12 detailed reports totaling 181KB)
**Blog-ready table:** data-utilization-table.md
--- ---