# Neville Letters Bigram Analysis (1585-1620)
## Including Henry VIII Section Breakdown

**Date:** January 5, 2026  
**Analysis:** Bigram Cosine Similarity  
**Corpus:** Neville Letters (99,088 tokens) vs. 296 Plays  
**Date Range:** 1585-1620  

---

## Key Finding: Henry VIII Section Analysis

| Section | Rank | Similarity | Tokens |
|---------|------|------------|--------|
| **Full Play** | **#1** | 0.6126 | 24,712 |
| **Shakespeare Section** | **#4** | 0.5866 | 9,994 |
| **Fletcher Section** | **#31** | 0.5326 | 14,718 |

**The Shakespeare section is 10.1% more similar to Neville than the Fletcher section.**
**(27-rank difference)**

---

## Top 20 Plays by Bigram Similarity

| Rank | Year | Similarity | Title |
|------|------|------------|-------|
| **1** | **1613** | **0.6126** | **Henry VIII [Full Play]** |
| 2 | 1609 | 0.6079 | The Winter's Tale |
| 3 | 1599 | 0.5897 | Henry V |
| **4** | **1613** | **0.5866** | **Henry VIII [Shakespeare Section]** |
| 5 | 1610 | 0.5843 | Cymbeline |
| 6 | 1618 | 0.5803 | Technogamia |
| 7 | 1603 | 0.5736 | All's Well That Ends Well |
| 8 | 1600 | 0.5687 | Cynthia's Revels |
| 9 | 1597 | 0.5659 | Henry IV, Part 2 |
| 10 | 1608 | 0.5652 | Coriolanus |
| 11 | 1602 | 0.5645 | Royal King and Loyal Subject |
| 12 | 1619 | 0.5636 | Two Wise Men |
| 13 | 1607 | 0.5597 | Tragedy of Charles Duke of Byron |
| 14 | 1595 | 0.5585 | Love's Labor's Lost |
| 15 | 1603 | 0.5583 | Measure for Measure |
| 16 | 1599 | 0.5528 | 1 Edward the Fourth |
| 17 | 1599 | 0.5509 | Every Man Out of His Humour |
| 18 | 1588 | 0.5473 | True Tragedy of Richard III |
| 19 | 1597 | 0.5473 | Henry IV, Part 1 |
| 20 | 1619 | 0.5470 | Two Merry Milkmaids |

---

## Comparison: Bigram vs Trigram Rankings

| Metric | Full Play | Shakespeare | Fletcher | S vs F Diff |
|--------|-----------|-------------|----------|-------------|
| **Bigram** | #1 | #4 | #31 | +10.1%, 27 ranks |
| **Trigram** | #3 | #16 | #57 | +20.7%, 41 ranks |

Shakespeare section wins on BOTH measures, with the advantage being larger for trigrams.

---

## Summary Statistics

- **Total Plays Analyzed:** 298 (including H8 sections)
- **Mean Similarity:** 0.4494
- **Standard Deviation:** 0.0697
- **Maximum:** 0.6126 (Henry VIII Full)
- **Minimum:** 0.1321

---

## Files in Claude-Jan-5/

- `Neville_Bigram_Plays_1585_1620.csv` - Full results
- `analyze_neville_bigrams_with_h8_sections.py` - Script
- `Bigram_Analysis_Summary.md` - This summary
