For NLP and statistical analysis, I have used:
In addition to the NLP and visualization, I used the following tools for analysis and infrastructure:
I have also used ggplot2 and other tools from the "tidyverse" in R for exploring analyses and visualizations, although these were not used in the final analyses and visualizations shown in the videos.
For more information to find out how you can use visualizations with your collection of letters, contact me!
All of the data used in these demos comes from out of copyright materials.
For the letters between Elizabeth Barrett and Robert Browning, I have used the versions originally published by their son Robert in 1898, as found on Project Gutenberg (vol 1) and the Internet Archive (vol 2). I have made available a basic annotated corpus, done with the help of students at the University of Tübingen, on my corpora resources page.
For the letters from Michelangelo, I have used the versions originally published in 1875 by Gaetano Milanesi, as found on Project Gutenberg. I have made available a basic annotated corpus, on my corpora resources page.
For the journal of Eleazar Ingalls, I have used the version originally published in 1852, as found on Project Gutenberg.