U.S-Presidential-Speeches

Analyze the text of U.S. President's Speeches from 1790 to 2006 using Word2Vec(w2v) model in gensim.
Note: I worked on this project in July, 2014. There has been a lot of progress since then.

All speeches in their original format are in speech.txt
The processed version of the speeches such that each line contains one processed speech is in all_speech.txt
The json file containing full metadata from speech.txt in the form of list of dictionaries is in data_processed.txt. It contains following key-value pairs:
- 'who': President's name
- 'date' : date of speech (example : January 27, 1984)
- 'speech' : Full speech
- 'what' :'State of the Union Address'
The code to process speech.txt is in speech.py
w2v_speech.py contains gensim model to learn w2v model from the speeches.
Speech vector is calculated by averaging all the word vectors in the speech
w2v_tsne.py contains the code to plot 2D version of 100 dimensional speech vectors
speech_vectors.npy is numpy vector of all speeches as processed in w2v_tsne.py

Here is the t-SNE plot of speech vectors: (labeled version is in the repo. Download the image to zoom in.)

Here is the distance matrix between speeches. Zoom in to see year of speech vectors.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
README.md		README.md
Speech_distances.png		Speech_distances.png
all_speech.txt		all_speech.txt
data_processed.txt		data_processed.txt
plot_with_labels.png		plot_with_labels.png
plot_without_labels.png		plot_without_labels.png
speech.py		speech.py
speech.txt		speech.txt
speech_vectors.npy		speech_vectors.npy
w2v_model.mod		w2v_model.mod
w2v_speech.py		w2v_speech.py
w2v_tsne.py		w2v_tsne.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

U.S-Presidential-Speeches

About

Releases 1

Packages

Languages

pg2455/U.S-Presidential-Speeches

Folders and files

Latest commit

History

Repository files navigation

U.S-Presidential-Speeches

About

Resources

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages