Predictive Text App - Natural Language Processing

This project was prepared as a capstone project for the Coursera/John Hopkins Data Science Specialization.

Synopsis

A predictive text application was developed using a corpora of English text from blog, news, and twitter sources.

Using a 5-gram dictionary paired with a 'Stupid Backoff' model, the application predicts the next word of sentences from user input with a top prediction rate of 11.51% and top-3 rate of 21.31%.

The final application can be found here.

A short presentation describing the analysis and application can be found here.

Exploratory Data Analysis - Word Clouds

Initial Exploration

Exploration with Final Testing Results

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
PredictiveTextApp		PredictiveTextApp
data		data
0 - FUNCTIONS.R		0 - FUNCTIONS.R
1 - Predictive Text App - Getting & Cleaning the Data.R		1 - Predictive Text App - Getting & Cleaning the Data.R
2 - Predictive Text App - Exploratory Analysis.Rmd		2 - Predictive Text App - Exploratory Analysis.Rmd
3 - Predictive Text App - Model.R		3 - Predictive Text App - Model.R
4 - Predictive Text App - Testing.R		4 - Predictive Text App - Testing.R
5_-_Predictive_Text_App_-_Presentation.Rmd		5_-_Predictive_Text_App_-_Presentation.Rmd
Natural-Language-Processing-Project.Rproj		Natural-Language-Processing-Project.Rproj
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Predictive Text App - Natural Language Processing

Synopsis

Exploratory Data Analysis - Word Clouds

About

Releases

Packages

Languages

Patrickdg/Predictive-Text-Application---Natural-Language-Processing

Folders and files

Latest commit

History

Repository files navigation

Predictive Text App - Natural Language Processing

Synopsis

Exploratory Data Analysis - Word Clouds

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages