This project was prepared as a capstone project for the Coursera/John Hopkins Data Science Specialization.
A predictive text application was developed using a corpora of English text from blog, news, and twitter sources.
Using a 5-gram dictionary paired with a 'Stupid Backoff' model, the application predicts the next word of sentences from user input with a top prediction rate of 11.51% and top-3 rate of 21.31%.
The final application can be found here.
A short presentation describing the analysis and application can be found here.