Tutorial Review: How to Build a Text Mining, Machine Learning Document Classification System in R!

This tutorial by Tim D’Auria on Youtube, is shorter than 30 minutes. Without pretending too much background it gives you the basic tools and knowledge to build a basic document classification system. The classifier uses a simple KNN  classification algorithm and text mining techniques to learn to distinguish the candidate who pronounced the speeches of the 2008’s US Presidential campaign, the two candidates being  Barak Obama’s and Mitt Romney.

Best of all this is an excellent example of the R tm package in use.

Links:

CRAN – Package tm

Basic Text Mining in R

A gentle introduction to text mining using R | Eight to Late

Hands-On Data Science with R – Text Mining

The Source code of the video can be found here