R Rants…

This JH Data Science capstone project is transforming into a nightmare, especially because of R and tm, which do not do what they are supposed to do. True I have changed architecture and PC in the middle, but this is not the problem.  R and R packages like tm evolve fast and sometimes too fast …

More

Tutorial Review: How to Build a Text Mining, Machine Learning Document Classification System in R!

This tutorial by Tim D’Auria on Youtube, is shorter than 30 minutes. Without pretending too much background it gives you the basic tools and knowledge to build a basic document classification system. The classifier uses a simple KNN  classification algorithm and text mining techniques to learn to distinguish the candidate who pronounced the speeches of …

More