A bit of rest…

Here it comes, summertime. Last year I was particularly active with this Blog and my quest to learn as much as possible about Data Science, Python, R etc. Many MOOCs and books later, and with only the Capstone project that separates me from achieving the Data Science Specialization, I have the need to slow down …

More

NLP – Natural Language Processing

The Coursera JH Data Science Specialization closes with a Capstone Project based on Natural Language Processing. This course is in the references and its lessons are  still available for preview but only until the 30 June clicking on the following URL: https://class.coursera.org/nlp/lecture. The lessons in PDF format are still available from Dan Jurafsky at the …

More

Happy First Birthday!

I almost did not notice, but last 15th May was my site’s become one year old! Wow! In one year, this is the basic traffic data: Number of visitors as of now : 3730 Number of visits: 26085 My free hosting profile does not allow me to know from where you visit, I choose to …

More

GoogleVis and R – Tutorial

During the first week of the “Developing Data Products” MOOC on Coursera, one of the lessons deals with GoogleVis. This is one of the ways to publish and animate your R charts. In practice GoogleVis provides an interface between R and the Google Charts Tools, allowing you to create interactive web charts from R without …

More

Free R Resources

I was looking for some references and examples in the scope of the “Regression Models” course on Coursera. Something  structured that could bridge the gaps in my R knowledge (and  they are many) and something that could provide me with a list of case studies. I have already bought a book on this, it is …

More

Shiny Application: A shaded Normal Distribution

I have played a bit with shiny. The RStudio folks provides the possibility of deploying R “Shiny” applications on their server. The final result  which will look a lot like this: https://<YourAccount>.shinyapps.io/<YourApp>. There are several plans for this type of deployment, including one that allows you to deploy up to 5 applications for free. You …

More

Reproducible Research

This MOOC is conceptually one of the most interesting ones that I have taken to date. It is based around the implementation of the concept of “Literate Programming”, introduced by Donald Knuth in his 1992 book, in which basically a system where documentation and “live” source code are presented in the same document. In the …

More