Machine Learning – Andrew Ng

I have mentioned this MOOC, from Stanford University’ professor Andrew Ng, available on Coursera in a couple of articles already. I have read about it in several forums and until now, I only took a quick look. A couple of months ago I finally decided to dive in, because I was really curious, the comments …

More

Forecasting using R

Rob J. Hyndman is Professor of Statistics in the Department of Econometrics and Business Statistics at Monash University. He, together with George Athanasopoulos, has published the freely available book “Forecasting: Principles and Practice”, that can be found here or bought in its paper version at Amazon (amazon.com, amazon.com.uk, amazon.fr) or in its electronic version at …

More

Build a Data Science skillset

I was invited to join the Coursera Data Science community a few weeks ago. I did, and this is a very interesting meeting point where people with any degrees of Data Science experience and skills meet and discuss several related topic. As I feel I am still not mature as a Data Scientist, I followed …

More

Machine Learning? Deep Learning? Mhhhh…

Well well. These days the terms “Machine Learning” and “Deep Learning” seem really to be all over the places. But what is all this hype. about 25 years ago, for my university thesis at Pisa University, I and my team mate have implemented a Functional programming language based on structures called “Hypergraphs” that specialized in …

More

Introduction to Data Science in Python

This is the first course of a series of five offered on Coursera by the University of Michigan. The specialization is called “Applied Data Science with Python”. It was the first course I took after I have completed the Data Science Specialization from the Johns Hopkins University on the same platform. The second was completely …

More

Tribute to Hans Rosling

Hans Rosling is one of the reasons why this blog exists. He defined himself as a “edutainer” and his way of explaining statistics to the public was simply mind blowing. I have been directed to the web site of his foundation, gapminder.org, by one of the courses I was following, I think it was “Analytic …

More

Working with SQLite in R

In the words of its creators, SQLite is a self-contained, high-reliability, embedded, full-featured, public-domain, SQL database engine. And apparently is the most used in the world. Libraries exist for interfacing R with SLQLite, the minimum requirement being DBI (A Common Database Interface) and RSQLite (SQLite interface for R). The keyword here is “embedded”. You do …

More

Data Science Specialization it is!

I just went through the experience of completing the Johns Hopkins University Data Science Specialization on Coursera. The last course of this specialization was the Capstone project, which consists basically in learning about a new subject, Natural Language Processing (or NLP in short) and producing a Shiny application hosted on Shinyapps.io that predicts the next word a …

More

R Rants…

This JH Data Science capstone project is transforming into a nightmare, especially because of R and tm, which do not do what they are supposed to do. True I have changed architecture and PC in the middle, but this is not the problem.  R and R packages like tm evolve fast and sometimes too fast …

More