Factfulness

This book, based on the life experience of Hans Rosling,  is about learning to have a fact-based vision of the  world. It is an eye-opener, as all the work done by Hans, his son Ola Rosling and his daughter in law Anna Rosling Rönnlund. After Hans passed away in February 2017, Ola and Anna continue …

More

Python Dash!

Let’s go to the point: Finally Python has what R used to have since a looong time ago with Shiny. This is nothing less than Python Dash, a Python framework for building  analytical applications (in the author’s own words). The library is based around Flask and Plotly, the installation is pretty easy and it is …

More

Evolution…

I have found out a couple of days ago that the evolution of the libraries in the Python ecosystem is FAST. It is not only fast, it is exceptionally fast. In my opinion some of these libraries are evolving together with the market and mainly for market reasons (one of these libraries is Pandas, part …

More

Invaluable resources…

It was a bit since last post. Recently I have been not following a very structured approach to courses or training materials, but I have rather attempted to consolidate what I have learned along these years. My daily job continues to be very far apart from Data Science and indeed it is going also far …

More

Jupyter Notebook on Android tablet

Update: The content of this article is no longer applicable for this tablet model. Termux and Jupyter Notebook still work, however Termux does no longer support (for a long time) Android 6.0 and older. The result is that crucial libraries like Matplotlib fail to install.  It is still usable but only for basic tasks.    …

More

SQLite and Python

I have already written an article about using a SQLite database from R.  I was playing recently with some data, scraped from a web site to text files, that I wanted to transform into a SQLite database. I decided to do it in python, using the sqlite3 module. After a few experiments, I noticed that …

More

Mining of Massive Datasets

The Lagunita platform offers another very interesting MOOC, Mining of Massive Datasets. The MOOC is self-paced and entitles those who manage to have answered at least 50% between the homework questions and the final exam. There are 14 homeworks, with a total of 62 multiple-choice questions. Together, they account for 50% of your grade.  The course …

More

An unusual distraction…

Now something different! Well, besides trying to learn as much as possible about Data Science, Machine Learning, Big Data etc, a few weeks ago I was brutally deviated to other subjects, by being asked the following question by my fourteen years old son: “Dad, can you teach me some programming”? At that point a number …

More

Statistical Learning

This resource is totally free, and consists in a course based on a book which is itself totally free and available. I got to know it while browsing  the  forums discussions on Coursera Data Science Discussions. Somebody in a discussion compared this course to the Machine Learning Course by Andrew Ng, and added that this …

More