This blog section contains small data science projects in form of posts to practice. The goal is to force myself to learn something new by sharing some work that help me to realize I don’t know something as well as I thought I did (Robinson and Nolis 2020). A common pattern on how these posts are structured are the tools used in a data science projects (Grolemund and Wickham 2017) (See Figure 1).
Figure 1: A model of the tools needed in a typical data science project
In addition, these posts are grouped by the sections Projects, Books and Quick fix based on their length. The motivation of choice to publish these posts is sharing what I learnt, so I aim for operative solutions rather than optimal solutions. Moreove, I heard from Marcel Reinmuth, Emily Robinson and Marinus (Marien) de Bakker that Perfection is an endless chase; The perfect is the enemy of the good ; Doing enough is the winning strategy (related to fit for business purpose see Figure 2).
Figure 2: Visualization of Fit for business purpose where doing enough is the winning strategy from Marinus (Marien) de Bakker at the MSc GIMA PM
From the technical design, it is possible to filter the posts based on the technology used or to the applied field. The main technology are programming languages such as PostgreSQL (Figure 3) and its spatial extension PostGIS, R and Python, while the applied fields are Geographic Information System (GIS), Project Management (PM) or Environmental Sciences (ENV).
(a) PostgreSQL-PostGIS
(b) R
(c) Python
(d) QGIS
Figure 3: Main technical framework shown in the filters named as categories