Hi everyone, first post regarding news digest. This week is focused on data detection tools.
csv-detective is a lightweight open-source tool doing type inferences. Worth using it if you want to detect stuff around french address system.
I didn't know before pandas profiling. The project has 7k stars on GitHub and in the future I'll use it on all my dataset. It generates a HTML UI describing everything you have
Data cleaning principles
A small talk from the csv,conf,v6 where Karl Broman explains his data cleaning framework. 20 points worth checking.
Data quality - rethink previsualisation
(Sorry for the french). Here the french administration tried to rethink how to visualise the data on the french open-data platform.
Join the newsletter to receive the latest updates in your inbox.