Skip to content

Data News — Week 18

Christophe Blefari
Christophe Blefari
2 min read

Hi everyone, first post regarding news digest. This week is focused on data detection tools.

csv-detective

csv-detective is a lightweight open-source tool doing type inferences. Worth using it if you want to detect stuff around french address system.

etalab/csv-detective
CSV inspection. Contribute to etalab/csv-detective development by creating an account on GitHub.

pandas-profiling

I didn't know before pandas profiling. The project has 7k stars on GitHub and in the future I'll use it on all my dataset. It generates a HTML UI describing everything you have

pandas-profiling/pandas-profiling
Create HTML profiling reports from pandas DataFrame objects - pandas-profiling/pandas-profiling

Data cleaning principles

A small talk from the csv,conf,v6 where Karl Broman explains his data cleaning framework. 20 points worth checking.

Data cleaning principles

Data quality - rethink previsualisation

(Sorry for the french). Here the french administration tried to rethink how to visualise the data on the french open-data platform.

Qualité des données : repenser la prévisualisation des données - data.gouv.fr
D’avril à juin c’est le printemps de data.gouv.fr : chaque semaine nous partageons nos réflexions, des annonces concrètes ou encore des événements et quelques surprises !
Data News

Data Explorer

The hub to explore Data News links

Search and bookmark more than 2500 links

Explore

Christophe Blefari

Staff Data Engineer. I like 🚲, 🪴 and 🎮. I can do everything with data, just ask.

Comments


Related Posts

Members Public

Data News — Week 24.16

Data News #24.16 — Llama the Third, Mistral probable $5B valuation, structured Gen AI, principal engineers, big data scale to count billions and benchmarks.

Members Public

Data News — Week 24.15

Data News #24.15 — MDSFest quick recap, LLM news, Airbnb Chronon, AST, Beam YAML, WAP and more.