It's already sunset (credits)

Dear members. Once again a late Friday edition. I was travelling this week and I slept too much. But not more excuses, below your Data News edition.

Data fundraising 💰

What a crazy period we live in. Every open-source technology launch a cloud based offering of their tool expecting money to finance development. Is it really sustainable?

A bit of data engineering

I do not share a lot what I do as a data engineer outside of this newsletter. Even if this is probably for a dedicated post I think today I'll do a category about the data engineer's life. At the moment I'm working on two projects that are migrations. For the first project I migrate from a 12 years old custom made analytical application to a new one made within Apache Superset.

I also feel that a lot of the projects I've worked on as a data engineer were migrations. Sometimes small migrations like changing a data pipelines, sometime larger one like migrating a warehouse technology or an orchestration tool.

Migrations fuel data engineering work today and Ben depicts it greatly in his new post Realities of being a data engineer — Migrations. As Ben said we have different kind of migrations : operations systems, hardware, cloud, analytics or data. Every migration obviously brings a risk and that's why we do a preparatory work to mitigate risk. But even with a good experience we can't plan the unexpected and deadlines will slide.

Later in the post Ben propose a 5-steps framework every migration should follow:

👉 Read Ben's article

After all the different migrations I've done and read I think one of the advice I can give you is to invest in developing custom tools to follow and help the migration. For instance if you have to migrate 200 SQL queries from Postgres to BigQuery, develop a dashboards that gives the progression of the migration and provide automated scripts to dumbly do it. Migration application is boring, gamify it.

To illustrate this post Ronnie from Airbnb described how they upgraded their data warehouse infrastructure. Migrating from Hive to Spark3 + Iceberg.

Data migration (credits)

ML Friday 🤖

A personalized homepage (credits)

Fast News ⚡️


See you next week ❤️.