Navigating through the numbers (credits)

Halo, a lot of content has been published this week with the Coalesce and I kept a lot of articles from the last week that I needed to navigate through this quantity to produce this edition. I'm not that proud of the format but it's ok.

As a side node I'm gonna do the 30-day map challenge in November. So if you do it or if you want to do it say hi.

Women in Data — part 2 👩‍💻

Second part of the summary of the Women in Data meetup we organized 2 weeks ago. In the second round table the discussions were about the parity in the data ecosystem.

What can we collectively do to achieve parity in data ecosystems? 💪

Several answers and ideas were proposed by the speakers. Let's dive-in by topics.

That's all for this Women in Data meetup. I hope I've transcript the discussion with the right words and intention. I might have misinterpreted some chats and if it's the case I'm sorry.

My last point on this topic, let's not forget we talk about diversity, so this is not only about man and women, there is more to be diverse and inclusive.

dbt Coalesce 2022

dbt Coalesce took place this week, this is the annual 4-days conference organised by dbt Labs. While all data influencers were there to meet and chat about the next trends of the analytics industry a few announcements were made.

dbt Labs took the time to announce the Semantic Layer. While others call it the metrics layer or feature store in the data science space. We'll see a lot of buzz around this unique layer to access metrics in 2023. dbt Labs will push forward this architecture, in search for revenue growth. They will add this as a product in their cloud offering—with a Proxy SQL and a Metadata API.

If you want to see on how the semantic layer can be use Hex demoed it. You can also see this semantic rise up from the BI perspective with the Semantic BI. In this new world everyone wants to see the issues from his perspective, which is annoying for users but fun as an outsider 🙃.

I'll dedicate a full post with my highlights of the conference early next week after watching all the replays.

The metrics layer (credits)

Data contracts 👻

Even if I try not to fall in the hype stuff to give a higher view on trends when I see data contracts everywhere I have to still share it. In a nutshell data contracts are contractualized interfaces between data producers and consumers. The most common pattern seems to be an API—http, file, event, table, etc.—between software engineers and the data team with a way to enforce the contract. We call this schema for ages.

I'm convinced for a long time that data contracts is not a data problem but an IT problem. If the whole tech team is not aligned on the way data changes should be managed you'll fix only a small part of the problem. Petr greatly wrote about the way we draw lines. What belongs where?

Data contracts aligned around business areas (domains) rather than technology layers. Contracts are technology-agnostic and can live anywhere inside the Data Platform.

Andrew and Daniel respectively wrote their own way of seeing data contract implementation. Andrew at GoCardless and Daniel by himself.

Fast News ⚡️

Data engineers are superheros (credits)

Data Fundraising 💰

This week a lot a few data satellite companies raised money. When I say satellite I mean companies that are not really related to data field, but they put data at the centre of their product.


See you next week ❤️.