PyCon AU 2024

Harmeet Sokhi

I'm Harmeet, a lead data engineer at Thoughtworks who's been lucky enough to work with some incredibly smart people in building data and ML platforms & products . Over the years, I've had the opportunity to lead teams in building data and ML platforms, products, and work on organisational transformations including evolving data function operating models to scale. I’ve dabbled in various industries, including energy, accounting, airlines, retail and many more , helping teams mature their data and ML capabilities.
Beyond my professional life, I'm a co-host of the data engineering Melbourne meetup and also had the opportunity to tech review an O'Reilly book 'Effective Machine Learning Teams'.


Session

11-24
11:15
30min
Rethinking Data Catalogs: The Promise and Pitfalls
Harmeet Sokhi, Vishal Srivastava

The promise of data catalogs, a single source of truth for your organisation's data, often clashes with the reality of under-utilised features, redundancy across various data catalog solutions across teams, adoption challenges and a lack of clear strategy.

This talk will pose some critical questions concerning current approaches of choosing and implementing data catalogs:

  • Do you actually need a Data catalog?
  • Are data catalogs becoming just glorified registries without much practical use?
  • Why do organisations find themselves juggling multiple catalogs?
  • Is there a synergy between System Catalog & Data Catalog?
  • How do you identify the right fit and what are the considerations?
  • How to measure success for a data catalog?

We'll dissect the reasons behind these challenges and share our experience of implementing data catalogs across different organisations.

Main Conference
Eureka 3