Data Catalog in Practice with Mark Grover

A data catalog provides an index into the data sets and schemas of a company. Data teams are growing in size, and more companies than ever have a data team, so the market for data catalog is larger than ever.

Mark is the CEO of Stemma and the co-creator of Amundsen, a data catalog that came out of Lyft. We have previously explored the basics of Amundsen. In today’s episode, Mark Grover returns to the show to talk about the art and science of data catalogs.

Sponsorship inquiries: sponsor@softwareengineeringdaily.com

 

Transcript

Transcript provided by We Edit Podcasts. Software Engineering Daily listeners can go to weeditpodcasts.com to get 15% off the first three months of audio editing and transcription services with code: SED. Thanks to We Edit Podcasts for partnering with SE Daily. Please click here to view this show’s transcript. 


Sponsors

Data engineers struggling with unreliable data rely on Monte Carlo, the world’s first end-to-end, fully automated Data Observability Platform! Monte Carlo enables data teams with visibility into the quality and reliability of their analytical data to maximize business impact. Start trusting your data with Monte Carlo today! Visit softwareengineeringdaily.com/montecarlodata
to learn more.

At mParticle, we believe that better decisions start with better data. Cleanse, visualize, and connect your customer data from any source or system to any API. 

Better data, better decisions, better outcomes. 

Visit mparticle.com to learn how teams at Postmates, NBCUniversal, Spotify, and Airbnb use mParticle’s customer data infrastructure to accelerate their customer data strategies.

Capital One believes everyone deserves better banking. This means easier access to your money and more security. That’s why Capital One is investing in machine learning. Machine Learning allows Capital One to do things like Fight fraud with random forests. Identify how mobile app outages happen with casual models. Speed up online shopping with machine learning at the edge. The potential of machine learning is so big. See how Capital One is using machine learning to create the future of banking. Machine learning at Capital One. What’s in your wallet? Visit capitalone.com/ML

TotallyBio.io lists hundreds of jobs for top biotech companies.

Jobs in frontend and backend development, cloud computing, computer vision, machine learning, automation and more.

TotallyBio has jobs where you can help apply the power of CRISPR to read, write, and edit genomes.

Jobs where you’ll help develop AI platforms which use protein structures to discover new life-saving drugs.Or, jobs where you’ll harness massive biological data streams to cure aging.You can find the perfect job on TotallyBio.io, including remote or hybrid jobs.

If you’re hiring for your biotech company you can post ads right now for no cost on TotallyBio.io.

To learn more go to TotallyBio.io.

 

WorkOS is a developer platform to make your app enterprise-ready. With a few simple APIs, you can immediately add common enterprise features like Single Sign-On, SAML, SCIM user provisioning, and more. Developers will find beautiful docs and SDKs that make integration a breeze. WorkOS is kind of like “Stripe for enterprise features.” WorkOS powers apps like Webflow, Hopin, Vercel, and more than 100 others. The platform is rock solid, fully SOC-2 compliant, and ready for even the largest enterprise environments. So what are you waiting for? Integrate WorkOS today and make your app enterprise-ready. To learn more and get started, go to softwareengineeringdaily.com/workos

Software Daily

Software Daily

 
Subscribe to Software Daily, a curated newsletter featuring the best and newest from the software engineering community.