Data Version Control with Dmitry Petrov

Code is version controlled through git, the version control system originally built to manage the Linux codebase. For decades, software has been developed using git for version control. More recently, data engineering has become an unavoidable facet of software development. It is reasonable to ask–why are we not version controlling our data?

Dmitry Petrov is the founder of Iterative.ai, a company for collaborating and version controlling data sets. Dmitry joins the show to talk about how data version control works, and Iterative.ai, the company he is building around dataset management and collaboration.

Sponsorship inquiries: sponsor@softwareengineeringdaily.com

Transcript

Transcript provided by We Edit Podcasts. Software Engineering Daily listeners can go to weeditpodcasts.com/sed to get 20% off the first two months of audio editing and transcription services. Thanks to We Edit Podcasts for partnering with SE Daily. Please click here to view this show’s transcript.


Sponsors

Code doesn’t always behave as you expect, and your team is wasting time trying to understand why. Empower your engineers with Rookout. Get the data you need from live systems instantly, and start shipping better software, faster. Visit rookout.com/sedaily today.

strongDM lets you manage and audit access to servers, databases, and Kubernetes clusters, no matter where your employees are. With strongDM, you can easily extend your identity provider to manage infrastructure access. You can automate onboarding, offboarding, and moving people within roles. strongDM. Manage and audit remote access to infrastructure. Start your free 14 day trial today at: strongdm.com/SEDaily

Teleport is open-source, written in Go, and is a drop-in replacement for OpenSSH. Plus, it has a native support for Kubernetes. Gravitational’s  Teleport provides identity-aware access using short-lived certificates with SSO, session recording, and other features that ensure compliance and audit requirements. Go give it a try by going to try.gravitational.com/sed, where there are links to downloads, documentation, and, of course, the GitHub repository.

Love the flexibility of open source observability tools like ELK, Grafana, and Jaeger but don’t have the time to scale and maintain them? With Logz.io, you can get the best of both worlds. Get the open source you love, at the scale you need. Sign up today for a 14-day free trial at Logz.io/sedaily, for your chance to receive your free Logz.io t-shirt.

Software Weekly

Software Weekly

Subscribe to Software Weekly, a curated weekly newsletter featuring the best and newest from the software engineering community.