Aquarium: Dataset Quality Improvement with Peter Gao

Machine learning models are only as good as the datasets they’re trained on. Aquarium is a system that helps machine learning teams make better models by improving their dataset quality. Model improvement is often made by curating high quality datasets, and Aquarium helps make that a reality. Peter Gao works on Aquarium, and he joins the show to talk through modern machine learning and the role of Aquarium.

Sponsorship inquiries: sponsor@softwareengineeringdaily.com

Transcript

Transcript provided by We Edit Podcasts. Software Engineering Daily listeners can go to weeditpodcasts.com/sed to get 20% off the first two months of audio editing and transcription services. Thanks to We Edit Podcasts for partnering with SE Daily. Please click here to view this show’s transcript.


Sponsors

Since 2006, TimeXtender has been helping companies build modern data estates with their low code/no code data management and automation platform.  TimeXtender is software that can be deployed on-premises or in the cloud to capture and organize the business logic required to build and operate the modern data estate.  SE Daily customers will get unlimited access to TimeXtender’s Solution Specialists and a free two-day Proof of Concept at timextender.com/sedaily.

Couchbase is a SQL-friendly multi-cloud to edge NoSQL database architected on top of an open source foundation. Register now for Couchbase Connect.ONLINE on October 14-16. Choose from 100+ tech sessions on cloud, full-text search, analytics, NoSQL query & indexing, mobile, Kubernetes, and more. Complete challenges, win cool prizes, all free. Secure your spot at couchbase.com/SEDaily

Triplebyte is a network of 200,000+ Top Engineers. Triplebyte works with more than 400 tech companies including Coinbase, Zoox, Dropbox, and Facebook.  Triplebyte is focused on matching high-quality engineers with great jobs. Let the right roles come to you. Want to know your strengths? Take the Triplebyte quiz and receive your personalized feedback report. Tracks offered: Generalist, Front End Mobile, Machine Learning, DevOps, DataScience, and Entry Level. Visit triplebyte.com/sedaily.

From their recent report on serverless adoption and trends, Datadog found half of their customer base using EC2s have now adopted AWS Lambda. You can easily monitor all your serverless functions in one place and generate serverless metrics straight from Datadog. Check it out yourself by signing up for a free 14-day trial and get a free t-shirt at softwareengineeringdaily.com/datadog

Software Daily

Software Daily

 
Subscribe to Software Daily, a curated newsletter featuring the best and newest from the software engineering community.