Pinecone: Vector Database with Edo Liberty

Vectors are the foundational mathematical building blocks of Machine Learning. Machine Learning models must transform input data into vectors to perform their operations, creating what is known as a vector embedding. Since data is not stored in vector form, an ML application must perform significant work to transform data in different formats into a form that ML models can understand. This can be computationally intensive and hard to scale, especially for the high-dimensional vectors used in complex models.

Pinecone is a managed database built specifically for working with vector data. Pinecone is serverless and API-driven, which means engineers and data scientists can focus on building their ML application or performing analysis without worrying about the underlying data infrastructure.

Edo Liberty is the founder and CEO of Pinecone. Prior to Pinecone, he led the creation of Amazon SageMaker at AWS. He joins the show today to talk about the fundamental importance of vectors in machine learning, how Pinecone built a vector-centric database, and why data infrastructure improvements are key to unlocking the next generation of AI applications.

Sponsorship inquiries: sponsor@softwareengineeringdaily.com

Transcript

Transcript provided by We Edit Podcasts. Software Engineering Daily listeners can go to weeditpodcasts.com/sed to get 20% off the first two months of audio editing and transcription services. Thanks to We Edit Podcasts for partnering with SE Daily. Please click here to view this show’s transcript.


Sponsors

strongDM lets you manage and audit access to servers, databases, and Kubernetes clusters, no matter where your employees are. With strongDM, you can easily extend your identity provider to manage infrastructure access. You can automate onboarding, offboarding, and moving people within roles. strongDM. Manage and audit remote access to infrastructure. Start your free 14 day trial today at: strongdm.com/SEDaily

O’Reilly is known for its animal books, which have helped tech professionals stay ahead for over forty years. Today, its online learning platform at oreilly.com takes learning tech to the next level. With live online sessions, your teams learn from the biggest brains in AI, software architecture, cloud, data, programming, and more. They can even prep for tech certification exams with official materials and interactive practice tests. It’s why sixty-six percent of all Fortune one hundred companies give their teams O’Reilly online learning. Get a demo today at oreilly.com.

With Datadog Security Monitoring, engineering teams can easily detect malicious activity in real-time before it affects their customers. Use OOTB detection rules and detailed observability data in one, unified platform to investigate security attacks. See it in action by signing up for a live security demo and receive a Datadog T-shirt by visiting  https://softwareengineeringdaily.com/datadogsecurity

Modus Create is a global, totally remote consultancy, with 300 people from 5 continents and over 40 countries. There are roles open all across the SLDC, and you’ll get to work with companies and colleagues from across the world. Modus Create hires the top 1% of technical talent, come join an incredible team. moduscreate.com/sedaily