Data with Ben Lorica
Ben Lorica is the chief data scientist at O’Reilly Media and the program director of the Strata Data Conference. In his work, Ben spends time with people across the software industry, giving him broad perspective.
In the early days of the data engineering ecosystem, the Hadoop vendor wars were starting between Cloudera and Hortonworks. Strata was a neutral ground for practitioners and open source contributors to meet and share ideas about the Hadoop ecosystem. Since then, the conference has grown to encompass topics such as data science, distributed databases, streaming frameworks, and machine learning.
There are many open questions in the data world right now. What is the best path that an enterprise can take to build out a data platform? How should a software team be arranged to efficiently build machine learning models? Which distributed streaming frameworks should I use for what purpose?
Ben joins the show to discuss modern data engineering, data science, and infrastructure.
Transcript provided by We Edit Podcasts. Software Engineering Daily listeners can go to weeditpodcasts.com/sed to get 20% off the first two months of audio editing and transcription services. Thanks to We Edit Podcasts for partnering with SE Daily. Please click here to view this show’s transcript.
G2i is a hiring platform run by engineers that matches you with React, React Native, GraphQL, and mobile engineers who you can trust. Whether you are a new company building your first product or an established company that wants additional engineering help, G2i has the talent you need to accomplish your goals. Go to softwareengineeringdaily.com/g2i
At Manning, we’ve spent over 20 years helping developers become the best that they can be. Whether it’s picking up a new programming language, or discovering a framework, there’s something for you. That’s why we’re offering an exclusive 40% off our entire catalog, including the selections shown at softwareengineeringdaily.com/manning! Just use the code softwaredaily40 when you checkout to save 40%. Or when you click the Add to Cart button from softwareengineeringdaily.com/manning we’ll enter the coupon code for you automatically.
Netlify is a modern way to build and manage fast, modern websites that run without the need for addressable web servers. Netlify is “serverless.” Automatic forms, identity management, and tools to manage and transform large images and media. Learn more about Netlify’s powerful platform at netlify.com/sedaily.
GoCD is a continuous delivery tool created by ThoughtWorks. It’s great to see the continued progress on GoCD with the new Kubernetes integrations–and you can check it out for yourself at gocd.org/sedaily.