Hadoop

Sort by:

Data Science at Spotify with Boxun Zhang

“I normally try to sit together or very close to a product team or engineering team. And by doing so, I get very close to the source of all kinds of challenging

Data Engineering with David Drummond and Austin Ouyang

“We want people to be able to pick up whatever tool it is and really push themselves to get something done with it in a short amount of time, because that’s ultimately what they need

Kudu with Todd Lipcon

“If you have an architecture where you’re trying to periodically trying to dump from one system to the other and synchronize, you can simplify your life quite a bit by just putting

Netflix Genie with Tom Gianos

“Sometimes there’s a misconception that Genie is a job scheduling platform... Genie really represents our extraction layer, from what our computational resources are, to our end user

Replacing Hadoop with Joe Doliner

“There are a lot more people who have the problem that Hadoop solves than there are people using Hadoop.” Pachyderm is a containerized data analytics platform that seeks to