Cloudera
StreamSets: DataOps and Smart Pipelines with Arvind Prabhakar
![](https://i0.wp.com/softwareengineeringdaily.com/wp-content/uploads/2021/06/Streamsets.png?resize=269%2C151&ssl=1)
The company StreamSets is enabling DataOps practices in today’s enterprises. StreamSets is a data engineering platform designed to help engineers design, deploy, and operate smart data
Federated Learning with Mike Lee Williams
![](https://i0.wp.com/softwareengineeringdaily.com/wp-content/uploads/2020/10/FederatedLearning.jpg?resize=269%2C151&ssl=1)
Federated learning is machine learning without a centralized data source. Federated Learning enables mobile phones or edge servers to collaboratively learn a shared prediction model
Competition in the Open Source Ecosystem
![](https://i0.wp.com/softwareengineeringdaily.com/wp-content/uploads/2016/02/Contributions2011.png?resize=269%2C151&ssl=1)
From Eric Sammer’s answer via Quora: At Cloudera (company) we regularly work on open source code right along side our competitors. I tend to joke that the engineers at our competitors
Kudu with Todd Lipcon
![](https://i0.wp.com/softwareengineeringdaily.com/wp-content/uploads/2015/10/kudu.png?resize=269%2C151&ssl=1)
“If you have an architecture where you’re trying to periodically trying to dump from one system to the other and synchronize, you can simplify your life quite a bit by just putting
Replacing Hadoop with Joe Doliner
![](https://i0.wp.com/softwareengineeringdaily.com/wp-content/uploads/2015/10/pachyderm.jpg?resize=269%2C151&ssl=1)
“There are a lot more people who have the problem that Hadoop solves than there are people using Hadoop.”
Pachyderm is a containerized data analytics platform that seeks to