Big Data

Sort by:

Building a Big Data Pipeline With Airflow, Spark and Zeppelin

Featured Image: “black tunnel interior with white lights” by Jared Arango on Unsplash This Article was originally written by Mahdi Karabiben on Medium. Reposted with permission.

DataOps and the Data Platform

They say that data, if tortured enough, will confess to anything. Maybe this is the explanation behind how and why data became a buzzword by the end of the 90s when hard drive production

Data Warehouse with Christian Kleinerman

A data warehouse provides fast access to large data sets for analytics, data science, and dashboards. A data warehouse differs from a transactional database, because you often do not

Mapillary: Computer Vision Crowdsourcing with Peter Neubauer

Mapillary is a platform for gathering photos taken by smartphones and using that data to build a 3D model of the world. Mapillary’s model of the world includes labeled objects such as

Stream Processing at Uber with Danny Yuan

“Be aggressive in vision, but conservative in operation.” Uber is a transportation company with a high volume of temporal spacial data, constantly being collected from the devices of
uber-eng