Python
Modin: Pandas Scalability with Devin Petersohn

Pandas is a Python data analysis library, and an essential tool in data science. Pandas allows users to load large quantities of data into a data structure called a dataframe, over which
Dask: Scalable Python with Matthew Rocklin

Python is the most widely used language for data science, and there are several libraries that are commonly used by Python data scientists including Numpy, Pandas, and scikit-learn.
Talking Python with Michael Kennedy

Python is one of the most popular programming languages in the software world. After working with Python and developing a love for the language, Michael Kennedy started to wonder why
Notebooks at Netflix with Matthew Seal

Netflix has petabytes of data and thousands of workloads running across that data every day. These workloads generate movie recommendations for users, create dashboards for data analysts
Scaling Machine Learning at Uber with Michelangelo

This post was originally written by Jeremy Hermann & Mike Del Balso on Uber Engineering. Reposted with permission. In September 2017, we published an article