Apache Arrow

Sort by:

Columnar Data: Apache Arrow and Parquet with Julien Le Dem and Jacques Nadeau

Column-oriented data storage allows us to access all of the entries in a database column quickly and efficiently. Columnar storage formats are mostly relevant today for performing large

Apache Arrow with Uwe Korn

In a typical data analytics system, there are a variety of technologies interacting. HDFS for storing files, Spark for distributed machine learning, pandas for data analysis in