Apache Parquet
Uber’s Big Data Platform: 100+ Petabytes with Minute Latency
This article was originally written by Reza Shiftehfar on Uber’s Engineering Blog. Reposted with permission from Uber Engineering. Uber is committed to delivering safer and more
Columnar Data: Apache Arrow and Parquet with Julien Le Dem and Jacques Nadeau
Column-oriented data storage allows us to access all of the entries in a database column quickly and efficiently. Columnar storage formats are mostly relevant today for performing large