Encodings — GeistHaus

Spartan Blog - Jerónimo Jerónimo López Jan 16, 2025

Parquet supports multiple compression algorithms. In the article we analyze and measure GZIP, LZ4, Snappy, ZSTD and LZO.

0 inbound links article en

Query Engines: Gatekeepers of the Parquet File Format

DuckDB Laurens Kuiper Jan 22, 2025

Mainstream query engines do not support reading newer Parquet encodings, forcing systems like DuckDB to default to writing older encodings, thereby sacrificing compression.

2 inbound links article en

A Deep Dive into Apache Parquet with ClickHouse - Part 2

ClickHouse Dale McDiarmid Apr 27, 2023

Read about the internals of the Parquet format and how the ClickHouse integration exploits these structures, with some recent improvements providing speed and usability improvements.

1 inbound link article en

A Deep Dive into Apache Parquet with ClickHouse - Part 1

ClickHouse Dale McDiarmid Apr 17, 2023

Learn out about how to query and write Apache Parquet files in the first post of our series on the popular data exchange format

1 inbound link article en Apache ParquetParquet

Querying Parquet with Millisecond Latency

Tustvold And Alamb Tustvold; Alamb Dec 26, 2022

Querying Parquet with Millisecond Latency Note: this article was originally published on the InfluxData Blog. We believe that querying data in Apache Parquet files directly can achieve similar or better storage efficiency and query performance than most specialized file formats. While it requires significant engineering effort, the benefits of Parquet's open format and broad ecosystem support make it the obvious choice for a wide class of data systems. In this article we explain several advanced techniques needed to query data stored in the Parquet format quickly that we implemented in the Apache Arrow Rust Parquet reader. Together these techniques make…

4 inbound links article en

Querying Parquet with Millisecond Latency | InfluxData

InfluxData Raphael Taylor-Davies; Andrew Lamb Dec 7, 2022

In this article we explain several advanced techniques needed to query data stored in the Parquet format quickly that we implemented in the Apache Arrow Rust Parquet reader.

3 inbound links article en

Efficient Filter Pushdown in Parquet – Xiangpeng’s blog

Xiangpeng’s blog Mar 12, 2025

How to implement efficient filter pushdown in Parquet readers and why it’s challenging in practice.

0 inbound links en

Efficient Filter Pushdown in Parquet – Xiangpeng’s blog

Xiangpeng’s blog Mar 12, 2025

How to implement efficient filter pushdown in Parquet readers and why it’s challenging in practice.

1 inbound link en