Skip to content

Docs: add exmaple of how to read parquet row groups in parallel#9396

Open
alamb wants to merge 2 commits intoapache:mainfrom
alamb:alamb/parallel_parquet_reading
Open

Docs: add exmaple of how to read parquet row groups in parallel#9396
alamb wants to merge 2 commits intoapache:mainfrom
alamb:alamb/parallel_parquet_reading

Conversation

@alamb
Copy link
Contributor

@alamb alamb commented Feb 11, 2026

Which issue does this PR close?

Rationale for this change

It is possible to read a parquet file in parallel today using the arrow-rs APIs (making an individual reader to read individual parts), however, it is not always obvious how to do so as @pmarks observes on #9381

What changes are included in this PR?

Add additional documentation explaining how to read files in parallel, along with a doc example

Here is an example of what it looks like rendered:

Screenshot 2026-02-11 at 5 33 02 PM

Are these changes tested?

By CI

Are there any user-facing changes?

more docs; No functional changes

@alamb alamb added the documentation Improvements or additions to documentation label Feb 11, 2026
@github-actions github-actions bot added the parquet Changes to the parquet crate label Feb 11, 2026
@alamb alamb marked this pull request as ready for review February 11, 2026 22:33
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

documentation Improvements or additions to documentation parquet Changes to the parquet crate

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Parallel Parquet Reading

1 participant