Docs: clarify when the parquet reader will read from object store when using cached metadata #10909

alamb · 2024-06-14T00:52:19Z

Which issue does this PR close?

Rationale for this change

While working on #10701 it was quite unclear to me why the parquet reader was doing a second object store request even when I had passed it pre-existing ParquetMetadata

It turns out the it was because the cached ParquetMetadata didn't have the page index strutures loaded, and so the parquet exec will load them on demand if required.

What changes are included in this PR?

Are these changes tested?

Add documentation

Note: I documented this in arrow-rs too: apache/arrow-rs#5887

Are there any user-facing changes?

…cached metadata

comphead

lgtm thanks @alamb

…cached metadata (apache#10909)

Docs: clarify when the reader will read from object store when using …

4bf1b91

…cached metadata

alamb changed the title ~~Docs: clarify when the reader will read from object store when using cached metadata~~ Docs: clarify when the parquet reader will read from object store when using cached metadata Jun 14, 2024

comphead approved these changes Jun 14, 2024

View reviewed changes

comphead merged commit 8f76ac5 into apache:main Jun 14, 2024
23 of 25 checks passed

findepi pushed a commit to findepi/datafusion that referenced this pull request Jul 16, 2024

Docs: clarify when the reader will read from object store when using …

c1fc79e

…cached metadata (apache#10909)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Docs: clarify when the parquet reader will read from object store when using cached metadata #10909

Docs: clarify when the parquet reader will read from object store when using cached metadata #10909

alamb commented Jun 14, 2024 •

edited

Loading

comphead left a comment

Docs: clarify when the parquet reader will read from object store when using cached metadata #10909

Docs: clarify when the parquet reader will read from object store when using cached metadata #10909

Conversation

alamb commented Jun 14, 2024 • edited Loading

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

comphead left a comment

Choose a reason for hiding this comment

alamb commented Jun 14, 2024 •

edited

Loading