Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add buffered data_pages to parquet column writer total bytes estimation #6862

Merged
merged 2 commits into from
Dec 11, 2024

Conversation

onursatici
Copy link
Contributor

Which issue does this PR close?

Closes #6839.

Rationale for this change

Per #6839 (comment), parquet column writer does not account for the buffered data pages when reporting estimated memory size

What changes are included in this PR?

Are there any user-facing changes?

@github-actions github-actions bot added the parquet Changes to the parquet crate label Dec 9, 2024
@onursatici onursatici changed the title add buffered data_pages to parquet column writer memory size estimation add buffered data_pages to parquet column writer total bytes estimation Dec 10, 2024
@tustvold
Copy link
Contributor

Thank you, looks good to me

@tustvold tustvold merged commit 06a0157 into apache:main Dec 11, 2024
17 checks passed
@alamb
Copy link
Contributor

alamb commented Dec 11, 2024

Thank you @onursatici and @tustvold 🙏

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
parquet Changes to the parquet crate
Projects
None yet
Development

Successfully merging this pull request may close these issues.

parquet arrow writer doesn't track memory size correctly for fixed sized lists
3 participants