Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

AVRO-4070: Optimize Check Max Collection Length for New Collections #3195

Merged
merged 1 commit into from
Oct 8, 2024

Conversation

belugabehr
Copy link
Contributor

What is the purpose of the change

  • This pull request improves file read performance by skipping needless checks, fixing AVRO-4070.

Verifying this change

This change is a trivial rework / code cleanup without any test coverage.

@github-actions github-actions bot added the Java Pull Requests for Java binding label Oct 2, 2024
@KalleOlaviNiemitalo
Copy link
Contributor

Not objecting, but I'm curious if you have any benchmark numbers showing the effect of this change.

@belugabehr
Copy link
Contributor Author

Hello,

I'm working through the project using different techniques. This code came to my attention by profiling the following benchmark, not a specific benchmark for this method.

Nothing crazy, but this area of the code consumed roughly 1.2% of total cpu time in my personal (read: dated) system. After the change it was roughly 1% of cpu time. So the savings were roughly 0.2% in this array-heavy workload.

Thanks.

https://softwaremill.com/data-serialization-tools-comparison-avro-vs-protobuf/

@belugabehr belugabehr merged commit 9304cb7 into apache:main Oct 8, 2024
8 checks passed
@belugabehr belugabehr deleted the belugabehr/AVRO-4070 branch October 8, 2024 23:33
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Java Pull Requests for Java binding
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants