-
Notifications
You must be signed in to change notification settings - Fork 3.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Change tcp_nodelay config to be true #4986
Conversation
🎉 All Contributor License Agreements have been signed. Ready to merge. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
PR Overview
This PR updates the default configuration for the TCP_NODELAY setting to improve latency when batching with linger.ms. The changes include:
- Updated the changelog to indicate that the Nagle algorithm is disabled (TCP_NODELAY enabled) by default.
- Modified the configuration documentation to change the default value of socket.nagle.disable from false to true.
Reviewed Changes
File | Description |
---|---|
CHANGELOG.md | Updated changelog entries to reflect the new default setting for TCP_NODELAY. |
CONFIGURATION.md | Changed the default value for socket.nagle.disable from false to true. |
Copilot reviewed 3 out of 3 changed files in this pull request and generated no comments.
I fully support this change. I was surprised to discover that the current default setting utilizes Nagle's algorithm. I only became aware of this while investigating some unexpected latency spikes in our benchmarks, which were resolved upon disabling Nagle's algorithm ( |
28d6fa2
to
37a9629
Compare
37a9629
to
a98f494
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks @milindl ! We decided to ship it as a patch and not in a major release because it's only giving improvements if QUICKACK isn't enabled on the other side, and this last one is a flag that needs to be reapplied after every read to the socket. Even if any broker implementation was setting QUICKACK after every read and this change was causing increased acks, a small increase in network traffic would be better that an increase in latency (and it would be already be happening with a Java client).
Currently, we use Nagle's algorithm, which doesn't make much sense as we're already implementing batching behaviour with linger.ms. In cases the user wants to optimize the latency and sets linger.ms = 0, it will still be delayed, which is undesirable/buggy behaviour.