Bug 5056: assertion failed: Read.cc:61: Comm::IsConnOpen(conn) #253

eduard-bagdasaryan · 2024-05-20T15:00:39Z

FATAL: assertion failed: Read.cc:61: "Comm::IsConnOpen(conn)"

comm_read() was called on a closed connection because it was initiated
by tunnelDelayedServerRead() event triggered after the connection
closure.

FATAL: assertion failed: Read.cc:61: "Comm::IsConnOpen(conn)" comm_read() was called on a closed connection because it was initiated by tunnelDelayedServerRead() event triggered after the connection closure.

eduard-bagdasaryan · 2024-05-21T14:00:51Z

src/tunnel.cc

@@ -898,6 +898,17 @@ void
 TunnelStateData::copyRead(Connection &from, IOCB *completion)
 {
    assert(from.len == 0);


I checked that this assertion is valid: copyRead() is called via writeServerDone() -> copyClientBytes(), and client.dataSent() in writeServerDone() resets this field. Also for the initial case (when copyRead() is not called via writeServerDone()), copyClientBytes() takes care about the 'pre read data' in a different execution path, and from.len is not checked in this case. So I think the code below can simply close the 'other' connection, we don't need finishWritingAndDelete() with its "waiting to finish writing" logic there.

I checked that this assertion is valid: copyRead() is called via ...

AFAICT, you have checked that this assertion is not going to fail in current code. I see several red flags in the description of those checks, but let's assume that it does not fail indeed. That is great, but it does not help me understand how new code (placed below this assertion) relies on the asserted condition...

... [complex reasoning about from.len value and possibly other conditions] ... So I think the code below can simply close the 'other' connection, we don't need finishWritingAndDelete() with its "waiting to finish writing" logic there.

I am having trouble connecting the dots here. Would the following statement be an accurate representation of proposed/new code logic?

Zero from.len implies that we are not currently writing to toConn. Thus, finishWritingAndDelete(toConn) is unnecessary and toConn->close() is sufficient to correctly end transaction and destroy TunnelStateData.

N.B. Official comm_read() calling code below the assertion relies on the asserted condition because it uses a pointer to the beginning of from.buf buffer. The number of currently buffered bytes must be zero for comm_read() to safely fill that area of the buffer. From that code point of view, the assertion is correct (i.e. it checks the correct invariant that the code below the assertion relies on) even if it fails (due to Squid bugs elsewhere)!

Zero from.len implies that we are not currently writing to toConn.

We may imply that: from.len becomes non-zero when we have read some new data and are going to write it (Connection::bytesIn() in TunnelStateData::readClient()). It becomes zero again when we have written all available data (Connection::dataSent() asserts on that) and are going to read again (our TunnelStateData::copyRead()). So, len==0 here means that we have written all available data (if any) and do not have new data to write (yet). So when we observe that client has gone (no new data) at this point, we simply close the opposite connection.

rousskov · 2024-05-21T15:05:12Z

src/tunnel.cc

@@ -898,6 +898,17 @@ void
 TunnelStateData::copyRead(Connection &from, IOCB *completion)
 {
    assert(from.len == 0);


I checked that this assertion is valid: copyRead() is called via ...

AFAICT, you have checked that this assertion is not going to fail in current code. I see several red flags in the description of those checks, but let's assume that it does not fail indeed. That is great, but it does not help me understand how new code (placed below this assertion) relies on the asserted condition...

... [complex reasoning about from.len value and possibly other conditions] ... So I think the code below can simply close the 'other' connection, we don't need finishWritingAndDelete() with its "waiting to finish writing" logic there.

I am having trouble connecting the dots here. Would the following statement be an accurate representation of proposed/new code logic?

Zero from.len implies that we are not currently writing to toConn. Thus, finishWritingAndDelete(toConn) is unnecessary and toConn->close() is sufficient to correctly end transaction and destroy TunnelStateData.

N.B. Official comm_read() calling code below the assertion relies on the asserted condition because it uses a pointer to the beginning of from.buf buffer. The number of currently buffered bytes must be zero for comm_read() to safely fill that area of the buffer. From that code point of view, the assertion is correct (i.e. it checks the correct invariant that the code below the assertion relies on) even if it fails (due to Squid bugs elsewhere)!

rousskov · 2024-05-21T15:12:42Z

src/tunnel.cc

@@ -898,6 +898,17 @@ void
 TunnelStateData::copyRead(Connection &from, IOCB *completion)
 {
    assert(from.len == 0);
+
+    // TODO: remove code duplication, creating a helper method
+    if (!Comm::IsConnOpen(from.conn)) {


If from.conn is not open anymore, then why was not our connection closure callback called? And if it was called, why do we need to do some extra steps here/now to end transaction and destroy TunnelStateData?

Perhaps we can rely on our connection closure callback and simply return here. However, it looks like that current tunnel code is inconsistent in this respect. For example, TunnelStateData::writeServerDone() does not rely on the callback (under the "If the other end has closed, so should we" comment) but TunnelStateData::readClient() does (under the "close handlers will tidy up for us" comment). I could not find a place whether/where we remove close callbacks in tunnel.cc. So it looks like that "relying on close handlers" is the correct approach.

eduard-bagdasaryan added 2 commits May 20, 2024 17:59

Bug 5056: assertion failed: Read.cc:61: "Comm::IsConnOpen(conn)

350a691

FATAL: assertion failed: Read.cc:61: "Comm::IsConnOpen(conn)" comm_read() was called on a closed connection because it was initiated by tunnelDelayedServerRead() event triggered after the connection closure.

TunnelStateData::copyRead() should honor the always empty 'from' buffer

52fdb46

eduard-bagdasaryan commented May 21, 2024

View reviewed changes

rousskov requested changes May 21, 2024

View reviewed changes

eduard-bagdasaryan added 2 commits August 9, 2024 15:42

Merged from master

2f5ea85

Autoformatted

7eb7c51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug 5056: assertion failed: Read.cc:61: Comm::IsConnOpen(conn) #253

Bug 5056: assertion failed: Read.cc:61: Comm::IsConnOpen(conn) #253

eduard-bagdasaryan commented May 20, 2024

eduard-bagdasaryan May 21, 2024

rousskov May 21, 2024

eduard-bagdasaryan May 22, 2024

rousskov May 21, 2024

rousskov May 21, 2024

eduard-bagdasaryan May 22, 2024

Bug 5056: assertion failed: Read.cc:61: Comm::IsConnOpen(conn) #253

Are you sure you want to change the base?

Bug 5056: assertion failed: Read.cc:61: Comm::IsConnOpen(conn) #253

Conversation

eduard-bagdasaryan commented May 20, 2024

eduard-bagdasaryan May 21, 2024

Choose a reason for hiding this comment

rousskov May 21, 2024

Choose a reason for hiding this comment

eduard-bagdasaryan May 22, 2024

Choose a reason for hiding this comment

rousskov May 21, 2024

Choose a reason for hiding this comment

rousskov May 21, 2024

Choose a reason for hiding this comment

eduard-bagdasaryan May 22, 2024

Choose a reason for hiding this comment