Ask Your Question
0

Many duplicate ack's without retransmission

asked 2025-04-03 20:47:24 +0000

I'm trying to debug an issue where an SMB mount keeps complaining about send timeouts.

I took a capture with wireshark, and it looks like it's getting duplicate ACKs for one of the packets. My understanding is that this indicates that the packet after the sequence ID that's getting ACK'd was dropped, and after the third duplicate ACK, my computer should retransmit that packet. But I don't see a single retransmit in the capture. Instead, the computer keeps sending more packets, and each one is responded to with another duplicate ACK. Eventually, cifs gives up, complains about a send timeout, closes the connection and starts a new one.

Does anyone know what might be causing this / how to debug this further?

image description image description

edit retag flag offensive close merge delete

Comments

In my experience Selective acknowledgements are intended to allow the stream to continue while the sender can send the missing packet again. If this happen frequently then you need to sort out why the packets are dropped. But in general this is an optimized way to handle packet loss and as such isn't an issue.

hugo.vanderkooij gravatar imagehugo.vanderkooij ( 2025-04-04 13:07:16 +0000 )edit

1 Answer

Sort by ยป oldest newest most voted
0

answered 2025-04-03 22:56:34 +0000

johnthacker gravatar image

updated 2025-04-03 23:00:53 +0000

It looks like an issue with TCP Segmentation Offload / Large Send Offload. Those TCP segments are much larger than what actually goes on the wire - presumably you captured on the sending machine.

If you look at the ACKs, you can see that they ACK 425428307 and they also have selective ACKs, with the SACK Left Edge (SLE) being 425444235. If you look at the sending packets, those are the sequence numbers of frames 72201 and 72203, and the length of the segment in 72201 is 15928. As 425428307 + 15928 = 425444235, naively, that looks like no missing bytes! Both segments are ACKnowledged, just one in the normal ACK and one in the SACK.

In practice, what must have happened is that the 15928 octet segment starting at 425428307 was actually split into much smaller segments smaller than the MSS, the first one was received but the second one was lost. The TSO/LSO sender apparently can't understand that the receiver is missing all the smaller segments other than the first, so it doesn't try to resend.

If you look at some of the earlier ACKs, you will see that they ACK sequence numbers that don't appear to be the start of segments sent - they are sequence numbers that appear in the real segments after divided up.

If this keeps happening to you, you should disable TSO or LSO (depending on whether the sending host is Linux or Windows) as a workaround.

edit flag offensive delete link more

Your Answer

Please start posting anonymously - your entry will be published after you log in or create a new account.

Add Answer

Question Tools

1 follower

Stats

Asked: 2025-04-03 20:47:24 +0000

Seen: 18 times

Last updated: 20 hours ago