TCP Retransmissions b/c of Checksum Incorrect

510 pts.
Tags:
Bandwidth
Cabling
Cisco
Ethernet
Fault isolation
Hubs
IPv4
NetBIOS
Network protocols
Network testing
Networking
Protocol analysis
Routers
Switches
TCP
TCP/IP
We have a problem where some users will complain of slow application response time (on the LAN) from our ERP server, but others will be fine. After many hours of troublshooting I've discovered the following: - Packet Sniff's of affected users shows TCP Retransmissions because the Checksum is Incorrect from the server. - Replacing the NIC card on the workstation temporarily fixes the errors. Then the problem may start happening on new NIC, so I swap back to old NIC and problem goes away for a while. - Sometimes the problem clears itself up for certain people after a couple months. - There is no similarity between NIC cards, drivers, versions, laptops, desktops, wireless. - Affected users have been tested on different segments of the LAN with no success.
ASKED: July 19, 2007  3:55 PM
UPDATED: February 4, 2009  9:55 PM

Answer Wiki

Thanks. We'll let you know when a new response is added.

You said “the Checksum is Incorrect from the server”. Did you mean server or client? Just to be absolutely clear!!

If you really meant “server”, then you should be looking at the server’s configuration & hardware.

If not, can you identify any other common characteristics of the users?

Discuss This Question: 4  Replies

 
There was an error processing your information. Please try again later.
Thanks. We'll let you know when a new response is added.
Send me notifications when members answer or reply to this question.

REGISTER or login:

Forgot Password?
By submitting you agree to receive email from TechTarget and its partners. If you reside outside of the United States, you consent to having your personal data transferred to and processed in the United States. Privacy
  • Tbitner
    I accidentally hit reply before I was finished composing last reply. The Retransmission requests are coming from the server stating "Checksum: 0x493a (incorrect, should be 0x4939)". The server is HP-UX 11.11 but will be migrating to 11.23 in the next couple months. From my testing it seems to be something wrong with the server, but we're DBA-less currently so I'm reluctant to make any changes!
    510 pointsBadges:
    report
  • Jtt555
    If OS is Microsoft you may find this useful: article - 224829 A possible reason for the incorrect checksum is if your network cards are capable of performing TCP Checksum Offload. Broadcom and Intel gigabit cards are among those that can offload TCP checksum calculation. Linux enabled TCP Checksum Offload automatically when it is available. With TCP Checksum Offload, the packets are captured before the card calculates the checksum -- so the checksums may not be correct. The checksum actually transmitted on the wire and received by the destination host will be correct. On Linux, it is possible to disable TCP checksum offload Of course it could also be due to any number of conditions, such as hardware failure, corruption of an IP datagram or router or congestion. Make sure your NIC drivers (server and wrkstn) are up to date. You may need to configure an NLB setup to make a fatter pipe for your ERP server. Good luck!
    0 pointsBadges:
    report
  • Tbitner
    jtt555, I don't think the client OS is generating these checksums since I'm seeing retransmission requests from the server. I'm starting to lean towards the server as the culprit from my tests. Can a bad cable cause incorrect checksums or would it be the server's nic? Thanks
    510 pointsBadges:
    report
  • Snapper70
    You might want to verify the duplex setting between the HP and the switch it's connected to. If the HP is set to 100full and the switch is autonegotiate, then you may have a mismatch; and as load increases you WILL get a lot of runts and retransmissions. The OTHER thing is that some older HP's didn't seem to run full duplex even if configured that way - although recent models don't have that issue. What we HAVE done is to FTP to/from the HP to a high end workstation, and verified the transmission rate. If you're on a 100 Meg connection, you should get at LEAST 30 Meg from a high end workstation to/from the HP via FTP (use a large file of 50 Meg or more). If your rate is only 5 meg or so, you've probably got a duplex mismatch. If you use Autonegotiate on the switches, you should also use Autonegotiate on the server; if hardcode speed/duplex at one side, make sure you fix it the same on the other. A duplex mismatch will severely impact performance, but may appear normal under low volume traffic.
    920 pointsBadges:
    report

Forgot Password

No problem! Submit your e-mail address below. We'll send you an e-mail containing your password.

Your password has been sent to:

To follow this tag...

There was an error processing your information. Please try again later.

REGISTER or login:

Forgot Password?
By submitting you agree to receive email from TechTarget and its partners. If you reside outside of the United States, you consent to having your personal data transferred to and processed in the United States. Privacy

Thanks! We'll email you when relevant content is added and updated.

Following