Re: [lwip-users] netconn

lwip-users

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [lwip-users] netconn_write blocking

From:	Frédéric BERNON
Subject:	Re: [lwip-users] netconn_write blocking
Date:	Tue, 9 Oct 2007 22:44:09 +0200

Hi,

When you said "lots of messages about the queue being full", can you tell mewhat exact messages you got?

I think if you wait a long long time (several minutes), you should got aerror. In fact, there is a kind of "timeout" in lwIP for tcp "write", butdefault values in opt.h and tcp.h are too big (to my point of view). I thinkmainly to TCP_SYNMAXRTX that you could reduce. This is the number ofretransmissions you have to wait before lwIP abort a TCP connection whenit's segments are not acknowledged: when you unplug your "peer", your servercontinue to send packets until it fill the "tcp send buffer". Even in thiscase, your "write" doesn't return if the segment is not "enqueued" (it retryeach time tcp_sent callback is invoked, with do_writemore). Since the cableis unplugged, the tcp segments you send are never acknowledged. So, the"slow" tcp timer try to resend them (tcp considers that these segments canbe lost in the network, so, this is a normal tcp retransmission). It try toresend them TCP_SYNMAXRTX times (but not in a "linear" way, but in a"exponential" way). After that, it abort the connection. If you can do awireshark capture, I suppose you can see these retransmissions (that what Idid, see below). So, the "solution" is to reach TCP_SYNMAXRTX faster. To dothat, you can:


- Reduce TCP_SYNMAXRTX in your lwipopts.h (you can try 4)
- Reduce TCP_TMR_INTERVAL in your lwipopts.h (you can try 100)

We have talk with Kieran about lwIP retransmission implementation in thisemails (this is not exactly the same case, but the cause is, but, becarefull, I talk about a dirty hack, don't use it, it was just forexperience):


http://lists.nongnu.org/archive/html/lwip-devel/2007-09/msg00061.html
http://lists.nongnu.org/archive/html/lwip-devel/2007-09/msg00062.html
http://lists.nongnu.org/archive/html/lwip-devel/2007-09/msg00063.html

I attach some captures I did during these tests, but I can remember theTCP_TMR_INTERVAL value I used. What you can see in "TCP_MAXRTX=12.cap", isthere is until 412 seconds until the connection is abort (we can seeanything in the capture, lwIP dosen't send any RST packet when it abort theconnection). You can also see the delay between each retransmission isincreased (doubled in a first time, until it reach a max value). It use thetcp_backoff table in can found in tcp.c:


const u8_t tcp_backoff[13] ={ 1, 2, 3, 4, 5, 6, 7, 7, 7, 7, 7, 7, 7};

In "TCP_MAXRTX=6.cap", you can see the abort is reach faster.

I hope it can help you...

----- Original Message -----From: <address@hidden>

To: "Mailing list for lwIP users" <address@hidden>
Sent: Tuesday, October 09, 2007 9:28 PM
Subject: Re: [lwip-users] netconn_write blocking

Hi,
Hi!
I have an application that is sending out TCP data to several clientusing the sequential API. When a client disconnects gracefully,netconn_write returns a negative value, and I can close the connection.However, if any of the clients locks up (i'm using embedded clients), ora cable gets unplugged, etc. netconn_write keeps queue packets until itfills up the buffer, and then blocks. I've been playing around withdebugging, and so far all I get is lots of messages about the queue beingfull.
It complains about the queue being full?? That would be a misconfigurationand maybe an error in your port! The queues should never be full! That'swhy sys_arch_mbox_post has no return value, and the port should assert tocheck that a queue is never full. Misconfiguration could lead to this: toobig TCP windows vs. too small queues...
But to be sure about this, could you post an excerpt of your debug outputso that I know which function / file complains?
My question is what is the proper way to deal with ungracefuldisconnections using the sequential API? Am I doing something wrong,should netconn_write return an error for ungraceful disconnections, or isthere any other way to check if for connection timeouts?
Unfortunately, there is only RX timeout currently. TX timeout is planned,I think...
Simon


_______________________________________________
lwip-users mailing list
address@hidden
http://lists.nongnu.org/mailman/listinfo/lwip-users

TCP_MAXRTX=12.cap
Description: Binary data

TCP_MAXRTX=6.cap
Description: Binary data

[Prev in Thread]

Current Thread

[Next in Thread]

[lwip-users] netconn_write blocking, Lukefahr, Andrew Robert (UMC-Student), 2007/10/08
- Re: [lwip-users] netconn_write blocking, address@hidden, 2007/10/09
  - Re: [lwip-users] netconn_write blocking, Frédéric BERNON <=
- Re: [lwip-users] netconn_write blocking, Andrew Lukefahr, 2007/10/09
  - RE: [lwip-users] netconn_write blocking, Goldschmidt Simon, 2007/10/10
    - RE: [lwip-users] netconn_write blocking, Kieran Mansley, 2007/10/10

Prev by Date: Re: [lwip-users] netconn_write blocking
Next by Date: Re: [lwip-users] netconn_write blocking
Previous by thread: Re: [lwip-users] netconn_write blocking
Next by thread: Re: [lwip-users] netconn_write blocking
Index(es):
- Date
- Thread