Re: [lwip-devel] [mqtt] Disconnection caused by a keep-alive timeout

lwip-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [lwip-devel] [mqtt] Disconnection caused by a keep-alive timeout

From:	Giuseppe Modugno
Subject:	Re: [lwip-devel] [mqtt] Disconnection caused by a keep-alive timeout
Date:	Thu, 3 Dec 2020 11:37:33 +0100
User-agent:	Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:78.0) Gecko/20100101 Thunderbird/78.5.0

Il 03/12/2020 10:54, Benjamin Kalytta ha scritto:

I don't know if my problem is related since I relied on Netconn API.

In that case, my embedded device was running a HTTP web server. Clients 
connected through Wi-FI to my device. AJAX requests were heavily used i.e. many 
connections opened and closed very often. As soon as the client  (running on 
Microsoft Windows) loses the Wi-FI connection Windows automatically shuts down 
all active TCP/IP connections since this network interface wasn't available any 
more (This should  not be conform to the TCP/IP specification!). However, the 
embedded device did not notice that the connections on the opposite side were 
already closed, so no pool memory were ever freed. When the Wi-FI connection 
reestablished, and the client tries to reconnect, no more memory was available 
to handle that client requests.

Yes, I think it's related. TCP/IP stack tries to close all connectionsin the best way it can, even if this means keeping data in memory for avery long time. During this time your system has a lower quantity offree memory to use for new connections. Because in embedded systemsoften we don't have big memories, this could be a critical issue.

My solution was to enable TCP/IP keep-alive per connection and setting a 
reasonable short timeout. Following configuration has to be set for that to 
work reliably:

#define LWIP_NETCONN_FULLDUPLEX                 1
#define LWIP_NETCONN_SEM_PER_THREAD             0

I didn't know the keep-alive feature of TCP stack. In my case I alreadyhave an application protocol (MQTT) that can be configured to usekeep-alives (at application layer). If the application detects a faultyconnection with the peer (no reply to the keep-alive request), it closesthe TCP socket. In this situation, I think it is safe (maybe better) touse tcp_abort(), that free everything related to the connection, insteadof tcp_close(), that will try to send unacked data.

[Prev in Thread]

Current Thread

[Next in Thread]

Re: [lwip-devel] [mqtt] Disconnection caused by a keep-alive timeout, Benjamin Kalytta, 2020/12/03
- Re: [lwip-devel] [mqtt] Disconnection caused by a keep-alive timeout, Giuseppe Modugno <=

Prev by Date: Re: [lwip-devel] [mqtt] Disconnection caused by a keep-alive timeout
Next by Date: [lwip-devel] [bug #59632] Calling tcp_write() without TCP_WRITE_FLAG_COPY does in fact copy data
Previous by thread: Re: [lwip-devel] [mqtt] Disconnection caused by a keep-alive timeout
Next by thread: [lwip-devel] [bug #59632] Calling tcp_write() without TCP_WRITE_FLAG_COPY does in fact copy data
Index(es):
- Date
- Thread