Re: [lwip-users] tcp_write with zero-copy

lwip-users

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [lwip-users] tcp_write with zero-copy

From:	Jonathan Larmour
Subject:	Re: [lwip-users] tcp_write with zero-copy
Date:	Sun, 17 Feb 2008 01:03:17 +0000
User-agent:	Mozilla Thunderbird 1.0.8-1.1.fc4 (X11/20060501)

Timmy Brolin wrote:

Hi,
Yes, the rx pool may have to be slightly bigger, but the tx pool couldbe set to almost zero instead.

Only in a limited subset of applications, I would have thought. Very fewprotocols have responses which you only slightly modify, and send back,keeping the same packet size; fewer still TCP-based ones (rather than UDP)- I can't think of any. After all, TCP is stream-based so you have no ideahow many pieces your message will arrive in at the far end. Or if theprotocol isn't entirely synchronous or multiple packets of this protocolcan be sent at once, then there may be bits of subsequent packets withinthe same pbufs. It seems a little like you're trying to make a quitespecific scenario more efficient based on guarantees that the underlyingprotocol does not make.

Determining the optimum balance betweenrx and tx pool sizes is not very easy as it is now. With true zero copythere would be no such balance. Simply put all available memory into thepbuf pool.

But then you run the risk of running out of configured space for receivingdata, because it's all used up with data for transmission. RX data has totake priority, especially since it includes TCP ACKs.

Yes, the system may become more "memory efficient" in the sense that moreof the available memory is used at any time; but this is at the expense ofdeterministic behaviour. It is more deterministic to have the generalprinciple of having a set of pbufs that are reserved only for rx data.

Today the application have to allocate a buffer for tx data before itcan free the rx buf, so momentarily there is twice the amount of memoryused, and when the application sends the data, lwip will do a second txbuffer allocation and memcpy which means yet again there is momentarilydouble the memory use.

In practice, there may not be any particular problem with having atcp_write_pbuf() variant - that's pretty much just moving existing codearound a little so hopefully wouldn't have any real repercussions fornormal users. But I wouldn't be happy about consolidating the pbuf memoryinto a single pool in general.

There are ways of avoiding this second allocation and memcpy by usingtcp_sent, but it is not a very practical method since it requires theapplication to keep track of exactly which data has been sent and acked.I am afraid that I don't quite understand how using pbufs for both rxand tx would use more memory than the separate rx/tx pools uses today.

Consider a more general TCP stream then you are using for your protocol.There are few constraints on how much data can be enqueued, principallyTCP_SNDBUF and TCP_SNDQUEUELEN. So an application that has a lot of datato send will be able to fill each tcp connection's send buffer entirely tothose limits. That would be done at the expense of rx buffers in yourscenario. That greatly risks deadlock.

So you might think then "well, why not just make sure TCP_SNDBUF andTCP_SNDQUEUELEN" are set to prevent that, in which case you may as wellhave used a separate tx buffer space, since you're again effectivelydividing up buffer space.

Anyway, I think if you can make a tcp_write_pbuf() implementation thatwould not increase the footprint for those who don't use it, then feelfree to submit it to the patches page on savannah. If it doesn't increasefootprint, I'm sure that would be ok to accept (after 1.3.0). But it doesseem a little to me like the protocol you are implementing really shouldbe datagram-based, not stream-based.


Jifl
--
eCosCentric Limited      http://www.eCosCentric.com/     The eCos experts
Barnwell House, Barnwell Drive, Cambridge, UK.       Tel: +44 1223 245571
Registered in England and Wales: Reg No 4422071.
------["The best things in life aren't things."]------      Opinions==mine

[Prev in Thread]

Current Thread

[Next in Thread]

Re: [lwip-users] tcp_write with zero-copy, Timmy Brolin, 2008/02/14
- RE: [lwip-users] tcp_write with zero-copy, Goldschmidt Simon, 2008/02/14
  - Re: [lwip-users] tcp_write with zero-copy, Timmy Brolin, 2008/02/16
    - Re: [lwip-users] tcp_write with zero-copy, Jonathan Larmour <=
    - Re: [lwip-users] tcp_write with zero-copy, Timmy Brolin, 2008/02/16
    - Re: [lwip-users] tcp_write with zero-copy, address@hidden, 2008/02/17

Prev by Date: Re: [lwip-users] tcp_write with zero-copy
Next by Date: Re: [lwip-users] tcp_write with zero-copy
Previous by thread: Re: [lwip-users] tcp_write with zero-copy
Next by thread: Re: [lwip-users] tcp_write with zero-copy
Index(es):
- Date
- Thread