Re: RE : [lwip-devel] brain storming about "socket2"

In my opinion socket api are very useful: i like the power of select to manage synchronized IO, and i like to have (specially using tcp) to have a separation between application and tcpip stack.
(socket function send and recv, do a copy, so puts a "red line" between application code and stack code, consuming ram and loosing efficiency)
I know that socket reduces performance, but i think we should have a trade-off between performance and the ability to offer an easy way to write application code using the most powerful api like socket. We should have a "full" tcpip stack in each our application, which start from a very efficient emac driver, integrated in better way in lwip stack (see the discussion about zero-copy driver in lwip-dev mailing list), to socket interface, changing how socket are build above other layers (i.e. without netconn)

i think that netconn api could be useful for particular applications, which needs high performance. Socket are more general purpose, there is a lot of documentation about them, a lot of code based on them, offers supports for synchronization of many simultaneous connections. Netconn are oriented to "event on network", to have low response latency using callback

enhancements... absolutely!

Probably my misunderstanding, then. I was reading about the memmp issues
needing a stack restart. However, that is a different thread, and probably
no relevant to this discussion!

yes... is related to a strange problems i sw stressing my code... no relevant for this discussion

Before starting with lwip, i saw interniche stack (the light version, available from microcontroller vendor.
I saw a very difficult to debug code, no documentation and support are available until you decide to pay for commercial support, and it doesn't have real bsd socket as interface: in lwip you have a very well done code, a community of developers/users behind, and you CAN CHOICE between performance (netconn api or raw api) and BSD SOCKET

in my opinion we can accept deviations from the standard posix, but some features must be compliance.

i tried to study lwip code...
first, during my application performance analysis, i checked that some time is spent in recv function. (note: i'm using freertos as RTOS with lwip 130)
My understanding:
for each call to recv, a call to netconn_recv is performed.
this function calls:

[ ------ in context of application thread --------]
- sys_arch_mbox_fetch(conn->recvmbox, (void *)&p, 0);    ----> a call to rtos kernel - wait msg on queue (if a checked socket with 'select()' before, a msg is already present)
- TCPIP_APIMSG(&msg);    ----> send a msg to tcpip task, calling:
                  - sys_mbox_post(mbox, &msg);                                             ----> a call to rtos kernel - send msq to queue (send requested operation to tcpip task, for safe execution)
                  - sys_arch_sem_wait(apimsg->msg.conn->op_completed, 0);   ----> a call to rtos kernel - wait on semaphore (wait that tcpip task executes operation)
                  [ --------- RTOS context switch -------- ]                                      ----> rtos kernel context switch - code execution continues in tcpip thread
[ ------ in context of tcpip thread --------]
- sys_mbox_fetch(mbox, (void *)&msg);     ----> a call to rtos kernel - wait msg on queue, requested operation is received in queue
- exec requested function                                                                         ----> CORE OF REQUESTED OPERATION (i.e. recv). At the end calls:
                  - TCPIP_APIMSG_ACK(msg);                                                ----> send msg to application task, calling:
                          - sys_sem_signal(m->conn->op_completed)                    ----> a call to rtos kernel - signal a semaphore (advise application thread: operation executed)
                  [ --------- RTOS context switch -------- ]                                      ----> rtos kernel context switch - code execution continues in application thread
[ ------ in context of application thread --------]

So, for recv operation, will be necessary:
- 5 calls to RTOS kernel
- 2 RTOS context switch

I saw that some piece of code in LWIP use LWIP_TCPIP_CORE_LOCKING, to protect access to lwip core, but only for send operation and INSIDE netconn api.
I suppose that the idea for socket2, build above raw api, could be to write socket code implementation which calls this low_level functions locking/unlocking core, using a general semaphore, without netconn api. In thsi case, a recv function should use:
- 2 calls to RTOS kernel (the first try to get semaphore before low_level call, the second release semaphore)

If i'm not in wrong, we should have better performance for socket api, and the tcpip thread should have less work to do.
Is it this the idea for new socket2 api?

mmmm. socket without copy? it's not posix compliance, but could be very useful!!

Can we explain how we can cooperate for this? starting with discussion and AFTER trying to change lwip code?

Bye
Piero

From:	Piero 74
Subject:	Re: RE : [lwip-devel] brain storming about "socket2"
Date:	Mon, 19 Jan 2009 16:11:12 +0100