Optimization, general questions #86

jwt27 · 2023-07-11T18:30:52Z

Hi,

While working on the recent PRs I noticed there could be some room for optimization. I'm looking into that now but I had some questions:

Callbacks from the native to BSD layer have to iterate over the socket list and find the appropriate BSD socket. Seems it would be better to store a pointer from the native socket back to the BSD socket that owns it. But maybe there is some reason why you hadn't done that?

I do see for the ICMP callback:

Watt-32/src/socket.c

Line 1845 in 70d84f2

* \note a single ICMP message may apply to several UDP sockets.

Is that still true? I don't see how it's possible. One BSD socket can only "own" one native socket, no?

I did see you had the idea once to store the Socket*s in a hash table. I finished the implementation for it now, but as long as there is so much iterating going on, it wouldn't be any faster.

Also I was looking into optimizing select_s()/poll(). I see select_s() has code to check DOS FDs too, but currently it will skip those and set errno = ENOTSOCK. So which is it? Should select_s() be able to work with DOS FDs? It would simplify things a bit if it didn't.

Edit: didn't see the check for SK_FIRST. So stdin/stdout/stderr should be allowed.

The text was updated successfully, but these errors were encountered:

gvanem · 2023-07-12T16:57:56Z

But maybe there is some reason why you hadn't done that?

Perhaps. But like I wrote earlier, there is some years since I wrote that.

But I remember experimenting and porting various to Watt-32 programs (like ncat, netperf), and others involving
PF_PACKET / SOCK_PACKET / SOCK_RAW sockets, I did these tweaks. Besides, there was not many sockets to iterate over.
But perhaps you're right if Watt-32 was used for a Web-server with 5000 clients. 😮‍💨

BTW, currently, MAX_SOCKETS == 5000 for djgpp/Win-targets. Perhaps there should be some
sysconf(OPEN_MAX) / getrlimit() function for this?

store a pointer from the native socket back to the BSD socket that owns it.

A native socket cannot store more than 1 pointer. The problem is that a native socket can be associated with
several BSD-sockets:

Watt-32/src/ip4_in.c

Lines 64 to 75 in 70d84f2

    
           #if defined(USE_BSD_API) 
        
             /* 
        
              * Match 'ip' against all SOCK_RAW sockets before doing normal 
        
              * protocol multiplexing below. 
        
              * 
        
              * Note: _bsd_socket_hook does nothing unless we have allocated at least 
        
              *       one SOCK_RAW socket. 
        
              * 
        
              * Should we return if the hook consumed packet (returns non-NULL) ? 
        
              */ 
        
             if (_bsd_socket_hook) 
        
               (*_bsd_socket_hook) (BSO_IP4_RAW, ip);

The upper layer BSO_IP4_RAW (BSD-Socket Operation) handler, should tell all these SOCK_RAW sockets
about an event.

The flow for letting SOCK_RAW socket(s) peek at the packet is something like:

_ip4_handler(), via (*_bsd_socket_hook()) calls socket_op_demux().
Then socket_op_demux(), via (*ip4_raw_hook) calls sock_raw4_recv().

Finally sock_raw4_recv() should do it's job for all SOCK_RAW sockets:

Watt-32/src/socket.c

Lines 931 to 940 in 70d84f2

    
           for (sock = sk_list; sock; sock = sock->next) 
        
           { 
        
             if (sock->so_type != SOCK_RAW) 
        
                continue; 
        
             if (sock->so_proto == IPPROTO_IP) 
        
                goto enqueue;   /* socket matches every IP-protocol, enqueue */ 
        
             if (ip->proto != sock->so_proto)  /* isn't what we want */ 
        
                continue;

It's rather messy, I know. Even more so when it comes to select_s() and SOCK_RAW / SOCK_PACKET sockets.
But things could improve.

And BTW, it took my brain almost an hour to dig up all these details after these years.

So stdin/stdout/stderr should be allowed.

Correct. Try it with:

cd tests
make djgpp.mak select.exe
py -3 -c "print('1'*100)" | select.exe -s 5000 -w 1 -d

Works kinda if you replace getch() with getchar(). But what if it created 5000 sockets first?

jwt27 · 2023-07-12T22:51:24Z

But maybe there is some reason why you hadn't done that?

Perhaps. But like I wrote earlier, there is some years since I wrote that.

But I remember experimenting and porting various to Watt-32 programs (like ncat, netperf), and others involving PF_PACKET / SOCK_PACKET / SOCK_RAW sockets, I did these tweaks. Besides, there was not many sockets to iterate over. But perhaps you're right if Watt-32 was used for a Web-server with 5000 clients.

BTW, currently, MAX_SOCKETS == 5000 for djgpp/Win-targets. Perhaps there should be some sysconf(OPEN_MAX) / getrlimit() function for this?

O(1) is still N times faster than O(N). But you're right, it does depend on scale.

store a pointer from the native socket back to the BSD socket that owns it.

A native socket cannot store more than 1 pointer. The problem is that a native socket can be associated with several BSD-sockets:

Watt-32/src/ip4_in.c

Lines 64 to 75 in 70d84f2

#if defined(USE_BSD_API)

/*

* Match 'ip' against all SOCK_RAW sockets before doing normal

* protocol multiplexing below.

*

* Note: _bsd_socket_hook does nothing unless we have allocated at least

* one SOCK_RAW socket.

*

* Should we return if the hook consumed packet (returns non-NULL) ?

*/

if (_bsd_socket_hook)

(*_bsd_socket_hook) (BSO_IP4_RAW, ip);

The upper layer BSO_IP4_RAW (BSD-Socket Operation) handler, should tell all these SOCK_RAW sockets about an event.

The flow for letting SOCK_RAW socket(s) peek at the packet is something like:

_ip4_handler(), via (*_bsd_socket_hook()) calls socket_op_demux().

Then socket_op_demux(), via (*ip4_raw_hook) calls sock_raw4_recv().

Finally sock_raw4_recv() should do it's job for all SOCK_RAW sockets:

Watt-32/src/socket.c

Lines 931 to 940 in 70d84f2

for (sock = sk_list; sock; sock = sock->next)

{

if (sock->so_type != SOCK_RAW)

continue;

if (sock->so_proto == IPPROTO_IP)

goto enqueue; /* socket matches every IP-protocol, enqueue */

if (ip->proto != sock->so_proto) /* isn't what we want */

continue;

It's rather messy, I know. Even more so when it comes to select_s() and SOCK_RAW / SOCK_PACKET sockets. But things could improve.

Somehow I was under the impression that it was the other way around (multiple native raw sockets per BSD socket). Don't know where I got that idea now. So there is one native raw socket that broadcasts to all raw BSD sockets? And for the more common case (TCP/UDP), there is always a 1:1 mapping from native to BSD, right?

Maybe ideally you'd want a hash table for TCP/UDP sockets and linked list for raw sockets. Now that gets messy.

gvanem · 2023-07-13T00:07:54Z

And for the more common case (TCP/UDP), there is always a 1:1 mapping from native to BSD, right?

AFAICR, for TCP yes. For UDP, I'm not so sure.

jwt27 · 2024-01-17T16:55:02Z

Was wondering, why does the BSD interface call tcp_Retransmitter (TRUE) everywhere? Is it just to poll for ARP lookups?

gvanem · 2024-01-18T08:48:51Z

Is it just to poll for ARP lookups?

It does a lot of things. First, when s->state == tcp_StateRESOLVE, the socket is in a pseudo state before SYN is sent via tcp_Retransmitter(). So if _arp_lookup() succeedes:

Watt-32/src/pctcp.c

Lines 1239 to 1245 in 5363dff

    
           if (_arp_lookup(s->hisaddr, &s->his_ethaddr))   /* Success */ 
        
           { 
        
             UINT rtt, MTU; 
        
             s->state   = tcp_StateSYNSENT; 
        
             s->timeout = set_timeout (tcp_LONGTIMEOUT); 
        
             TCP_SEND (s);  /* send opening SYN */

the opening SYN gets sent.

Second, It also drains s->state == tcp_StateCLOSED sockets to free up memory and local port.

Finally, it handles window update, slow start, Karn Count, and retransmission stuff.

It's rather messy....

jwt27 · 2024-01-18T16:44:23Z

I see, there's a lot going on there. But normally that is on a timer (tcp_RETRAN_TIME), called from tcp_tick(). Why does the BSD interface need to bypass that timer?

I thought, if it is only to speed up ARP resolves, it could be more efficient to add a callback from pcarp.c back to pctcp.c.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimization, general questions #86

Optimization, general questions #86

jwt27 commented Jul 11, 2023 •

edited

Loading

gvanem commented Jul 12, 2023

jwt27 commented Jul 12, 2023

gvanem commented Jul 13, 2023

jwt27 commented Jan 17, 2024

gvanem commented Jan 18, 2024

jwt27 commented Jan 18, 2024

Optimization, general questions #86

Optimization, general questions #86

Comments

jwt27 commented Jul 11, 2023 • edited Loading

gvanem commented Jul 12, 2023

jwt27 commented Jul 12, 2023

gvanem commented Jul 13, 2023

jwt27 commented Jan 17, 2024

gvanem commented Jan 18, 2024

jwt27 commented Jan 18, 2024

jwt27 commented Jul 11, 2023 •

edited

Loading