[packetbeat] Expire source port mappings. #41581

haesbaert · 2024-11-11T09:56:32Z

port->pid mappings were only overwritten, never expired, the overwriting mechanism has some issues:

It only overwrites if it manages to find the new pid, so it misses short lived processes.
It only refreshes the mapping of said port, if a packet arriving on another port misses the lookup (otherwise the original port is found and returned). Meaning, once all ports are used at least once, the cache is filled and never mutated again.

The observable effect is that the user will see wrong process correlations to older/long lived processes, imagine the follwing:

Long lived process makes short lived TCP connection from src_port S.
Years later, a short lived process makes a TCP connection to somewhere else, but from the same src_port S. It hits the cache, since it had a mapping for S, so packetbeat incorrectly correlates the new short-lived process connection, with the old long lived process.

Related to a very long SDH, where a more in depth explanation of the bug can be found here, with a program to reproduce it.

The solution is to discard mappings that are "old enough", with a hardcoded window of 10 seconds, so as long as the port is not re-used in this window, we are fine.

This also makes sure the cache never becomes "immutable", since mappings will invariably get old, forcing a refresh.

It's a very conservative approach as I don't want to introduce other bugs by redesigning it, work is on the way to change how the cache works in linux anyway.

While here, I've noticed the locking was also wrong, we were doing the lookup unlocked, and also having to relock in case we have to update the mapping, so change this to grab the lock once and only once, interleaving is baad.

Proposed commit message

Checklist

My code follows the style guidelines of this project
I have commented my code, particularly in hard-to-understand areas
~~- [ ] I have made corresponding changes to the documentation~~
~~- [ ] I have made corresponding change to the default configuration files~~
~~- [ ] I have added tests that prove my fix is effective or that my feature works~~
I have added an entry in CHANGELOG.next.asciidoc or CHANGELOG-developer.next.asciidoc.

Test

The following program can be used to demonstrate the bug:

#define _GNU_SOURCE
#include <err.h>
#include <errno.h>
#include <stdint.h>
#include <stdio.h>
#include <stdlib.h>
#include <string.h>
#include <strings.h>
#include <unistd.h>

#include <sys/socket.h>
#include <netinet/in.h>
#include <arpa/inet.h>

#define M		"all your mappings are belong to us"
#define msleep(_x)	usleep((uint64_t)_x * 1000ULL)
#define nitems(_a)	(sizeof((_a)) / sizeof((_a)[0]))

static void
usage(void)
{
	fprintf(stderr, "usage: %s host port\n", program_invocation_short_name);

	exit(1);
}

int
do_connect(int bport, struct sockaddr_in *sin)
{
	int			fd;
	struct sockaddr_in	bsin;

	if ((fd = socket(AF_INET, SOCK_STREAM, 0)) == -1)
		err(1, "socket");
	bzero(&bsin, sizeof(bsin));
	bsin.sin_family = AF_INET;
	bsin.sin_addr.s_addr = INADDR_ANY;
	bsin.sin_port = htons(bport);
	if (bind(fd, (struct sockaddr *)&bsin, sizeof(bsin)) == -1) {
		warn("bind: %d", bport);
		close(fd);
		return (-1);
	}
	if (connect(fd, sin, sizeof(*sin)) == -1) {
		warn("connect: %d", bport);
		close(fd);
		return (-1);
	}
	if (write(fd, M, strlen(M)) != strlen(M)) {
		warn("write: %d", bport);
		close(fd);
		return (-1);
	}

	return (fd);
}

int
main(int argc, char *argv[])
{
	struct sockaddr_in	 sin;
	int			 i, bport, fds[5000];

	if (argc != 3)
		usage();

	bzero(&sin, sizeof(sin));
	sin.sin_family = AF_INET;
	if (inet_aton(argv[1], &sin.sin_addr) != 1)
		err(1, "inet_aton: %s", argv[1]);
	/* no strtonum in leen0x */
	if (atoi(argv[2]) < 1024 || atoi(argv[2]) >= 65536)
		errx(1, "bad port %s", argv[2]);
	sin.sin_port = htons(atoi(argv[2]));

	bport = 1024;
	while (bport < 65536) {
		for (i = 0; i < (int)nitems(fds) && bport < 65536; i++) {
			fds[i] = do_connect(bport, &sin);
			if (fds[i] != -1) {
				printf("%d\n", bport);
				msleep(1);
			}
			bport++;
		}
		sleep(10);
		for (i = 0; i < (int)nitems(fds); i++) {
			if (fds[i] != -1) {
				close(fds[i]);
				fds[i] = -1;
			}
		}
	}
	
	return (0);
}

Build and run with:

$ cc -o all_your_mappings_are_belong_to_us all_your_mappings_are_belong_to_us.c  -Wall && ./all_your_mappings_are_belong_to_us 192.168.1.50 12345

It will do one connection per source port to the specified address (192.168.1.50:12345) and send some bytes, to make it easier, 192.168.1.50 should be in another machine than packetbeat, you can then run tcpbench, by yours truly, or any other service that will accept tcp connections and eat some bytes:

$ git clone https://github.com/bluhm/tcpbench-portable && cd tcpbench-portable && make && ./tcpbench -s4

After running all_your_mappings_are_belong_to_us, if you do a tcp connection to any other port, packetbeat will incorrectly assume it belongs to all_your_mappings_are_belong_to_us, see screenshots

Tested on 8.14.3 and main.

Screenshots

The circled in red thing is a wget to google.com, yet it things it's from all_your_mappings_are_belong_to_us.

After the fix, the mappings correctly show wget

mergify · 2024-11-11T09:57:08Z

This pull request does not have a backport label.
If this is a bug or security fix, could you label this PR @haesbaert? 🙏.
For such, you'll need to label your PR with:

The upcoming major version of the Elastic Stack
The upcoming minor version of the Elastic Stack (if you're not pushing a breaking change)

To fixup this pull request, you need to add the backport labels for the needed
branches, such as:

backport-8./d is the label to automatically backport to the 8./d branch. /d is the digit

mergify · 2024-11-11T09:57:08Z

backport-8.x has been added to help with the transition to the new branch 8.x.
If you don't need it please use backport-skip label and remove the backport-8.x label.

elasticmachine · 2024-11-11T10:03:11Z

Pinging @elastic/sec-linux-platform (Team:Security-Linux Platform)

haesbaert · 2024-11-11T10:03:50Z

cc @marc-gr @Tacklebox

haesbaert · 2024-11-11T10:16:17Z

The fix naturally results in more /proc parsing, basically at least once per 10 seconds for every connection, before it was basically "at every new connection that doesn't have a map already". The other conservative approach would be to do an O(n) expiration of every map every X seconds, while this would reduce the amount of /proc parsing on the positive case, it wouldn't change anything for the negative (if it misses the cache it tries to rebuild it anyway).

nicholasberlin

Some nitpicks, but LGTM

nicholasberlin · 2024-11-11T16:24:50Z

packetbeat/procs/procs.go

@@ -253,9 +276,8 @@ func (proc *ProcessesWatcher) expireProcessCache() {
 	}
 }

+// proc.mu must be locked


Should add this comment above line 250, also.

nicholasberlin · 2024-11-11T16:27:59Z

packetbeat/procs/procs.go

+	// the whole map.
+	//
+	// We take a conservative approach by discarding the entry if
+	// it's old enough. When we the first time here, our caller


Suggested change

// it's old enough. When we the first time here, our caller

// it's old enough. The first time here, our caller

port->pid mappings were only overwritten, never expired, the overwriting mechanism has a bunch of issues: - It only overwrites if it manages to find the new pid, so it misses short lived processes. - It only refreshes the mapping of said port, if a packet arriving on _another_ port misses the lookup (otherwise the original port is found and returned). Meaning, once all ports are used at least once, the cache is filled and never mutated again. The observable effect is that the user will see wrong process correlations _to_ older/long lived processes, imagine the follwing: - Long lived process makes _short_ lived TCP connection from src_port S. - Years later, a _short_ lived process makes a TCP connection to somewhere else, but from the same src_port S. It hits the cache, since it had a mapping for S, so packetbeat incorrectly correlates the new short-lived process connection, with the old long lived process. Related to a very long SDH, where a more in depth explanation of the bug can be found here, with a program to reproduce it. - elastic/sdh-beats#4604 (comment) - elastic/sdh-beats#4604 (comment) The solution is to discard mappings that are "old enough", with a hardcoded window of 10 seconds, so as long as the port is not re-used in this window, we are fine. This also makes sure the cache never becomes "immutable", since mappings will invariably get old, forcing a refresh. It's a very conservative approach as I don't want to introduce other bugs by redesigning it, work is on the way to change how the cache works in linux anyway. While here, I've noticed the locking was also wrong, we were doing the lookup unlocked, and also having to relock in case we have to update the mapping, so change this to grab the lock once and only once, interleaving is baad.

port->pid mappings were only overwritten, never expired, the overwriting mechanism has a bunch of issues: - It only overwrites if it manages to find the new pid, so it misses short lived processes. - It only refreshes the mapping of said port, if a packet arriving on _another_ port misses the lookup (otherwise the original port is found and returned). Meaning, once all ports are used at least once, the cache is filled and never mutated again. The observable effect is that the user will see wrong process correlations _to_ older/long lived processes, imagine the follwing: - Long lived process makes _short_ lived TCP connection from src_port S. - Years later, a _short_ lived process makes a TCP connection to somewhere else, but from the same src_port S. It hits the cache, since it had a mapping for S, so packetbeat incorrectly correlates the new short-lived process connection, with the old long lived process. Related to a very long SDH, where a more in depth explanation of the bug can be found here, with a program to reproduce it. - https://github.com/elastic/sdh-beats/issues/4604#issuecomment-2459969325 - https://github.com/elastic/sdh-beats/issues/4604#issuecomment-2460829030 The solution is to discard mappings that are "old enough", with a hardcoded window of 10 seconds, so as long as the port is not re-used in this window, we are fine. This also makes sure the cache never becomes "immutable", since mappings will invariably get old, forcing a refresh. It's a very conservative approach as I don't want to introduce other bugs by redesigning it, work is on the way to change how the cache works in linux anyway. While here, I've noticed the locking was also wrong, we were doing the lookup unlocked, and also having to relock in case we have to update the mapping, so change this to grab the lock once and only once, interleaving is baad. (cherry picked from commit 587dc60)

port->pid mappings were only overwritten, never expired, the overwriting mechanism has a bunch of issues: - It only overwrites if it manages to find the new pid, so it misses short lived processes. - It only refreshes the mapping of said port, if a packet arriving on _another_ port misses the lookup (otherwise the original port is found and returned). Meaning, once all ports are used at least once, the cache is filled and never mutated again. The observable effect is that the user will see wrong process correlations _to_ older/long lived processes, imagine the follwing: - Long lived process makes _short_ lived TCP connection from src_port S. - Years later, a _short_ lived process makes a TCP connection to somewhere else, but from the same src_port S. It hits the cache, since it had a mapping for S, so packetbeat incorrectly correlates the new short-lived process connection, with the old long lived process. Related to a very long SDH, where a more in depth explanation of the bug can be found here, with a program to reproduce it. - https://github.com/elastic/sdh-beats/issues/4604#issuecomment-2459969325 - https://github.com/elastic/sdh-beats/issues/4604#issuecomment-2460829030 The solution is to discard mappings that are "old enough", with a hardcoded window of 10 seconds, so as long as the port is not re-used in this window, we are fine. This also makes sure the cache never becomes "immutable", since mappings will invariably get old, forcing a refresh. It's a very conservative approach as I don't want to introduce other bugs by redesigning it, work is on the way to change how the cache works in linux anyway. While here, I've noticed the locking was also wrong, we were doing the lookup unlocked, and also having to relock in case we have to update the mapping, so change this to grab the lock once and only once, interleaving is baad. (cherry picked from commit 587dc60) Co-authored-by: Christiano Haesbaert <[email protected]>

botelastic bot added the needs_team Indicates that the issue/PR needs a Team:* label label Nov 11, 2024

mergify bot assigned haesbaert Nov 11, 2024

mergify bot added the backport-8.x Automated backport to the 8.x branch with mergify label Nov 11, 2024

haesbaert added the Team:Security-Linux Platform Linux Platform Team in Security Solution label Nov 11, 2024

botelastic bot removed the needs_team Indicates that the issue/PR needs a Team:* label label Nov 11, 2024

haesbaert marked this pull request as ready for review November 11, 2024 10:03

haesbaert requested a review from a team as a code owner November 11, 2024 10:03

nfritts approved these changes Nov 11, 2024

View reviewed changes

nicholasberlin approved these changes Nov 11, 2024

View reviewed changes

haesbaert added the bugfix label Nov 12, 2024

haesbaert force-pushed the expire-maps branch from a0d2ab8 to 4ed9319 Compare November 12, 2024 06:57

marc-gr approved these changes Nov 12, 2024

View reviewed changes

Merge branch 'main' into expire-maps

60e6a5f

haesbaert merged commit 587dc60 into main Nov 12, 2024
30 checks passed

haesbaert deleted the expire-maps branch November 12, 2024 08:14

mergify bot mentioned this pull request Nov 12, 2024

[8.x](backport #41581) [packetbeat] Expire source port mappings. #41601

Merged

3 tasks

marc-gr added backport-8.15 Automated backport to the 8.15 branch with mergify backport-8.16 Automated backport with mergify labels Nov 12, 2024

This was referenced Nov 12, 2024

[8.15](backport #41581) [packetbeat] Expire source port mappings. #41602

Merged

[8.16](backport #41581) [packetbeat] Expire source port mappings. #41603

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[packetbeat] Expire source port mappings. #41581

[packetbeat] Expire source port mappings. #41581

haesbaert commented Nov 11, 2024 •

edited

Loading

mergify bot commented Nov 11, 2024

mergify bot commented Nov 11, 2024

elasticmachine commented Nov 11, 2024

haesbaert commented Nov 11, 2024

haesbaert commented Nov 11, 2024 •

edited

Loading

nicholasberlin left a comment

nicholasberlin Nov 11, 2024

nicholasberlin Nov 11, 2024

	// it's old enough. When we the first time here, our caller
	// it's old enough. The first time here, our caller

[packetbeat] Expire source port mappings. #41581

[packetbeat] Expire source port mappings. #41581

Conversation

haesbaert commented Nov 11, 2024 • edited Loading

Proposed commit message

Checklist

Test

Screenshots

mergify bot commented Nov 11, 2024

mergify bot commented Nov 11, 2024

elasticmachine commented Nov 11, 2024

haesbaert commented Nov 11, 2024

haesbaert commented Nov 11, 2024 • edited Loading

nicholasberlin left a comment

Choose a reason for hiding this comment

nicholasberlin Nov 11, 2024

Choose a reason for hiding this comment

nicholasberlin Nov 11, 2024

Choose a reason for hiding this comment

haesbaert commented Nov 11, 2024 •

edited

Loading

haesbaert commented Nov 11, 2024 •

edited

Loading