feat(perf): continuosly measure on single conn (iperf-style) #276

mxinden · 2023-08-24T17:31:40Z

Our current throughput tests open a connection, open a stream, up- or download 100MB and close the connection. 100 MB is not enough on the given path (60ms, ~5gbit/s) to exit congestion controller's slow-start. See #261 for details.

Instead of downloading 100MB multiple times, each on a new connection, establish a single connection and continuously measure the throughput for a fixed duration (20s).

Closes #261

Our current throughput tests open a connection, open a stream, up- or download 100MB and close the connection. 100 MB is not enough on the given path (60ms, ~5gbit/s) to exit congestion controller's slow-start. See #261 for details. Instead of downloading 100MB multiple times, each on a new connection, establish a single connection and continuously measure the throughput for a fixed duration (60s).

marten-seemann

Should we do 3 iterations at 20 seconds? Slow start won't take more than 1-2s, so this should give us plenty of time to converge, and it would show situations where the congestion controllers runs into a state that it takes long to recover from (sudden cross-traffic).

marten-seemann · 2023-08-25T01:37:14Z

perf/impl/https/v0.1/main.go

+		// TODO
+		jsonB, err := json.Marshal(Result{
+			TimeSeconds: time.Since(r.LastReportTime).Seconds(),
+			UploadBytes: uint(r.lastReportRead),
+			Type: "intermediary",
+		})
+		if err != nil {
+			log.Fatalf("failed to marshal perf result: %s", err)
+		}
+		fmt.Println(string(jsonB))
+
+		r.LastReportTime = time.Now()
+		r.lastReportRead = 0


Only do a single call to time.Now(), so we don't lose any bytes sent between the two calls:

Suggested change

// TODO

jsonB, err := json.Marshal(Result{

TimeSeconds: time.Since(r.LastReportTime).Seconds(),

UploadBytes: uint(r.lastReportRead),

Type: "intermediary",

})

if err != nil {

log.Fatalf("failed to marshal perf result: %s", err)

}

fmt.Println(string(jsonB))

r.LastReportTime = time.Now()

r.lastReportRead = 0

now := time.Now()

// TODO

jsonB, err := json.Marshal(Result{

TimeSeconds: r.LastReportTime.Sub(now).Seconds(),

UploadBytes: uint(r.lastReportRead),

Type: "intermediary",

})

if err != nil {

log.Fatalf("failed to marshal perf result: %s", err)

}

fmt.Println(string(jsonB))

r.LastReportTime = now

r.lastReportRead = 0

marten-seemann · 2023-08-25T06:00:51Z

Preliminary results show fruitful for https but not rust-libp2p/quic.

Are these results available anywhere?

mxinden · 2023-08-25T09:32:14Z

Preliminary results show fruitful for https but not rust-libp2p/quic.

Are these results available anywhere?

Not yet. Still in a very work-in-progress state.

mxinden · 2023-08-25T12:17:37Z

Reaching 4.5 gbit/s with https and >5 gbit/s with rust-libp2p. Still testing. Though this is looking promising.

Needed for dashboard

…rf-exit-slow-start

(Also removes -b 25g which does not have an impact on throughput.)

mxinden · 2023-09-04T12:52:16Z

iperf throughput mismatch was due to Nagle's algorithm. Disabled now via -N. See previous commit. (Default for the https implementation already golang/go#57530.)

https://observablehq.com/d/682dcea9fe2505c4?branch=perf-exit-slow-start#branch

I still need to investigate more for the other measurements (*-libp2p and quic-go). Will try with a fixed MTU of 1500 next.

//CC @marten-seemann

marten-seemann · 2023-09-04T14:33:10Z

iperf throughput mismatch was due to Nagle's algorithm

Interesting! It's great to see iperf and HTTPS achieving roughly similar results (at least in the limit). This means that our setup is getting more trustworthy!

Looking at the graphs, why are some measurements drawn as boxes and some as points? Why do some have error bars and others don't? The spread seems pretty high, do we need more iterations?

I wouldn't be surprised if quic-go maxed out somewhere around 2 Gbps. At some point, your transfer becomes CPU-limited, depending on the number of kernel offloads that your QUIC stack uses (and that's not the thing we want to benchmark here). That said, I just updated quic-go/perf to quic-go v0.38.1 (quic-go/perf#16, I'll merge the PR once GHA is not broken anymore...), which uses GSO by default. Might be worth rebasing your branch to see if this changes anything.

In go-libp2p, Yamux uses a 16 MB receive window, which should limit us to roughly 2 Gbps (minus some muxer overhead). It's interesting to see that we're achieving roughly half of that. Could be a coincidence, or point to a bug in our flow control autotuning. I'd be happy to debug this using the current setup (assuming I can still run it manually as I could with the version on master), please let me know.

QUIC uses a 10 MB window, which limits the bandwidth to 1.25 Gbps. That means we're not quite at the optimum, but pretty close. Would it be helpful for you if we prioritize resolving libp2p/go-libp2p#2290? Alternatively, we could also just have a go-libp2p branch that bumps that value, so we can see if that's actually the root of the problem.

Will try with a fixed MTU of 1500 next.

Does AWS allow larger MTUs on their backbone? That would indeed give TCP an unfair advantage over at least quic-go. Have you verified that using tcpdump / Wireshark?

marten-seemann · 2023-09-07T16:24:33Z

Here are some interesting result from running the HTTPS test and analyzing the tcpdump. The congestion controller used for this test is Cubic. First interesting result: importing and processing an 8 GB pcap into Wireshark takes a pretty long time O(30min) ;)

Here's the RTT distribution:

There's definitely some queues building up in the network, but it's not too terrible.

Here's the sequence plot (ignore the wrapping of the packet number, obviously), showing the time when packet loss occurred:

And here's the throughput:

Obviously, we're very far from reaching a steady state.

Here's some back-of-the-envelope math to calculate the recovery time (i.e. the time it takes to ramp up the congestion window to its original size after a loss), and assuming a BDP (at 5 gbps and 65ms RTT) of roughly 40 MB:

On Reno, a packet loss halves the cwnd, and every round-trip without a loss event increases it by one MSS. Thus the recovery time is (20 MB / 1400 bytes) ~14000 RTTs, which is 15 minutes (!).
On Cubic, it's harder to estimate. The L4S Prague paper claims that at 100 mbps the recovery time is 250 RTTs, and doubles for every 8x increase in bandwidth. That's roughly 1000 RTTs, which is 1 minute.

In the sequence plot above, we see packet loss happening 10x as frequently as this calculation suggests. This might be due to the more shallow buffer, but I don't know precisely how the recovery time scales with the buffer size.

What does this mean for our perf setup? At the BDP that we chose for our test, we're running into limitations imposed by the congestion controllers:

With a recovery time of 15 mintues, Reno has no chance against Cubic whatsoever, yet this is what RFC 9002 recommends for QUIC implementations and what major players have deployed in their QUIC stacks.
Even Cubic's recovery time is pretty long. At a sampling frequency of 1s, we will pick up the saw-tooth pattern inherent to the congestion controller, and we will inevitably see a wide spread of measurement results.

mxinden · 2023-09-15T14:27:57Z

Will try with a fixed MTU of 1500 next.

Does AWS allow larger MTUs on their backbone? That would indeed give TCP an unfair advantage over at least quic-go. Have you verified that using tcpdump / Wireshark?

Turns out, it does not:

[ec2-user@ip-]$ ping -M do -s 1472 -c 4 x.x.x.x
PING  () 1472(1500) bytes of data.
1480 bytes from : icmp_seq=1 ttl=109 time=63.4 ms
1480 bytes from : icmp_seq=2 ttl=109 time=63.4 ms
1480 bytes from : icmp_seq=3 ttl=109 time=63.4 ms
1480 bytes from : icmp_seq=4 ttl=109 time=63.4 ms

---  ping statistics ---
4 packets transmitted, 4 received, 0% packet loss, time 3004ms
rtt min/avg/max/mdev = 63.384/63.408/63.441/0.024 ms

[ec2-user@ip-]$ ping -M do -s 1500 -c 4 x.x.x.x
PING  () 1500(1528) bytes of data.
From  icmp_seq=1 Frag needed and DF set (mtu = 1500)
ping: local error: message too long, mtu=1500
ping: local error: message too long, mtu=1500
ping: local error: message too long, mtu=1500

---  ping statistics ---
4 packets transmitted, 0 received, +4 errors, 100% packet loss, time 3086ms

…rf-exit-slow-start

perf/README.md

See #276 (comment)

mxinden · 2023-09-20T15:30:08Z

Looking at the graphs, why are some measurements drawn as boxes and some as points?

The Box visualizes Q1 to Q3 with Q2 (median) denoted with a line within the box. The lines are the whiskers, representing Q0 (minimum) and Q4 (maximum). The dots represent outliers.

https://en.wikipedia.org/wiki/Box_plot has a good explanation for each of these.

Why do some have error bars and others don't?

I am not sure what you refer to with "error bars" @marten-seemann.

The spread seems pretty high, do we need more iterations?

I decreased each measurement duration to 20 seconds and increased the iterations per implementation and transport to 10. I triggered a new CI run to update our benchmark-results.json.

https://github.com/libp2p/test-plans/actions/runs/6250828757/job/16970551159

mxinden · 2023-09-20T15:33:24Z

I wouldn't be surprised if quic-go maxed out somewhere around 2 Gbps. At some point, your transfer becomes CPU-limited, depending on the number of kernel offloads that your QUIC stack uses (and that's not the thing we want to benchmark here). That said, I just updated quic-go/perf to quic-go v0.38.1 (quic-go/perf#16, I'll merge the PR once GHA is not broken anymore...), which uses GSO by default. Might be worth rebasing your branch to see if this changes anything.

👍 Note that I merged current master into quic-go/perf#17 and updated the reference here. Thus with the next update to benchmark-results.json we will see the impact of GSO.

mxinden · 2023-09-20T15:34:39Z

In go-libp2p, Yamux uses a 16 MB receive window, which should limit us to roughly 2 Gbps (minus some muxer overhead). It's interesting to see that we're achieving roughly half of that. Could be a coincidence, or point to a bug in our flow control autotuning. I'd be happy to debug this using the current setup (assuming I can still run it manually as I could with the version on master), please let me know.

Indeed surprising. You can still run it manually. Please go ahead. Thank you @marten-seemann.

github-actions · 2023-09-20T16:37:22Z

~~See new metrics at https://observablehq.com/@libp2p-workspace/performance-dashboard?branch=27d07a6f47c2bc1a9c9d9a9f6626b536248284f5~~

mxinden · 2023-09-21T09:23:12Z

I have updated the forked dashboard to the latest data format:

https://observablehq.com/d/682dcea9fe2505c4?branch=27d07a6f47c2bc1a9c9d9a9f6626b536248284f5

mxinden

@marten-seemann @sukunrt can either of you give the perf/impl/https and perf/impl/go-libp2p changes a review?

mxinden · 2023-09-25T06:59:24Z

perf/runner/src/versions.ts

-    {
-        id: "v0.46",
-        implementation: "js-libp2p",
-        transportStacks: ["tcp"]
-    }
+    // {
+    //     id: "v0.46",
+    //     implementation: "js-libp2p",
+    //     transportStacks: ["tcp"]
+    // }


Addressed in libp2p/js-libp2p#2067.

perf/runner/src/versions.ts

marten-seemann

I reviewed the go-libp2p implementation.

perf/impl/go-libp2p/v0.29/main.go

perf/impl/go-libp2p/v0.29/perf.go

github-actions · 2023-10-19T11:31:27Z

See new metrics at https://observablehq.com/@libp2p-workspace/performance-dashboard?branch=822c56c94a0a9a36f685524860eef5e24785987f

mxinden · 2023-10-23T20:02:10Z

Unless there are any objections, I plan to merge here once libp2p/rust-libp2p#4382 is merged.

Now that libp2p/rust-libp2p#4382 is merged.

Once libp2p/js-libp2p#2067 is merged, we can re-introduce it.

mxinden · 2023-10-25T11:23:39Z

The last commit removes js-libp2p. Once libp2p/js-libp2p#2067 is merged, we can re-introduce it here.

marten-seemann reviewed Aug 25, 2023

View reviewed changes

Use "Max out quic connection and stream data"

b1d7cd5

mxinden added 4 commits August 25, 2023 17:10

Push results

2c29805

Needed for dashboard

Add go-libp2p v0.29

65f4ae6

Increase time to run

533f24f

Add quic-go

b223d7c

This was referenced Aug 31, 2023

Benchmarking feedback/notes #222

Open

feat(perf): use m5.xlarge #284

Merged

mxinden and others added 6 commits September 4, 2023 11:15

Merge branch 'master' of https://github.com/libp2p/test-plans into pe…

fe93e28

…rf-exit-slow-start

Add benchmark results

a1096e0

Increase iterations for CI to do longer run

bba3f71

perf: update benchmark results

7ed527a

Disable Nagle's algorithm on iperf

eb08da5

(Also removes -b 25g which does not have an impact on throughput.)

perf: update benchmark results

a409e63

perf: update benchmark results

f48d729

mxinden added 7 commits September 15, 2023 16:34

fix(perf/https): call time.Now once

39603f0

Reintroduce latency benchmark

fb3785b

fix(perf/https): log on download and introduce reportingReader

027127c

fix(perf/go-libp2p): log on download

73f7e06

feat(perf): introduce download test

7903f12

feat(perf/quic-go): update git ref

38daa81

Update benchmark results with --testing data

1b75a03

mxinden added 2 commits September 20, 2023 17:06

Merge branch 'master' of https://github.com/libp2p/test-plans into pe…

83e891c

…rf-exit-slow-start

Document changes to output format

02d4238

mxinden commented Sep 20, 2023

View reviewed changes

perf/README.md Show resolved Hide resolved

mxinden added 2 commits September 20, 2023 17:17

Disable js-libp2p for now

b0eaec5

See #276 (comment)

Increase iterations, decrease time

175597c

perf: update benchmark results

27d07a6

Update TODOs and implement final log for https

dac602c

mxinden mentioned this pull request Sep 25, 2023

perf: throughput test (TCP, QUIC, libp2p, but not iperf) never exits slow start #261

Closed

mxinden commented Sep 25, 2023

View reviewed changes

marten-seemann reviewed Sep 25, 2023

View reviewed changes

perf/impl/go-libp2p/v0.29/main.go Outdated Show resolved Hide resolved

perf/impl/go-libp2p/v0.29/perf.go Outdated Show resolved Hide resolved

perf/impl/go-libp2p/v0.29/perf.go Outdated Show resolved Hide resolved

perf/impl/go-libp2p/v0.29/perf.go Outdated Show resolved Hide resolved

mxinden added 2 commits September 26, 2023 17:05

Use uint64 and >=

26820e7

Port changes to all go-libp2p versions

6b05b73

mxinden marked this pull request as ready for review October 19, 2023 09:38

perf: update benchmark results

822c56c

mxinden added 3 commits October 25, 2023 10:09

Update to latest rust-libp2p

9b7a12d

Update to latest master

2e86c0f

Now that libp2p/rust-libp2p#4382 is merged.

fix(perf): remove js-libp2p

a3d3ca6

Once libp2p/js-libp2p#2067 is merged, we can re-introduce it.

mxinden merged commit 0a8dbab into master Oct 25, 2023

mxinden deleted the perf-exit-slow-start branch October 25, 2023 11:24

mxinden mentioned this pull request Nov 20, 2023

feat(perf): add nim-libp2p v1.1 #262

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(perf): continuosly measure on single conn (iperf-style) #276

feat(perf): continuosly measure on single conn (iperf-style) #276

mxinden commented Aug 24, 2023 •

edited

Loading

marten-seemann left a comment

marten-seemann Aug 25, 2023

marten-seemann commented Aug 25, 2023

mxinden commented Aug 25, 2023

mxinden commented Aug 25, 2023

mxinden commented Sep 4, 2023

marten-seemann commented Sep 4, 2023 •

edited

Loading

marten-seemann commented Sep 7, 2023

mxinden commented Sep 15, 2023

mxinden commented Sep 20, 2023 •

edited

Loading

mxinden commented Sep 20, 2023

mxinden commented Sep 20, 2023

github-actions bot commented Sep 20, 2023 •

edited by mxinden

Loading

mxinden commented Sep 21, 2023

mxinden left a comment

mxinden Sep 25, 2023

marten-seemann left a comment

github-actions bot commented Oct 19, 2023

mxinden commented Oct 23, 2023

mxinden commented Oct 25, 2023

feat(perf): continuosly measure on single conn (iperf-style) #276

feat(perf): continuosly measure on single conn (iperf-style) #276

Conversation

mxinden commented Aug 24, 2023 • edited Loading

marten-seemann left a comment

Choose a reason for hiding this comment

marten-seemann Aug 25, 2023

Choose a reason for hiding this comment

marten-seemann commented Aug 25, 2023

mxinden commented Aug 25, 2023

mxinden commented Aug 25, 2023

mxinden commented Sep 4, 2023

marten-seemann commented Sep 4, 2023 • edited Loading

marten-seemann commented Sep 7, 2023

mxinden commented Sep 15, 2023

mxinden commented Sep 20, 2023 • edited Loading

mxinden commented Sep 20, 2023

mxinden commented Sep 20, 2023

github-actions bot commented Sep 20, 2023 • edited by mxinden Loading

mxinden commented Sep 21, 2023

mxinden left a comment

Choose a reason for hiding this comment

mxinden Sep 25, 2023

Choose a reason for hiding this comment

marten-seemann left a comment

Choose a reason for hiding this comment

github-actions bot commented Oct 19, 2023

mxinden commented Oct 23, 2023

mxinden commented Oct 25, 2023

mxinden commented Aug 24, 2023 •

edited

Loading

marten-seemann commented Sep 4, 2023 •

edited

Loading

mxinden commented Sep 20, 2023 •

edited

Loading

github-actions bot commented Sep 20, 2023 •

edited by mxinden

Loading