Finding the maximum performance #646

hatoo · 2025-01-04T08:26:39Z

Only for a fixed number of requests run.

❯ TOKIO_WORKER_THREADS=16 oha -n 4000000 -c 1000 --no-tui http://localhost:3000
Summary:
  Success rate: 100.00%
  Total:        6.2924 secs
  Slowest:      0.0923 secs
  Fastest:      0.0000 secs
  Average:      0.0016 secs
  Requests/sec: 635684.8490

  Total data:   49.59 MiB
  Size/request: 13 B
  Size/sec:     7.88 MiB

❯ TOKIO_WORKER_THREADS=16 cargo run --release -- -n 4000000 -c 1000 --no-tui http://localhost:3000
Summary:
  Success rate: 100.00%
  Total:        5.7571 secs
  Slowest:      0.1873 secs
  Fastest:      0.0000 secs
  Average:      0.0014 secs
  Requests/sec: 694790.7464

  Total data:   49.59 MiB
  Size/request: 13 B
  Size/sec:     8.61 MiB

❯ wrk -t 16 -c 1000  http://localhost:3000
Running 10s test @ http://localhost:3000
  16 threads and 1000 connections
  Thread Stats   Avg      Stdev     Max   +/- Stdev
    Latency     1.23ms    1.19ms  21.26ms   90.28%
    Req/Sec    56.31k     9.45k  160.94k    78.17%
  9026938 requests in 10.05s, 1.09GB read
Requests/sec: 898204.31
Transfer/sec:    111.36MB

note: Set TOKIO_WORKER_THREADS to the number of actual physical cpus (no hyper-threading) actually helps performance.
It's another issue.

hatoo · 2025-01-06T08:42:15Z

Not sending to channel per requests greatly improve performance

note: --profile pgo doesn't mean it uses PGO, just profile for PGO build (enabling lto etc.)

❯ cargo run --profile pgo -- -n 6000000 -c 1000 --no-tui http://localhost:3000
Summary:
  Success rate: 100.00%
  Total:        6.9893 secs
  Slowest:      0.2570 secs
  Fastest:      0.0000 secs
  Average:      0.0011 secs
  Requests/sec: 858459.4633

  Total data:   74.39 MiB
  Size/request: 13 B
  Size/sec:     10.64 MiB

❯ wrk -t16 -c 1000 http://localhost:3000
Running 10s test @ http://localhost:3000
  16 threads and 1000 connections
  Thread Stats   Avg      Stdev     Max   +/- Stdev
    Latency     1.19ms    1.15ms  19.68ms   90.30%
    Req/Sec    58.90k     8.17k  177.76k    78.10%
  9435690 requests in 10.06s, 1.14GB read
Requests/sec: 938274.13
Transfer/sec:    116.33MB

hatoo force-pushed the local-future branch 2 times, most recently from eb082f7 to 2604c2e Compare January 5, 2025 09:36

hatoo changed the title ~~Utilize LocalRuntime~~ Optimizing by using thread local futures Jan 5, 2025

hatoo added 8 commits January 5, 2025 22:47

save

4f01fe0

unstable

6e03e8c

a

190d7f3

phys

6bea6cf

aa

b2300fc

fix warning

0576d05

no tokio-unstable

40fe69b

I don't know

8b09f9f

hatoo force-pushed the local-future branch from 730b529 to 8b09f9f Compare January 5, 2025 13:47

hatoo added 3 commits January 5, 2025 23:32

tweak

fce00a8

looks good

0619f1d

WIP

8391f5e

hatoo changed the title ~~Optimizing by using thread local futures~~ Finding the maximum performance Jan 6, 2025

hatoo added 13 commits January 6, 2025 20:08

support ctrl-c

2556099

work_until2

21fb7af

whatever

1a2d2c1

wip glue

501922b

a

e69455e

a

1494094

delay

67f8b83

wip

6d2e426

OK

0be8611

work2_http2

f0cb277

reorder

8e35829

work_until2_http2

651c539

mod

71d246d

hatoo added 2 commits January 9, 2025 16:08

tweak

85bf2ae

work_mode enum

7a71d2c

hatoo marked this pull request as ready for review January 9, 2025 07:39

hatoo merged commit 55849fe into master Jan 9, 2025
11 checks passed

hatoo mentioned this pull request Jan 11, 2025

Comparing to wrk #617

Open

hatoo deleted the local-future branch January 16, 2025 10:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Finding the maximum performance #646

Finding the maximum performance #646

hatoo commented Jan 4, 2025 •

edited

Loading

hatoo commented Jan 6, 2025

Finding the maximum performance #646

Finding the maximum performance #646

Conversation

hatoo commented Jan 4, 2025 • edited Loading

hatoo commented Jan 6, 2025

hatoo commented Jan 4, 2025 •

edited

Loading