Skip to content

Releases: ROCm/TransferBench

TransferBench v1.41

30 Nov 07:38
f5e9cf3
Compare
Choose a tag to compare

Additions

  • Adding schmoo preset config benchmarks local/remote reads/writes/copies
    • Usage: ./TransferBench schmoo <numBytes=64M> <localIdx=0> <remoteIdx=1> <maxNumCUs=32>

Fixes

  • Fixing some misreported timings when running with non-fixed number of iterations

TransferBench v1.40

30 Nov 02:38
437b6e7
Compare
Choose a tag to compare

Fixes

  • Fixing XCC defaulting to 0 instead of random for preset configs, ignoring XCC_PREF_TABLE

TransferBench v1.39

29 Nov 22:26
7a1dbd6
Compare
Choose a tag to compare

v1.39

Additions

  • (Experimental) Adding support for Executor sub-index

Fixes

  • Remove deprecated gcnArch code. ROCm version must include support for hipDeviceMallocUncached

TransferBench v1.38

30 Nov 02:18
c519772
Compare
Choose a tag to compare

Fixes

  • Adding missing threadfence which could cause non-fine-grained Transfers to report higher speeds

TransferBench v1.37

24 Nov 13:52
9e3a04c
Compare
Choose a tag to compare

Changes

  • USE_SINGLE_STREAM is enabled by default now. (Disable via USE_SINGLE_STREAM=0)

Fixes

  • Fix unrecognized token error when XCC_PREF_TABLE is unspecified

TransferBench v1.35

22 Nov 23:38
e047656
Compare
Choose a tag to compare

Additions

  • USE_FINE_GRAIN also applies to a2a preset

TransferBench v1.34

07 Nov 23:37
004710f
Compare
Choose a tag to compare

Added

  • Set GPU_KERNEL=3 to default for gfx942

TransferBench v1.33

30 Oct 17:42
1d34a19
Compare
Choose a tag to compare

Adding ALWAYS_VALIDATE env var to allow for validation after every iteration instead of just once at end of all iterations

TransferBench v1.32

19 Oct 22:20
9c2ecae
Compare
Choose a tag to compare

Modified

  • Increased line limit from 2048 to 32768

TransferBench v1.31

17 Oct 19:40
79a3a00
Compare
Choose a tag to compare

Modified

  • SHOW_ITERATIONS now show XCC:CU instead of just CU ID
  • SHOW_ITERATIONS also printed when USE_SINGLE_STREAM=1