Skip to content

TransferBench v1.45

Compare
Choose a tag to compare
@gilbertlee-amd gilbertlee-amd released this 05 Dec 06:41
f33c7fd

Additions

  • Adding A2A_MODE to a2a preset (0 = copy, 1 = read-only, 2 = write-only)
  • Adding GFX_UNROLL to modify GFX kernel's unroll factor
  • Adding GFX_WAVE_ORDER to modify order in which wavefronts process data

Modifications

  • Rewrote the GFX reduction kernel to support new wave ordering