You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Unfortunately this is wrong both rocblas configurations have compute_type: "f32_r" and the benchmark problem now proudly proclames ComputeDataType: h, ofc this means that whatever we benchmark next wont use MFMA and will never be chosen, Ignoreing that we are now benchmarking the wrong thing, we continue with runTensileTuning-all.sh which succeeds.
Issue 3
We next run: ./scripts/provision_verification.sh tune tune/tensile/Tensile arcturus
Unfortionatly this script is broken as
This ofc requires Tensile to be installed and happen be a comptatble version with the version that provision_tuning.sh installed, which ofc it is not so this fails.
We work around this using this patch:
diff --git a/tuning/scripts/provision_verification.sh b/tuning/scripts/provision_verification.sh
index a0fe5a05..82fe55d6 100755
--- a/tuning/scripts/provision_verification.sh
+++ b/tuning/scripts/provision_verification.sh
@@ -193,6 +193,7 @@ if [ "${LIBRARY}" == arcturus ]; then
fi
fi
+export PYTHONPATH=${TENSILE_PATH}
MERGE_SCRIPT=${TENSILE_PATH}/Tensile/Utilities/merge.py
MASSAGE_SCRIPT=${REFERENCE_LIBRARY_ARCHIVE}/massage.py
Issue 5
Unfortionatly merge.py remains broken, tune/logs/merge.log is empty and no merged logic files are created.
At this point i gave up. Tensile/tuning is broken to a degree that would be funny if it weren't sad.
Operating System
Ubuntu 24.04
CPU
Epyc 7552
GPU
MI100
ROCm Version
ROCm 6.3.0
ROCm Component
rocBLAS, Tensile
Steps to Reproduce
No response
(Optional for Linux users) Output of /opt/rocm/bin/rocminfo --support
No response
Additional Information
No response
The text was updated successfully, but these errors were encountered:
Problem Description
There appear to be multiple issues in the training scripts:
First we run a test using ROCBLAS_LEVEL=4 on a simple test application from this we gain a list of executed rocblas calls to tune on:
Issue 1
We then try to provision a tuning session like so:
./scripts/provision_tuning.sh tune rocblas.log test.yaml arcturus -m
The above dose not work as the configurations contain
flags: none
and this is converted to '--flags none' byConvertToRocBlasBenchCall
at:Tensile/tuning/automation/ExtractSizes.py
Line 789 in 3c91b64
and then passed to
rocblasParser
which dose not accept 'none' as it defines --flags as to be an argument of int type:Tensile/tuning/automation/ExtractSizes.py
Line 70 in 3c91b64
Working around this issue using this patch:
Issue 2
We imminently encounter the next issue:
From the above rocblas configurations Tensile has generated the following benchmark configuration:
Unfortunately this is wrong both rocblas configurations have
compute_type: "f32_r"
and the benchmark problem now proudly proclamesComputeDataType: h
, ofc this means that whatever we benchmark next wont use MFMA and will never be chosen, Ignoreing that we are now benchmarking the wrong thing, we continue withrunTensileTuning-all.sh
which succeeds.Issue 3
We next run:
./scripts/provision_verification.sh tune tune/tensile/Tensile arcturus
Unfortionatly this script is broken as
Tensile/tuning/scripts/provision_verification.sh
Line 175 in 3c91b64
Tries to copy the content of the path defined here:
Tensile/tuning/scripts/provision_verification.sh
Line 166 in 3c91b64
using cp. Unfortionatly ${ROCBLAS_PATH}/library/src/blas3/Tensile/Logic/asm_full contains only directories, which ofc cp will not copy without -r so this operation fails.
Issue 4
Ignoring issue 3 we encounter the next problem:
provision_verification.sh
executes${TENSILE_PATH}/Tensile/Utilities/merge.py
Which contains:
Tensile/Tensile/Utilities/merge.py
Line 25 in 3c91b64
This ofc requires Tensile to be installed and happen be a comptatble version with the version that provision_tuning.sh installed, which ofc it is not so this fails.
We work around this using this patch:
Issue 5
Unfortionatly merge.py remains broken,
tune/logs/merge.log
is empty and no merged logic files are created.At this point i gave up. Tensile/tuning is broken to a degree that would be funny if it weren't sad.
Operating System
Ubuntu 24.04
CPU
Epyc 7552
GPU
MI100
ROCm Version
ROCm 6.3.0
ROCm Component
rocBLAS, Tensile
Steps to Reproduce
No response
(Optional for Linux users) Output of /opt/rocm/bin/rocminfo --support
No response
Additional Information
No response
The text was updated successfully, but these errors were encountered: