-
Notifications
You must be signed in to change notification settings - Fork 0
Sync meeting on EESSI test suite (2023 11 08)
Kenneth Hoste edited this page Nov 8, 2023
·
1 revision
- every 2 weeks on Thursday at 14:00 CE(S)T
- next meetings:
- Wed 22 Nov'23 10:00 CET: OK for all
- Wed 6 Dec'23 15:30 CET: OK for all
- Wed 20 Dec'23 14:00 CET: OK for those who want to be there
- Wed 3 Jan'24 15:30 CET: unclear for some, to be confirmed
- Thu 18 Jan'24 14:00 CET
- OSU test (PR #54)
- Sam reviewed it, Satish still needs to take review comments into account
- Biggest blocker is still the memory-issue. We agreed last time that we'll go for the option of using job options as described here. We will ask for
--mem
, i.e. total memory, because that is supported on any job scheduler
- Updated CI driving scripts (PR #93)
- Todo: Caspar updates
REFRAME_VERSION
in all theci-config.sh
files to 4.3.3
- Todo: Caspar updates
- CPU autodetect failing due to failing "
pip install reframe-hpc==4.3.3
" (ReFrame issue #3023)- will be fixed in upcoming ReFrame release (it's currently on the 4.5 milestone)
- add scales
1_cpn_2_nodes
and1_cpn_4_nodes
(PR #94)- someone should test this and make sure it works => Satish
- job script that is generated by ReFrame can be checked via dry run
- how can we collect/provide/dynamically determine performance reference numbers so tests can also be used for performance regression?
- step-by-step
- come up with a structure for storing/retrieving reference performance numbers (+ upper/lower bound thresholds) for a particular system
- incl. relevant metadata of the system (CPU, storage, network, ...)
- just use ReFrame perf logging for this, configured to store the perf log data like we want it to
- provide an automated way to harvest initial reference perf numbers from recent runs of test suite
- Create a function that produces a perf_ref + upper + lower over all entries for this unique combination of test hash, system, and
$EESSI_TESTSUITE_PERF_DATA_LABEL
(based on some statistics, average, SD, etc)
- Create a function that produces a perf_ref + upper + lower over all entries for this unique combination of test hash, system, and
- check if this could become a feature in ReFrame itself
- nice to have: automatically collect initial perf refs if none are available
- based on similarity of current system with systems for which data is available
- that's likely quite difficult to do...
- come up with a structure for storing/retrieving reference performance numbers (+ upper/lower bound thresholds) for a particular system
- step-by-step
export EESSI_TESTSUITE_PERF_DATA_LABEL='eessi-2023.06-nov2023'
eessi/testsuite/tests/apps/tensorflow
eessi/testsuite/perf_data/apps/tensorflow/
README.txt
hashes.txt => mapping of hashes to test parameters
hortense/
rome/
eessi-2023.06-nov2023.csv
test_hash,perf_var,perf_value,perf_lower_tresh,perf_upper_thres
/deadb33f,ns_day,100,95,105
milan/
eessi-2023.06-nov2023.csv
vega/
hydra/
snellius/
- Should we try and set up a meeting with the ReFrame developers on this perf data logging/harvesting idea?
- Kenneth can contact Vasileios on this via ReFrame Slack
- https://github.com/EESSI/meetings/wiki/Sync-meeting-on-EESSI-test-suite-(2023-10-19)
- https://github.com/EESSI/meetings/wiki/Sync-meeting-on-EESSI-test-suite-(2023-10-04)
- https://github.com/EESSI/meetings/wiki/Sync-meeting-on-EESSI-test-suite-(2023-09-20)
- https://github.com/EESSI/meetings/wiki/Sync-meeting-on-EESSI-test-suite-(2023-09-06)
- https://github.com/EESSI/meetings/wiki/Sync-meeting-on-EESSI-test-suite-(2023-08-25)
- https://github.com/EESSI/meetings/wiki/Sync-meeting-on-EESSI-test-suite-(2023-08-09)
- https://github.com/EESSI/meetings/wiki/Sync-meeting-on-software-testing-(27%E2%80%9007%E2%80%902023)
- https://github.com/EESSI/meetings/wiki/Sync-meeting-on-EESSI-test-suite-(2023-06-28)
- https://github.com/EESSI/meetings/wiki/Sync-meeting-on-EESSI-test-suite-(2023-06-15)
- https://github.com/EESSI/meetings/wiki/Sync-meeting-on-EESSI-test-suite-(2023-05-31)
- https://github.com/EESSI/meetings/wiki/Sync-meeting-on-EESSI-test-suite-(2023-05-17)
- https://github.com/EESSI/meetings/wiki/Sync-meeting-on-EESSI-test-suite-(2023-04-20)
- https://github.com/EESSI/meetings/wiki/Sync-meeting-on-EESSI-test-suite-(2023-03-30)
- https://github.com/EESSI/meetings/wiki/Sync-meeting-on-EESSI-test-suite-(2023-03-10) (incl. 2023-02-23)
- https://github.com/EESSI/meetings/wiki/Sync-meeting-on-EESSI-test-suite-(2023-02-09)