Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
habana_power.py		habana_power.py
run-power-bench.sh		run-power-bench.sh
run-throughput-bench.sh		run-throughput-bench.sh
run_generation.py		run_generation.py
run_generation_power.py		run_generation_power.py

README.md

Deepspeed on Intel Habana Gaudi2

We utilized Gaudi2 systems In Intel Developer Cloud.

Build Container

   docker pull vault.habana.ai/gaudi-docker/1.17.1/{$OS}/habanalabs/pytorch-installer-2.3.1:latest

Run Container

    docker run -it -v /home/sdp:/home/sdp --runtime=habana -e HABANA_VISIBLE_DEVICES=all -e OMPI_MCA_btl_vader_single_copy_mechanism=none --cap-add=sys_nice --net=host --ipc=host vault.habana.ai/gaudi-docker/1.16.0/ubuntu22.04/habanalabs/pytorch-installer-2.2.2:latest

Run Benchmarks

Clone optimum-habana repo and go to correct direcotry.

    git clone https://github.com/huggingface/optimum-habana.git
    cd examples/text-generation

Replace run_generation.py file with the one in this directory.
Copy run_generation_power.py and habana_power.py from this directory to text generation directory.

Collect Throughput Metric

Use provided shell script run-throughput-bench.sh in this directory to run run_generation.py for various configurations of input, output lengths and batch sizes.

    source run-throughput-bench.sh

Collect Power Metric

Use provided shell script run-power-bench.sh in this directory to run run_generation_power.py for various configurations of input, output lengths and batch sizes.

    source run-power-bench.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Gaudi2

Gaudi2

README.md

Deepspeed on Intel Habana Gaudi2

Build Container

Run Container

Run Benchmarks

Collect Throughput Metric

Collect Power Metric

Files

Gaudi2

Directory actions

More options

Directory actions

More options

Latest commit

History

Gaudi2

Folders and files

parent directory

README.md

Deepspeed on Intel Habana Gaudi2

Build Container

Run Container

Run Benchmarks

Collect Throughput Metric

Collect Power Metric