Skip to content

Commit

Permalink
fix mfu
Browse files Browse the repository at this point in the history
  • Loading branch information
samsja committed Sep 22, 2024
1 parent 293dfbd commit fc603ef
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion src/zeroband/train.py
Original file line number Diff line number Diff line change
Expand Up @@ -207,7 +207,7 @@ def train(config: Config):
time_taken = time.time() - beginning_step_time
tokens_per_second = config.data.seq_length * config.optim.batch_size / time_taken

mfu = 100 * num_flop_per_token * tokens_per_second / gpu_peak_flops
mfu = 100 * num_flop_per_token * tokens_per_second / gpu_peak_flops / world_info.local_world_size

metrics = {
"Loss": loss_batch.item(),
Expand Down

0 comments on commit fc603ef

Please sign in to comment.