Skip to content

Move the sharding axes from dimensions that need replication to batch dimensions, such that we replace an all-gather with an all-to-all.#21735

Merged
copybara-service[bot] merged 1 commit intomainfrom test_718546009Jan 23, 2025