forked from pytorch/FBGEMM
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Make zero_start_index_M optional for dynamic BF16 Grouped Gemm (pytor…
…ch#3553) Summary: Pull Request resolved: pytorch#3553 X-link: facebookresearch/FBGEMM#639 There is some value in being able to invoke a grouped gemm and directly return a single unified tensor rather than an array of tensors, especially if the goal is to concatenate the groups immediately after returning, as concat adds a copy. This diff makes zero_start_index_M optional for the dynamic version of bf16 grouped gemm. When not provided, we allocate a single coalesced tensor for all outputs. While dynamic is a bit of a misnomer in this case, its the most convenient way to allow returning a tensor rather than a list. Reviewed By: jasonjk-park, jianyuh Differential Revision: D67884826 fbshipit-source-id: d2c0f83f7c55b2dc41000bb1f4dc27fbdf82515d
- Loading branch information
1 parent
e4880a0
commit 8b748c6
Showing
4 changed files
with
100 additions
and
59 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters