-
Notifications
You must be signed in to change notification settings - Fork 21
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Epilogue with RELU #86
Epilogue with RELU #86
Conversation
3c08963
to
31434ee
Compare
fusion::LinearCombination<ElementOutput_, ElementCompute_, ElementSource_, ElementScalar_, RoundStyle_>, | ||
CtaTileShapeMNK_, | ||
EpilogueTile_ | ||
> : Sm90LinearCombination<typename cutlass::detail::get_unpacked_element_type<ElementOutput_>::type, ElementCompute_, ElementSource_, ElementScalar_, RoundStyle_> { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
why is this Sm90?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
That's the name of the fusion operator for linear combination in Cutlass 3. Unfortunately, that's the naming convention used for almost all the visitor fusions in Cutlass 3, even though most of them are generic and can be used on other GPUs.
If, for whatever reason, we need a linear combination that is specific for Intel PVC, we can just replace Sm90LinearCombination
with the specific implementation (same apply for all the visitor fusions).
Could be useful to post the compare link for these incremental PRs. In this case: muhammad-tanvir-1211/cutlass-fork@intel-epilogue...aacostadiaz:cutlass-fork:intel-relu |
1a47f2d
to
c361ed1
Compare
039211b
to
8f3bd67
Compare
This PR adds the EVT for relu.