Skip to content

GemmaMLP uses 'tanh` approximation for GeLU activation (#1004) #1

GemmaMLP uses 'tanh` approximation for GeLU activation (#1004)

GemmaMLP uses 'tanh` approximation for GeLU activation (#1004) #1

Annotations

1 warning

cpu-tests (ubuntu-20.04, 3.8)

succeeded Mar 9, 2024 in 7m 55s