Support microscaling (mx) type #14293

jinevening · 2024-11-04T08:37:32Z

What

Let's introduce mx (microscaling) type to circle.

Since we don't have any backend that uses mx type yet, let's focus on adding new types to circle schema rather than implementing mx type kernels.

This issue will be closed after the following goals are achieved.

Deadline: 11/30

jinevening · 2024-11-04T09:54:15Z

#14294 addresses the issues to achieve the goals in the main comment, but there are some remaining issues.

Even if tensors have the same MX type, their axes can be different (the axis would be determined by consumer operator). For example, axes of BMM Op's inputs are reduction dimension. So, for A @ B, A's axis would be -1, and B's axis would be -2.
We may need a way to describe the axis, e.g., MXQuantizationParameter.

If we'd like to do value check in luci-interpreter, we may need to extend luci-interpreter::Tensor class (and memory manager that is in charge of alloc/dealloc of Tensor) because it does not have data structure to save shared exponent.

Circle's type inference rule for BMM enforces input dtype == output dtype. This constraint may become a problem when we should support mixed-precision operators.

jinevening · 2024-11-28T01:09:55Z

The goals described in the main comment are all achieved. Let's close this issue.

Further issues listed in #14293 (comment) (and there may be more) will be addressed when necessary.

jinevening self-assigned this Nov 4, 2024

jinevening mentioned this issue Nov 4, 2024

DRAFT: Introduce mx type to frontend #14294

Closed

jinevening closed this as completed Nov 28, 2024