Skip to content

Commit

Permalink
[onert] Introduce block quantization data type (Samsung#13770)
Browse files Browse the repository at this point in the history
This commit introduces block quantization data type.
This commit is not including parameter info.

ONE-DCO-1.0-Signed-off-by: Hyeongseok Oh <[email protected]>
  • Loading branch information
hseok-oh authored Aug 29, 2024
1 parent cb5f98d commit 881437c
Showing 1 changed file with 2 additions and 0 deletions.
2 changes: 2 additions & 0 deletions runtime/onert/core/include/ir/DataType.h
Original file line number Diff line number Diff line change
Expand Up @@ -39,6 +39,8 @@ enum class DataType
QUANT_INT16_ASYMM = 10,
QUANT_INT8_SYMM_PER_CHANNEL = 11,
QUANT_INT16_SYMM = 12,
QUANT_GGML_Q4_0 = 13, // 4bit quantization, 32 block, 16bit delta
QUANT_GGML_Q8_0 = 14 // 8bit quantization, 32 block, 16bit delta
};

size_t sizeOfDataType(DataType data_type);
Expand Down

0 comments on commit 881437c

Please sign in to comment.