Skip to content

Commit

Permalink
[Tensor] Int4QTensor with quantized 4-bit integer data type
Browse files Browse the repository at this point in the history
This pull request presents the  class, a powerful solution for efficiently storing quantized 4-bit integer data.
By packing each 4-bit integer into an 8-bit memory space, we utilize memory resources effectively—where the first four bits hold the first 4-bit value and the last four bits hold the second.

1. Build test: [X]Passed [ ]Failed [ ]Skipped
2. Run test:   [X]Passed [ ]Failed [ ]Skipped

Signed-off-by: Donghyeon Jeong <[email protected]>
  • Loading branch information
djeong20 committed Jan 23, 2025
1 parent 091c496 commit d60db7a
Show file tree
Hide file tree
Showing 7 changed files with 1,050 additions and 134 deletions.
1 change: 1 addition & 0 deletions debian/nntrainer-dev.install
Original file line number Diff line number Diff line change
Expand Up @@ -10,6 +10,7 @@
/usr/include/nntrainer/memory_data.h
/usr/include/nntrainer/tensor.h
/usr/include/nntrainer/tensor_base.h
/usr/include/nntrainer/int4_tensor.h
/usr/include/nntrainer/char_tensor.h
/usr/include/nntrainer/short_tensor.h
/usr/include/nntrainer/uint_tensor.h
Expand Down
Loading

0 comments on commit d60db7a

Please sign in to comment.