-
Notifications
You must be signed in to change notification settings - Fork 81
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Enhanced Quantizer with QINT16 Support #2874
Conversation
ce54643
to
bc3f6af
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
bc3f6af
to
368a72b
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I really like the way you left some briefs about new test cases. Really helpful for reviewing!
This PR enhances the quantizer by leveraging the output tensor along with scale factors for more accurate quantization. Furthermore, it introduces support for the QINT16 data type, expanding our capabilities and improving performance. **Self-evaluation:** 1. Build test: [X]Passed [ ]Failed [ ]Skipped 2. Run test: [X]Passed [ ]Failed [ ]Skipped Signed-off-by: Donghyeon Jeong <[email protected]>
368a72b
to
1dc0047
Compare
This PR enhances the quantizer by leveraging the output tensor along with scale factors for more accurate quantization.
Furthermore, it introduces support for the QINT16 data type, expanding our capabilities and improving performance.
Self-evaluation: