From 1c0fabbc67fd806d999ddb9186a1100fc950ce17 Mon Sep 17 00:00:00 2001 From: "chen, suyue" Date: Thu, 2 Jan 2025 11:03:56 +0800 Subject: [PATCH] update publication_list.md (#2105) Signed-off-by: chensuyue --- docs/source/publication_list.md | 5 +++-- 1 file changed, 3 insertions(+), 2 deletions(-) diff --git a/docs/source/publication_list.md b/docs/source/publication_list.md index 7cafe1b1521..fec7a4cf004 100644 --- a/docs/source/publication_list.md +++ b/docs/source/publication_list.md @@ -1,6 +1,7 @@ -Full Publications/Events (85) +Full Publications/Events (86) ========== -## 2024 (6) +## 2024 (7) +* Blog by Microsoft: [Phi-4 quantization and inference speedup](https://techcommunity.microsoft.com/blog/machinelearningblog/phi-4-quantization-and-inference-speedup/4360047) (Dec 2024) * EMNLP'2024: [Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs](https://arxiv.org/abs/2309.05516) (Sep 2024) * Blog on Medium: [Quantization on Intel Gaudi Series AI Accelerators](https://medium.com/intel-analytics-software/intel-neural-compressor-v3-0-a-quantization-tool-across-intel-hardware-9856adee6f11) (Aug 2024) * Blog on Medium: [Accelerating Qwen2 Models with Intel Extension for Transformers](https://medium.com/intel-analytics-software/accelerating-qwen2-models-with-intel-extension-for-transformers-99403de82f68) (June 2024)