[MODULE] A module on quantization #169

michaelshekasta · 2025-01-12T15:03:12Z

I’d like to propose a new module aimed at optimizing language models for efficient CPU-based inference, reducing reliance on GPUs. The module covers three key areas: quantization techniques, the GGUF model format, and utilizing Intel and MLX accelerators for optimized inference.

What are you thinking?

burtenshaw · 2025-01-15T13:08:16Z

Hi @michaelshekasta . Sorry to go quiet on this. I've been wrapped up on an agents course for HF learn this week. I will review it tomorrow.

michaelshekasta · 2025-01-16T07:51:18Z

@burtenshaw a gentle reminder

burtenshaw · 2025-01-16T10:22:06Z

@michaelshekasta This is a great start. I've implemented a more typical structure. I would suggest that you now follow on with next stage:

find references for each section of the module.
add them to the references section of the markdown pages.
add bullet point note to each section of the page with key topics.
highlight sections that you don't understand or need help on

Once you're ready, I'll review and complete the module's prose.

typo

michaelshekasta added 3 commits January 12, 2025 16:41

Create 8 - quantization

2f9cacc

Delete 8 - quantization

43c6272

Quantization draft

dfd7840

michaelshekasta marked this pull request as draft January 12, 2025 15:03

burtenshaw changed the title ~~Draft!! Quantization~~ [MODULE] A module on quantization Jan 16, 2025

burtenshaw added 4 commits January 16, 2025 10:42

make directory naming consistent

7a84cc1

update structure in readme

00c027f

update readme with structure

59f3045

update sub pages with structure

12fc3fc

michaelshekasta added 6 commits January 16, 2025 13:07

Merge branch 'huggingface:main' into main

b2a75f2

Update fundamentals.md

7951134

Update fundamentals.md

d9723ea

Update fundamentals.md

b6793fa

typo

Update fundamentals.md

1b94c08

Update fundamentals.md

021bbf0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MODULE] A module on quantization #169

[MODULE] A module on quantization #169

michaelshekasta commented Jan 12, 2025 •

edited

Loading

burtenshaw commented Jan 15, 2025

michaelshekasta commented Jan 16, 2025

burtenshaw commented Jan 16, 2025

[MODULE] A module on quantization #169

Are you sure you want to change the base?

[MODULE] A module on quantization #169

Conversation

michaelshekasta commented Jan 12, 2025 • edited Loading

burtenshaw commented Jan 15, 2025

michaelshekasta commented Jan 16, 2025

burtenshaw commented Jan 16, 2025

michaelshekasta commented Jan 12, 2025 •

edited

Loading