Update chapters/en/Unit 3 - Vision Transformers/KnowledgeDistillation…

….mdx Co-authored-by: Merve Noyan <[email protected]>
johko · Feb 10, 2024 · 73316e1 · 73316e1
1 parent 801776e
commit 73316e1
Showing 1 changed file with 1 addition and 1 deletion.
diff --git a/chapters/en/Unit 3 - Vision Transformers/KnowledgeDistillation.mdx b/chapters/en/Unit 3 - Vision Transformers/KnowledgeDistillation.mdx
@@ -6,7 +6,7 @@ Presumably, we've all had teachers who "teach" by simply providing us the correc
 of machine learning models where we provide a labelled dataset to train on. Instead of having a model train on labels, however,
 we can pursue [Knowledge Distillation](https://arxiv.org/abs/1503.02531) as an alternative to arrive at a much smaller model that can perform comparably to the larger model, and much faster to boot.
 
-## For some intuition, 
+## Intuition Behind Knowledge Distillation
 
 Imagine you were given this multiple choice question: