Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
burtenshaw authored Jan 20, 2025
1 parent ddcdc53 commit 3c9a583
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions 7_inference/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -16,7 +16,7 @@ Learn how to use the Hugging Face Transformers pipeline for basic inference. We'

### 2. [Production Inference with TGI](./tgi_inference.md)

Learn how to deploy models for production using Text Generation Inference. We'll explore optimized serving techniques, batching strategies, and monitoring solutions. TGI provides production-ready features like health checks, metrics, and Docker deployment options. [Start learning](./tgi_inference.md).
Learn how to deploy models for production using Text Generation Inference. We'll explore optimized serving techniques, batching strategies, and monitoring solutions. TGI provides production-ready features like health checks, metrics, and Docker deployment options. [Start learning](./text_generation_inference.md).

### Exercise Notebooks

Expand All @@ -33,4 +33,4 @@ Learn how to deploy models for production using Text Generation Inference. We'll
- [TGI GitHub Repository](https://github.com/huggingface/text-generation-inference)
- [Hugging Face Model Deployment Documentation](https://huggingface.co/docs/inference-endpoints/index)
- [vLLM: High-throughput LLM Serving](https://github.com/vllm-project/vllm)
- [Optimizing Transformer Inference](https://huggingface.co/blog/optimize-transformer-inference)
- [Optimizing Transformer Inference](https://huggingface.co/blog/optimize-transformer-inference)

0 comments on commit 3c9a583

Please sign in to comment.