The User Guide describes how to use Triton as an inference solution, including information on how to configure Triton, how to organize and configure your models, how to use the C++ and Python clients, etc.
- QuickStart
- Model Repository
- Model Configuration
- Model Pipeline
- Model Management
- Metrics
- Framework Custom Operations
- Client Libraries and Examples
- C++ HTTP/GRPC Libraries
- Python HTTP/GRPC Libraries
- Java HTTP Library
- GRPC Generated Libraries
- Performance Analysis
- Jetson and JetPack
The Developer Guide describes how to build and test Triton and also how Triton can be extended with new functionality.