Scaling Llms with Nvidia Triton and Tensorrt-LLM
The Complete Guide to Production Inference, Kubernetes Deployment, and Multi-Node GPU Optimization
(Author) Jacob Quinlan
Format:
Paperback
£26.21
Price: £26.21
(0% off)
Generally dispatched in 1 to 2 days
Information
Publisher:
Independently Published
Format:
Paperback
Number of pages:
None
Language:
en
ISBN:
9798277387214
Publish year:
2025
Publish date:
Dec. 4, 2025