Scaling Llms with Nvidia Triton and Tensorrt-LLM

The Complete Guide to Production Inference, Kubernetes Deployment, and Multi-Node GPU Optimization

(Author) Jacob Quinlan
Format: Paperback
£26.21 Price: £26.21 (0% off)
Generally dispatched in 1 to 2 days
Information
Publisher:
Independently Published
Format:
Paperback
Number of pages:
None
Language:
en
ISBN:
9798277387214
Publish year:
2025
Publish date:
Dec. 4, 2025

Jacob Quinlan

Reviews

Leave a review

Please login to leave a review.

Be the first to review this product

Other related