TensorRT-LLM Triton Backend Support #33

shixianc · 2023-11-15T19:25:44Z

When can NAV support creating Triton Repo for this new backend? Is it on your roadmap?
https://github.com/triton-inference-server/tensorrtllm_backend

jkosek · 2023-11-21T09:19:04Z

@shixianc thanks for feature request. We are going to review the backend options and add the support in next release.

If there are any specific requirements you see, let us know. Thanks!

ishandhanani · 2024-04-04T18:43:30Z

Hi team! Was this ever added? I'm looking through the release notes but cannot find support for TRT-LLM

jkosek · 2024-04-04T22:39:02Z

Hi @ishandhanani. Apologize, not yet. Let us prioritize this feature and provide some ETA.

jkosek · 2024-04-04T22:46:41Z

@ishandhanani maybe some questions to clarify expected behavior. Do you see this feature as generating the model store for tensorrtllm backend only (example) or you would expect that whole deployment of pre/post processing with BLS would be created (similar to this example)?

ishandhanani · 2024-04-04T22:57:45Z

I think a good first step would be to have it generate the model repo for the trtllm backend only. In the future it would be great if we could generate the entire pre/post processing model repo @jkosek

jkosek · 2024-08-06T12:04:14Z

@ishandhanani you may want to review the newly added TensorRTLLMModelConfig class that specify the TensorRT-LLM backend configuration: https://triton-inference-server.github.io/model_navigator/0.11.0/inference_deployment/triton/api/specialized_configs/#model_navigator.triton.TensorRTLLMModelConfig

jkosek self-assigned this Nov 21, 2023

jkosek added enhancement New feature or request non-stale labels Nov 21, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TensorRT-LLM Triton Backend Support #33

TensorRT-LLM Triton Backend Support #33

shixianc commented Nov 15, 2023 •

edited

Loading

jkosek commented Nov 21, 2023 •

edited

Loading

ishandhanani commented Apr 4, 2024

jkosek commented Apr 4, 2024

jkosek commented Apr 4, 2024

ishandhanani commented Apr 4, 2024

jkosek commented Aug 6, 2024

TensorRT-LLM Triton Backend Support #33

TensorRT-LLM Triton Backend Support #33

Comments

shixianc commented Nov 15, 2023 • edited Loading

jkosek commented Nov 21, 2023 • edited Loading

ishandhanani commented Apr 4, 2024

jkosek commented Apr 4, 2024

jkosek commented Apr 4, 2024

ishandhanani commented Apr 4, 2024

jkosek commented Aug 6, 2024

shixianc commented Nov 15, 2023 •

edited

Loading

jkosek commented Nov 21, 2023 •

edited

Loading