-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Unable to run performance analyzer on my model - Request for unknown model: '/models::ensemble' is not found #68
Comments
@matthewkotila or @ganeshku1 any ideas? @vijetha35 when you launched the server, did the sagemaker wrapper change the endpoints? |
Which endpoint are you talking about?
…On Fri, Aug 30, 2024, 4:07 PM Elias Bermudez ***@***.***> wrote:
@matthewkotila <https://github.com/matthewkotila> or @ganeshku1
<https://github.com/ganeshku1> any ideas?
It looks like the model endpoint is different than what we would expect.
@vijetha35 <https://github.com/vijetha35> when you launched the server,
did the sagemaker wrapper change the endpoints?
—
Reply to this email directly, view it on GitHub
<#68 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ABBFIUFB37ZEFZLOKDDYW73ZUD3JJAVCNFSM6AAAAABNNIJX4OVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDGMRSGU2TKOBTGY>
.
You are receiving this because you were mentioned.Message ID:
***@***.***>
|
Any of the endpoints that triton uses by default. |
@vijetha35 few questions
|
|
All the endpoints are up:
|
@vijetha35 I see that you are using a bit outdated triton version. Does the error persist when you use one of the latest ones? (I'm assuming you are using 23.08 for both server and sdk container) Also, to help us verify that this is indeed a bug in the codebase, could you provide a small reproducer ensemble model? |
Correct, I am using the same version for both server and sdk container. |
Unable to run performance analyzer on my model
I am using a sagemaker wrapper image of triton server and am able to serve the model with requests and even validate that it is up, all ports for grpc, http and metrics are up
But when I am run the performance analyzer, it is throwing me an error
perf_analyzer -m ensemble --input-data data.json --measurement-interval=60000 -v -v
Request for unknown model: '/models::ensemble' is not found
I see that it is doing GET /v2/models/stats
which fails for me as well
Where as the curl on ensemble model succeeds.
What could be wrong?
The text was updated successfully, but these errors were encountered: