Replies: 2 comments
-
Add the requests to the engine.
in this way ,i can only get the beginning stats info.how could i get all stats info? any answers will be appreciated! |
Beta Was this translation helpful? Give feedback.
0 replies
-
I have similar use case. @tohneecao Were you able to get this working? |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
How can i get metrics like {gpu_cache_usage, cpu_cache_usage, time_to_first_tokens, time_per_output_tokens, time_per_output_tokens} when using offline inference
Beta Was this translation helpful? Give feedback.
All reactions