Replies: 1 comment 3 replies
-
https://github.com/vllm-project/vllm/tree/main/examples/production_monitoring Here's an exampel dashboard put together by @robertgshaw2-neuralmagic |
Beta Was this translation helpful? Give feedback.
3 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
vLLM Dashboard
What are you using to monitor your vLLM installation?
I wanted a simple dashboard to help trend vLLM and GPU metrics and performance. The Prometheus metrics that vLLM provides gives some good insight into the health and use of the system with data like tokens per second, running requests, waiting requests and more. What indicators do you find most useful?
Grafana for vLLM and GPU Metrics
I put together a Grafana dashbaord using InfluxDB as the datastore, Telegraf to pull the metrics from vLLM and a custom script to grab nvidia-smi data. I uploaded the details here and would be happy to create an example PR if there is interest: https://github.com/jasonacox/TinyLLM/tree/main/monitoring
Beta Was this translation helpful? Give feedback.
All reactions