Skip to content

Issues with DCGM Nvidia Dashboard #30

@gaby

Description

@gaby

@chaitanya-sistla Just ran into the new blog showcases how to run DCGM to get metrics from Nvidia. Blog https://openobserve.ai/blog/how-to-monitor-nvidia-gpu/

Importing this same dashboard results in unusable metrics. The DCGM Exporter exports series in upper case while the dashboard is somehow using lowercase.

Changing the case makes visualizations appear and show data, but trying to edit them results in Y-Axis errors (could be related to v0.16.0).

The graphs are not very friendly for multi-GPU systems. For example, my test host has 8 GPUs.

Note:

  • I'm using DCGM Exporter Container and Prometheus Container. I'm not using the Otel collector.

Related openobserve/openobserve#9008

Metadata

Metadata

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions