Skip to content

optimize the GPU-related monitoring && upgrade dashboards CRDs

Calvin Yu请求将github/fork/zhu733756/master合并到master

Created by: zhu733756

Signed-off-by: zhu733756 talonzhu@yunify.com

Features:

  • add enable a pluggable GPU-related monitoring metrics.
  • upgrade dashboards CRDs to v1alpha2.

For enabling GPU-related monitoring metrics, you need to change configuration as follows:

common:
    monitoring:
      # type: external   # Whether to specify the external prometheus stack, and need to modify the endpoint at the next line.
      endpoint: http://prometheus-operated.kubesphere-monitoring-system.svc:9090 # Prometheus endpoint to get metrics data.
      GPUMonitoring:     # Enable or disable the GPU-related metrics. If you enable this switch but have no GPU resources, Kubesphere will set it to zero. 
        enabled: true    
---

After finishing the installation, you will get:

apiVersion: v1
data:
  kubesphere.yaml: |
      monitoring:
           enableGPUMonitoring: true
           endpoint: http://prometheus-operated.kubesphere-monitoring-system.svc:9090
kind: ConfigMap
metadata:
     name: kubesphere-config
     namespace: kubesphere-system

合并请求报告