You can collect data about your environment, monitor the health of your cluster and virtual machines (VMs), and troubleshoot OKD Virtualization resources with the following tools.
The OKD web console displays resource usage, alerts, events, and trends for your cluster and for OKD Virtualization components and resources.
Page | Description |
---|---|
Overview page |
Cluster details, status, alerts, inventory, and resource usage |
Virtualization → Overview tab |
OKD Virtualization resources, usage, alerts, and status |
Virtualization → Top consumers tab |
Top consumers of CPU, memory, and storage |
Virtualization → Migrations tab |
Progress of live migrations |
VirtualMachines → VirtualMachine → VirtualMachine details → Metrics tab |
VM resource usage, storage, network, and migration |
VirtualMachines → VirtualMachine → VirtualMachine details → Events tab |
List of VM events |
VirtualMachines → VirtualMachine → VirtualMachine details → Diagnostics tab |
VM status conditions and volume snapshot status |
When you submit a support case to Red Hat Support, it is helpful to provide debugging information. You can gather debugging information by performing the following steps:
Configure Prometheus and Alertmanager and collect must-gather
data for OKD and OKD Virtualization.
Collect must-gather
data and memory dumps from VMs.
must-gather
tool for OKD VirtualizationConfigure and use the must-gather
tool.
You can monitor the health of your cluster and VMs. For details about monitoring tools, see the Monitoring overview.
Troubleshoot OKD Virtualization components and VMs and resolve issues that trigger alerts in the web console.
View important life-cycle information for VMs, namespaces, and resources.
View and configure logs for OKD Virtualization components and VMs.
Diagnose and resolve issues that trigger OKD Virtualization alerts in the web console.
Troubleshoot data volumes by analyzing conditions and events.