< img height="1" width="1" style="display:none;" alt="" src="https://px.ads.linkedin.com/collect/?pid=3131724&fmt=gif" />

View etcd Database Status

This section introduces how to view etcd database status.

Note

Viewing etcd monitoring information requires enabling etcd monitoring in advance. For more information, see the details page of the WhizardTelemetry Monitoring extension in the Extensions Center.

Prerequisites

  • You should join a cluster and have the Monitoring Viewing permission within the cluster. For more information, refer to "Cluster Members" and "Cluster Roles".

  • WhizardTelemetry Monitoring should have been installed and enabled.

Steps

  1. Log in to the KubeSphere web console with a user who has the Monitoring Viewing permission, and access your cluster.

  2. Click Monitoring & Alerting > Cluster Status in the left navigation pane.

  3. On the Cluster Status page, click the etcd Monitoring tab to view the running status of the etcd database.

    • The etcd Monitoring tab provides the following information:

      ParameterDescription

      Service Status

      Displays the Leader node of the etcd cluster, the IP address of each node, and the number of Leader changes in the last hour.

      DB Size

      The size of the etcd database over the specified time range.

      Client Traffic

      Displays the data traffic sent to and received from gRPC clients.

      gRPC Stream Message

      Displays the number of gRPC streaming messages received and sent per second by the server.

      WAL Fsync

      Displays the latency of WAL calling fsync. Before applying log entries, etcd calls wal_fsync when persisting log entries to disk.

      DB Fsync

      Displays the distribution of backend commit latency. When etcd commits its latest delta snapshot to disk, it calls backend_commit. High disk operation latency (long WAL log sync time or database sync time) usually indicates disk issues, which can cause high request latency or cluster instability.

      Raft Proposal

      Displays the current etcd Raft proposals per second. Hover over the line chart to view proposal data at a specific time point.

      • Committed: The commit rate of consensus proposals. This gauge should increase over time if the cluster is healthy. Several healthy members of an etcd cluster may have different total committed proposals at once. This discrepancy may be due to recovering from peers after starting, lagging behind the leader, or being the leader and therefore having the most commits. It is important to monitor this metric across all the members in the cluster; a consistently large lag between a single member and its leader indicates that member is slow or unhealthy.

      • Applied: The total apply rate of consensus proposals. The etcd server applies every committed proposal asynchronously. The difference between Committed and Applied should usually be small (within a few thousands even under high load). If the difference between them continues to rise, it indicates that the etcd server is overloaded. This might happen when applying expensive queries like heavy range queries or large txn operations.

      • Failed: The total failure rate of proposals. This value is affected by two factors: temporary failures related to a leader election or longer downtime caused by a loss of quorum in the cluster.

      • Pending: It indicates how many proposals are queued to commit. Rising pending proposals suggests there is a high client load or the member cannot commit proposals.

      For more information about etcd database parameters, see the etcd documentation.

    • Click timed-task in the upper right corner to set the time range.

    • Click start/pause in the upper right corner to enable/disable real-time data refresh.

    • Click refresh in the upper right corner to refresh data manually.

Receive the latest news, articles and updates from KubeSphere


Thanks for the feedback. If you have a specific question about how to use KubeSphere, ask it on Slack. Open an issue in the GitHub repo if you want to report a problem or suggest an improvement.

What’s on this Page