update sizing guide with user event metrics info

Closes #35640 Signed-off-by: Kamesh Akella <kamesh.asp@gmail.com> Signed-off-by: Kamesh Akella <kakella@redhat.com> Signed-off-by: Alexander Schwartz <aschwart@redhat.com> Co-authored-by: Alexander Schwartz <alexander.schwartz@gmx.net> Co-authored-by: Alexander Schwartz <aschwart@redhat.com>
2025-12-21 06:20:05 -06:00 · 2024-12-09 12:27:03 -05:00
parent 80890737d4
commit 047b28a63e
2 changed files with 19 additions and 0 deletions
--- a/docs/guides/high-availability/concepts-memory-and-cpu-sizing.adoc
+++ b/docs/guides/high-availability/concepts-memory-and-cpu-sizing.adoc
@@ -61,6 +61,24 @@ For every 100 login/logout/refresh requests per second:

 The vCPU requirement is given as a range, as with an increased CPU saturation on the database host the CPU usage per request decreases while the response times increase. A lower CPU quota on the database can lead to slower response times during peak loads. Choose a larger CPU quota if fast response times during peak loads are critical. See below for an example.

+=== Measuring the activity of a running {project_name} instance
+
+Sizing of a {project_name} instance depends on the actual and forecasted numbers for password-based user logins, refresh token requests, and client credential grants as described in the previous section.
+
+To retrieve the actual numbers of a running {project_name} instance for these three key inputs, use the metrics {project_name} provides:
+
+* The user event metric `keycloak_user_events_total` for event type `login` includes both password-based logins and cookie-based logins, still it can serve as a first approximate input for this sizing guide.
+If you are using a standard username and password form and want to retrieve only the number of password-based logins, use the metric `http_server_requests_seconds_count` with method `POST` and URI `+/realms/{realm}/login-actions/authenticate+` instead.
+* Use the user event metric `keycloak_user_events_total` for the event types `refresh_token` and `client_login` for refresh token requests and client credential grants respectively.
+
+See the <@links.observability id="event-metrics"/> and <@links.observability id="metrics-for-troubleshooting-http"/> {sections} for more information.
+
+These metrics are crucial for tracking daily and weekly fluctuations in user activity loads,
+identifying emerging trends that may indicate the need to resize the system and
+validating sizing calculations.
+By systematically measuring and evaluating these user event metrics,
+you can ensure your system remains appropriately scaled and responsive to changes in user behavior and demand.
+
 === Calculation example (single site)

 Target size:
--- a/docs/guides/observability/metrics-for-troubleshooting-http.adoc
+++ b/docs/guides/observability/metrics-for-troubleshooting-http.adoc
@@ -21,6 +21,7 @@ Spikes or increases in the processing time may be an early sign that some node i
 ====
 *Tags*

+`method`:: HTTP method.
 `outcome`:: A more general outcome tag.
 `status`:: The HTTP status code.
 `uri`:: The requested URI.