Este documento ha sido traducido utilizando tecnología de traducción automática. Si bien nos esforzamos por proporcionar traducciones precisas, no ofrecemos garantías sobre la integridad, precisión o confiabilidad del contenido traducido. En caso de discrepancia, la versión original en inglés prevalecerá y constituirá el texto autorizado.

Referencia de expresiones PromQL

Las expresiones PromQL en este documento se pueden utilizar para configurar alertas.

Para más información sobre cómo consultar la base de datos de series temporales de Prometheus, consulta la documentación oficial de Prometheus.

Métricas del clúster

Utilización de CPU del clúster

productos Expresión

Información

1 - (avg(irate(node_cpu_seconds_total{mode="idle"}[5m])) by (instance))

Resumen

1 - (avg(irate(node_cpu_seconds_total{mode="idle"}[5m])))

Promedio de carga del clúster

productos Expresión

Información

<table><tr><td>load1</td><td>sum(node_load1) by (instance) / count(node_cpu_seconds_total{mode="system"}) by (instance)</td></tr><tr><td>load5</td><td>sum(node_load5) by (instance) / count(node_cpu_seconds_total{mode="system"}) by (instance)</td></tr><tr><td>load15</td><td>sum(node_load15) by (instance) / count(node_cpu_seconds_total{mode="system"}) by (instance)</td></tr></table>

Resumen

<table><tr><td>load1</td><td>sum(node_load1) by (instance) / count(node_cpu_seconds_total{mode="system"})</td></tr><tr><td>load5</td><td>sum(node_load5) by (instance) / count(node_cpu_seconds_total{mode="system"})</td></tr><tr><td>load15</td><td>sum(node_load15) by (instance) / count(node_cpu_seconds_total{mode="system"})</td></tr></table>

Utilización de memoria del clúster

productos Expresión

Información

1 - sum(node_memory_MemAvailable_bytes) by (instance) / sum(node_memory_MemTotal_bytes) by (instance)

Resumen

1 - sum(node_memory_MemAvailable_bytes) / sum(node_memory_MemTotal_bytes)

Utilización del disco del clúster

productos Expresión

Información

(sum(node_filesystem_size_bytes{device!="rootfs"}) by (instance) - sum(node_filesystem_free_bytes{device!="rootfs"}) by (instance)) / sum(node_filesystem_size_bytes{device!="rootfs"}) by (instance)

Resumen

(sum(node_filesystem_size_bytes{device!="rootfs"}) - sum(node_filesystem_free_bytes{device!="rootfs"})) / sum(node_filesystem_size_bytes{device!="rootfs"})

E/S de disco del clúster

productos Expresión

Información

<table><tr><td>lectura</td><td>sum(rate(node_disk_read_bytes_total[5m])) por (instancia)</td></tr><tr><td>escrito</td><td>sum(rate(node_disk_written_bytes_total[5m])) por (instancia)</td></tr></table>

Resumen

<table><tr><td>lectura</td><td>sum(rate(node_disk_read_bytes_total[5m]))</td></tr><tr><td>escrito</td><td>sum(rate(node_disk_written_bytes_total[5m]))</td></tr></table>

Paquetes de red del clúster

productos Expresión

Información

<table><tr><td>paquetes recibidos descartados</td><td>sum(rate(node_network_receive_drop_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*"}[5m])) por (instancia)</td></tr><tr><td>errores de recepción</td><td>sum(rate(node_network_receive_errs_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*"}[5m])) por (instancia)</td></tr><tr><td>paquetes recibidos</td><td>sum(rate(node_network_receive_packets_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*"}[5m])) por (instancia)</td></tr><tr><td>paquetes transmitidos descartados</td><td>sum(rate(node_network_transmit_drop_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*"}[5m])) por (instancia)</td></tr><tr><td>errores de transmisión</td><td>sum(rate(node_network_transmit_errs_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*"}[5m])) por (instancia)</td></tr><tr><td>paquetes transmitidos</td><td>sum(rate(node_network_transmit_packets_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*"}[5m])) por (instancia)</td></tr></table>

Resumen

<table><tr><td>paquetes recibidos descartados</td><td>sum(rate(node_network_receive_drop_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*"}[5m]))</td></tr><tr><td>errores de recepción</td><td>sum(rate(node_network_receive_errs_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*"}[5m]))</td></tr><tr><td>paquetes recibidos</td><td>sum(rate(node_network_receive_packets_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*"}[5m]))</td></tr><tr><td>paquetes transmitidos descartados</td><td>sum(rate(node_network_transmit_drop_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*"}[5m]))</td></tr><tr><td>errores de transmisión</td><td>sum(rate(node_network_transmit_errs_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*"}[5m]))</td></tr><tr><td>paquetes transmitidos</td><td>sum(rate(node_network_transmit_packets_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*"}[5m]))</td></tr></table>

E/S de red del clúster

productos Expresión

Información

<table><tr><td>recibir</td><td>sum(rate(node_network_receive_bytes_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*"}[5m])) por (instancia)</td></tr><tr><td>transmitir</td><td>sum(rate(node_network_transmit_bytes_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*"}[5m])) por (instancia)</td></tr></table>

Resumen

<table><tr><td>recibir</td><td>sum(rate(node_network_receive_bytes_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*"}[5m]))</td></tr><tr><td>transmitir</td><td>sum(rate(node_network_transmit_bytes_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*"}[5m]))</td></tr></table>

Métricas del nodo

Utilización de la CPU del nodo

productos Expresión

Información

avg(irate(node_cpu_seconds_total{mode!="idle", instance=~"$instance"}[5m])) by (mode)

Resumen

1 - (avg(irate(node_cpu_seconds_total{mode="idle", instance=~"$instance"}[5m])))

Promedio de carga del nodo

productos Expresión

Información

<table><tr><td>load1</td><td>sum(node_load1{instance=~"$instance"}) / count(node_cpu_seconds_total{mode="system",instance="$instance"})</td></tr><tr><td>load5</td><td>sum(node_load5{instance="$instance"}) / count(node_cpu_seconds_total{mode="system",instance="$instance"})</td></tr><tr><td>load15</td><td>sum(node_load15{instance="$instance"}) / count(node_cpu_seconds_total{mode="system",instance=~"$instance"})</td></tr></table>

Resumen

<table><tr><td>load1</td><td>sum(node_load1{instance=~"$instance"}) / count(node_cpu_seconds_total{mode="system",instance="$instance"})</td></tr><tr><td>load5</td><td>sum(node_load5{instance="$instance"}) / count(node_cpu_seconds_total{mode="system",instance="$instance"})</td></tr><tr><td>load15</td><td>sum(node_load15{instance="$instance"}) / count(node_cpu_seconds_total{mode="system",instance=~"$instance"})</td></tr></table>

Utilización de la memoria del nodo

productos Expresión

Información

1 - sum(node_memory_MemAvailable_bytes{instance=~"$instance"}) / sum(node_memory_MemTotal_bytes{instance=~"$instance"})

Resumen

`1 - sum(node_memory_MemAvailable_bytes{instance=~"$instance"}) / sum(node_memory_MemTotal_bytes{instance=~"$instance"}) `

Utilización del disco del nodo

productos Expresión

Información

(sum(node_filesystem_size_bytes{device!="rootfs",instance=~"$instance"}) by (device) - sum(node_filesystem_free_bytes{device!="rootfs",instance=~"$instance"}) by (device)) / sum(node_filesystem_size_bytes{device!="rootfs",instance=~"$instance"}) by (device)

Resumen

(sum(node_filesystem_size_bytes{device!="rootfs",instance=~"$instance"}) - sum(node_filesystem_free_bytes{device!="rootfs",instance=~"$instance"})) / sum(node_filesystem_size_bytes{device!="rootfs",instance=~"$instance"})

E/S del disco del nodo

productos Expresión

Información

<table><tr><td>leído</td><td>sum(rate(node_disk_read_bytes_total{instance="$instance"}[5m]))</td></tr><tr><td>escrito</td><td>sum(rate(node_disk_written_bytes_total{instance="$instance"}[5m]))</td></tr></table>

Resumen

<table><tr><td>leído</td><td>sum(rate(node_disk_read_bytes_total{instance="$instance"}[5m]))</td></tr><tr><td>escrito</td><td>sum(rate(node_disk_written_bytes_total{instance="$instance"}[5m]))</td></tr></table>

Paquetes de red del nodo

productos Expresión

Información

<table><tr><td>paquetes recibidos descartados</td><td>sum(rate(node_network_receive_drop_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*",instance=~"$instance"}[5m])) by (device)</td></tr><tr><td>errores de recepción</td><td>sum(rate(node_network_receive_errs_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*",instance=~"$instance"}[5m])) by (device)</td></tr><tr><td>paquetes recibidos</td><td>sum(rate(node_network_receive_packets_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*",instance=~"$instance"}[5m])) by (device)</td></tr><tr><td>paquetes transmitidos descartados</td><td>sum(rate(node_network_transmit_drop_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*",instance=~"$instance"}[5m])) by (device)</td></tr><tr><td>errores de transmisión</td><td>sum(rate(node_network_transmit_errs_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*",instance=~"$instance"}[5m])) by (device)</td></tr><tr><td>paquetes transmitidos</td><td>sum(rate(node_network_transmit_packets_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*",instance=~"$instance"}[5m])) by (device)</td></tr></table>

Resumen

<table><tr><td>paquetes recibidos descartados</td><td>sum(rate(node_network_receive_drop_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*",instance=~"$instance"}[5m]))</td></tr><tr><td>errores de recepción</td><td>sum(rate(node_network_receive_errs_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*",instance=~"$instance"}[5m]))</td></tr><tr><td>paquetes recibidos</td><td>sum(rate(node_network_receive_packets_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*",instance=~"$instance"}[5m]))</td></tr><tr><td>paquetes transmitidos descartados</td><td>sum(rate(node_network_transmit_drop_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*",instance=~"$instance"}[5m]))</td></tr><tr><td>errores de transmisión</td><td>sum(rate(node_network_transmit_errs_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*",instance=~"$instance"}[5m]))</td></tr><tr><td>paquetes transmitidos</td><td>sum(rate(node_network_transmit_packets_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*",instance=~"$instance"}[5m]))</td></tr></table>

E/S de red del nodo

productos Expresión

Información

<table><tr><td>recibir</td><td>sum(rate(node_network_receive_bytes_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*",instance=~"$instance"}[5m])) by (device)</td></tr><tr><td>transmitir</td><td>sum(rate(node_network_transmit_bytes_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*",instance=~"$instance"}[5m])) by (device)</td></tr></table>

Resumen

<table><tr><td>recibir</td><td>sum(rate(node_network_receive_bytes_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*",instance="$instance"}[5m]))</td></tr><tr><td>transmitir</td><td>sum(rate(node_network_transmit_bytes_total{device!"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*",instance=~"$instance"}[5m]))</td></tr></table>

Métricas de Etcd

Etcd tiene un líder

max(etcd_server_has_leader)

Número de veces que cambia el líder

max(etcd_server_leader_changes_seen_total)

Número de propuestas fallidas

sum(etcd_server_proposals_failed_total)

Tráfico del Cliente GRPC

productos Expresión

Información

<table><tr><td>entrada</td><td>sum(rate(etcd_network_client_grpc_received_bytes_total[5m])) by (instance)</td></tr><tr><td>salida</td><td>sum(rate(etcd_network_client_grpc_sent_bytes_total[5m])) by (instance)</td></tr></table>

Resumen

<table><tr><td>entrada</td><td>sum(rate(etcd_network_client_grpc_received_bytes_total[5m]))</td></tr><tr><td>salida</td><td>sum(rate(etcd_network_client_grpc_sent_bytes_total[5m]))</td></tr></table>

Tráfico de Pares

productos Expresión

Información

<table><tr><td>entrada</td><td>sum(rate(etcd_network_peer_received_bytes_total[5m])) by (instance)</td></tr><tr><td>salida</td><td>sum(rate(etcd_network_peer_sent_bytes_total[5m])) by (instance)</td></tr></table>

Resumen

<table><tr><td>entrada</td><td>sum(rate(etcd_network_peer_received_bytes_total[5m]))</td></tr><tr><td>salida</td><td>sum(rate(etcd_network_peer_sent_bytes_total[5m]))</td></tr></table>

Tamaño de la DB

productos Expresión

Información

sum(etcd_debugging_mvcc_db_total_size_in_bytes) by (instance)

Resumen

sum(etcd_debugging_mvcc_db_total_size_in_bytes)

Flujos Activos

productos Expresión

Información

<table><tr><td>vigilancia de arrendamiento</td><td>sum(grpc_server_started_total{grpc_service="etcdserverpb.Lease",grpc_type="bidi_stream"}) por (instancia) - sum(grpc_server_handled_total{grpc_service="etcdserverpb.Lease",grpc_type="bidi_stream"}) por (instancia)</td></tr><tr><td>vigilancia</td><td>sum(grpc_server_started_total{grpc_service="etcdserverpb.Watch",grpc_type="bidi_stream"}) por (instancia) - sum(grpc_server_handled_total{grpc_service="etcdserverpb.Watch",grpc_type="bidi_stream"}) por (instancia)</td></tr></table>

Resumen

<table><tr><td>lease-watch</td><td>sum(grpc_server_started_total{grpc_service="etcdserverpb.Lease",grpc_type="bidi_stream"}) - sum(grpc_server_handled_total{grpc_service="etcdserverpb.Lease",grpc_type="bidi_stream"})</td></tr><tr><td>watch</td><td>sum(grpc_server_started_total{grpc_service="etcdserverpb.Watch",grpc_type="bidi_stream"}) - sum(grpc_server_handled_total{grpc_service="etcdserverpb.Watch",grpc_type="bidi_stream"})</td></tr></table>

Propuestas de Raft

productos Expresión

Información

<table><tr><td>aplicadas</td><td>sum(increase(etcd_server_proposals_applied_total[5m])) por (instancia)</td></tr><tr><td>comprometidas</td><td>sum(increase(etcd_server_proposals_committed_total[5m])) por (instancia)</td></tr><tr><td>pendientes</td><td>sum(increase(etcd_server_proposals_pending[5m])) por (instancia)</td></tr><tr><td>fallidas</td><td>sum(increase(etcd_server_proposals_failed_total[5m])) por (instancia)</td></tr></table>

Resumen

<table><tr><td>aplicadas</td><td>sum(increase(etcd_server_proposals_applied_total[5m]))</td></tr><tr><td>comprometidas</td><td>sum(increase(etcd_server_proposals_committed_total[5m]))</td></tr><tr><td>pendientes</td><td>sum(increase(etcd_server_proposals_pending[5m]))</td></tr><tr><td>fallidas</td><td>sum(increase(etcd_server_proposals_failed_total[5m]))</td></tr></table>

Tasa de RPC

productos Expresión

Información

<table><tr><td>total</td><td>sum(rate(grpc_server_started_total{grpc_type="unary"}[5m])) por (instancia)</td></tr><tr><td>fallo</td><td>sum(rate(grpc_server_handled_total{grpc_type="unary",grpc_code!="OK"}[5m])) por (instancia)</td></tr></table>

Resumen

<table><tr><td>total</td><td>sum(rate(grpc_server_started_total{grpc_type="unary"}[5m]))</td></tr><tr><td>fallo</td><td>sum(rate(grpc_server_handled_total{grpc_type="unary",grpc_code!="OK"}[5m]))</td></tr></table>

Operaciones de Disco

productos Expresión

Información

<table><tr><td>commit-llamado-por-backend</td><td>sum(rate(etcd_disk_backend_commit_duration_seconds_sum[1m])) por (instancia)</td></tr><tr><td>fsync-llamado-por-wal</td><td>sum(rate(etcd_disk_wal_fsync_duration_seconds_sum[1m])) por (instancia)</td></tr></table>

Resumen

<table><tr><td>commit-llamado-por-backend</td><td>sum(rate(etcd_disk_backend_commit_duration_seconds_sum[1m]))</td></tr><tr><td>fsync-llamado-por-wal</td><td>sum(rate(etcd_disk_wal_fsync_duration_seconds_sum[1m]))</td></tr></table>

Duración de Sincronización de Disco

productos Expresión

Información

<table><tr><td>wal</td><td>histogram_quantile(0.99, sum(rate(etcd_disk_wal_fsync_duration_seconds_bucket[5m])) by (instance, le))</td></tr><tr><td>db</td><td>histogram_quantile(0.99, sum(rate(etcd_disk_backend_commit_duration_seconds_bucket[5m])) by (instance, le))</td></tr></table>

Resumen

<table><tr><td>wal</td><td>sum(histogram_quantile(0.99, sum(rate(etcd_disk_wal_fsync_duration_seconds_bucket[5m])) by (instance, le)))</td></tr><tr><td>db</td><td>sum(histogram_quantile(0.99, sum(rate(etcd_disk_backend_commit_duration_seconds_bucket[5m])) by (instance, le)))</td></tr></table>

Métricas de Componentes de Kubernetes

Latencia de Solicitudes del Servidor API

productos Expresión

Información

avg(apiserver_request_latencies_sum / apiserver_request_latencies_count) by (instance, verb) /1e+06

Resumen

avg(apiserver_request_latencies_sum / apiserver_request_latencies_count) by (instance) /1e+06

Tasa de Solicitudes del Servidor API

productos Expresión

Información

sum(rate(apiserver_request_count[5m])) by (instance, code)

Resumen

sum(rate(apiserver_request_count[5m])) by (instance)

Programación de Pods Fallidos

productos Expresión

Información

sum(kube_pod_status_scheduled{condition="false"})

Resumen

sum(kube_pod_status_scheduled{condition="false"})

Profundidad de Cola del Administrador de Controladores

productos Expresión

Información

<table><tr><td>volúmenes</td><td>sum(volumes_depth) by instance</td></tr><tr><td>ampliación</td><td>sum(deployment_depth) by instance</td></tr><tr><td>replicaset</td><td>sum(replicaset_depth) by instance</td></tr><tr><td>servicio</td><td>sum(service_depth) by instance</td></tr><tr><td>cuenta de servicio</td><td>sum(serviceaccount_depth) by instance</td></tr><tr><td>punto final</td><td>sum(endpoint_depth) by instance</td></tr><tr><td>daemonset</td><td>sum(daemonset_depth) by instance</td></tr><tr><td>statefulset</td><td>sum(statefulset_depth) by instance</td></tr><tr><td>replicationmanager</td><td>sum(replicationmanager_depth) by instance</td></tr></table>

Resumen

<table><tr><td>volúmenes</td><td>sum(volumes_depth)</td></tr><tr><td>ampliación</td><td>sum(deployment_depth)</td></tr><tr><td>replicaset</td><td>sum(replicaset_depth)</td></tr><tr><td>servicio</td><td>sum(service_depth)</td></tr><tr><td>cuenta de servicio</td><td>sum(serviceaccount_depth)</td></tr><tr><td>punto final</td><td>sum(endpoint_depth)</td></tr><tr><td>daemonset</td><td>sum(daemonset_depth)</td></tr><tr><td>statefulset</td><td>sum(statefulset_depth)</td></tr><tr><td>replicationmanager</td><td>sum(replicationmanager_depth)</td></tr></table>

Latencia de programación E2E del planificador

productos Expresión

Información

histogram_quantile(0.99, sum(scheduler_e2e_scheduling_latency_microseconds_bucket) by (le, instance)) / 1e+06

Resumen

sum(histogram_quantile(0.99, sum(scheduler_e2e_scheduling_latency_microseconds_bucket) by (le, instance)) / 1e+06)

Intentos de preempción del planificador

productos Expresión

Información

sum(rate(scheduler_total_preemption_attempts[5m])) by (instance)

Resumen

sum(rate(scheduler_total_preemption_attempts[5m]))

Conexiones del Controlador de Ingress

productos Expresión

Información

<table><tr><td>leyendo</td><td>sum(nginx_ingress_controller_nginx_process_connections{state="reading"}) by (instance)</td></tr><tr><td>esperando</td><td>sum(nginx_ingress_controller_nginx_process_connections{state="waiting"}) by (instance)</td></tr><tr><td>escribiendo</td><td>sum(nginx_ingress_controller_nginx_process_connections{state="writing"}) by (instance)</td></tr><tr><td>aceptado</td><td>sum(ceil(increase(nginx_ingress_controller_nginx_process_connections_total{state="accepted"}[5m]))) by (instance)</td></tr><tr><td>activo</td><td>sum(ceil(increase(nginx_ingress_controller_nginx_process_connections_total{state="active"}[5m]))) by (instance)</td></tr><tr><td>manejado</td><td>sum(ceil(increase(nginx_ingress_controller_nginx_process_connections_total{state="handled"}[5m]))) by (instance)</td></tr></table>

Resumen

<table><tr><td>leyendo</td><td>sum(nginx_ingress_controller_nginx_process_connections{state="reading"})</td></tr><tr><td>esperando</td><td>sum(nginx_ingress_controller_nginx_process_connections{state="waiting"})</td></tr><tr><td>escribiendo</td><td>sum(nginx_ingress_controller_nginx_process_connections{state="writing"})</td></tr><tr><td>aceptado</td><td>sum(ceil(increase(nginx_ingress_controller_nginx_process_connections_total{state="accepted"}[5m])))</td></tr><tr><td>activo</td><td>sum(ceil(increase(nginx_ingress_controller_nginx_process_connections_total{state="active"}[5m])))</td></tr><tr><td>manejado</td><td>sum(ceil(increase(nginx_ingress_controller_nginx_process_connections_total{state="handled"}[5m])))</td></tr></table>

Tiempo de proceso de solicitud del controlador de Ingress

productos Expresión

Información

topk(10, histogram_quantile(0.95,sum by (le, host, path)(rate(nginx_ingress_controller_request_duration_seconds_bucket{host!="_"}[5m]))))

Resumen

topk(10, histogram_quantile(0.95,sum by (le, host)(rate(nginx_ingress_controller_request_duration_seconds_bucket{host!="_"}[5m]))))

Métricas de Registro de Rancher

Tasa de la cola del búfer de Fluentd

productos Expresión

Información

sum(rate(fluentd_output_status_buffer_queue_length[5m])) by (instance)

Resumen

sum(rate(fluentd_output_status_buffer_queue_length[5m]))

Tasa de Entrada de Fluentd

productos Expresión

Información

sum(rate(fluentd_input_status_num_records_total[5m])) by (instance)

Resumen

sum(rate(fluentd_input_status_num_records_total[5m]))

Tasa de Errores de Salida de Fluentd

productos Expresión

Información

sum(rate(fluentd_output_status_num_errors[5m])) by (type)

Resumen

sum(rate(fluentd_output_status_num_errors[5m]))

Tasa de Salida de Fluentd

productos Expresión

Información

sum(rate(fluentd_output_status_num_records_total[5m])) by (instance)

Resumen

sum(rate(fluentd_output_status_num_records_total[5m]))

Métricas de Carga de Trabajo

Utilización de CPU de Carga de Trabajo

productos Expresión

Información

<table><tr><td>segundos de limitación cfs</td><td>sum(rate(container_cpu_cfs_throttled_seconds_total{namespace="$namespace",pod_name=~"$podName",container_name!=""}[5m])) by (pod_name)</td></tr><tr><td>segundos de usuario</td><td>sum(rate(container_cpu_user_seconds_total{namespace="$namespace",pod_name=~"$podName",container_name!=""}[5m])) by (pod_name)</td></tr><tr><td>segundos de sistema</td><td>sum(rate(container_cpu_system_seconds_total{namespace="$namespace",pod_name=~"$podName",container_name!=""}[5m])) by (pod_name)</td></tr><tr><td>segundos de uso</td><td>sum(rate(container_cpu_usage_seconds_total{namespace="$namespace",pod_name=~"$podName",container_name!=""}[5m])) by (pod_name)</td></tr></table>

Resumen

<table><tr><td>segundos de limitación cfs</td><td>sum(rate(container_cpu_cfs_throttled_seconds_total{namespace="$namespace",pod_name=~"$podName",container_name!=""}[5m]))</td></tr><tr><td>segundos de usuario</td><td>sum(rate(container_cpu_user_seconds_total{namespace="$namespace",pod_name=~"$podName",container_name!=""}[5m]))</td></tr><tr><td>segundos de sistema</td><td>sum(rate(container_cpu_system_seconds_total{namespace="$namespace",pod_name=~"$podName",container_name!=""}[5m]))</td></tr><tr><td>segundos de uso</td><td>sum(rate(container_cpu_usage_seconds_total{namespace="$namespace",pod_name=~"$podName",container_name!=""}[5m]))</td></tr></table>

Utilización de memoria de carga de trabajo

productos Expresión

Información

sum(container_memory_working_set_bytes{namespace="$namespace",pod_name=~"$podName", container_name!=""}) by (pod_name)

Resumen

sum(container_memory_working_set_bytes{namespace="$namespace",pod_name=~"$podName", container_name!=""})

Paquetes de red de carga de trabajo

productos Expresión

Información

<table><tr><td>paquetes recibidos</td><td>sum(rate(container_network_receive_packets_total{namespace="$namespace",pod_name=~"$podName",container_name!=""}[5m])) by (pod_name)</td></tr><tr><td>paquetes descartados en recepción</td><td>sum(rate(container_network_receive_packets_dropped_total{namespace="$namespace",pod_name=~"$podName",container_name!=""}[5m])) by (pod_name)</td></tr><tr><td>errores de recepción</td><td>sum(rate(container_network_receive_errors_total{namespace="$namespace",pod_name=~"$podName",container_name!=""}[5m])) by (pod_name)</td></tr><tr><td>paquetes transmitidos</td><td>sum(rate(container_network_transmit_packets_total{namespace="$namespace",pod_name=~"$podName",container_name!=""}[5m])) by (pod_name)</td></tr><tr><td>paquetes descartados en transmisión</td><td>sum(rate(container_network_transmit_packets_dropped_total{namespace="$namespace",pod_name=~"$podName",container_name!=""}[5m])) by (pod_name)</td></tr><tr><td>errores de transmisión</td><td>sum(rate(container_network_transmit_errors_total{namespace="$namespace",pod_name=~"$podName",container_name!=""}[5m])) by (pod_name)</td></tr></table>

Resumen

<table><tr><td>paquetes recibidos</td><td>sum(rate(container_network_receive_packets_total{namespace="$namespace",pod_name=~"$podName",container_name!=""}[5m]))</td></tr><tr><td>paquetes descartados en recepción</td><td>sum(rate(container_network_receive_packets_dropped_total{namespace="$namespace",pod_name=~"$podName",container_name!=""}[5m]))</td></tr><tr><td>errores de recepción</td><td>sum(rate(container_network_receive_errors_total{namespace="$namespace",pod_name=~"$podName",container_name!=""}[5m]))</td></tr><tr><td>paquetes transmitidos</td><td>sum(rate(container_network_transmit_packets_total{namespace="$namespace",pod_name=~"$podName",container_name!=""}[5m]))</td></tr><tr><td>paquetes descartados en transmisión</td><td>sum(rate(container_network_transmit_packets_dropped_total{namespace="$namespace",pod_name=~"$podName",container_name!=""}[5m]))</td></tr><tr><td>errores de transmisión</td><td>sum(rate(container_network_transmit_errors_total{namespace="$namespace",pod_name=~"$podName",container_name!=""}[5m]))</td></tr></table>

E/S de red de carga de trabajo

productos Expresión

Información

<table><tr><td>recibir</td><td>sum(rate(container_network_receive_bytes_total{namespace="$namespace",pod_name=~"$podName",container_name!=""}[5m])) by (pod_name)</td></tr><tr><td>transmitir</td><td>sum(rate(container_network_transmit_bytes_total{namespace="$namespace",pod_name=~"$podName",container_name!=""}[5m])) by (pod_name)</td></tr></table>

Resumen

<table><tr><td>recibir</td><td>sum(rate(container_network_receive_bytes_total{namespace="$namespace",pod_name="$podName",container_name!=""}[5m]))</td></tr><tr><td>transmitir</td><td>sum(rate(container_network_transmit_bytes_total{namespace="$namespace",pod_name="$podName",container_name!=""}[5m]))</td></tr></table>

E/S de disco de carga de trabajo

productos Expresión

Información

<table><tr><td>leer</td><td>sum(rate(container_fs_reads_bytes_total{namespace="$namespace",pod_name=~"$podName",container_name!=""}[5m])) by (pod_name)</td></tr><tr><td>escribir</td><td>sum(rate(container_fs_writes_bytes_total{namespace="$namespace",pod_name=~"$podName",container_name!=""}[5m])) by (pod_name)</td></tr></table>

Resumen

<table><tr><td>leer</td><td>sum(rate(container_fs_reads_bytes_total{namespace="$namespace",pod_name="$podName",container_name!=""}[5m]))</td></tr><tr><td>escribir</td><td>sum(rate(container_fs_writes_bytes_total{namespace="$namespace",pod_name="$podName",container_name!=""}[5m]))</td></tr></table>

Métricas de pod

Utilización de CPU de pod

productos Expresión

Información

<table><tr><td>segundos de limitación cfs</td><td>sum(rate(container_cpu_cfs_throttled_seconds_total{container_name!="POD",namespace="$namespace",pod_name="$podName", container_name!=""}[5m])) by (container_name)</td></tr><tr><td>segundos de uso</td><td>sum(rate(container_cpu_usage_seconds_total{container_name!="POD",namespace="$namespace",pod_name="$podName", container_name!=""}[5m])) by (container_name)</td></tr><tr><td>segundos de sistema</td><td>sum(rate(container_cpu_system_seconds_total{container_name!="POD",namespace="$namespace",pod_name="$podName", container_name!=""}[5m])) by (container_name)</td></tr><tr><td>segundos de usuario</td><td>sum(rate(container_cpu_user_seconds_total{container_name!="POD",namespace="$namespace",pod_name="$podName", container_name!=""}[5m])) by (container_name)</td></tr></table>

Resumen

<table><tr><td>segundos de limitación cfs</td><td>sum(rate(container_cpu_cfs_throttled_seconds_total{container_name!="POD",namespace="$namespace",pod_name="$podName", container_name!=""}[5m]))</td></tr><tr><td>segundos de uso</td><td>sum(rate(container_cpu_usage_seconds_total{container_name!="POD",namespace="$namespace",pod_name="$podName", container_name!=""}[5m]))</td></tr><tr><td>segundos de sistema</td><td>sum(rate(container_cpu_system_seconds_total{container_name!="POD",namespace="$namespace",pod_name="$podName", container_name!=""}[5m]))</td></tr><tr><td>segundos de usuario</td><td>sum(rate(container_cpu_user_seconds_total{container_name!="POD",namespace="$namespace",pod_name="$podName", container_name!=""}[5m]))</td></tr></table>

Utilización de memoria de pod

productos Expresión

Información

sum(container_memory_working_set_bytes{container_name!="POD",namespace="$namespace",pod_name="$podName",container_name!=""}) by (container_name)

Resumen

sum(container_memory_working_set_bytes{container_name!="POD",namespace="$namespace",pod_name="$podName",container_name!=""})

Paquetes de red de pod

productos Expresión

Información

<table><tr><td>paquetes recibidos</td><td>sum(rate(container_network_receive_packets_total{namespace="$namespace",pod_name="$podName",container_name!=""}[5m]))</td></tr><tr><td>paquetes descartados en recepción</td><td>sum(rate(container_network_receive_packets_dropped_total{namespace="$namespace",pod_name="$podName",container_name!=""}[5m]))</td></tr><tr><td>errores de recepción</td><td>sum(rate(container_network_receive_errors_total{namespace="$namespace",pod_name="$podName",container_name!=""}[5m]))</td></tr><tr><td>paquetes transmitidos</td><td>sum(rate(container_network_transmit_packets_total{namespace="$namespace",pod_name="$podName",container_name!=""}[5m]))</td></tr><tr><td>paquetes descartados en transmisión</td><td>sum(rate(container_network_transmit_packets_dropped_total{namespace="$namespace",pod_name="$podName",container_name!=""}[5m]))</td></tr><tr><td>errores de transmisión</td><td>sum(rate(container_network_transmit_errors_total{namespace="$namespace",pod_name="$podName",container_name!=""}[5m]))</td></tr></table>

Resumen

<table><tr><td>paquetes recibidos</td><td>sum(rate(container_network_receive_packets_total{namespace="$namespace",pod_name="$podName",container_name!=""}[5m]))</td></tr><tr><td>paquetes descartados en recepción</td><td>sum(rate(container_network_receive_packets_dropped_total{namespace="$namespace",pod_name="$podName",container_name!=""}[5m]))</td></tr><tr><td>errores de recepción</td><td>sum(rate(container_network_receive_errors_total{namespace="$namespace",pod_name="$podName",container_name!=""}[5m]))</td></tr><tr><td>paquetes transmitidos</td><td>sum(rate(container_network_transmit_packets_total{namespace="$namespace",pod_name="$podName",container_name!=""}[5m]))</td></tr><tr><td>paquetes descartados en transmisión</td><td>sum(rate(container_network_transmit_packets_dropped_total{namespace="$namespace",pod_name="$podName",container_name!=""}[5m]))</td></tr><tr><td>errores de transmisión</td><td>sum(rate(container_network_transmit_errors_total{namespace="$namespace",pod_name="$podName",container_name!=""}[5m]))</td></tr></table>

E/S de red de pod

productos Expresión

Información

<table><tr><td>recibir</td><td>sum(rate(contenedor_network_receive_bytes_total{namespace="$namespace",pod_name="$podName",container_name!=""}[5m]))</td></tr><tr><td>transmitir</td><td>sum(rate(contenedor_network_transmit_bytes_total{namespace="$namespace",pod_name="$podName",container_name!=""}[5m]))</td></tr></table>

Resumen

<table><tr><td>recibir</td><td>sum(rate(contenedor_network_receive_bytes_total{namespace="$namespace",pod_name="$podName",container_name!=""}[5m]))</td></tr><tr><td>transmitir</td><td>sum(rate(contenedor_network_transmit_bytes_total{namespace="$namespace",pod_name="$podName",container_name!=""}[5m]))</td></tr></table>

E/S de disco de pod

productos Expresión

Información

<table><tr><td>leer</td><td>sum(rate(contenedor_fs_reads_bytes_total{namespace="$namespace",pod_name="$podName",container_name!=""}[5m])) por (container_name)</td></tr><tr><td>escribir</td><td>sum(rate(contenedor_fs_writes_bytes_total{namespace="$namespace",pod_name="$podName",container_name!=""}[5m])) por (container_name)</td></tr></table>

Resumen

<table><tr><td>leer</td><td>sum(rate(contenedor_fs_reads_bytes_total{namespace="$namespace",pod_name="$podName",container_name!=""}[5m]))</td></tr><tr><td>escribir</td><td>sum(rate(contenedor_fs_writes_bytes_total{namespace="$namespace",pod_name="$podName",container_name!=""}[5m]))</td></tr></table>

Métricas del contenedor

Utilización de CPU del contenedor

productos Expresión

segundos limitados por cfs

sum(rate(container_cpu_cfs_throttled_seconds_total{namespace="$namespace",pod_name="$podName",container_name="$containerName"}[5m]))

segundos de uso

sum(rate(container_cpu_usage_seconds_total{namespace="$namespace",pod_name="$podName",container_name="$containerName"}[5m]))

segundos del sistema

sum(rate(container_cpu_system_seconds_total{namespace="$namespace",pod_name="$podName",container_name="$containerName"}[5m]))

segundos de usuario

sum(rate(container_cpu_user_seconds_total{namespace="$namespace",pod_name="$podName",container_name="$containerName"}[5m]))

Utilización de memoria del contenedor

sum(container_memory_working_set_bytes{namespace="$namespace",pod_name="$podName",container_name="$containerName"})

E/S de disco del contenedor

productos Expresión

leer

sum(rate(container_fs_reads_bytes_total{namespace="$namespace",pod_name="$podName",container_name="$containerName"}[5m]))

write

sum(rate(container_fs_writes_bytes_total{namespace="$namespace",pod_name="$podName",container_name="$containerName"}[5m]))