Este documento foi traduzido usando tecnologia de tradução automática de máquina. Sempre trabalhamos para apresentar traduções precisas, mas não oferecemos nenhuma garantia em relação à integridade, precisão ou confiabilidade do conteúdo traduzido. Em caso de qualquer discrepância, a versão original em inglês prevalecerá e constituirá o texto official.

Referência de Expressão PromQL

As expressões PromQL neste documento podem ser usadas para configurar alertas.

Para mais informações sobre como consultar o banco de dados de séries temporais do Prometheus, consulte a documentação oficial do Prometheus.

Métricas do Cluster

Utilização da CPU do Cluster

Catálogo Expressão

Detalhe

1 - (avg(irate(node_cpu_seconds_total{mode="idle"}[5m])) by (instance))

Resumo

1 - (avg(irate(node_cpu_seconds_total{mode="idle"}[5m])))

Média de Carga do Cluster

Catálogo Expressão

Detalhe

<table><tr><td>load1</td><td>sum(node_load1) by (instance) / count(node_cpu_seconds_total{mode="system"}) by (instance)</td></tr><tr><td>load5</td><td>sum(node_load5) by (instance) / count(node_cpu_seconds_total{mode="system"}) by (instance)</td></tr><tr><td>load15</td><td>sum(node_load15) by (instance) / count(node_cpu_seconds_total{mode="system"}) by (instance)</td></tr></table>

Resumo

<table><tr><td>load1</td><td>sum(node_load1) by (instance) / count(node_cpu_seconds_total{mode="system"})</td></tr><tr><td>load5</td><td>sum(node_load5) by (instance) / count(node_cpu_seconds_total{mode="system"})</td></tr><tr><td>load15</td><td>sum(node_load15) by (instance) / count(node_cpu_seconds_total{mode="system"})</td></tr></table>

Utilização da Memória do Cluster

Catálogo Expressão

Detalhe

1 - sum(node_memory_MemAvailable_bytes) by (instance) / sum(node_memory_MemTotal_bytes) by (instance)

Resumo

1 - sum(node_memory_MemAvailable_bytes) / sum(node_memory_MemTotal_bytes)

Utilização do Disco do Cluster

Catálogo Expressão

Detalhe

(sum(node_filesystem_size_bytes{device!="rootfs"}) by (instance) - sum(node_filesystem_free_bytes{device!="rootfs"}) by (instance)) / sum(node_filesystem_size_bytes{device!="rootfs"}) by (instance)

Resumo

(sum(node_filesystem_size_bytes{device!="rootfs"}) - sum(node_filesystem_free_bytes{device!="rootfs"})) / sum(node_filesystem_size_bytes{device!="rootfs"})

E/S do Disco do Cluster

Catálogo Expressão

Detalhe

<table><tr><td>read</td><td>sum(rate(node_disk_read_bytes_total[5m])) by (instance)</td></tr><tr><td>written</td><td>sum(rate(node_disk_written_bytes_total[5m])) by (instance)</td></tr></table>

Resumo

<table><tr><td>read</td><td>sum(rate(node_disk_read_bytes_total[5m]))</td></tr><tr><td>written</td><td>sum(rate(node_disk_written_bytes_total[5m]))</td></tr></table>

Pacotes de Rede do Cluster

Catálogo Expressão

Detalhe

<table><tr><td>receive-dropped</td><td>sum(rate(node_network_receive_drop_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*"}[5m])) by (instance)</td></tr><tr><td>receive-errs</td><td>sum(rate(node_network_receive_errs_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*"}[5m])) by (instance)</td></tr><tr><td>receive-packets</td><td>sum(rate(node_network_receive_packets_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*"}[5m])) by (instance)</td></tr><tr><td>transmit-dropped</td><td>sum(rate(node_network_transmit_drop_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*"}[5m])) by (instance)</td></tr><tr><td>transmit-errs</td><td>sum(rate(node_network_transmit_errs_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*"}[5m])) by (instance)</td></tr><tr><td>transmit-packets</td><td>sum(rate(node_network_transmit_packets_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*"}[5m])) by (instance)</td></tr></table>

Resumo

<table><tr><td>receive-dropped</td><td>sum(rate(node_network_receive_drop_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*"}[5m]))</td></tr><tr><td>receive-errs</td><td>sum(rate(node_network_receive_errs_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*"}[5m]))</td></tr><tr><td>receive-packets</td><td>sum(rate(node_network_receive_packets_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*"}[5m]))</td></tr><tr><td>transmit-dropped</td><td>sum(rate(node_network_transmit_drop_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*"}[5m]))</td></tr><tr><td>transmit-errs</td><td>sum(rate(node_network_transmit_errs_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*"}[5m]))</td></tr><tr><td>transmit-packets</td><td>sum(rate(node_network_transmit_packets_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*"}[5m]))</td></tr></table>

E/S de Rede do Cluster

Catálogo Expressão

Detalhe

<table><tr><td>receive</td><td>sum(rate(node_network_receive_bytes_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*"}[5m])) by (instance)</td></tr><tr><td>transmit</td><td>sum(rate(node_network_transmit_bytes_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*"}[5m])) by (instance)</td></tr></table>

Resumo

<table><tr><td>receive</td><td>sum(rate(node_network_receive_bytes_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*"}[5m]))</td></tr><tr><td>transmit</td><td>sum(rate(node_network_transmit_bytes_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*"}[5m]))</td></tr></table>

Métricas do Nó

Utilização da CPU do Nó

Catálogo Expressão

Detalhe

avg(irate(node_cpu_seconds_total{mode!="idle", instance=~"$instance"}[5m])) by (mode)

Resumo

1 - (avg(irate(node_cpu_seconds_total{mode="idle", instance=~"$instance"}[5m])))

Média de Carga do Nó

Catálogo Expressão

Detalhe

<table><tr><td>load1</td><td>sum(node_load1{instance=~"$instance"}) / count(node_cpu_seconds_total{mode="system",instance="$instance"})</td></tr><tr><td>load5</td><td>sum(node_load5{instance="$instance"}) / count(node_cpu_seconds_total{mode="system",instance="$instance"})</td></tr><tr><td>load15</td><td>sum(node_load15{instance="$instance"}) / count(node_cpu_seconds_total{mode="system",instance=~"$instance"})</td></tr></table>

Resumo

<table><tr><td>load1</td><td>sum(node_load1{instance=~"$instance"}) / count(node_cpu_seconds_total{mode="system",instance="$instance"})</td></tr><tr><td>load5</td><td>sum(node_load5{instance="$instance"}) / count(node_cpu_seconds_total{mode="system",instance="$instance"})</td></tr><tr><td>load15</td><td>sum(node_load15{instance="$instance"}) / count(node_cpu_seconds_total{mode="system",instance=~"$instance"})</td></tr></table>

Utilização da Memória do Nó

Catálogo Expressão

Detalhe

1 - sum(node_memory_MemAvailable_bytes{instance=~"$instance"}) / sum(node_memory_MemTotal_bytes{instance=~"$instance"})

Resumo

`1 - sum(node_memory_MemAvailable_bytes{instance=~"$instance"}) / sum(node_memory_MemTotal_bytes{instance=~"$instance"}) `

Utilização do Disco do Nó

Catálogo Expressão

Detalhe

(sum(node_filesystem_size_bytes{device!="rootfs",instance=~"$instance"}) by (device) - sum(node_filesystem_free_bytes{device!="rootfs",instance=~"$instance"}) by (device)) / sum(node_filesystem_size_bytes{device!="rootfs",instance=~"$instance"}) by (device)

Resumo

(sum(node_filesystem_size_bytes{device!="rootfs",instance=~"$instance"}) - sum(node_filesystem_free_bytes{device!="rootfs",instance=~"$instance"})) / sum(node_filesystem_size_bytes{device!="rootfs",instance=~"$instance"})

E/S de Disco do Nó

Catálogo Expressão

Detalhe

<table><tr><td>lido</td><td>sum(rate(node_disk_read_bytes_total{instance="$instance"}[5m]))</td></tr><tr><td>gravado</td><td>sum(rate(node_disk_written_bytes_total{instance="$instance"}[5m]))</td></tr></table>

Resumo

<table><tr><td>lido</td><td>sum(rate(node_disk_read_bytes_total{instance="$instance"}[5m]))</td></tr><tr><td>gravado</td><td>sum(rate(node_disk_written_bytes_total{instance="$instance"}[5m]))</td></tr></table>

Pacotes de Rede do Nó

Catálogo Expressão

Detalhe

<table><tr><td>pacotes descartados na recepção</td><td>sum(rate(node_network_receive_drop_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*",instance=~"$instance"}[5m])) por (dispositivo)</td></tr><tr><td>erros na recepção</td><td>sum(rate(node_network_receive_errs_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*",instance=~"$instance"}[5m])) por (dispositivo)</td></tr><tr><td>pacotes recebidos</td><td>sum(rate(node_network_receive_packets_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*",instance=~"$instance"}[5m])) por (dispositivo)</td></tr><tr><td>pacotes descartados na transmissão</td><td>sum(rate(node_network_transmit_drop_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*",instance=~"$instance"}[5m])) por (dispositivo)</td></tr><tr><td>erros na transmissão</td><td>sum(rate(node_network_transmit_errs_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*",instance=~"$instance"}[5m])) por (dispositivo)</td></tr><tr><td>pacotes transmitidos</td><td>sum(rate(node_network_transmit_packets_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*",instance=~"$instance"}[5m])) por (dispositivo)</td></tr></table>

Resumo

<table><tr><td>pacotes descartados na recepção</td><td>sum(rate(node_network_receive_drop_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*",instance=~"$instance"}[5m]))</td></tr><tr><td>erros na recepção</td><td>sum(rate(node_network_receive_errs_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*",instance=~"$instance"}[5m]))</td></tr><tr><td>pacotes recebidos</td><td>sum(rate(node_network_receive_packets_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*",instance=~"$instance"}[5m]))</td></tr><tr><td>pacotes descartados na transmissão</td><td>sum(rate(node_network_transmit_drop_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*",instance=~"$instance"}[5m]))</td></tr><tr><td>erros na transmissão</td><td>sum(rate(node_network_transmit_errs_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*",instance=~"$instance"}[5m]))</td></tr><tr><td>pacotes transmitidos</td><td>sum(rate(node_network_transmit_packets_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*",instance=~"$instance"}[5m]))</td></tr></table>

E/S de Rede do Nó

Catálogo Expressão

Detalhe

<table><tr><td>recebido</td><td>sum(rate(node_network_receive_bytes_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*",instance=~"$instance"}[5m])) by (device)</td></tr><tr><td>transmitido</td><td>sum(rate(node_network_transmit_bytes_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*",instance=~"$instance"}[5m])) by (device)</td></tr></table>

Resumo

<table><tr><td>recebido</td><td>sum(rate(node_network_receive_bytes_total{device!~"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*",instance="$instance"}[5m]))</td></tr><tr><td>transmitido</td><td>sum(rate(node_network_transmit_bytes_total{device!"lo | veth.* | docker.* | flannel.* | cali.* | cbr.*",instance=~"$instance"}[5m]))</td></tr></table>

Métricas do Etcd

Etcd Tem um Líder

max(etcd_server_has_leader)

Número de Vezes que o Líder Muda

max(etcd_server_leader_changes_seen_total)

Número de Propostas com Falha

sum(etcd_server_proposals_failed_total)

Tráfego do Cliente GRPC

Catálogo Expressão

Detalhe

<table><tr><td>entrada</td><td>sum(rate(etcd_network_client_grpc_received_bytes_total[5m])) by (instance)</td></tr><tr><td>saída</td><td>sum(rate(etcd_network_client_grpc_sent_bytes_total[5m])) by (instance)</td></tr></table>

Resumo

<table><tr><td>entrada</td><td>sum(rate(etcd_network_client_grpc_received_bytes_total[5m]))</td></tr><tr><td>saída</td><td>sum(rate(etcd_network_client_grpc_sent_bytes_total[5m]))</td></tr></table>

Tráfego de Pares

Catálogo Expressão

Detalhe

<table><tr><td>entrada</td><td>sum(rate(etcd_network_peer_received_bytes_total[5m])) by (instance)</td></tr><tr><td>saída</td><td>sum(rate(etcd_network_peer_sent_bytes_total[5m])) by (instance)</td></tr></table>

Resumo

<table><tr><td>entrada</td><td>sum(rate(etcd_network_peer_received_bytes_total[5m]))</td></tr><tr><td>saída</td><td>sum(rate(etcd_network_peer_sent_bytes_total[5m]))</td></tr></table>

Tamanho do banco de dados

Catálogo Expressão

Detalhe

sum(etcd_debugging_mvcc_db_total_size_in_bytes) by (instance)

Resumo

sum(etcd_debugging_mvcc_db_total_size_in_bytes)

Streams Ativos

Catálogo Expressão

Detalhe

<table><tr><td>lease-watch</td><td>sum(grpc_server_started_total{grpc_service="etcdserverpb.Lease",grpc_type="bidi_stream"}) by (instance) - sum(grpc_server_handled_total{grpc_service="etcdserverpb.Lease",grpc_type="bidi_stream"}) by (instance)</td></tr><tr><td>watch</td><td>sum(grpc_server_started_total{grpc_service="etcdserverpb.Watch",grpc_type="bidi_stream"}) by (instance) - sum(grpc_server_handled_total{grpc_service="etcdserverpb.Watch",grpc_type="bidi_stream"}) by (instance)</td></tr></table>

Resumo

<table><tr><td>lease-watch</td><td>soma(grpc_server_started_total{grpc_service="etcdserverpb.Lease",grpc_type="bidi_stream"}) - soma(grpc_server_handled_total{grpc_service="etcdserverpb.Lease",grpc_type="bidi_stream"})</td></tr><tr><td>watch</td><td>soma(grpc_server_started_total{grpc_service="etcdserverpb.Watch",grpc_type="bidi_stream"}) - soma(grpc_server_handled_total{grpc_service="etcdserverpb.Watch",grpc_type="bidi_stream"})</td></tr></table>

Propostas Raft

Catálogo Expressão

Detalhe

<table><tr><td>aplicadas</td><td>sum(increase(etcd_server_proposals_applied_total[5m])) por instância</td></tr><tr><td>comprometidas</td><td>sum(increase(etcd_server_proposals_committed_total[5m])) por instância</td></tr><tr><td>pendentes</td><td>sum(increase(etcd_server_proposals_pending[5m])) por instância</td></tr><tr><td>falhas</td><td>sum(increase(etcd_server_proposals_failed_total[5m])) por instância</td></tr></table>

Resumo

<table><tr><td>aplicadas</td><td>sum(increase(etcd_server_proposals_applied_total[5m]))</td></tr><tr><td>comprometidas</td><td>sum(increase(etcd_server_proposals_committed_total[5m]))</td></tr><tr><td>pendentes</td><td>sum(increase(etcd_server_proposals_pending[5m]))</td></tr><tr><td>falhas</td><td>sum(increase(etcd_server_proposals_failed_total[5m]))</td></tr></table>

Taxa de RPC

Catálogo Expressão

Detalhe

<table><tr><td>total</td><td>sum(rate(grpc_server_started_total{grpc_type="unary"}[5m])) por instância</td></tr><tr><td>falha</td><td>sum(rate(grpc_server_handled_total{grpc_type="unary",grpc_code!="OK"}[5m])) por instância</td></tr></table>

Resumo

<table><tr><td>total</td><td>sum(rate(grpc_server_started_total{grpc_type="unary"}[5m]))</td></tr><tr><td>falha</td><td>sum(rate(grpc_server_handled_total{grpc_type="unary",grpc_code!="OK"}[5m]))</td></tr></table>

Operações de Disco

Catálogo Expressão

Detalhe

<table><tr><td>commit-chamado-pelo-backend</td><td>sum(rate(etcd_disk_backend_commit_duration_seconds_sum[1m])) por instância</td></tr><tr><td>fsync-chamado-pelo-wal</td><td>sum(rate(etcd_disk_wal_fsync_duration_seconds_sum[1m])) por instância</td></tr></table>

Resumo

<table><tr><td>commit-chamado-pelo-backend</td><td>sum(rate(etcd_disk_backend_commit_duration_seconds_sum[1m]))</td></tr><tr><td>fsync-chamado-pelo-wal</td><td>sum(rate(etcd_disk_wal_fsync_duration_seconds_sum[1m]))</td></tr></table>

Duração de Sincronização de Disco

Catálogo Expressão

Detalhe

<table><tr><td>wal</td><td>histogram_quantile(0.99, sum(rate(etcd_disk_wal_fsync_duration_seconds_bucket[5m])) by (instance, le))</td></tr><tr><td>db</td><td>histogram_quantile(0.99, sum(rate(etcd_disk_backend_commit_duration_seconds_bucket[5m])) by (instance, le))</td></tr></table>

Resumo

<table><tr><td>wal</td><td>sum(histogram_quantile(0.99, sum(rate(etcd_disk_wal_fsync_duration_seconds_bucket[5m])) by (instance, le)))</td></tr><tr><td>db</td><td>sum(histogram_quantile(0.99, sum(rate(etcd_disk_backend_commit_duration_seconds_bucket[5m])) by (instance, le)))</td></tr></table>

Métricas de Componentes Kubernetes

Latência de Solicitação do Servidor API

Catálogo Expressão

Detalhe

avg(apiserver_request_latencies_sum / apiserver_request_latencies_count) by (instance, verb) /1e+06

Resumo

avg(apiserver_request_latencies_sum / apiserver_request_latencies_count) by (instance) /1e+06

Taxa de Solicitação do Servidor API

Catálogo Expressão

Detalhe

sum(rate(apiserver_request_count[5m])) by (instance, code)

Resumo

sum(rate(apiserver_request_count[5m])) by (instance)

Agendamento de Pods com falha

Catálogo Expressão

Detalhe

sum(kube_pod_status_scheduled{condition="false"})

Resumo

sum(kube_pod_status_scheduled{condition="false"})

Profundidade da Fila do Gerenciador de Controladores

Catálogo Expressão

Detalhe

<table><tr><td>volumes</td><td>sum(volumes_depth) por instância</td></tr><tr><td>implantação</td><td>sum(deployment_depth) por instância</td></tr><tr><td>replicaset</td><td>sum(replicaset_depth) por instância</td></tr><tr><td>service</td><td>sum(service_depth) por instância</td></tr><tr><td>serviceaccount</td><td>sum(serviceaccount_depth) por instância</td></tr><tr><td>endpoint</td><td>sum(endpoint_depth) por instância</td></tr><tr><td>daemonset</td><td>sum(daemonset_depth) por instância</td></tr><tr><td>statefulset</td><td>sum(statefulset_depth) por instância</td></tr><tr><td>replicationmanager</td><td>sum(replicationmanager_depth) por instância</td></tr></table>

Resumo

<table><tr><td>volumes</td><td>sum(volumes_depth)</td></tr><tr><td>implantação</td><td>sum(deployment_depth)</td></tr><tr><td>replicaset</td><td>sum(replicaset_depth)</td></tr><tr><td>service</td><td>sum(service_depth)</td></tr><tr><td>serviceaccount</td><td>sum(serviceaccount_depth)</td></tr><tr><td>endpoint</td><td>sum(endpoint_depth)</td></tr><tr><td>daemonset</td><td>sum(daemonset_depth)</td></tr><tr><td>statefulset</td><td>sum(statefulset_depth)</td></tr><tr><td>replicationmanager</td><td>sum(replicationmanager_depth)</td></tr></table>

Latência de Agendamento E2E do Scheduler

Catálogo Expressão

Detalhe

histogram_quantile(0.99, sum(scheduler_e2e_scheduling_latency_microseconds_bucket) by (le, instance)) / 1e+06

Resumo

sum(histogram_quantile(0.99, sum(scheduler_e2e_scheduling_latency_microseconds_bucket) by (le, instance)) / 1e+06)

Tentativas de Preempção do Scheduler

Catálogo Expressão

Detalhe

sum(rate(scheduler_total_preemption_attempts[5m])) by (instance)

Resumo

sum(rate(scheduler_total_preemption_attempts[5m]))

Conexões do Controlador de Ingress

Catálogo Expressão

Detalhe

<table><tr><td>reading</td><td>sum(nginx_ingress_controller_nginx_process_connections{state="reading"}) by (instance)</td></tr><tr><td>waiting</td><td>sum(nginx_ingress_controller_nginx_process_connections{state="waiting"}) by (instance)</td></tr><tr><td>writing</td><td>sum(nginx_ingress_controller_nginx_process_connections{state="writing"}) by (instance)</td></tr><tr><td>accepted</td><td>sum(ceil(increase(nginx_ingress_controller_nginx_process_connections_total{state="accepted"}[5m]))) by (instance)</td></tr><tr><td>active</td><td>sum(ceil(increase(nginx_ingress_controller_nginx_process_connections_total{state="active"}[5m]))) by (instance)</td></tr><tr><td>handled</td><td>sum(ceil(increase(nginx_ingress_controller_nginx_process_connections_total{state="handled"}[5m]))) by (instance)</td></tr></table>

Resumo

<table><tr><td>reading</td><td>sum(nginx_ingress_controller_nginx_process_connections{state="reading"})</td></tr><tr><td>waiting</td><td>sum(nginx_ingress_controller_nginx_process_connections{state="waiting"})</td></tr><tr><td>writing</td><td>sum(nginx_ingress_controller_nginx_process_connections{state="writing"})</td></tr><tr><td>accepted</td><td>sum(ceil(increase(nginx_ingress_controller_nginx_process_connections_total{state="accepted"}[5m])))</td></tr><tr><td>active</td><td>sum(ceil(increase(nginx_ingress_controller_nginx_process_connections_total{state="active"}[5m])))</td></tr><tr><td>handled</td><td>sum(ceil(increase(nginx_ingress_controller_nginx_process_connections_total{state="handled"}[5m])))</td></tr></table>

Tempo de Processamento de Solicitação do Controlador de Ingress

Catálogo Expressão

Detalhe

topk(10, histogram_quantile(0.95,sum by (le, host, path)(rate(nginx_ingress_controller_request_duration_seconds_bucket{host!="_"}[5m]))))

Resumo

topk(10, histogram_quantile(0.95,sum by (le, host)(rate(nginx_ingress_controller_request_duration_seconds_bucket{host!="_"}[5m]))))

Métricas de Logs do Rancher

Taxa da Fila de Buffer do Fluentd

Catálogo Expressão

Detalhe

sum(rate(fluentd_output_status_buffer_queue_length[5m])) by (instance)

Resumo

sum(rate(fluentd_output_status_buffer_queue_length[5m]))

Taxa de Entrada do Fluentd

Catálogo Expressão

Detalhe

sum(rate(fluentd_input_status_num_records_total[5m])) by (instance)

Resumo

sum(rate(fluentd_input_status_num_records_total[5m]))

Taxa de Erros de Saída do Fluentd

Catálogo Expressão

Detalhe

sum(rate(fluentd_output_status_num_errors[5m])) by (type)

Resumo

sum(rate(fluentd_output_status_num_errors[5m]))

Taxa de Saída do Fluentd

Catálogo Expressão

Detalhe

sum(rate(fluentd_output_status_num_records_total[5m])) by (instance)

Resumo

sum(rate(fluentd_output_status_num_records_total[5m]))

Métricas de Carga de Trabalho

Utilização da CPU da Carga de Trabalho

Catálogo Expressão

Detalhe

<table><tr><td>segundos de limitação de CFS</td><td>soma(rate(contêiner_cpu_cfs_throttled_seconds_total{namespace="$namespace",pod_name=~"$podName",container_name!=""}[5m])) by (nome_do_pod)</td></tr><tr><td>segundos de usuário</td><td>soma(rate(contêiner_cpu_user_seconds_total{namespace="$namespace",pod_name=~"$podName",container_name!=""}[5m])) by (nome_do_pod)</td></tr><tr><td>segundos de sistema</td><td>soma(rate(contêiner_cpu_system_seconds_total{namespace="$namespace",pod_name=~"$podName",container_name!=""}[5m])) by (nome_do_pod)</td></tr><tr><td>segundos de uso</td><td>soma(rate(contêiner_cpu_usage_seconds_total{namespace="$namespace",pod_name=~"$podName",container_name!=""}[5m])) by (nome_do_pod)</td></tr></table>

Resumo

<table><tr><td>segundos de CFS limitados</td><td>soma(rate(contêiner_cpu_cfs_throttled_seconds_total{namespace="$namespace",pod_name=~"$podName",container_name!=""}[5m]))</td></tr><tr><td>segundos de usuário</td><td>soma(rate(contêiner_cpu_user_seconds_total{namespace="$namespace",pod_name=~"$podName",container_name!=""}[5m]))</td></tr><tr><td>segundos de sistema</td><td>soma(rate(contêiner_cpu_system_seconds_total{namespace="$namespace",pod_name=~"$podName",container_name!=""}[5m]))</td></tr><tr><td>segundos de uso</td><td>soma(rate(contêiner_cpu_usage_seconds_total{namespace="$namespace",pod_name=~"$podName",container_name!=""}[5m]))</td></tr></table>

Utilização da Memória da Carga de Trabalho

Catálogo Expressão

Detalhe

sum(container_memory_working_set_bytes{namespace="$namespace",pod_name=~"$podName", container_name!=""}) by (pod_name)

Resumo

sum(container_memory_working_set_bytes{namespace="$namespace",pod_name=~"$podName", container_name!=""})

Pacotes de Rede da Carga de Trabalho

Catálogo Expressão

Detalhe

<table><tr><td>pacotes recebidos</td><td>soma(rate(contêiner_network_receive_packets_total{namespace="$namespace",pod_name=~"$podName",container_name!=""}[5m])) by (pod_name)</td></tr><tr><td>pacotes descartados</td><td>soma(rate(contêiner_network_receive_packets_dropped_total{namespace="$namespace",pod_name=~"$podName",container_name!=""}[5m])) by (pod_name)</td></tr><tr><td>erros de recebimento</td><td>soma(rate(contêiner_network_receive_errors_total{namespace="$namespace",pod_name=~"$podName",container_name!=""}[5m])) by (pod_name)</td></tr><tr><td>pacotes transmitidos</td><td>soma(rate(contêiner_network_transmit_packets_total{namespace="$namespace",pod_name=~"$podName",container_name!=""}[5m])) by (pod_name)</td></tr><tr><td>pacotes transmitidos descartados</td><td>soma(rate(contêiner_network_transmit_packets_dropped_total{namespace="$namespace",pod_name=~"$podName",container_name!=""}[5m])) by (pod_name)</td></tr><tr><td>erros de transmissão</td><td>soma(rate(contêiner_network_transmit_errors_total{namespace="$namespace",pod_name=~"$podName",container_name!=""}[5m])) by (pod_name)</td></tr></table>

Resumo

<table><tr><td>pacotes recebidos</td><td>sum(rate(contêiner_network_receive_packets_total{namespace="$namespace",pod_name=~"$podName",container_name!=""}[5m]))</td></tr><tr><td>pacotes descartados</td><td>sum(rate(contêiner_network_receive_packets_dropped_total{namespace="$namespace",pod_name=~"$podName",container_name!=""}[5m]))</td></tr><tr><td>erros de recebimento</td><td>sum(rate(contêiner_network_receive_errors_total{namespace="$namespace",pod_name=~"$podName",container_name!=""}[5m]))</td></tr><tr><td>pacotes transmitidos</td><td>sum(rate(contêiner_network_transmit_packets_total{namespace="$namespace",pod_name=~"$podName",container_name!=""}[5m]))</td></tr><tr><td>pacotes transmitidos descartados</td><td>sum(rate(contêiner_network_transmit_packets_dropped_total{namespace="$namespace",pod_name=~"$podName",container_name!=""}[5m]))</td></tr><tr><td>erros de transmissão</td><td>sum(rate(contêiner_network_transmit_errors_total{namespace="$namespace",pod_name=~"$podName",container_name!=""}[5m]))</td></tr></table>

E/S de Rede da Carga de Trabalho

Catálogo Expressão

Detalhe

<table><tr><td>receber</td><td>soma(rate(contêiner_network_receive_bytes_total{namespace="$namespace",pod_name=~"$podName",container_name!=""}[5m])) por (pod_name)</td></tr><tr><td>transmitir</td><td>soma(rate(contêiner_network_transmit_bytes_total{namespace="$namespace",pod_name=~"$podName",container_name!=""}[5m])) por (pod_name)</td></tr></table>

Resumo

<table><tr><td>receber</td><td>soma(rate(contêiner_network_receive_bytes_total{namespace="$namespace",pod_name="$podName",container_name!=""}[5m]))</td></tr><tr><td>transmitir</td><td>soma(rate(contêiner_network_transmit_bytes_total{namespace="$namespace",pod_name="$podName",container_name!=""}[5m]))</td></tr></table>

E/S de Disco da Carga de Trabalho

Catálogo Expressão

Detalhe

<table><tr><td>ler</td><td>soma(rate(contêiner_fs_reads_bytes_total{namespace="$namespace",pod_name=~"$podName",container_name!=""}[5m])) by (pod_name)</td></tr><tr><td>escrever</td><td>soma(rate(contêiner_fs_writes_bytes_total{namespace="$namespace",pod_name=~"$podName",container_name!=""}[5m])) by (pod_name)</td></tr></table>

Resumo

<table><tr><td>ler</td><td>soma(rate(contêiner_fs_reads_bytes_total{namespace="$namespace",pod_name="$podName",container_name!=""}[5m]))</td></tr><tr><td>escrever</td><td>soma(rate(contêiner_fs_writes_bytes_total{namespace="$namespace",pod_name="$podName",container_name!=""}[5m]))</td></tr></table>

Métricas do Pod

Utilização da CPU do Pod

Catálogo Expressão

Detalhe

<table><tr><td>segundos de CFS limitados</td><td>soma(rate(contêiner_cpu_cfs_throttled_seconds_total{container_name!="POD",namespace="$namespace",pod_name="$podName", container_name!=""}[5m])) por (container_name)</td></tr><tr><td>segundos de uso</td><td>soma(rate(contêiner_cpu_usage_seconds_total{container_name!="POD",namespace="$namespace",pod_name="$podName", container_name!=""}[5m])) por (container_name)</td></tr><tr><td>segundos de sistema</td><td>soma(rate(contêiner_cpu_system_seconds_total{container_name!="POD",namespace="$namespace",pod_name="$podName", container_name!=""}[5m])) por (container_name)</td></tr><tr><td>segundos de usuário</td><td>soma(rate(contêiner_cpu_user_seconds_total{container_name!="POD",namespace="$namespace",pod_name="$podName", container_name!=""}[5m])) por (container_name)</td></tr></table>

Resumo

<table><tr><td>segundos de CFS limitados</td><td>soma(rate(contêiner_cpu_cfs_throttled_seconds_total{container_name!="POD",namespace="$namespace",pod_name="$podName", container_name!=""}[5m]))</td></tr><tr><td>segundos de uso</td><td>soma(rate(contêiner_cpu_usage_seconds_total{container_name!="POD",namespace="$namespace",pod_name="$podName", container_name!=""}[5m]))</td></tr><tr><td>segundos de sistema</td><td>soma(rate(contêiner_cpu_system_seconds_total{container_name!="POD",namespace="$namespace",pod_name="$podName", container_name!=""}[5m]))</td></tr><tr><td>segundos de usuário</td><td>soma(rate(contêiner_cpu_user_seconds_total{container_name!="POD",namespace="$namespace",pod_name="$podName", container_name!=""}[5m]))</td></tr></table>

Utilização da Memória do Pod

Catálogo Expressão

Detalhe

sum(container_memory_working_set_bytes{container_name!="POD",namespace="$namespace",pod_name="$podName",container_name!=""}) by (container_name)

Resumo

sum(container_memory_working_set_bytes{container_name!="POD",namespace="$namespace",pod_name="$podName",container_name!=""})

Pacotes de Rede do Pod

Catálogo Expressão

Detalhe

<table><tr><td>pacotes recebidos</td><td>soma(rate(contêiner_network_receive_packets_total{namespace="$namespace",pod_name="$podName",container_name!=""}[5m]))</td></tr><tr><td>pacotes descartados</td><td>soma(rate(contêiner_network_receive_packets_dropped_total{namespace="$namespace",pod_name="$podName",container_name!=""}[5m]))</td></tr><tr><td>erros de recebimento</td><td>soma(rate(contêiner_network_receive_errors_total{namespace="$namespace",pod_name="$podName",container_name!=""}[5m]))</td></tr><tr><td>pacotes transmitidos</td><td>soma(rate(contêiner_network_transmit_packets_total{namespace="$namespace",pod_name="$podName",container_name!=""}[5m]))</td></tr><tr><td>transmitir-dropped</td><td>soma(rate(contêiner_network_transmit_packets_dropped_total{namespace="$namespace",pod_name="$podName",container_name!=""}[5m]))</td></tr><tr><td>erros de transmissão</td><td>soma(rate(contêiner_network_transmit_errors_total{namespace="$namespace",pod_name="$podName",container_name!=""}[5m]))</td></tr></table>

Resumo

<table><tr><td>pacotes recebidos</td><td>sum(rate(contêiner_network_receive_packets_total{namespace="$namespace",pod_name="$podName",container_name!=""}[5m]))</td></tr><tr><td>pacotes descartados</td><td>sum(rate(contêiner_network_receive_packets_dropped_total{namespace="$namespace",pod_name="$podName",container_name!=""}[5m]))</td></tr><tr><td>erros de recebimento</td><td>sum(rate(contêiner_network_receive_errors_total{namespace="$namespace",pod_name="$podName",container_name!=""}[5m]))</td></tr><tr><td>pacotes transmitidos</td><td>sum(rate(contêiner_network_transmit_packets_total{namespace="$namespace",pod_name="$podName",container_name!=""}[5m]))</td></tr><tr><td>pacotes descartados na transmissão</td><td>sum(rate(contêiner_network_transmit_packets_dropped_total{namespace="$namespace",pod_name="$podName",container_name!=""}[5m]))</td></tr><tr><td>erros de transmissão</td><td>``sum(rate(contêiner_network_transmit_errors_total{namespace="$namespace",pod_name="$podName",container_name!=""}[5m]))</td></tr></table>

E/S de Rede do Pod

Catálogo Expressão

Detalhe

<table><tr><td>recebidos</td><td>sum(rate(container_network_receive_bytes_total{namespace="$namespace",pod_name="$podName",container_name!=""}[5m]))</td></tr><tr><td>transmitidos</td><td>sum(rate(container_network_transmit_bytes_total{namespace="$namespace",pod_name="$podName",container_name!=""}[5m]))</td></tr></table>

Resumo

<table><tr><td>recebidos</td><td>sum(rate(container_network_receive_bytes_total{namespace="$namespace",pod_name="$podName",container_name!=""}[5m]))</td></tr><tr><td>transmitidos</td><td>sum(rate(container_network_transmit_bytes_total{namespace="$namespace",pod_name="$podName",container_name!=""}[5m]))</td></tr></table>

E/S de Disco do Pod

Catálogo Expressão

Detalhe

<table><tr><td>leitura</td><td>sum(rate(container_fs_reads_bytes_total{namespace="$namespace",pod_name="$podName",container_name!=""}[5m])) by (container_name)</td></tr><tr><td>escrita</td><td>sum(rate(container_fs_writes_bytes_total{namespace="$namespace",pod_name="$podName",container_name!=""}[5m])) by (container_name)</td></tr></table>

Resumo

<table><tr><td>leitura</td><td>sum(rate(container_fs_reads_bytes_total{namespace="$namespace",pod_name="$podName",container_name!=""}[5m]))</td></tr><tr><td>escrita</td><td>sum(rate(container_fs_writes_bytes_total{namespace="$namespace",pod_name="$podName",container_name!=""}[5m]))</td></tr></table>

Métricas de Contêiner

Utilização de CPU do Contêiner

Catálogo Expressão

segundos limitados pelo CFS

sum(rate(container_cpu_cfs_throttled_seconds_total{namespace="$namespace",pod_name="$podName",container_name="$containerName"}[5m]))

segundos de uso

sum(rate(container_cpu_usage_seconds_total{namespace="$namespace",pod_name="$podName",container_name="$containerName"}[5m]))

segundos de sistema

sum(rate(container_cpu_system_seconds_total{namespace="$namespace",pod_name="$podName",container_name="$containerName"}[5m]))

segundos de usuário

sum(rate(container_cpu_user_seconds_total{namespace="$namespace",pod_name="$podName",container_name="$containerName"}[5m]))

Utilização de Memória do Contêiner

sum(container_memory_working_set_bytes{namespace="$namespace",pod_name="$podName",container_name="$containerName"})

E/S de Disco do Contêiner

Catálogo Expressão

ler

sum(rate(container_fs_reads_bytes_total{namespace="$namespace",pod_name="$podName",container_name="$containerName"}[5m]))

gravar

sum(rate(container_fs_writes_bytes_total{namespace="$namespace",pod_name="$podName",container_name="$containerName"}[5m]))