其它服务页面,目前提供RabbitMQ和Etcd服务监控数据。
RabbitMQ
功能介绍
RabbitMQ页面接口调用说明。
URI
GET /apis/monitoring/v1/ecms/rabbitmq
可选Metrics
名称 | 说明 | 单位 |
---|---|---|
rabbitmq_cluster_status | RabbitMQ 集群整体状态 RabbitMQ Cluster |
|
rabbitmq_cluster_queues | 队列数 Queues | |
rabbitmq_cluster_connections | 连接数 Connections | |
rabbitmq_cluster_channels | 通道数 Channels | |
rabbitmq_cluster_exchanges | 消息交换机 Exchanges | |
rabbitmq_cluster_consumers | 消费者数 Consumers | |
rabbitmq_node_status | RabbitMQ 节点状态 RabbitMQ Node Status |
|
rabbitmq_top10_queues_ack | 消息数前10名队列 Top 10 Queues (Sorted by Messages) |
|
rabbitmq_top10_queues_unack | 未应答消息数前10名队列 Top 10 Queues (Sorted by unack Messages) |
|
rabbitmq_messages_ready | 已就绪消息数 Ready Messages | |
rabbitmq_messages_published | 已发布消息 Published Messages | |
rabbitmq_messages_delivered | 已交付消息 Delivered Messages | |
rabbitmq_messages_ack | 已确认消息 Ack Messages | |
rabbitmq_messages_unack | 未确认消息 Unack Messages | |
rabbitmq_nodes_disk_free_bytes | 节点磁盘 - Free Nodes Disk - Free |
Byte |
rabbitmq_nodes_disk_limit_bytes | 节点磁盘 - Limit Nodes Disk - Limit |
Byte |
rabbitmq_nodes_mem_used_bytes | 节点内存 - Used Nodes Memory - Used |
Byte |
rabbitmq_nodes_mem_limit_bytes | 节点内存 - Limit Nodes Memory - Limit |
Byte |
rabbitmq_file_descriptors_total | 文件描述符总数 File Descriptors Total |
|
rabbitmq_file_descriptors_used | 文件描述符已使用数 File Descriptors Used |
|
rabbitmq_sockets_total | Socket连接总数 Sockets Total |
|
rabbitmq_sockets_used | Socket连接已使用数 Sockets Used |
示例
示例一:查询RabbitMQ集群整体状态
请求示例:
curl -H 'X-Auth-Token: gAAAAABf_Qb8yyZ2zM-uhK7q5c8D6ikGc7A5o4FenPkzYlpCI2PhJXtnDOyl9EeCQ-Gj9lg0SV37PIZda3nlUCTrBawfsbnSM1pw-9QJWoEo3NDxSvM5fyhQIadjI17H0jV40NHx9tc3EkIxrAScXhxZsldOt9uVjRO6cq9pFTKs6qhxyu1GBHk' 'http://emla.openstack.svc.cluster.local/apis/monitoring/v1/ecms/rabbitmq?metrics_filter=rabbitmq_cluster_status'
响应示例:
{
"results": [
{
"metric_name": "rabbitmq_cluster_status",
"data": {
"resultType": "vector",
"result": [
{
"value": [
1609315972.046,
"1"
]
}
]
}
}
]
}
RabbitMQ 集群整体状态 rabbitmq_cluster_status 有以下四种状态:
- 健康(Healthy)
- 警告(Warning)
- 故障(Error)
- 无数据(No data)
状态–Value:
Metric 名称 | 说明 | Healthy | Warning | Error | No data |
---|---|---|---|---|---|
rabbitmq_cluster_status | RabbitMQ 集群整体状态 | [1, 1) | [0.001, 0.999) | [0, 0) | [null, null) |
“1”在[1, 1)范围内,所以此时RabbitMQ 集群整体状态为健康。
示例二:查询消息数前10队列
请求示例:
curl -H 'X-Auth-Token: gAAAAABf_Qb8yyZ2zM-uhK7q5c8D6ikGc7A5o4FenPkzYlpCI2PhJXtnDOyl9EeCQ-Gj9lg0SV37PIZda3nlUCTrBawfsbnSM1pw-9QJWoEo3NDxSvM5fyhQIadjI17H0jV40NHx9tc3EkIxrAScXhxZsldOt9uVjRO6cq9pFTKs6qhxyu1GBHk' 'http://emla.openstack.svc.cluster.local/apis/monitoring/v1/ecms/rabbitmq?metrics_filter=rabbitmq_top10_queues_ack'
响应示例:
{
"results": [
{
"metric_name": "rabbitmq_top10_queues_ack",
"data": {
"resultType": "vector",
"result": [
{
"metric": {
"__name__": "top10_rabbitmq_queue_messages",
"queue": "mistral_engine",
},
"value": [
1609318241.539,
"1"
]
},
{
"metric": {
"__name__": "top10_rabbitmq_queue_messages",
"queue": "notifications.info",
},
"value": [
1609318241.539,
"100"
]
},
...
{
"metric": {
"__name__": "top10_rabbitmq_queue_messages",
"queue": "notifications.sample",
},
"value": [
1609318241.539,
"168"
]
}
]
}
}
]
}
metric中“queue”的值为队列名称
消息数前10队列:
队列 | 消息数 | |
---|---|---|
1 | mistral_engine | 1 |
2 | notifications.info | 100 |
… | ||
10 | notifications.sample | 168 |
Etcd
功能介绍
Etcd页面接口调用说明。
URI
GET /apis/monitoring/v1/ecms/etcd
可选Metrics
名称 | 说明 | 单位 |
---|---|---|
etcd_cluster_status | Etcd 集群整体状态 Etcd Cluster |
|
etcd_node_status | Etcd 节点状态 Etcd On Node-x Status |
|
etcd_leader_info | Etcd Leader 信息 Etcd Leader Info |
|
etcd_total_leader_elections_per_day | 近一天的Leader选举数 Total Leader Elections Per Day |
|
etcd_rpc_rate | RPC 启动总数 RPC Rate |
|
etcd_rpc_failed_rate | RPC 启动失败数 RPC Failed Rate |
|
etcd_raft_proposals_failed_rate | Raft 提案失败数 Raft Proposals Failed Rate |
|
etcd_raft_proposals_pending_total | Raft 提案排队数 Raft Proposals Pending Total |
|
etcd_raft_proposals_committed_rate | Raft 提案提交数 Raft Proposals Committed Rate |
|
etcd_raft_proposals_applied_rate | Raft 提案应用数 Raft Proposals Applied Rate |
|
etcd_db_size_total_bytes | DB 总量 DB Size Total |
Byte |
etcd_db_size_in_use_bytes | DB 使用量 DB Size Total |
Byte |
etcd_cpu_usage | Etcd CPU 使用量 Etcd CPU Usage |
|
etcd_resident_memory_bytes | 常驻内存 Resident Memory |
Byte |
etcd_wal_fsync_latency_ms | Etcd WAL 同步延迟 Etcd WAL Fsync Latency |
ms |
etcd_backend_commit_latency_ms | Etcd 后端提交延迟 Etcd Backend Commit Latency |
ms |
etcd_client_traffic_received | Etcd 客户端接收流量 Etcd Client Traffic Received |
Bps |
etcd_client_traffic_received_avg | Etcd 客户端接收流量平均值 Etcd Client Traffic Received Average |
Bps |
etcd_client_traffic_sent | Etcd 客户端发送流量 Etcd Client Traffic Sent |
Bps |
etcd_client_traffic_sent_avg | Etcd 客户端发送流量平均值 Etcd Client Traffic Sent Average |
Bps |
示例
示例一:查询Etcd集群整体状态
请求示例:
curl -H 'X-Auth-Token: gAAAAABf_Qb8yyZ2zM-uhK7q5c8D6ikGc7A5o4FenPkzYlpCI2PhJXtnDOyl9EeCQ-Gj9lg0SV37PIZda3nlUCTrBawfsbnSM1pw-9QJWoEo3NDxSvM5fyhQIadjI17H0jV40NHx9tc3EkIxrAScXhxZsldOt9uVjRO6cq9pFTKs6qhxyu1GBHk' 'http://emla.openstack.svc.cluster.local/apis/monitoring/v1/ecms/etcd?metrics_filter=etcd_cluster_status'
响应示例:
{
"results": [
{
"metric_name": "etcd_cluster_status",
"data": {
"resultType": "vector",
"result": [
{
"value": [
1650800110.54,
"0.1"
]
}
]
}
}
]
}
Etcd 集群整体状态 etcd_cluster_status 有以下四种状态:
- 健康(Healthy)
- 警告(Warning)
- 故障(Error)
- 无数据(No data)
状态–Value:
Metric 名称 | 说明 | Healthy | Warning | Error | No data |
---|---|---|---|---|---|
etcd_cluster_status | Etcd 集群整体状态 | [1, 1) | [0.001, 0.999) | [0, 0) | [null, null) |
“0.1”在[0.001, 0.999)范围内,所以此时Etcd集群整体状态为警告。
示例二:查询Etcd Leader信息
请求示例:
curl -H 'X-Auth-Token: gAAAAABf_Qb8yyZ2zM-uhK7q5c8D6ikGc7A5o4FenPkzYlpCI2PhJXtnDOyl9EeCQ-Gj9lg0SV37PIZda3nlUCTrBawfsbnSM1pw-9QJWoEo3NDxSvM5fyhQIadjI17H0jV40NHx9tc3EkIxrAScXhxZsldOt9uVjRO6cq9pFTKs6qhxyu1GBHk' 'http://emla.openstack.svc.cluster.local/apis/monitoring/v1/ecms/etcd?metrics_filter=etcd_leader_info'
响应示例:
{
"results": [
{
"metric_name": "etcd_leader_info",
"data": {
"resultType": "vector",
"result": [
{
"metric": {
"__name__": "ecms_etcd_server_is_leader",
"endpoint": "metrics",
"host_ip": "10.10.1.4",
"instance": "10.10.1.4:2379",
"job": "etcd",
"namespace": "kube-system",
"node_name": "node-1",
"service": "etcd"
},
"value": [
1650800987.238,
"0"
]
},
{
"metric": {
"__name__": "ecms_etcd_server_is_leader",
"endpoint": "metrics",
"host_ip": "10.10.1.5",
"instance": "10.10.1.5:2379",
"job": "etcd",
"namespace": "kube-system",
"node_name": "node-2",
"service": "etcd"
},
"value": [
1650800987.238,
"1"
]
},
{
"metric": {
"__name__": "ecms_etcd_server_is_leader",
"endpoint": "metrics",
"host_ip": "10.10.1.6",
"instance": "10.10.1.6:2379",
"job": "etcd",
"namespace": "kube-system",
"node_name": "node-3",
"service": "etcd"
},
"value": [
1650800987.238,
"0"
]
}
]
}
}
]
}
Etcd Leader 信息:
节点名称 | 节点IP | 是否是Leader |
---|---|---|
node-1 | 10.10.1.4 | 0:否 |
node-2 | 10.10.1.5 | 1:是 |
node-3 | 10.10.1.6 | 0:否 |