Flink实战（七十）：监控（二）搭建flink可视化监控 Pushgateway+ Prometheus + Grafana （windows ）

1 Flink 的配置:

在flink配置⽂件flink-conf.yaml中添加：

metrics.reporter.promgateway.class:
org.apache.flink.metrics.prometheus.PrometheusPushGatewayReporter
metrics.reporter.promgateway.host: localhost # promgateway 主要是Pushgateway所在机器的ip地址
metrics.reporter.promgateway.port: 9091
metrics.reporter.promgateway.jobName: zhisheng // 随意起名
metrics.reporter.promgateway.randomJobNameSuffix: true
metrics.reporter.promgateway.deleteOnShutdown: false

将flink包中opt文件下的flink-metrics-prometheus-xxxxx.jar包复制到lib文件夹中

2 pushgateway 配置

2.1 下载prometheus到window上

Prometheus 的下载链接为：
https://prometheus.io/download/

下载 pushgateway-1.3.0.windows-amd64.tar.gz 后解压

3 Prometheus配置

3.1 下载prometheus到window上

Prometheus 的下载链接为：
https://prometheus.io/download/

下载 prometheus-2.22.0.windows-amd64.tar.gz 后解压

这里所需Prometheus的组件为：

prometheus
pushgateway(Flink推送监控数据到此)

将这些组件分别解压到任意目录。

3.2 配置Prometheus

修改Prometheus根目录prometheus.yml文件的scrape_config，如下图所示：

# my global config
global:
  scrape_interval:     15s # Set the scrape interval to every 15 seconds. Default is every 1 minute.
  evaluation_interval: 15s # Evaluate rules every 15 seconds. The default is every 1 minute.
  # scrape_timeout is set to the global default (10s).

# Alertmanager configuration
alerting:
  alertmanagers:
  - static_configs:
    - targets:
      # - alertmanager:9093

# Load rules once and periodically evaluate them according to the global 'evaluation_interval'.
rule_files:
  # - "first_rules.yml"
  # - "second_rules.yml"

# A scrape configuration containing exactly one endpoint to scrape:
# Here it's Prometheus itself.
scrape_configs:
  # The job name is added as a label `job=<job_name>` to any timeseries scraped from this config.
  - job_name: 'prometheus'
    honor_labels: true

    # metrics_path defaults to '/metrics'
    # scheme defaults to 'http'.

    static_configs:
    - targets: ['localhost:9091']
      labels: 
         instance: pushgateway