安装
wget https://static.imply.io/release/imply-2.5.15.tar.gz tar zxvf imply-2.5.15.tar.gz -C /usr/local/ cd imply-2.5.15 nohup bin/supervise -c conf/supervise/quickstart.conf > quickstart.log &
查看启动日志
[root@localhost imply-2.5.15]# tail -f quickstart.log [Sat Jun 9 11:32:22 2018] Running command[zk], logging to[/usr/local/imply-2.5.15/var/sv/zk.log]: bin/run-zk conf-quickstart [Sat Jun 9 11:32:22 2018] Running command[coordinator], logging to[/usr/local/imply-2.5.15/var/sv/coordinator.log]: bin/run-druid coordinator conf-quickstart [Sat Jun 9 11:32:22 2018] Running command[broker], logging to[/usr/local/imply-2.5.15/var/sv/broker.log]: bin/run-druid broker conf-quickstart [Sat Jun 9 11:32:22 2018] Running command[historical], logging to[/usr/local/imply-2.5.15/var/sv/historical.log]: bin/run-druid historical conf-quickstart [Sat Jun 9 11:32:22 2018] Running command[overlord], logging to[/usr/local/imply-2.5.15/var/sv/overlord.log]: bin/run-druid overlord conf-quickstart [Sat Jun 9 11:32:22 2018] Running command[middleManager], logging to[/usr/local/imply-2.5.15/var/sv/middleManager.log]: bin/run-druid middleManager conf-quickstart [Sat Jun 9 11:32:22 2018] Running command[imply-ui], logging to[/usr/local/imply-2.5.15/var/sv/imply-ui.log]: bin/run-imply-ui-quickstart conf-quickstart
可见启动了如下服务
服务 | 简介 | 访问地址 |
zookeeper | 分布式协调服务 | —— |
coordinator | 协调节点,管理集群状态 | http://localhost:8081/#/ |
broker | 查询节点,处理查询请求 | http://localhost:8082/druid/v2 |
historical | 历史节点,管理历史数据 | http://localhost:8083/druid/v2 |
overlord | 统治节点,管理数据写入任务 | http://localhost:8090/console.html |
middleManager | 中间管理者,负责写数据处理 | —— |
imply-ui | 数据查询Web UI | http://localhost:9095/datasets/ |
数据导入与查询
导入离线数据与查询
数据导入与查询
导入离线数据与查询
-
quickstart/wikipedia-2016-06-27-sampled.json文件包含了维基百科的一些日志数据(每行是一个JSON字符串)
-
quickstart/wikipedia-index.json文件为离线写入任务的描述文件,其用JSON格式描述了任务的数据源、时间信息、维度信息、指标信息等
将这些数据导入到Druid中
bin/post-index-task --file quickstart/wikipedia-index.json
可能会报“ImportError: No module named argparse”的错误,执行pip install argparse安装即可解决
bin/post-index-task --file quickstart/wikipedia-index.json Beginning indexing data for wikipedia Task started: index_wikipedia_2018-06-09T04:03:26.580Z Task log: http://localhost:8090/druid/indexer/v1/task/index_wikipedia_2018-06-09T04:03:26.580Z/log Task status: http://localhost:8090/druid/indexer/v1/task/index_wikipedia_2018-06-09T04:03:26.580Z/status Task index_wikipedia_2018-06-09T04:03:26.580Z still running... Task index_wikipedia_2018-06-09T04:03:26.580Z still running... Task finished with status: SUCCESS Completed indexing data for wikipedia. Now loading indexed data onto the cluster... wikipedia is 0.0% finished loading... wikipedia loading complete! You may now query your data
查询数据
访问http://localhost:9095,可以查看当前数据集的概况和字段信息等
查询编辑次数TOP5的页面
方式一切到SQL标签,输入以下SQL
SELECT page, COUNT(*) AS Edits FROM wikipedia WHERE "__time" BETWEEN TIMESTAMP '2016-06-27 00:00:00' AND TIMESTAMP '2016-06-28 00:00:00' GROUP BY page ORDER BY Edits DESC LIMIT 5
方式二直接通过CURL查询
quickstart/wikipedia-top-pages.json中定了查询的JSON文件
curl -X 'POST' -H 'Content-Type:application/json' -d @quickstart/wikipedia-top-pages.json http://localhost:8082/druid/v2?pretty
返回
[ { "timestamp" : "2016-06-27T00:00:11.080Z", "result" : [ { "count" : 29, "page" : "Copa América Centenario" }, { "count" : 16, "page" : "User:Cyde/List of candidates for speedy deletion/Subpage" }, { "count" : 16, "page" : "Wikipedia:Administrators' noticeboard/Incidents" }, { "count" : 15, "page" : "2016 Wimbledon Championships – Men's Singles" }, { "count" : 15, "page" : "Wikipedia:Administrator intervention against vandalism" } ] } ]
实时导入与查询
发送数据
bin/generate-example-metrics可以产生一些随机的样例metric,运行下面的语句
[root@localhost imply]# bin/generate-example-metrics | curl -XPOST -H'Content-Type: application/json' --data-binary @- http://localhost:8200/v1/post/tutorial-tranquility-server
返回结果如下
{"result":{"received":25,"sent":25}}