Hive

一、Hive安装(略)

二、数据导入

(1)数据的分隔符在创建表的时候指定:

create table if not exists test(name string,age int) row format delimited fields terminated by '	';

指定之后在导入数据时就会以 也就是tab分隔字串:

ngsan   30

lisi    50

wangwu  60

peiqi   6

(2)创建Hive外部表

CREATE external TABLE tt (

stime STRING,

assid STRING,

ssid STRING,

other STRING,

mac STRING,

servicecode STRING,

host STRING,

apname STRING,

termtype STRING

) ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LINES TERMINATED BY '
' LOCATION '/logstash';

此时会自动创建元数据,无需load

(3)创建Hive内部表

CREATE external TABLE tt (

stime STRING,

assid STRING,

ssid STRING,

other STRING,

mac STRING,

servicecode STRING,

host STRING,

apname STRING,

termtype STRING

) ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LINES TERMINATED BY '
';

LOAD DATA INPATH 'hdfs://cdh01:8020/logstash/test.txt' overwrite INTO TABLE bigdata_odl.mac;

创建内部表时需要load数据才会创建元数据,而且删除了表则数据也删除了;注意: 'hdfs://cdh01:8020/logstash/test.txt' cdh连接,非cdh则为:'hdfs://logstash/test.txt'

原文地址:https://www.cnblogs.com/lzj123/p/13474122.html