hive建表后缀

最终版

  • 数据存储格式推荐使用orc,压缩方式推荐使用snappy(ORC文件的默认压缩方式是ZLIB)
COMMENT '表注释'
PARTITIONED BY (dt string COMMENT '日分区')
ROW FORMAT DELIMITED
    FIELDS TERMINATED BY '\007'
STORED AS ORC 
TBLPROPERTIES ('orc.compression'='snappy');
  • 分桶(分区常用;分桶少用)
COMMENT '表注释'
PARTITIONED BY (dt string COMMENT '日分区')
clustered by(c_id) into 4 buckets
ROW FORMAT DELIMITED
    FIELDS TERMINATED BY '\007'
STORED AS ORC 
TBLPROPERTIES ('orc.compression'='snappy');

建表书写

  • 分区 \007分隔符 textfile
comment '表注释'
partitioned by (dt string)
row format delimited
fields terminated by '\007'
stored as textfile;
  • 分区 \007分隔符 orc
comment '表注释'
partitioned by (dt string)
row format delimited
fields terminated by '\007'
stored as orc;
  • row format delimited

ROW FORMAT DELIMITED NULL DEFINED AS ‘’
功能:将Hive的这张表中的null设置为空
注意:Hive中的NULL值是假NULL,Hive底层的数据是文件,Hive中的NULL值实际是\n
为什么特殊指定?
如果将Hive中的NUll导出到MySQL中,就不能成功

show create table 显示

  • 分区 \t分隔符,要当心 textfile

PARTITIONED BY (dt string)
ROW FORMAT SERDE
‘org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe’
WITH SERDEPROPERTIES (
‘field.delim’=‘\t’,
‘serialization.format’=‘\t’)
STORED AS INPUTFORMAT
‘org.apache.hadoop.mapred.TextInputFormat’
OUTPUTFORMAT
‘org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat’

  • 分区 \t分隔符 orc

partitioned by (dt date)
ROW FORMAT SERDE
‘org.apache.hadoop.hive.ql.io.orc.OrcSerde’
WITH SERDEPROPERTIES (
‘field.delim’=‘\t’,
‘serialization.format’=‘\t’)
STORED AS INPUTFORMAT
‘org.apache.hadoop.hive.ql.io.orc.OrcInputFormat’
OUTPUTFORMAT
‘org.apache.hadoop.hive.ql.io.orc.OrcOutputFormat’

  • 分区 默认分隔符 orc

partitioned by (dt date)
ROW FORMAT SERDE
‘org.apache.hadoop.hive.ql.io.orc.OrcSerde’
STORED AS INPUTFORMAT
‘org.apache.hadoop.hive.ql.io.orc.OrcInputFormat’
OUTPUTFORMAT
‘org.apache.hadoop.hive.ql.io.orc.OrcOutputFormat’

  • 分区 未知分隔符,要当心 textfile

PARTITIONED BY (dt string)
ROW FORMAT SERDE
‘org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe’
WITH SERDEPROPERTIES (
‘field.delim’=‘’,
‘serialization.format’=‘’)
STORED AS INPUTFORMAT
‘org.apache.hadoop.mapred.TextInputFormat’
OUTPUTFORMAT
‘org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat’

  • 1
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值