03准备工作

1.创建表

1.1 所需表
	gulivideo_ori,gulivideo_user_ori,
	gulivideo_orc,gulivideo_user_orc。
1.2 代码
	create table gulivideo_ori(
		videoId string, 
		uploader string, 
		age int, 
		category array<string>, 
		length int, 
		views int, 
		rate float, 
		ratings int, 
		comments int,
		relatedId array<string>)
	row format delimited 
	fields terminated by "\t"
	collection items terminated by "&"
	stored as textfile;

	create table gulivideo_user_ori(
		uploader string,
		videos int,
		friends int)
	row format delimited 
	fields terminated by "\t" 
	stored as textfile;
	
	create table gulivideo_orc(
		videoId string, 
		uploader string, 
		age int, 
		category array<string>, 
		length int, 
		views int, 
		rate float, 
		ratings int, 
		comments int,
		relatedId array<string>)
	clustered by (uploader) into 8 buckets 
	row format delimited fields terminated by "\t" 
	collection items terminated by "&" 
	stored as orc;

	create table gulivideo_user_orc(
		uploader string,
		videos int,
		friends int)
	row format delimited 
	fields terminated by "\t" 
	stored as orc;
	
1.3 将原始数据插入到orc中

2.导入ETL后的数据

load data inpath "/gulivideo/output/video/2008/0222" into table gulivideo_ori;
load data inpath "/gulivideo/user/2008/0903" 		 into table gulivideo_user_ori;

3.向ORC表插入数据

insert into table gulivideo_orc 		select * from gulivideo_ori;
insert into table gulivideo_user_orc 	select * from gulivideo_user_ori;		
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包

打赏作者

hao难懂

你的鼓励将是我创作的最大动力

¥1 ¥2 ¥4 ¥6 ¥10 ¥20
扫码支付:¥1
获取中
扫码支付

您的余额不足,请更换扫码支付或充值

打赏作者

实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值