1.创建一个学生表
create table student(sno int,sname string,sex string,sage int,sdept string)row format delimited fields terminated by ','stored as textfile;
2.从hdfs导入数据到表中
load data inpath '/hivedata/student.txt' overwrite into table student;
3.使用orderby(select * from student order by sex,sage;)
order by只启动了一个reduce进行全局排序,
4.使用sortby(select * from student sort by sage;)
sortby是对一个分区内的数据进行局部排序
5.distribute by
将数据分区到不同的reduce
6.使用 distribute by 和sortby(select * from student distribute by sage sort by sage ;)
7.使用cluster by(select * from student cluster by sage)
6与7sql等价。