Hive练习

本文通过一系列HQL(Hive SQL)查询实例,展示了如何在Hive中进行数据加载、部门与员工信息的查询,包括部门信息、员工薪资比较、上下级关系、入职日期比较等复杂操作,涵盖了子查询、联接、聚合函数等多种HQL技巧。
摘要由CSDN通过智能技术生成

一、将下列数据加载hive表 :

员工信息表emp:
字段:员工id,员工名字,工作岗位,部门经理,受雇日期,薪水,奖金,部门编号
英文名:EMPNO,ENAME,JOB,MGR,HIREDATE,SAL,BONUS,DEPTNO

部门信息表dept:
字段:部门编号,部门名称,部门地点
英文名:DEPTNO,DEPTNAME,DEPTADDR

建表emp: 

create table emp(
    EMPNO int
    ,ENAME string
    ,JOB string
    ,MGR int
    ,HIREDATE string
    ,SAL int
    ,BONUS int
    ,DEPTNO int

row format delimited
fields terminated by ',';

数据:

7369,SMITH,CLERK,7902,1980-12-17,800,null,20
7499,ALLEN,SALESMAN,7698,1981-02-20,1600,300,30
7521,WARD,SALESMAN,7698,1981-02-22,1250,500,30
7566,JONES,MANAGER,7839,1981-04-02,2975,null,20,
7654,MARTIN,SALESMAN,7698,1981-09-28,1250,1400,30
7698,BLAKE,MANAGER,7839,1981-05-01,2850,null,30
7782,CLARK,MANAGER,7839,1981-06-09,2450,null,10
7788,SCOTT,ANALYST,7566,1987-04-19,3000,null,20
7839,KING,PRESIDENT,null,1981-11-17,5000,null,10
7844,TURNER,SALESMAN,7698,1981-09-08,1500,0,30
7876,ADAMS,CLERK,7788,1987-05-23,1100,null,20
7900,JAMES,CLERK,7698,1981-12-03,950,null,30
7902,FORD,ANALYST,7566,1981-12-03,3000,null,20
7934,MILLER,CLERK,7782,1982-01-23,1300,null,10

建表dept:

create table dept(
    DEPTNO int
    ,DEPTNAME string
    ,DEPTADDR string

row format delimited
fields terminated by ',';

数据:

10,ACCOUNTING,NEW YORK
10,ACCOUNTING,shanghai
20,RESEARCH,DALLAS
30,SALES,CHICAGO
40,OPERATIONS,BOSTON

二、 使用HQL完成下面需求:

1. 列出至少有一个员工的所有部门。 
 select * from dept where DEPTNO in (select DEPTNO from emp group by DEPTNO);


2. 列出薪金比“SMITH”多的所有员工。
 select t1.ename,t1.sal from (select *,1 as cid from emp) t1 join (select sal,1 as cid from emp where ename='SMITH') t2 on t1.cid=t2.cid where t1.sal>t2.sal;


3. 列出所有员工的姓名及其直接上级的姓名。 
select t1.ename,t2.ename from emp t1 left join emp t2 on t1.mgr=t2.empno;


4. 列出受雇日期早于其直接上级的所有员工。 
select t1.ename,t1.hiredate,t2.ename,t2.hiredate from emp t1 left join emp t2 on t1.mgr=t2.empno where t1.hiredate<t2.hiredate;


5. 列出部门名称和这些部门的员工信息,同时列出那些没有员工的部门。 
 select t1.deptname,t2.* from dept t1 left join emp t2 on t1.deptno = t2.deptno;


6. 列出所有“CLERK”(办事员)的姓名及其部门名称。
select t1.ename,t2.deptname from (select * from emp where job='CLERK') t1 left join dept t2 on t1.deptno=t2.deptno;


7. 列出最低薪金大于1500的各种工作。
select job from emp where sal > 1500 group by job;


8. 列出在部门“SALES”(销售部)工作的员工的姓名,假定不知道销售部的部门编号
select t1.ename,t2.deptname from emp t1 right join (select * from dept where deptname='SALES') t2 on t1.deptno=t2.deptno;


9. 列出薪金高于公司平均薪金的所有员工。 
round(column_name,2)                     四舍五入截取
cast(column_name as decimal(10,2)) 10表示最多可以有10位数字,2表示小数部分的位数为2  cast函数截取(推荐使用)

select t1.ename,t1.sal,t2.avg_sal from (select ename,sal,1 as cid from emp) t1 left join (select cast(avg(sal) as decimal(10,2))  as avg_sal,1 as cid from emp) t2 on t1.cid=t2.cid where t1.sal>t2.avg_sal;


10.列出与“SCOTT”从事相同工作的所有员工。
 select t1.ename,t2.job from (select ename,job,1 as cid from emp) t1 join (select job,1 as cid from emp where ename='SCOTT') t2 on t1.cid=t2.cid where t1.job=t2.job and t1.ename<>'SCOTT';


11.列出薪金等于部门30中员工的薪金的所有员工的姓名和薪金。
select ename,sal from emp t1 where t1.sal in (select sal from emp where deptno=30);


12.列出薪金高于在部门30工作的所有员工的薪金的员工姓名和薪金。 
select t1.ename,t1.sal from (select ename,sal,1 as cid from emp) t1 join (select max(sal) as max_sal,1 as cid from emp where deptno=30) t2 on t1.cid=t2.cid where t1.sal>t2.max_sal;


13.列出在每个部门工作的员工数量、平均工资和平均服务期限。

日期相减:

DATEDIFF(date1,date2)

当前日期:current_date
select deptno,count(*),avg(sal),avg(datediff(current_date,hiredate)) from emp group by deptno;


14.列出所有员工的姓名、部门名称和工资。 
select t1.ename,t2.deptname,t1.sal from emp t1 left join dept t2 on t1.deptno=t2.deptno;


15.列出所有部门的详细信息和部门人数。 
 select t2.*,t1.count from (select deptno,count(*) as count from emp group by deptno) t1 right join dept t2 on t1.deptno=t2.deptno;


16.列出各种工作的最低工资。 
select job,min(sal) from emp group by job;


17.列出各个部门的MANAGER(经理)的最低薪金。
 select deptno,min(sal) from emp where job='MANAGER' group by deptno;


18.列出所有员工的年工资,按年薪从低到高排序。
字段含有null值,使用nvl函数
eg:select id,nvl(x,0) from vels; //x为null就是0

 select ename,sal*12+nvl(bonus,0) as sum from emp order by sum desc;


19. 列出每个部门薪水前两名最高的人员名称以及薪水。 

row_number() over (partition by ... order by ...) 

select ename,sal
from
(select ename,sal,deptno,row_number() over(partition by deptno order by sal desc) as rk
from emp) as t
where rk<=2

20. 列出每个员工从受雇开始到2018-12-12 为止共受雇了多少天。
select ename,datediff('2018-12-12',hiredate) from emp;

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值