hive中case when then else end 函数和dense_rank()over()函数的用法
数据
Employee 表包含所有员工信息,每个员工有其对应的工号 Id,姓名 Name,工资 Salary 和部门编号 DepartmentId 。
+----+-------+--------+--------------+
| Id | Name | Salary | DepartmentId |
+----+-------+--------+--------------+
| 1 | Joe | 85000 | 1 |
| 2 | Henry | 80000 | 2 |
| 3 | Sam | 60000 | 2 |
| 4 | Max | 90000 | 1 |
| 5 | Janet | 69000 | 1 |
| 6 | Randy | 85000 | 1 |
| 7 | Will | 70000 | 1 |
+----+-------+--------+--------------+
Department 表包含公司所有部门的信息。
+----+----------+
| Id | Name |
+----+----------+
| 1 | IT |
| 2 | Sales |
+----+----------+
需求
编写一个 SQL 查询,找出每个部门获得前三高工资的所有员工。例如,根据上述给定的表,查询结果应返回:
+------------+----------+--------+
| Department | Employee | Salary |
+------------+----------+--------+
| IT | Max | 90000 |
| IT | Randy | 85000 |
| IT | Joe | 85000 |
| IT | Will | 70000 |
| Sales | Henry | 80000 |
| Sales | Sam | 60000 |
+------------+----------+--------+
答案
select
case DepartmentId when '1' then 'IT' else 'Sales' end as Department ,
name,
Salary
from
(
select
*
from
(
select
name,
salary,
DepartmentId,
dense_rank()over(partition by DepartmentId order by Salary desc) rank
from
Employee
)t
where rank<4
)t2
;
+-------------+--------+---------+
| department | name | salary |
+-------------+--------+---------+
| IT | Max | 90000 |
| IT | Joe | 85000 |
| IT | Randy | 85000 |
| IT | Will | 70000 |
| Sales | Henry | 80000 |
| Sales | Sam | 60000 |
+-------------+--------+---------+
解析:
--判断函数case when 的两种用法 1.
case DepartmentId when '1' then 'IT' else 'Sales' end
--case when 的两种用法 2.
case when DepartmentId= '1' then 'IT' else 'Sales' end
--稠密排序函数的用法类似于正常排序
row_number()over(partition by DepartmentId order by Salary desc)
dense_rank()over(partition by DepartmentId order by Salary desc)