Hive都有那些相关的基本语句操作

最新推荐文章于 2024-07-16 13:43:40 发布

自律也自由

最新推荐文章于 2024-07-16 13:43:40 发布

阅读量213

点赞数

文章标签： hive

本文链接：https://blog.csdn.net/qq_43243579/article/details/109749752

版权

在这里插入图片描述

确保mysql已开启
确保hadoop集群正确开启

首次开启需要初始化schematool -initSchema -dbType mysql
随后在后台启动metastore服务./hive --service metastore &
在这里插入图片描述

在配置好hive环境变量的状态下，任意路径输入hive
进入hive交互界面
在这里插入图片描述

创建指定文件夹，将数据上传至hdfs指定目录/college/下

hadoop fs -mkdir -p /college
hadoop fs -put /root/college/loan.csv /college/
hadoop fs -ls /college/

删除文件和目录：

hadoop fs -rm /user/hive_remote/warehouse/t_big24/stu.info  
hadoop fs -rm -r -f /college/

创建table表时用此语句来设置分隔符：

row format delimited fields terminated by ',';

例如：

CREATE TABLE loan(ProsperScore int,Occupation string,LoanStatus string)
row format delimited fields terminated by ',';

load data local inpath '/root/college/loan.csv' into table t_loan;

load data inpath '/college/loan.csv' into table t_loan;

分组聚合

select shop,avg(price)
from t_jddata 
group by shop

select shop,avg(price) as prices
from t_jddata where price < 5000
group by shop

排序取前十

select shop,price
from t_jddata
order by price desc limit 10;

统计表数据，结果写入文件中
写入本地

insert overwrite local directory '/usr/hive/data'
row format delimited fields terminated by ','
select * from t_text;

写入hdfs

insert overwrite directory '/college'
row format delimited fields terminated by ','
select * from t_text;

注意该写入方法会将指定路径下的所有文件和目录全部覆盖写，因此要写到空的目录下，防止其他信息丢失。

1.若发现hive启动不成功，查看原因可能是由于端口被占用了。把端口关闭即可：

2.若显示jar包冲突，Root Issue; slf4j在两处找到了jar包。分别是在Hadoop和hive的安装目录。删除一个就好。

3.暂时将hive设置为本地模式，mapreduce的运行速度会较远程模式快一些 set hive.exec.mode.local.auto=true;

4.hive日志路径设置

 cp hive-log4j2.properties.template hive-log4j2.properties

关注