在hive上处理数据的过程中,不免要导出数据,以下是我在查看相关资料,自己试验成功的方法:
1.用insert,写到hdfs目录下,但是目录好像要由hive用户创建才可以,否则会报错
INSERT OVERWRITE DIRECTORY '/tmp/test1029_tmp' ROW FORMAT DELIMITED FIELDS TERMINATED by ',' select * from bank_info;
+----------+-------+-----------+--------+------------+----------+
| acc_num | name | password | email | cellphone | balance |
+----------+-------+-----------+--------+------------+----------+
+----------+-------+-----------+--------+------------+----------+
No rows selected (0.947 seconds)
2.beeline客户端查询后写到本地
beeline -u jdbc:hive2://localhost:10000/ana --silent=true --outputformat=csv --showHeader=false -e "select a.* from last_30day limit 10" > out.csv