谈谈Hive
Hive和RMDB关系型数据库对比?
Hive内部表和外部表有什么区别?什么时候使用内部表?
Hive的sort by、order by、distribute by、cluster by区别?
Hive有哪些窗口函数?
Hive自定义函数怎么实现(UDF、UDTF、UDAF)?
Hive默认的数据库是什么?生产环境用什么数据库?如何实现高可用?
Hive文件存储格式有哪些?如何选择合适的存储格式?提示:行式存储、列式存储
Hive如何进行优化?提示:分区分桶、MapReduce调优
Hive导入数据有几种方式?提示:load、insert、as select、location、import
Hive创建表语句
CREATE [EXTERNAL] TABLE [IF NOT EXISTS] table_name
[(col_name data_type [COMMENT col_comment], ...)]
[COMMENT table_comment]
[PARTITIONED BY (col_name data_type [COMMENT col_comment], ...)]
[CLUSTERED BY (col_name, col_name, ...)
[SORTED BY (col_name [ASC|DESC], ...)] INTO num_buckets BUCKETS]
[ROW FORMAT row_format]
[STORED AS file_format]
[LOCATION hdfs_path]