hive官方文档中这样描述将数据从一个表中插入到另一个表中
hive> FROM invites a INSERT OVERWRITE TABLE events SELECT a.bar, count(*) WHERE a.foo > 0 GROUP BY a.bar;
hive> INSERT OVERWRITE TABLE events SELECT a.bar, count(*) FROM invites a WHERE a.foo > 0 GROUP BY a.bar;
The keyword 'overwrite' signifies that existing data in the table is deleted.
If the 'overwrite' keyword is omitted, data files are appended to existing data sets.
但若省略overwrite,则会报如下错:
hive> INSERT TABLE events SELECT a.bar, count(*) FROM invites a WHERE a.foo > 0 GROUP BY a.bar;
FAILED: ParseException line 1:0 cannot recognize input near 'insert' 'table' 'events' in insert clause
省略overwirite的正确写法是:
hive> INSERT INTO TABLE events SELECT a.bar, count(*) FROM invites a WHERE a.foo > 0 GROUP BY a.bar;
就这么简单,官方文档中有这样的写法,可能不会引起大家的注意:
hive> LOAD DATA LOCAL INPATH './examples/files/kv2.txt' OVERWRITE INTO TABLE invites PARTITION (ds='2008-08-15');
hive> LOAD DATA LOCAL INPATH './examples/files/kv3.txt' OVERWRITE INTO TABLE invites PARTITION (ds='2008-08-08');
这是带了into的,但将insert将在行首,没有加into的写法,所以一开始我也很迷糊。