写在前面
ps:干货很多…
我们都知道,Hive基本上就是内部表和外部表两种类型,在面试的时候,常会问到这种题目:
1.hive内部表和外部表的区别
2.什么时候使用内部表,什么时候使用外部表
来自官网的定义:
Managed tables
A managed table is stored under the hive.metastore.warehouse.dir path property, by default in a folder path similar to /user/hive/warehouse/databasename.db/tablename/. The default location can be overridden by the location property during table creation. If a managed table or partition is dropped, the data and metadata associated with that table or partition are deleted. If the PURGE option is not specified, the data is moved to a trash folder for a defined duration.
Use managed tables when Hive should manage the lifecycle of the table, or when generating temporary tables.
External tables
An external table describes the metadata / schema on external files. External table files can be accessed and managed by processes outside of Hive. External tables can