Array
Array<int>:[1,2,3] 文件格式: 1,2,3
Array<string>: ["a","b","c"] 文件格式: a,b,c
Map
Map<string,int>:{"k1":1,"k2":2,"k3",3} 文件格式: k1:1,k2:2,k3:3
Map<string,string>:{"k1":"value1","k2":"value2","k3","value3"} 文件格式: k1:value1,k2:value2,k3:value3
Map<string,array<string>>:{"k1":["value1","value2"],"k2":["value1","value"],"k3",["value1","value"]} 文件格式: k1:value1:value2,k2:value1:value,k3:value1:value
Struct
STRUCT<a:INT,b:INT>:{"a":100,"b":101} 文件格式 : 100,101
组合类型
如:array<array<a string>>, array<struct<a:INT,b:INT>>, map<String,array<string>>
Bip中创建表结构的模式定义如:
create table xx(xx xx, xx xx, ...)
row format delimited fields terminated by '\t'
collection items terminated by ','
MAP KEYS TERMINATED BY ':'
collection items terminated by ','
MAP KEYS TERMINATED BY ':'