在SparkSQL上对SDB内的两个集合进行关联(JOIN)查询。要求如下:
1.为两个集合在PG内创建两张表;
2.查看创建的两张表的表结构;
3.JOIN关联查询需要走索引;
创建student外表:
create table student using com.sequoiadb.spark options (host 'ysx:11810,ysx1:11810,ysx2:11810', collectionspace 'pratice4', collection student', username 'sdbadmin', password '1234');
创建class外表:
create table class using com.sequoiadb.spark options (host 'ysx:11810,ysx1:11810,ysx2:11810', collectionspace 'company', collection 'class', username 'sdbadmin', password '1234');
JOIN关联查询:
select s.sid,s.sname,c.avgsChinese,c.avgsMath from student s INNER JOIN class c ON s.cid=c.cid order by s.sid ASC limit 5;