Hive中已有表records:
hive> desc records;
OK
year string
temperature int
quality int
hive> select * from records;
OK
2013 15 18
2014 23 32
2015 19 91
把records表中temperature中!=15的筛选出来,另建立一张新表存入筛选后的数据。代码如下:
from pyspark import SparkContext
from pyspark.sql import HiveContext
def inside(row):
<span style="colo