1.python引用赋值、浅拷贝、深拷贝、is与==的区别
https://www.cnblogs.com/xiaxiaoxu/p/9742452.html
https://www.cnblogs.com/xueli/p/4952063.html
https://www.cnblogs.com/yifanrensheng/p/11865041.html
2.Python函数中的 “*” 和 “**”
https://www.cnblogs.com/beiluowuzheng/p/8461518.html
3.Hive的分区表和分桶表的区别
https://blog.csdn.net/jenrey/article/details/80588493
https://www.jianshu.com/p/192005d0f925
4.clustered by 和 sorted by 的区别
5.spark中key in key_list 内存放不下怎么办
https://www.cnblogs.com/showing/p/8716191.html 第三点
6.spark内存溢出的处理
https://www.cnblogs.com/wcgstudy/p/11407607.html
7.Spark 题
https://www.jianshu.com/p/79509eccc611
8.hive 的 left semi join 讲解
https://blog.csdn.net/happyrocking/article/details/79885071
9.数据库设计——smallint(5) VS varchar(5)
https://blog.csdn.net/Daybreak1209/article/details/80419546