一、Spark实现方案
1. 创建Hive外部表, 数据保存格式为parquet
create external table `bigdatalearnshare.test_excel_file`(
`id` string,
`name` string,
`age` string)
stored as parquet
location '/bigdatalearnshare/test/test_excel_file';
2. spark代码demo
spark版本: 2.4.3
scala版本: 2.11.8
maven pom文件中除了spark等相关依赖包外,需有支持的Excel插件:
<dependency>
<groupId>com.crealytics</groupId>
<artifactId>spark-excel_2.11</artifactId>
<version>0.11.1</version>
<!-- 按需确定是否需要排除一些依赖包 -->
<exclusions>
<exclusion>
<groupId>com.fasterxml.jackson.core</groupId>
<artifactId>jackson-core</artifactId>
</exclusion>
<exclusion>
<groupId&