大数据集群搭建:安装部署MySQL、SQL Server、Zookeeper、Hadoop、Spark、Flink、Kafka、Kettle、Airflow、Flume集群 文章分类链接大数据集群服务器环境搭建大数据集群环境搭建:Hadoop、Spark、Flink分布式集群环境MySQL5.7MySQL 5.7.32 CentOS7离线安装教程MySQL8MySQL 8.0.19 CentOS7离线安装教程MySQL5.7mysql5.7数据库主从同步、双机热备、读写分离高可用集群的实现SQL Server 2019SQL Server 2019 Linux安装教程SQL Server 2019SQL Server 2019 Windows安装教程ZooKeeperCentOS7服务器安装ZooKeeper3.6.2集群Hadoop3Hadoop3高可用(HA)分布式集群搭建Hadoop3Hadoop3 重新格式化namenodeSpark3Spark3 Standalone模式分布式集群搭建Spark3Spark3 Standalone模式高可用分布式集群搭建(HA模式)Spark3Spark3 on Yarn分布式集群安装部署(YARN模式Hadoop3、Spark3Hadoop3和Spark3配置日志聚合,客户端电脑实现YARN页面跳转查看Hadoop和Spark历史任务日志Hadoop3、Spark3通过给Hadoop、Spark集群的CentOS 7服务器安装Google浏览器,解决客户端环境YARN页面无法查看Hadoop、Spark日志的问题Python3、Scala、Spark、pySparkjupyter notebook集成Python3、Scala、Spark、pySpark内核pysparkCentOS7上安装Jupyter notebook使用pyspark连接spark集群FlinkFlink local模式、Standalone模式、Standalone 高可用(HA)模式的安装部署FlinkFlink on Yarn高可用集群的安装部署HiveHive-3.1.2安装部署KafkaCentOS7搭建Kafka2.7分布式集群kafkakafka-eagle-2.0.3安装部署kafkaCentOS7安装kafka-managerAirflow基于Python3虚拟环境安装Apache AirflowKettleKettle分布式集群安装部署FlumeFlume原理和Flume配置文件参数详解Flumeansible自动化部署flume集群消费kafka数据到HDFSZeppelin安装zeppelin-0.9.0