DolphinScheduler测试环境说明
角色 | 主机名 | 备注 |
---|---|---|
masters | hadoop01,hadoop02 | |
workers | hadoop01,hadoop02,hadoop03,hadoop04,hadoop05 | |
alertServer | hadoop04 | |
apiServers | hadoop05 | 172.30.96.41 |
pythonGatewayServers | hadoop05 | |
数据库MySQL8.0.25 | hadoop05 | 用户名:dolphinscheduler |
Apache DolphinScheduler集群部署
部署(官方文档)档:https://dolphinscheduler.apache.org/zh-cn/docs/2.0.5/user_doc/guide/installation/cluster.html
架构设计(官方文档):https://dolphinscheduler.apache.org/zh-cn/docs/2.0.5/user_doc/architecture/design.html
分布式部署 DolphinScheduler 需要有外部软件的支持(官网)
- JDK:下载JDK (1.8+),并将 JAVA_HOME 配置到以及 PATH 变量中。如果你的环境中已存在,可以跳过这步。
- 二进制包:在下载页面下载 DolphinScheduler 二进制包
- 数据库:PostgreSQL (8.2.15+) 或者 MySQL (5.7+),两者任选其一即可,如 MySQL 则需要 JDBC Driver 8.0.16(建议使用MySQL8.0.16+,JDBC Driver 版本与其保持一致)
- 注册中心:ZooKeeper (3.4.6+),下载地址
- 进程树分析
- macOS安装pstree
- Fedora/Red/Hat/CentOS/Ubuntu/Debian安装psmisc
注意: DolphinScheduler 本身不依赖 Hadoop、Hive、Spark,但如果你运行的任务需要依赖他们,就需要有对应的环境支持
安装MySQL8.0.25
见博客 https://www.cnblogs.com/happy-king/p/16408631.html
连上数据库,初始化表
source ${dolphinscheduler_deploy_dir}/sql/dolphinscheduler_mysql.sql
配置修改
目录结构和配置文件(官方文档):https://dolphinscheduler.apache.org/zh-cn/docs/2.0.5/user_doc/architecture/configuration.html
**部署程序目录:**dolphinscheduler_deploy_dir=/app/downloads/dolphinscheduler-bin
修改安装配置文件 ${dolphinscheduler_deploy_dir}/conf/config/install_config.conf
${dolphinscheduler_deploy_dir}/install.sh/conf/config/install_config.conf
#
# Licensed to the Apache Software Foundation (ASF) under one or more
# contributor license agreements. See the NOTICE file distributed with
# this work for additional information regarding copyright ownership.
# The ASF licenses this file to You under the Apache License, Version 2.0
# (the "License"); you may not use this file except in compliance with
# the License. You may obtain a copy of the License at
#
# [http://www.apache.org/licenses/LICENSE-2.0](http://www.apache.org/licenses/LICENSE-2.0)
#
#