作为一个kylin的初学者来说,首要任务是需要安装kylin,本文将给大家讲述如何安装kylin 2.1.0 ,本文参考了kylin的官方文档写的,官方文档
由于目前kylin 2.0开始支持spark,这里会加上spark的详细配置
由于hadoop,hive,hbase版本是cdh5.8 ,我们下载的时候下载cdh的包
1、下载kylin 2.1.0
下载地址:http://www.apache.org/dyn/closer.cgi/kylin/apache-kylin-2.1.0/apache-kylin-2.1.0-bin-cdh57.tar.gz
2、解压文件
tar zxvf apache-kylin-2.1.0-bin-cdh57.tar.gz /usr/share/apache-kylin/apache-kylin-2.1.0-bin-cdh57
cd /usr/share/apache-kylin/apache-kylin-2.1.0-bin-cdh57
3、添加kylin用户,专门用来执行
useradd -g kylin kylin
su kylin
4、配置文件
用户目录下新建hadoop_conf目录,将hadoop,yarn,hive,hbase的site.xml复制到该目录下面
mkdir hadoop_conf
cp /usr/share/hadoop/conf/*-site.xml /usr/share/apache-kylin/apache-kylin-2.1.0-bin-cdh57/hadoop_conf/
cp /usr/share/hbase/conf/hbase-site.xml /usr/share/apache-kylin/apache-kylin-2.1.0-bin-cdh57/hadoop_conf/
export KYLIN_HOME=/usr/share/apache-kylin/apache-kylin-2.1.0-bin-cdh57
修改conf/kylin.properties
kylin.env.hadoop-conf-dir=/usr/share/apache-kylin/apache-kylin-2.1.0-bin-cdh57/hadoop_conf/
kylin.engine.spark-conf.spark.master=yarn
kylin.engine.spark-conf.spark.submit.deployMode=cluster
kylin.engine.spark-conf.spark.yarn.queue=default
kylin.engine.spark-conf.spark.executor.memory=4G
kylin.engine.spark-conf.spark.executor.cores=2
kylin.engine.spark-conf.spark.executor.instances=1
kylin.engine.spark-conf.spark.eventLog.enabled=true
kylin.engine.spark-conf.spark.eventLog.dir=hdfs\:///kylin/spark-history
kylin.engine.spark-conf.spark.history.fs.logDirectory=hdfs\:///kylin/spark-history
kylin.engine.spark-conf.spark.hadoop.yarn.timeline-service.enabled=false
kylin.engine.spark-conf.spark.yarn.jar=hdfs://localhost:8020/kylin/spark/spark-assembly-1.6.3-hadoop2.6.0-cdh5.8.2.jar
5、启动kylin
当上面的操作执行完成,首先执行check-env.sh 脚本检查环境变量有没有问题
bin/check-env.sh //检查环境变量
bin/kylin.sh start //启动kylin
bin/kylin.sh stop //停止kylin