2018年05月_ukakasu

12月 11月 10月 09月 08月 07月 06月 05月 04月

原创元数据管理工具atlas初探

元数据管理工具atlas初探安装：Ambari添加服务（略）Hive配置：将atlas主节点/usr/hdp/2.6.3.0-235/atlas/hook拷贝到其他节点。自定义hive-env，HIVE_AUX_JARS_PATH =/usr/hdp/2.6.3.0-235/atlas/hook /hive。/usr/hdp/2.6.3.0-235/atlas/con...

2018-05-31 16:49:42 13420 5

原创 hdf组件简介

NiFi离线数据、实时数据的分布式ETL工具。支持本地文件、ftp、hdfs、数据库、hbase、es、hive、kafka等数据的in/out。Streaming Analytics ManagerStorm实时数据处理。从kafka中消费avro数据，此数据可通过nifi接入，storm处理后写入druid、hbase、hdfs等。Storm的processor包括agg...

2018-05-24 08:54:53 573

原创 hdf安装

HDF3.0.2安装https://docs.hortonworks.com/HDPDocuments/HDF3/HDF-3.0.2/bk_release-notes/content/ch_hdf_relnotes.htmlhttps://docs.hortonworks.com/HDPDocuments/HDF3/HDF-3.0.0/bk_installing-hdf-on-hdp/co...

2018-05-23 13:17:39 2726

原创 Amabri2.6.0、hdp2.6.1安装

Amabri2.6.0、hdp2.6.1在centos7下安装一、环境准备1、修改各个节点主机名vi /etc/hostname2、配置主节点hostsvi /etc/hosts2、配置免密（1）手动配置主节点执行：ssh-keygen -t rsassh-copy-id $host（第1步中的各个节点名称）（2）脚本配置3、同步hosts主节点...

2018-05-23 11:08:59 800

原创 pandas中小数作为index精度问题

pandas中用小数作为index进行join，结果发现数据条数变少，怀疑是精度问题所致。解决方法：将小数作为index之前先转换为str，再作为index。df = df.round({0: 3})df[0] = df[0].astype(str)df = df.set_index(0) ...

2018-05-08 16:35:28 1071

binutils-2.23.52.0.1-12.el7.x86_64 compat-libcap1-1.10-3.el7.x86_64 compat-libstdc++-33-3.2.3-71.el7.i686 compat-libstdc++-33-3.2.3-71.el7.x86_64 gcc-4.8.2-3.el7.x86_64 gcc-c++-4.8.2-3.el7.x86_64 glibc-2.17-36.el7.i686 glibc-2.17-36.el7.x86_64 glibc-devel-2.17-36.el7.i686 glibc-devel-2.17-36.el7.x86_64 ksh

2018-06-21

空空如也

TA创建的收藏夹 TA关注的收藏夹

TA关注的人

ukakasu的博客

原创元数据管理工具atlas初探

原创 hdf组件简介

原创 hdf安装

原创 Amabri2.6.0、hdp2.6.1安装

原创 pandas中小数作为index精度问题

oracle11g-el7依赖

python连接oracle包

gcc升级依赖包

gcc安装依赖包

空空如也

原创 元数据管理工具atlas初探

原创 hdf组件简介

原创 hdf安装

原创 Amabri2.6.0、hdp2.6.1安装

原创 pandas中小数作为index精度问题

oracle11g-el7依赖

python连接oracle包

gcc升级依赖包

gcc安装依赖包

空空如也

原创元数据管理工具atlas初探