Hadoop
gongbi917
Collectively responsible for enterprise big data platform, for building out a data analytics platform that different user easily and efficiently run large calculations over the big dataset, and ultimately for turning that data into something actionable that helps the business.
展开
-
Hive插件开发详解
Hive UDF UDAF UDTF 插件开发 扩展原创 2016-08-29 07:30:56 · 629 阅读 · 0 评论 -
Apache Spark - 交互式数据分析
Spark ShellSpark SubmitSpark Job ServerJupiterApache ZeppelinCloudera Livy + Hue原创 2016-08-30 13:33:30 · 854 阅读 · 0 评论 -
MongoDB + Hadoop解决方案
MongoDB概述Hadoop概述MongoDB + Hadoop解决方案对比实时导入MongoDB数据到Hadoop的解决方案解决方案思路:实时读取MongoDB Replica Set Oplog到Apache HBase原创 2016-08-29 07:57:37 · 899 阅读 · 0 评论 -
Python Access Secured Hadoop Cluster Through Thrift API
Python Access Secured Hadoop Cluster Through Thrift APIApache Thrift Python Kerberos SupportTypical way to connect kerberos secured thrift serverExample - HiveExample - HBaseApache Thrift Python K原创 2016-10-12 10:55:08 · 1704 阅读 · 0 评论 -
Hadoop数据采集方案
数据源RDBMS OracleMySQLNOSQL MongoDB文件 日志文件JSONXML数据存储HDFSHBase工具SqoopFlumeStreamsetsOracle GoldenGate for Big DataMySQL Applier for Hadoopmongo-hadoop原创 2016-09-30 11:11:14 · 5184 阅读 · 0 评论