java hiveconnector_Datastage 11.5新功能介绍----Hive Connector

IBM Information Server 11.5.0.1引入了Hive Connector,改进了对Hive的支持。相较于ODBC/JDBC connector,Hive Connector提供对多种Hive版本的支持,包括Cloudera、HortonWorks和BigInsights,并且支持HiveQL运行时生成、分区表读写等Hive特定功能。此外,它还支持源、目标和请求上下文,简化了Hive与ETL作业的集成。然而,Hive Connector存在一些限制,如向分区表的插入操作未实现批量模式。
摘要由CSDN通过智能技术生成

Question

Datastage是否有专门的组件支持Hive呢?

Answer

在最新的IBM Information Server 11.5.0.1中,新增了一些组件和功能,其中比较重要的就是Hive Connector。在之前的版本中,Datastage产品对Hive的支持主要通过ODBC connector或者JDBC connector来实现,使用ODBC / JDBC connector连接Hive存在一些局限性

qConnector的如下的选项不受支持:

oGenerate SQL at Runtime

oIsolation levels

oAuto-commit

oCreate functionality with different file formats

qHive中的partition table不受支持

q在通用的Connector里处理一些Hive特定的功能比较难

q新的Hive Connector较之使用通用的ODBC / JDBC connector有如下优势

q支持多种Hive

Cloudera Hive

Cloudera Impala

HortonWorks

BigInsights

q提供了对Hive一些特定功能的支持

Generation of HiveQL at runtime

Generation of the table DDL specific to Hive

DML generation as per the syntax of HiveQL

Hive specific table formats [AVRO, Parquet, ORC etc]

Partitioned tables

q用户可以使用Generate SQL选项,不必使用 HQL/SQL 语句

q支持partitioned table的读写

q处理Hive特定的功能比较容易

Hive Connector 配置

q底层使用 JDBC 协议,配置相对简单

q使用Datadirect JDBC driver for Hive – 在Information Server安装包默认提供

q创建或修改配置文件isjdbc.config (IS_HOME/Server/DSEngine)

q配置文件中的如下内容用于指定 class path 和driver Java classes:

CLASSPATH=

CLASS_NAMES=

qisjdbc.config示例

CLASSPATH=/opt/IBM/InformationServer/ASBNode/lib/java/IShive.jar;

CLASS_NAMES=com.ibm.isf.jdbc.hive.HiveDriver;

Note : Details about the Kerberos configuration would be provided in the subsequent slides

Hive Connector - Repository View and Palette

3bde294c2a596c77dd67736d1d25fdaf.png

5fffcb051f36ad2b830b05385d44fe5b.png

Hive Connector 支持Source context / read mode,Target context / write mode,Request context / Lookup mode,下面分别进行介绍

Source context or Read mode

1c53515bb2f6a698caa3049550ba2dac.png

Hive Connector 可以配置成Source context (或 read mode) 用于从Hive中读取数据.

q可以生成 Select 语句

q支持Partitioned Reads

q提供对额外的 Hive 选项的支持

q支持Before or After SQL

q支持 number of rows returned by the stage限制

q支持从文件中读取SQL

Target context or write mode

bb748ac6769e76a4a242cfc6f79a220f.png

qHive Connector 可以配置成Target context (或 write mode) 用于向Hive中insert数据. 可以使Hive和ETL job无缝集成

q支持写入partitioned table.

q支持多种 Table action modes [ 包括 Create, Append, Truncate and Replace ]

q目前不支持Update

q在生成Create table语句的时候,支持生成 HiveQL 或HQL 格式的DDL

q支持Insert操作的 Generate SQL

q支持User – defined SQL 选项,可以从文件中读取SQL语句

q支持Before / After SQL statements

q支持额外的 Hive 选项

6f72cba18c234d3cf6968037f187d353.png

Request context or Lookup mode

q支持Normal以及 Sparse模式

894557b624c253571950ed9145eef75c.png

Hive Connector的使用限制

q目前向 partitioned table中的insert操作是逐条执行的,没有批量模式

q 在 Big Integrate 环境中, Keytab 文件的localization目前不被支持

q 不支持向Hive表中Load数据

workaround:

可以使用 File Connector向Hive表中load数据. 用File connector将数据load到文件,然后在创建一个Hive表关联这个文件

q 只支持Insert的写模式

Hive Connector Troubleshooting

q确保 isjdbc.config配置正确

q在读取partitioned table时,确保 placeholder ([[part-value]]) 正确设置

qHive connector的debugging与其他的connector类似,使用CC_MSG_LEVEL参数可以获取更详细的Hive connector日志ex : CC_MSG_LEVEL=1 / 2

q 其他troubleshooting技巧和问题,请参考

http://www.ibm.com/support/knowledgecenter/SSZJPZ_11.5.0/com.ibm.swg.im.iis.conn.hive.usage.doc/topics/hivecc_troubleshooting.html

其他关于Hive connector的配置,使用介绍,请参考Information Server KnowledgeCenter相关章节

http://www.ibm.com/support/knowledgecenter/SSZJPZ_11.5.0/com.ibm.swg.im.iis.conn.hive.usage.doc/topics/hive_connector_top_of_nav.html

了解更多Information Server11.5的新功能,请参考Information Server 11.5.0.1 Release Notes

http://www-01.ibm.com/support/docview.wss?uid=swg21996106

[{"Product":{"code":"SSZJPZ","label":"InfoSphere Information Server"},"Business Unit":{"code":"BU053","label":"Cloud & Data Platform"},"Component":"Not Applicable","Platform":[{"code":"PF002","label":"AIX"},{"code":"PF016","label":"Linux"},{"code":"PF033","label":"Windows"}],"Version":"11.5.0.1;11.5","Edition":"","Line of Business":{"code":"LOB10","label":"Data and AI"}}]

IBM may not offer the products, services, or features discussed in this document in other countries. Consult your local IBM representative for information on the products and services currently available in your area. Any reference to an IBM product, program, or service is not intended to state or imply that only that IBM product, program, or service may be used. Any functionally equivalent product, program, or service that does not infringe any IBM intellectual property right may be used instead. However, it is the user's responsibility to evaluate and verify the operation of any non-IBM product, program, or service. IBM may have patents or pending patent applications covering subject matter described in this document. The furnishing of this document does not give you any license to these patents. You can send license inquiries, in writing, to: IBM Director of Licensing, IBM Corporation, North Castle Drive, Armonk, NY 10504-1785 U.S.A. The following paragraph does not apply to the United Kingdom or any other country where such provisions are inconsistent with local law: INTERNATIONAL BUSINESS MACHINES CORPORATION PROVIDES THIS PUBLICATION "AS IS" WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESS OR IMPLIED, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF NON-INFRINGEMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Some states do not allow disclaimer of express or implied warranties in certain transactions, therefore, this statement may not apply to you. This information could include technical inaccuracies or typographical errors. Changes are periodically made to the information herein; these changes will be incorporated in new editions of the publication. IBM may make improvements and/or changes in the product(s) and/or the program(s) described in this publication at any time without notice. Any references in this information to non-IBM Web sites are provided for convenience only and do not in any manner serve as an endorsement of those Web sites. The materials at those Web sites are not part of the materials for this IBM product and use of those Web sites is at your own risk. IBM may use or distribute any of the information you supply in any way it believes appropriate without incurring any obligation to you. Information concerning non-IBM products was obtained from the suppliers of those products, their published announcements or other publicly available sources. IBM has not tested those products and cannot confirm the accuracy of performance, compatibility or any other claims related to non-IBM products. Questions on the capabilities of non-IBM products should be addressed to the suppliers of those products. This information contains examples of data and reports used in daily business operations. To illustrate them as completely as possible, the examples include the names of individuals, companies, brands, and products. All of these names are fictitious and any similarity to the names and addresses used by an actual business enterprise is entirely coincidental. COPYRIGHT LICENSE: This information contains sample application programs in source language, which illustrate
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值