编译
DataX Git地址:GitHub - alibaba/DataX: DataX是阿里云DataWorks数据集成的开源版本。
Idea编译DataX源码参考博客:DataX教程(02)- IDEA运行DataX完整流程(填完所有的坑)-阿里云开发者社区
上传到服务器
正常上传即可
模板查看
在datax/bin目录下执行查询模板语句
[root@node2 bin]# python datax.py -r xxxxreader -w xxxxxwriter
然后就会出现使用模板:
DataX (DATAX-OPENSOURCE-3.0), From Alibaba !
Copyright (C) 2010-2017, Alibaba Group. All Rights Reserved.
Please refer to the oraclereader document:
https://github.com/alibaba/DataX/blob/master/oraclereader/doc/oraclereader.md
Please refer to the doriswriter document:
https://github.com/alibaba/DataX/blob/master/doriswriter/doc/doriswriter.md
Please save the following configuration as a json file and use
python {DATAX_HOME}/bin/datax.py {JSON_FILE_NAME}.json
to run the job.
{
"job": {
"content": [
{
"reader": {
"name": "oraclereader",
"parameter": {
"column": [],
"connection": [
{
"jdbcUrl": [],
"table": []
}
],
"password": "",
"username": ""
}
},
"writer": {
"name": "doriswriter",
"parameter": {
"beLoadUrl": [],
"column": [],
"connection": [
{
"jdbcUrl": "",
"selectedDatabase": "",
"table": []
}
],
"loadProps": {},
"loadUrl": [],
"password": "",
"postSql": [],
"preSql": [],
"username": ""
}
}
}
],
"setting": {
"speed": {
"channel": ""
}
}
}
}
运行同步任务
python {DATAX_HOME}/bin/datax.py {DATAX_HOME}/job/{JSON_FILE_NAME}.json