一、背景
日志文件部分内容
2020-12-25 00:02:58.049 INFO [job-0] com.alibaba.datax.core.statistics.container.communicator.job.StandAloneJobContainerCommunicator:50 - Total 1043200 records, 218743048 bytes | Speed 0B/
s, 0 records/s | Error 0 records, 0 bytes | All Task WaitWriterTime 9.835s | All Task WaitReaderTime 1,240,595.000s | Transfermor Success 1043201 records | Transformer Error 0 records | T
ransformer Filter 0 records | Transformer usedTime 0.338s | Percentage 0.00%
2020-12-25 00:03:02.571 INFO [-ZB_CITY_IOTGROUPCODE_4G_2ORACLE-0-0-0-reader] com.alibaba.datax.plugin.reader.kafkareader.KafkaReaderHelper:410 - ######{"jobID":"","target":"","action":"inp
ut","media":"-","module":"datax","uuid":"","fileName":"","allLineNum":30,"succLineNum":30,"errLineNum":0,"checkTime":"2020-12-24 16:03:02","recTime":"0","endTime":"2020-12-24 16:03:02"}
2020-12-25 00:03:02.572 INFO [-ZB_CITY_IOTGROUPCODE_4G_2ORACLE-0-0-0-reader] com.alibaba.datax.plugin.reader.kafkareader.KafkaReaderHelper:410 - ######{"jobID":"","target":"","action":"inp
ut","media":"-","module":"datax","uuid":"","fileName":"","allLineNum":29,"succLineNum":29,"errLineNum":0,"checkTime":"2020-12-24 16:03:02","recTime":"0","endTime":"2020-12-24 16:03:02"}
2020-12-25 00:03:02.635 INFO [-ZB_CITY_IOTGROUPCODE_4G_2ORACLE-0-0-0-reader] com.alibaba.datax.plugin.reader.kafkareader.Kafka

本文介绍如何在Linux环境中,通过Python和shell脚本将日志文件内容解析并导入到Oracle数据库。首先阐述了需求,即从日志中提取特定数据,并创建了Oracle数据库对应的表结构。接着,详细展示了使用cx_Oracle模块的Python实现过程,包括所需库的安装。最后,提到了使用shell工具`splldr`的简化方案,包括ctl配置文件的编写,以及shell脚本的执行与日志处理。相较于Python,shell方式更便捷且无环境限制。
最低0.47元/天 解锁文章
5838

被折叠的 条评论
为什么被折叠?



