python3读取配置文件内容前面加b_Python B 文件里的每行数据匹配 A 文件里的内容逻辑问题...

最新推荐文章于 2021-07-14 23:12:54 发布

weixin_39646107

最新推荐文章于 2021-07-14 23:12:54 发布

阅读量154

点赞数

文章标签： python3读取配置文件内容前面加b

A 文件是个 json 文件：

a.txt

{

"_id": "113.254.82.124",

"_index": "fofapro_subdomain",

"header": "HTTP/1.1 401 Unauthorized\r\nConnection: close\r\nContent-Length: 195\r\nCache-Control: no-cache\r\nContent-Type: text/html\r\nDate: Sat, 20 Oct 2018 15:59:44 GMT\r\nEtag: \"0-29d-b90\"\r\nServer: Embedthis-Appweb/3.3.1\r\nWww-Authenticate: Basic realm=\"DCS-2530L\"\r\nX-Frame-Options: SAMEORIGIN\r\n",

}

{

"_id": "http://10.254.82.12",

"_index": "fofapro_subdomain",

"header": "HTTP/1.1 401 Unauthorized\r\nConnection: close\r\nContent-Length: 195\r\nCache-Control: no-cache\r\nContent-Type: text/html\r\nDate: Sat, 20 Oct 2018 15:59:44 GMT\r\nEtag: \"0-29d-b90\"\r\nServer: Embedthis-Appweb/3.3.1\r\nWww-Authenticate: Basic realm=\"DCS-2530L\"\r\nX-Frame-Options: SAMEORIGIN\r\n",

}

{

"_id": "https://192.168.1.10:9090",

"_index": "fofapro_subdomain",

"header": "HTTP/1.1 401 Unauthorized\r\nConnection: close\r\nContent-Length: 195\r\nCache-Control: no-cache\r\nContent-Type: text/html\r\nDate: Sat, 20 Oct 2018 15:59:44 GMT\r\nEtag: \"0-29d-b90\"\r\nServer: Embedthis-Appweb/3.3.1\r\nWww-Authenticate: Basic realm=\"DCS-2530L\"\r\nX-Frame-Options: SAMEORIGIN\r\n",

}

{

"_id": "127.0.0.1:8343",

"_index": "fofapro_subdomain",

"header": "HTTP/1.1 401 Unauthorized\r\nConnection: close\r\nContent-Length: 195\r\nCache-Control: no-cache\r\nContent-Type: text/html\r\nDate: Sat, 20 Oct 2018 15:59:44 GMT\r\nEtag: \"0-29d-b90\"\r\nServer: Embedthis-Appweb/3.3.1\r\nWww-Authenticate: Basic realm=\"DCS-2530L\"\r\nX-Frame-Options: SAMEORIGIN\r\n",

}

B 文件：

b.txt

127.0.01

192.168.1.10

192.168.88.88

代码

import re

import json

def filesJson(filepath,dstpaths):

datas = set()

#正则匹配

rule = re.compile('^[a-zA-z]{1}.*$')

with open(filepath, 'r', encoding='UTF-8') as a, open(dstpaths, 'r', encoding='UTF-8') as b:

b.seek(0)

for realine_a in a:

json_datas = json.loads(realine_a)

ips = json_datas['_id']

if rule.findall(ips):

ips = ips.strip("http[s]?://")

ips = ips.split(":")[0]

datas.add(ips)

for realine_b in b:

if realine_b in datas:

print(realine_b)

else:

break

if __name__ == '__main__':

file_paths = "a.txt"

dstpaths = 'b.txt'

filesJson(file_paths, dstpaths)

我的想法是把 A 文件里的 IP，去除协议和端口，只保留 IP 写入到一个集合中，然后在通过 B 文件的数据去匹配这个集合，有没有这个 IP，如果有这个 IP，把 A 文件这行数据写入到 C 文件中，现在问题是 B 文件无法匹配 A 文件的数据，而且如果 A 文件内容是几百万行数据，B 文件内容是几万行数据，这种逻辑是不是有很大的问题。

weixin_39646107

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
python3读取配置文件内容前面加b_Python B 文件里的每行数据匹配 A 文件里的内容逻辑问题...

A 文件是个 json 文件：a.txt{"_id": "113.254.82.124","_index": "fofapro_subdomain","header": "HTTP/1.1 401 Unauthorized\r\nConnection: close\r\nContent-Length: 195\r\nCache-Control: no-cache\r\nContent-Type: ...
复制链接

扫一扫

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。