0x00 参考
使用python中的scapy库抓取并解析pcap包(五元组信息)
0x01 前言
由于云厂商的链路监控做得实在是不咋地,所以我们决定自己实现一套网络链路监控系统。
目的效果:客户端安装agent,实时采集流量数据并且进行上报给服务器端,不管是HTTP、HTTPS、MySQL,以及底层的TCP、UDP,甚至是链路层的,都需要实时采集并上报。
由于我们团队使用Python的居多,所以第一版以Python为开发语言进行开发。一番调研之下,客户端的agent决定使用Scapy来进行流量的获取
0x02 概述
Scapy是一个强大的,用Python编写的交互式数据包处理程序,它能让用户发送、嗅探、解析,以及伪造网络报文,从而用来侦测、扫描和向网络发动攻击。Scapy可以轻松地处理扫描(scanning)、路由跟踪(tracerouting)、探测(probing)、单元测试(unit tests)、攻击(attacks)和发现网络(network discorvery)之类的传统任务。
它可以代替hping,arpspoof,arp-sk,arping,p0f 甚至是部分的Nmap,tcpdump和tshark 的功能。
支持很多协议,比如ARP,BOOTP,Dot1Q,DHCP,DNS,GRE,HSRP,ICMP,IP, NTP,RIP,SNMP,STP,PPPoE,TCP,TFTP,UDP等,可以通过ls()查看支持的协议
简单的说,Scapy主要做两件事:发送报文和接收回应。
0x03 安装
由于Scapy是Python的模块,所以可以使用pip进行安装
pip install scapy
注意:由于Scapy需要root权限才能发送数据包,所以建议使用root用户进行安装
安装完成可以直接在命令行中输入scapy测试是否安装成功:
>scapy
INFO: Can't import matplotlib. Won't be able to plot.
INFO: Can't import PyX. Won't be able to use psdump() or pdfdump().
WARNING: No libpcap provider available ! pcap won't be used
WARNING: No default IPv4 routes found. Your Windows release may no be supported and you have to enter your routes manually
INFO: No IPv6 support in kernel
INFO: Can't import python-cryptography v1.7+. Disabled WEP decryption/encryption. (Dot11)
INFO: Can't import python-cryptography v1.7+. Disabled IPsec encryption/authentication.
WARNING: IPython not available. Using standard Python shell instead.
AutoCompletion, History are disabled.
WARNING: On Windows, colors are also disabled
aSPY//YASa
apyyyyCY//YCa |
sY//YSpcs scpCY//Pp | Welcome to Scapy
ayp ayyyyyyySCP//Pp syY//C | Version 2.4.4
AYAsAYYYYYYYY///Ps cY//S |
pCCCCY//p cSSps y//Y | https://github.com/secdev/scapy
SPPPP///a pP///AC//Y |
A//A cyPC | Have fun!
p///Ac sC///a |
PYCpc A//A | To craft a packet, you have to be a
scccccp///pSP///p p//Y | packet, and learn how to swim in
sY/y caa S//P | the wires and in the waves.
cayCyayP//Ya pY/Ya | -- Jean-Claude Van Damme
sY/PsYYCc aC//Yp |
sc sccaCY//PCypaapyCP//YSs
spCPY//YPSps
ccaacs
注意:如果没有安装所有的可选包,Scapy会显示有些功能不能用,如:
INFO: Can't import matplotlib. Won't be able to plot.
INFO: Can't import PyX. Won't be able to use psdump() or pdfdump().
WARNING: No libpcap provider available ! pcap won't be used
WARNING: No default IPv4 routes found. Your Windows release may no be supported and you have to enter your routes manually
INFO: No IPv6 support in kernel
INFO: Can't import python-cryptography v1.7+. Disabled WEP decryption/encryption. (Dot11)
INFO: Can't import python-cryptography v1.7+. Disabled IPsec encryption/authentication.
具体可参考:https://scapy.readthedocs.io/en/latest/installation.html#optional-dependencies
注意:Windows平台需要安装Winpcap才能进行嗅探sniff
0x04 使用
1. ls()
进入scapy后,可用ls()来查看scapy支持的网络协议(由于输出内容太长,只截取部分以供参考)
可以看到耳熟能详的ARP,BOOTP,Dot1Q,DHCP,DNS,GRE,HSRP,ICMP,IP,NTP,RIP,SNMP,STP,PPPoE,TCP,TFTP,UDP等等都支持
注意:这里可以带参考,比如ls(IP)来查看IP包的各种默认参数
2. lsc()
进入scapy后,可以用lsc()来查看scapy的函数集。
比较常用的函数有:
- arpcachepoison(用于arp毒化攻击,也叫arp欺骗攻击)
- arping(用于构造一个ARP的who-has包)
- send (用于发3层报文)
- sendp(用于发2层报文)
- sniff(用于网络嗅探,类似Wireshark和tcpdump)
- sr(发送+接收3层报文)
- srp(发送+接收2层报文)
比如:(由于输出内容太长,只截取部分以供参考)
3. 嗅探流量
嗅探流量的数据包有两种方式:
- sniff实时抓包
- 读取pcap文件
sniff函数如下:
def sniff(count=0, store=1, offline=None, prn=None,filter=None, L2socket=None, timeout=None, opened_socket=None, stop_filter=None, iface=None,*args,**kargs)
参数含义:
- count:抓包的数量,0表示无限制,如无限制会一直抓取
- store:保存抓取的数据包到内存或者丢弃,1:保存,0:丢弃
- offline:从 pcap 文件读取数据包,而不进行嗅探,默认为None
- prn:为每一个数据包定义一个回调函数,例如:prn = lambda x: x.summary(); ( packct.summar()函数返回的是对包的统计性信息 ),则抓取到每个包之后进行显示统计信息
- filter:过滤规则,bpf过滤器,可参考:BPF语法,比如filter=‘udp’、filter=‘tcp and ( port 80 or port 443)’、filter="( ip and dst 8.8.8.8) "
- L2socket:使用给定的 L2socket
- timeout:在给定的时间后停止嗅探,默认为 None,如果为0或不设置则会一直抓取,如果为3则是抓取此时到3秒之间的流量
- opened_socket:对指定的对象使用 .recv() 进行读取;
- stop_filter:定义一个函数,决定在抓到指定数据包后停止抓包,如:stop_filter = lambda x: x.haslayer(TCP);
- iface:指定抓包的接口,一般服务器的默认网卡是eth0,如果没有指定interface,则会在所有的interface上面进行嗅探
注意:如果不设置count和timeout的话,那么sniff会卡在那里一直运行着,sniff后面的代码是不会运行的,一般是会配合prn回调函数,且store建议设置为0,不保存到内存
保存成pcap
wrpcap('xxx.pcap', package)
读取pcap
rdpcap('xxx.pcap')
比如:
>>> from scapy.all import *
>>> packets = sniff(count=1) # 获取一个流量数据包
>>> wrpcap('test.pcpp', packets) # 保存成pcap
>>> pcap_packets = rdpcap('test.pcap') # 读取pcap文件
>>> pcap_packets
<test.pacp: TCP:1 UDP:0 ICMP:0 Other:0>
>>> pcap_packets[0]
<Ether dst=50:d2:f5:fd:85:e2 src=d4:6d:6d:1c:93:50 type=IPv4 |<IP version=4 ihl=5 tos=0x0 len=41 id=17191 fla
gs=DF frag=0 ttl=64 proto=tcp chksum=0x574f src=192.168.31.179 dst=14.215.177.38 |<TCP sport=2894 dport=https
seq=622811909 ack=621991385 dataofs=5 reserved=0 flags=A window=510 chksum=0x8563 urgptr=0 |<Raw load='\x00' |
>>> pcap_packets[0].src
'd4:6d:6d:1c:93:50'
>>> pcap_packets[0].dst
'50:d2:f5:fd:85:e2'
>>> pcap_packets[0].show()
###[ Ethernet ]###
dst = 50:d2:f5:fd:85:e2
src = d4:6d:6d:1c:93:50
type = IPv4
###[ IP ]###
version = 4
ihl = 5
tos = 0x0
len = 41
id = 17191
flags = DF
frag = 0
ttl = 64
proto = tcp
chksum = 0x574f
src = 192.168.31.179
dst = 14.215.177.38
\options \
###[ TCP ]###
sport = 2894
dport = https
seq = 622811909
ack = 621991385
dataofs = 5
reserved = 0
flags = A
window = 510
chksum = 0x8563
urgptr = 0
options = []
###[ Raw ]###
load = '\x00'
package[0]是查看第一个数据包的数据,package[0].show()是查看第一个数据包的详细信息
scapy是按照按照 TCP/IP 四层参考模型显示详细包信息的,即:链路层 [Ethernet]、网络层[IP]、传输层[TCP/UDP]、应用层[RAW]
通过上述输出结果,我们可以看出每个层的数据包有哪些属性可以取出。
这里P代表的是Ethernet层。P.dst (取出dst属性)、P.src (取出src属性)、P.type (取出type属性)
每一层都有一个 payload 属性,可以不断进入下一层。
p.payload:IP层(可用 p.payload.* 取出IP层的属性)
p.payload.payload:TCP/UDP层(可用 p.payload.payload.* 取出TCP/UDP层的属性)
p.payload.payload.payload:RAW层(可用 p.payload.payload.payload.* 取出RAW层的属性)
上面显示的是TCP的请求,下面的UDP的请求:
###[ Ethernet ]###
dst = 14:43:d0:65:ae:a2
src = 88:b8:5d:bd:0d:7f
type = IPv4
###[ IP ]###
version = 4
ihl = 5
tos = 0x0
len = 40
id = 28350
flags =
frag = 0
ttl = 128
proto = udp
chksum = 0x0
src = 192.168.1.56
dst = 192.144.236.192
\options \
###[ UDP ]###
sport = 59297
dport = ms_wbt_server
len = 20
chksum = 0x6f27
###[ Raw ]###
load = '\xae;\xf8\x01\x04\x00\x04\x04\x00\x01\x07\x00'
4. 发送数据包
发送数据包之前需要构建数据包
构建数据包
比如使用IP()就可以创建默认的数据包,也可以使用TCP()创建TCP包、使用UDP()创建UDP包等,具体可以通过ls()查看,使用 ls(IP()) 可以查看IP数据包可以有哪些参数
比如:
>>> ip_package = IP(dst='8.8.8.8') # 创建目的地址是8.8.8.8的数据包
>>> ip_package.show() # 查看数据包的信息
###[ IP ]###
version = 4
ihl = None
tos = 0x0
len = None
id = 1
flags =
frag = 0
ttl = 64
proto = ip
chksum = None
src = 192.168.31.179
dst = 8.8.8.8
\options \
>>> ip_package.summary() # 查看数据包的概要信息
'192.168.31.179 > 8.8.8.8 ip'
可以使用 ‘/’ 操作符来给数据包加上一层。比如构造一个TCP数据包,在IP层指明数据包的目的地址。在TCP层可以设定数据包的目的端口等等。UDP数据包同理
比如:
>>> ip_package = IP(dst='8.8.8.8')/TCP(dport=(53)) # 创建目的地址是8.8.8.8的数据包,目的端口为53
>>> ip_package.show()
###[ IP ]###
version = 4
ihl = None
tos = 0x0
len = None
id = 1
flags =
frag = 0
ttl = 64
proto = tcp
chksum = None
src = 192.168.31.179
dst = 8.8.8.8
\options \
###[ TCP ]###
sport = ftp_data
dport = (80, 443)
seq = 0
ack = 0
dataofs = None
reserved = 0
flags = S
window = 8192
chksum = None
urgptr = 0
options = []
发送数据包
构造完数据包就可以进行发送,有以下方法:
send(pkt):发送三层数据包,但不会受到返回的结果。
sr(pkt):发送三层数据包,返回两个结果,分别是接收到响应的数据包和未收到响应的数据包。
sr1(pkt):发送三层数据包,仅仅返回接收到响应的数据包。
sendp(pkt):发送二层数据包。
srp(pkt):发送二层数据包,并等待响应。
srp1(pkt):发送第二层数据包,并返回响应的数据包
比如:
>>> ip_package = IP(dst='8.8.8.8')/TCP(dport=(53))
>>> ans, unans = sr(ip_package)
Begin emission:
Finished sending 1 packets.
........*
Received 9 packets, got 1 answers, remaining 0 packets
>>> ans
<Results: TCP:1 UDP:0 ICMP:0 Other:0>
>>> unans
<Unanswered: TCP:0 UDP:0 ICMP:0 Other:0>
具体更多的例子可以查看:https://github.com/wizardforcel/scapy-docs-zh/blob/master/3.md
0x05. 嗅探例子
A. 嗅探单个流量数据包
#coding=utf-8
from scapy.all import *
sniff(prn=lambda x: x.show(), count=1)
B. 实时捕获TCP并且端口是80的数据包并且输出每个数据包的信息
#coding=utf-8
from scapy.all import *
# 数据包回调函数,显示TCP信息
def packet_callback(packet):
if packet[TCP].payload:
mail_packet = str(packet[TCP].payload)
if "user" in mail_packet.lower() or "pass" in mail_packet.lower():
print("[*] Server: %s" % packet[IP].dst)
print("[*] %s" % packet[TCP].payload)
sniff(filter="tcp port 80", prn=packet_callback, store=0)
输出:
# 使用get请求一下接口:
[*] Server: 172.16.6.173
[*] b'GET /api/v1/flow/flow_map?start_time=2020-10-29%2021:34:16 HTTP/1.1\r\nUser-Agent: PostmanRuntime/7.26.5\r\nAccept: */*\r\nCache-Control: no-cache\r\nPostman-Token: f86343d6-de6f-4242-a7b3-cda13b325636\r\nHost: clue.tdevops.svc.com\r\nAccept-Encoding: gzip, deflate, br\r\nConnection: keep-alive\r\n\r\n'
# 使用post请求一下接口:
[*] Server: 172.16.6.173
[*] b'POST /message/ajax.php?c=workWeixin HTTP/1.1\r\nsw6: 1-NjI0OS45MS4xNjA0MTMzMDk2NjUyNDAyOQ==-NjI0OS45MS4xNjA0MTMzMDk2NjUyNDAyOA==-0-6249-6249-MTM1-Iy9tZXNzYWdlL2FqYXgucGhw-Iy9tZXNzYWdlL2FqYXgucGhw\r\nContent-Length: 167\r\nContent-Type: text/plain; charset=UTF-8\r\nHost: ttools.svc.com\r\nConnection: Keep-Alive\r\nUser-Agent: auto\r\nCookie: PHPSESSID=u8e6g212n8qu6l81mq8fbiu6v6\r\nAccept-Encoding: gzip,deflate\r\n\r\n{"data":{"user":"xiaokun.wang,zhenggong.hou","msgtype":"text","content":"\xe3\x80\x90sit\xe6\xb5\x8b\xe8\xaf\x95\xe7\x8e\xaf\xe5\xa2\x83\xe3\x80\x91\xe3\x80\x90\xe4\xbf\xa1\xe8\xb4\xb7\xe6\xa0\xb8\xe5\xbf\x83\xe3\x80\x9120540110-20540111\xe4\xbf\xa1\xe8\xb4\xb7\xe6\x97\xa5\xe5\x88\x87\xe8\xb7\x91\xe6\x89\xb9\xe5\xbc\x80\xe5\xa7\x8b"},"m":"send"}'
C. 抓取30秒内的TCP协议的数据包并且保存
#coding=utf-8
from scapy.all import *
import os
# 数据包回调函数,显示TCP信息
def packet_callback(packet):
if packet[TCP].payload:
mail_packet = str(packet[TCP].payload)
if "user" in mail_packet.lower() or "pass" in mail_packet.lower():
print("[*] Server: %s" % packet[IP].dst)
print("[*] %s" % packet[TCP].payload)
# 开启嗅探器,过滤出tcp协议,一次抓30秒,
package=sniff(filter="tcp", timeout=30, prn=packet_callback, store=1)
flowName = "test.pcap"
wrpcap(flowName, package) #将抓取到的包保存为test.pcap文件
D. 读取pcap文件并解析HTTP请求
#coding=utf-8
from scapy.all import *
try:
import scapy.all as scapy
except ImportError:
import scapy
def parse_http_pcap(pcap_path):
pcap_infos = list()
packets = scapy.rdpcap(pcap_path)
for p in packets:
print("----")
# 判断是否包含某一层,用haslayer
if p.haslayer("IP"):
src_ip = p["IP"].src
dst_ip = p["IP"].dst
print("sip: %s" % src_ip)
print("dip: %s" % dst_ip)
if p.haslayer("TCP"):
#获取某一层的原始负载用.payload.original
raw_http = p["TCP"].payload.original
sport = p["TCP"].sport
dport = p["TCP"].dport
print("sport: %s" % sport)
print("dport: %s" % dport)
print("raw_http:\n%s" % raw_http)
if p.haslayer("HTTPRequest"):
host = p["HTTPRequest"].Host
uri = p["HTTPRequest"].Path
# 直接获取提取好的字典形式的http数据用fields
http_fields = p["HTTPRequest"].fields
http_payload = p["HTTPRequest"].payload.fields
print("host: %s" % host)
print("uri: %s" % uri)
print("http_fields:\n%s" % http_fields)
print("http_payload:\n%s" % http_payload)
parse_http_pcap("test.pcap")
输出:
# Get请求
sip: 172.30.219.206
dip: 172.16.6.173
sport: 11599
dport: 80
raw_http:
b'GET /api/v1/flow/flow_map?start_time=2020-10-29%2021:34:16 HTTP/1.1\r\nUser-Agent: PostmanRuntime/7.26.5\r\nAccept: */*\r\nCache-Control: no-cache\r\nPostman-Token: 0eb5d6bf-8758-4f3e-baee-b5e4cad11d25\r\nHost: clue.tdevops.svc.com\r\nAccept-Encoding: gzip, deflate, br\r\nConnection: keep-alive\r\n\r\n'
E. 实时获取并解析流量数据包(HTTP和HTTPS)
Python2和Python3都可以运行,并且能够自动检查scapy依赖是否安装,如果没安装则会进行自动安装
注意:python2的话,代码里面的with open()
里面要去掉encoding,否则会报错的
注意:当前python文件所在目录,需要有解析包的目录protocol,即https://github.com/HatBoy/Pcap-Analyzer里面的解析包:app/utils/protocol
注意:解析包里面是一些配置文件,对应着端口和对应协议或应用,比如3306:MySQL,不一定能够精确识别到对应的应用
#!/usr/bin/python
# -*- coding: utf-8 -*-
from __future__ import print_function
import sys
# protocol目录
PROTOCOL_PATH = './protocol'
# 监听的网卡,默认是eth0
NETWORK_INTERFACE = 'eth0'
# 需要安装的Python依赖包,注意:如果是开启上报且异步上报的话,还需要安装aiohttp
NEED_INSTALL = ['scapy']
class PcapDecode:
def __init__(self):
# ETHER:读取以太网层协议配置文件
with open('%s/ETHER' % PROTOCOL_PATH, 'r', encoding='UTF-8') as f:
ethers = f.readlines()
self.ETHER_DICT = {}
for ether in ethers:
ether = ether.strip().strip('\n').strip('\r').strip('\r\n')
self.ETHER_DICT[int(ether.split(':')[0])] = ether.split(':')[1] # 将配置文件中的信息(0257:Experimental)存入dict
# IP:读取IP层协议配置文件
with open('%s/IP' % PROTOCOL_PATH, 'r', encoding='UTF-8') as f:
ips = f.readlines()
self.IP_DICT = {}
for ip in ips:
ip = ip.strip().strip('\n').strip('\r').strip('\r\n')
self.IP_DICT[int(ip.split(':')[0])] = ip.split(':')[1] # 将配置文件中的信息(41:IPv6)存入dic
# PORT:读取应用层协议端口配置文件
with open('%s/PORT' % PROTOCOL_PATH, 'r', encoding='UTF-8') as f:
ports = f.readlines()
self.PORT_DICT = {}
for port in ports:
port = port.strip().strip('\n').strip('\r').strip('\r\n')
self.PORT_DICT[int(port.split(':')[0])] = port.split(':')[1] # 如:21:FTP
# TCP:读取TCP层协议配置文件
with open('%s/TCP' % PROTOCOL_PATH, 'r', encoding='UTF-8') as f:
tcps = f.readlines()
self.TCP_DICT = {}
for tcp in tcps:
tcp = tcp.strip().strip('\n').strip('\r').strip('\r\n')
self.TCP_DICT[int(tcp.split(':')[0])] = tcp.split(':')[1] # 465:SMTPS
# UDP:读取UDP层协议配置文件
with open('%s/UDP' % PROTOCOL_PATH, 'r', encoding='UTF-8') as f:
udps = f.readlines()
self.UDP_DICT = {}
for udp in udps:
udp = udp.strip().strip('\n').strip('\r').strip('\r\n')
self.UDP_DICT[int(udp.split(':')[0])] = udp.split(':')[1] # 513:Who
# 解析以太网层协议 ---ether_decode——ip_decode(tcp_decode or udp_decode)
def ether_decode(self, p):
data = {} # 解析出的信息以dict的形式保存
if p.haslayer("Ether"): # scapy.haslayer,将pcap包中的信息分层,再处理
data = self.ip_decode(p) # 解析IP层协议
return data
else:
data['time'] = time.strftime('%Y-%m-%d %H:%M:%S', time.localtime(p.time))
data['source_ip'] = ''
data['dest_ip'] = ''
data['protocol'] = ''
data['len'] = len(corrupt_bytes(p))
data['info'] = p.summary()
data['source_port'] = ''
data['dest_port'] = ''
return data
# 解析IP层协议
def ip_decode(self, p):
data = {}
if p.haslayer("IP"): # 2048:Internet IP (IPv4) ,分IPV4和IPV6和其他协议
ip = p.getlayer("IP")
if p.haslayer("TCP"): # 6:TCP
data = self.tcp_decode(p, ip)
return data
elif p.haslayer("UDP"): # 17:UDP
data = self.udp_decode(p, ip)
return data
else:
if ip.proto in self.IP_DICT: # 若ip分层中的协议信息在字典中,则提取ip分层中的源地址、目的地址、协议(转换)等
data['time'] = time.strftime('%Y-%m-%d %H:%M:%S', time.localtime(p.time))
data['source_ip'] = ip.src
data['dest_ip'] = ip.dst
data['protocol'] = self.IP_DICT[ip.proto]
data['len'] = len(corrupt_bytes(p))
data['info'] = p.summary()
data['source_port'] = 'UnKnow'
data['dest_port'] = 'UnKnow'
return data
else:
data['time'] = time.strftime('%Y-%m-%d %H:%M:%S', time.localtime(p.time))
data['source_ip'] = ip.src
data['dest_ip'] = ip.dst
data['protocol'] = 'IPv4'
data['len'] = len(corrupt_bytes(p))
data['info'] = p.summary()
data['source_port'] = 'UnKnow'
data['dest_port'] = 'UnKnow'
return data
elif p.haslayer("IPv6"): # 34525:IPv6
ipv6 = p.getlayer("IPv6")
if p.haslayer("TCP"): # 6:TCP
data = self.tcp_decode(p, ipv6)
return data
elif p.haslayer("UDP"): # 17:UDP
data = self.udp_decode(p, ipv6)
return data
else:
if ipv6.nh in self.IP_DICT:
data['time'] = time.strftime('%Y-%m-%d %H:%M:%S', time.localtime(p.time))
data['source_ip'] = ipv6.src
data['dest_ip'] = ipv6.dst
data['protocol'] = self.IP_DICT[ipv6.nh]
data['len'] = len(corrupt_bytes(p))
data['info'] = p.summary()
data['source_port'] = 'UnKnow'
data['dest_port'] = 'UnKnow'
return data
else:
data['time'] = time.strftime('%Y-%m-%d %H:%M:%S', time.localtime(p.time))
data['source_ip'] = ipv6.src
data['dest_ip'] = ipv6.dst
data['protocol'] = 'IPv6'
data['len'] = len(corrupt_bytes(p))
data['info'] = p.summary()
data['source_port'] = 'UnKnow'
data['dest_port'] = 'UnKnow'
return data
else:
if p.type in self.ETHER_DICT:
data['time'] = time.strftime('%Y-%m-%d %H:%M:%S', time.localtime(p.time))
data['source_ip'] = p.src
data['dest_ip'] = p.dst
data['protocol'] = self.ETHER_DICT[p.type]
data['len'] = len(corrupt_bytes(p))
data['info'] = p.summary()
data['source_port'] = 'UnKnow'
data['dest_port'] = 'UnKnow'
return data
else:
data['time'] = time.strftime('%Y-%m-%d %H:%M:%S', time.localtime(p.time))
data['source_ip'] = p.src
data['dest_ip'] = p.dst
data['protocol'] = hex(p.type) # 若在字典中没有改协议,则以16进制的形式显示
data['len'] = len(corrupt_bytes(p))
data['info'] = p.summary()
data['source_port'] = 'UnKnow'
data['dest_port'] = 'UnKnow'
return data
# 解析TCP层协议
def tcp_decode(self, p, ip):
data = {}
tcp = p.getlayer("TCP")
data['time'] = time.strftime('%Y-%m-%d %H:%M:%S', time.localtime(p.time))
data['source_ip'] = ip.src
data['dest_ip'] = ip.dst
data['len'] = len(corrupt_bytes(p))
data['info'] = p.summary()
data['source_port'] = str(ip.sport)
data['dest_port'] = str(ip.dport)
if tcp.dport in self.PORT_DICT: # 若端口信息在PORT_DICT\TCP_DICT中则转换为已知
data['protocol'] = self.PORT_DICT[tcp.dport]
elif tcp.sport in self.PORT_DICT:
data['protocol'] = self.PORT_DICT[tcp.sport]
elif tcp.dport in self.TCP_DICT:
data['protocol'] = self.TCP_DICT[tcp.dport]
elif tcp.sport in self.TCP_DICT:
data['protocol'] = self.TCP_DICT[tcp.sport]
else:
data['protocol'] = "TCP"
return data
# 解析UDP层协议
def udp_decode(self, p, ip):
data = {}
udp = p.getlayer("UDP")
data['time'] = time.strftime('%Y-%m-%d %H:%M:%S', time.localtime(p.time))
data['source_ip'] = ip.src
data['dest_ip'] = ip.dst
data['len'] = len(corrupt_bytes(p))
data['info'] = p.summary()
data['source_port'] = str(ip.sport)
data['dest_port'] = str(ip.dport)
if udp.dport in self.PORT_DICT: # 若端口信息在PORT_DICT\UDP_DICT中则转换为已知
data['protocol'] = self.PORT_DICT[udp.dport]
elif udp.sport in self.PORT_DICT:
data['protocol'] = self.PORT_DICT[udp.sport]
elif udp.dport in self.UDP_DICT:
data['protocol'] = self.UDP_DICT[udp.dport]
elif udp.sport in self.UDP_DICT:
data['protocol'] = self.UDP_DICT[udp.sport]
else:
data['protocol'] = "UDP"
return data
def get_host_ip():
"""
查询本机ip地址
:return: ip
"""
try:
s = socket.socket(socket.AF_INET, socket.SOCK_DGRAM)
s.connect(('8.8.8.8', 80))
ip = s.getsockname()[0]
finally:
s.close()
return ip
def get_host_name():
"""
查询本机hostname,如果是:xxx.novalocal,则只获取xxx
"""
hostname = socket.gethostname()
name = hostname.split('.')[0] if '.' in hostname else hostname
return name
def handle_flow(obj, data):
# 解析流量数据
data = obj.ether_decode(data)
print(data)
return data
def check_import(name):
try:
exec("import %s" % name)
return 0, 'success'
except:
return 1, 'Python模块%s未安装' % name
def check_init():
if sys.version_info < (3, 0):
return 1, '该脚本只能在Python3下运行,请使用Python3运行该脚本'
for app_name in NEED_INSTALL:
(code, msg) = check_import(app_name)
if code:
return 1, msg
print('Python模块%s已安装' % app_name)
return 0, 'success'
if __name__ == '__main__':
PD = PcapDecode()
host_ip = get_host_ip()
host_name = get_host_name()
print('------------------------------------------------------\r\n')
print('开始检测是否安装依赖')
(code, msg) = check_init()
if code:
print(msg)
sys.exit(1)
loop = None
from scapy.all import *
print('------------------------------------------------------\r\n')
print('主机IP: ', host_ip)
print('主机名称:', host_name)
print('监听网卡:', NETWORK_INTERFACE)
print('------------------------------------------------------\r\n')
print('开始监控流量数据')
print('------------------------------------------------------\r\n')
# 实时抓取流量数据并且每个数据包进行解析输出和上报
while True:
sniff(iface=NETWORK_INTERFACE, timeout=20, store=0, filter="tcp and ( port 80 or port 443 )", prn=lambda x: handle_flow(PD, x))
每有一条流量过来,会输出:
{"source_ip": "172.16.6.220","source_port": "41804","dest_ip": "172.16.6.173","dest_port": "80","protocol": "HTTP","len": 74,"info": "Ether / IP / TCP 172.16.6.173:41804 > 172.16.6.16:http S","time": "2020-10-29 11:56:28"}
能够获取到五元组,其实也可以做到获取完整的报文信息。