1. 本地文件
新建一个sh脚本,内容如下:
#!/bin/bash
MY_SAVEIFS=$IFS
IFS=$'\n'
for fileList in $(find $1 -type f);do
ls -l "$fileList"
done
IFS=$MY_SAVEIFS#!/bin/bash
MY_SAVEIFS=$IFS
IFS=$'\n'
for fileList in $(find $1 -type f);do
ls -l "$fileList"
done
IFS=$MY_SAVEIFS
运行结果:sudo filesize.sh HDTV > /tmp/list.txt
其中文件size是以字节为单位
上述结果拷贝到excel中,用文字替换,分列的方法得到本地表格,类似下面的:
2. 云侧文件
以189天翼云盘为例,新增如下函数,python递归遍历文件夹,打印文件名和文件size
def get_all_file_list(self, fid, folderName):
"""获取网盘目录fid下所有的文件列表"""
url = self._host_url + "/api/open/file/listFiles.action"
params = {
"folderId": str(fid),
}
resp = self._get(url, params=params)
if not resp:
logger.error(f"File list: {fid=}network error!")
return
resp = resp.text
if 'errorCode' in resp:
logger.error(f"File list: {fid=}server response error!")
return
root = ElementTree.fromstring(resp)
children_node = root.find('fileList')
if len(children_node) == 0:
logger.error(f"File list: {folderName=}({fid})xml parse error!")
return
for child in children_node:
##logger.debug(f"find a node: {child.tag}:{child.findtext('name')}:id={child.findtext('id')}")
if child.tag == "file":
name = child.findtext('name')
size = child.findtext('size')
logger.debug(f"FILE_FOUND: {folderName}/{name}#{size=}")
id_ = int(child.findtext('id'))
md5 = child.findtext('md5')
self._print_file_path.append(FileInfo(name=folderName+'/'+name, id=id_, size=size))
elif child.tag == "folder":
name = child.findtext('name')
self.get_all_file_list(int(child.findtext('id')), folderName+'/'+child.findtext('name'))
else:
#do nothing
pass
return
得到如下输出:
拷贝到excel,文字替换+分列,得到如下表格:
3. 处理数据
1)把云侧的size ,vlookup到本地表中
2)利用IF表达式比较本地文件和云侧文件的size是否相等
https://n2.urltosub.com/link/IdWEOsjoksAW04q0?list=shadowrocket&extend=1
4. 结论,189的云盘还是挺靠谱的,4T的数据,上传成功率是100%,后续可以把md5的校验加上,看看效果如何。