C#比较两个文件内容是否相同

最新推荐文章于 2023-07-30 00:11:26 发布

Unique_849997563

最新推荐文章于 2023-07-30 00:11:26 发布

阅读量4.1k

点赞数 1

分类专栏：计算机相关基础 C# 文章标签： C# 源码比较内容哈希算法基础案例

本文为博主原创文章，转载请附上原文出处链接和本声明。我的网站：https://www.hewen.site

本文链接：https://blog.csdn.net/qq_33461689/article/details/121499130

版权

C# 同时被 2 个专栏收录

49 篇文章 1 订阅

订阅专栏

计算机相关基础

21 篇文章 0 订阅

订阅专栏

今天看到项目的比较文件是先将所有字节读出来，然后逐一进行比较，想找找有没有可以优化的地方。在网上看了一下，有比较哈希码的，验证了一下，发现不管是文件开始就不相同，还是文件末尾才不相同，都比较耗时。

C#比较两个文本文件的内容是否相等 - 五点 - 博客园 (cnblogs.com)

测试文件大小：10000KB

一、比较哈希码

先将文件内容转成哈希码，然后进行比较。

    public static bool CompareFile(string sourceFilePath, string destFilePath)
    {
        if (string.IsNullOrEmpty(sourceFilePath) || string.IsNullOrEmpty(destFilePath))
        {
            return false;
        }
        if (!File.Exists(sourceFilePath)|| !File.Exists(destFilePath))
        {
            return false;
        }
        using (HashAlgorithm hash = HashAlgorithm.Create())
        {
            using (FileStream file1 = new FileStream(sourceFilePath, FileMode.Open), file2 = new FileStream(destFilePath, FileMode.Open))
            {
                byte[] hashByte1 = hash.ComputeHash(file1);//哈希算法根据文本得到哈希码的字节数组
                byte[] hashByte2 = hash.ComputeHash(file2);
                string str1 = BitConverter.ToString(hashByte1);//将字节数组装换为字符串
                string str2 = BitConverter.ToString(hashByte2);
                return str2 == str1;
            }
        }
        return false;
    }

在两个转哈希码的方法上加上测试代码。发现，比较耗时。

二、逐字符比较

  public static bool CompareFile(string sourceFilePath, string destFilePath)
    {
        if (string.IsNullOrEmpty(sourceFilePath) || string.IsNullOrEmpty(destFilePath))
        {
            return false;
        }
        if (!File.Exists(sourceFilePath) || !File.Exists(destFilePath))
        {
            return false;
        }
        byte[] _source = File.ReadAllBytes(sourceFilePath);
        byte[] _dest = File.ReadAllBytes(destFilePath);
        if (_source.Length != _dest.Length)
        {
            return false;
        }
        for (int i = 0; i < _source.Length; ++i)
        {
            if (_source[i] != _dest[i])
            {
                return false;
            }
        }
        return true;
    }

在逐字符的比较函数上加上测试代码。

如果文件开始就不相同，比较耗时约等于读取文件的耗时。

如果文件末尾才不相同，比较耗时也比先转哈希码再比较快很多。