[bash]fgrep

这篇博客讨论了如何在大型文件中高效地查找和处理数据。内容涉及多个程序,如程序1、2和3,其中程序1处理数据速度较慢。文章重点介绍了fgrep命令,尽管它不支持正则表达式,但在特定场景下比grep更适用。作者还分享了如何处理大型文件中特定行范围的方法,并引用了一个关于快速提取文件中指定行的链接。
摘要由CSDN通过智能技术生成

这篇日志的题目好想叫做<论编程不好是怎样一种体验>…

文件

1.input file (10万行)

"ClipID","yfcc100m_hash"
"YFCC100M53595ec67db890b7ad32dfa375196dec","53595ec67db890b7ad32dfa375196dec"
"YFCC100Md6d7f154d7ad0a367353b8a4c7f868","d6d7f154d7ad0a367353b8a4c7f868"
"YFCC100Mabfa822c14bcac4c0f45deb97d18b5f","abfa822c14bcac4c0f45deb97d18b5f"
"YFCC100Md665346779ab26f33f3e537462daf3a","d665346779ab26f33f3e537462daf3a"
"YFCC100M2c7480e2c372a568f5efe91e323ea022","2c7480e2c372a568f5efe91e323ea022"
"YFCC100Mf2b1ab759b93a444efcff0f1d8db86ad","f2b1ab759b93a444efcff0f1d8db86ad"
"YFCC100M3430141d3e43fd1dc903147ecbbe3c","3430141d3e43fd1dc903147ecbbe3c"
"YFCC100M3fd1fb7996b492bc324f50fe6510cd31","3fd1fb7996b492bc324f50fe6510cd31"
"YFCC100M5bab830a1651bbcd45476a2a589d8","5bab830a1651bbcd45476a2a589d8"
"YFCC100Me967f4132a9490dea9890d551d4313e","e967f4132a9490dea9890d551d4313e"
"YFCC100M7617ffd88e62dcec4563f958eb879","7617ffd88e62dcec4563f958eb879"
"YFCC100Mc397a76d6b461ab0319bbadb2ee7997","c397a76d6b461ab0319bbadb2ee7997"
"YFCC100Mb44854aeaa7ab878b606699bf46a94a","b44854aeaa7ab878b606699bf46a94a"
"YFCC100M4db1d3a7dafad886ebaf4121aff4a921","4db1d3a7dafad886ebaf4121aff4a921"
"YFCC100M716414c3a7cc5dca529a9dd32a8d9bed","716414c3a7cc5dca529a9dd32a8d9bed"
"YFCC100M721b1d73bfad7edb75e364bce5a7ba88","721b1d73bfad7edb75e364bce5a7ba88"
"YFCC100Md6452731b2e6fd49bdc81791b522","d6452731b2e6fd49bdc81791b522"
"YFCC100Mdccadda6f254e2c0612b563cc0fc2ff8","dccadda6f254e2c0612b563cc0fc2ff8"
"YFCC100Mf0db32256c23de67e3f42824b622d1d5","f0db32256c23de67e3f42824b622d1d5"

2.hash file (4G)
//identifier, hash value

37941   f81e7c3af3799b60b7dbd514c67184af
37946   dfdb5ed692372b7fde1be75a17b45fcc
38121   df85d12e831af19b1e5ccff79169df45
38129   d919459df4a5d7b0a2ae56bbfcffa9dd
38143   32d6f03a3dc5ad04af8919e333e95d7
38225   f4281b87d46177294726c7d933366a2
38389   7ff9a834d0d25f47d50a3d9bec68dd
38395   017fdbe9f414c3933cb92dc64fc6d56
38397   716b13e1bfa95c40569b817a5ebccf0
38408   e05beeaa8528a98742ceae5ac4f2718
38414   c239c75be9558cedd65a6b1a5e6deee8
38425   516754853cb6f40682f5d28d3458f
69471   ecea8b8d25ba22c3326136666927e0
69500   f02cb5ad1113481e9368bd3aa09b971a
69604   7ebf6e2e9ccda758a525a1af70123f
69606   50c1c3d43ea4d1dd70579681b4adf15
69725   65ec648d1525d9158c5f6b5fce2ee8
69802   e377a4faf03e6a26cd845fff5cce44
69895   53e6707335874eac29b69b5782f94061
69941   ae0915a8b9233286ebf29e5a981c7d
70023   3ef140ec0f976448fa85e779fdcdde
70110   7c1db88bea789d5e238580f9c776f99e
  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值