java跨文件调用函数_java – 处理一个巨大的文件并快速调用文件的每一行上的函数...

最新推荐文章于 2023-06-20 07:30:00 发布

Keq Chen

最新推荐文章于 2023-06-20 07:30:00 发布

阅读量531

点赞数

文章标签： java跨文件调用函数

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/weixin_34766547/article/details/114666251

版权

我有一个大约10.000.000行文本的文件(是的,我有足够的内存).

现在我想要一个MyClass列表(构造函数是MyClass(String s)与文件的每一行.现在我这样做：

List help = Files.lines(Paths.get(s))

.parallel()

.map(MyClass::new)

.collect(Collectors.toList());

但需要数年才能取得进展.关于如何加快这个问题的任何想法？

最佳答案首先,来自

Collectors.toList()文档的相关摘录：

[…]There are no guarantees on the type, mutability, serializability, or thread-safety of the List returned; if more control over the returned List is required, use toCollection(Supplier)

现在,让我们更深入地了解收藏家characteristics;我们发现这个：

public static final Collector.Characteristics CONCURRENT

Indicates that this collector is concurrent, meaning that the result container can support the accumulator function being called concurrently with the same result container from multiple threads.

If a CONCURRENT collector is not also UNORDERED, then it should only be evaluated concurrently if applied to an unordered data source.

现在,没有什么能保证Collectors.toList()返回的收集器完全是并发的.

尽管启动你的新类别可能需要一段时间,但这里的安全赌注是假设这个收集器不是并发的.但幸运的是,我们有一种方法可以使用并发集合,如javadoc中所述.那么,让我们试试：

.collect(

Collector.of(CopyOnWriteArrayList::new,

List::add,

(o, o2) -> { o.addAll(o2); return o; },

Function.>identity(),

Collector.Characteristics.CONCURRENT,

Collector.Characteristics.IDENTITY_FINISH

)

)

这可能会加快速度.

现在,你有另一个问题.你不关闭你的流.

这个鲜为人知,但Stream(无论是任何类型还是{Int,Double,Long}流)都实现了AutoCloseable.您想要关闭I / O绑定的流,而Files.lines()就是这样的流.

所以,试试这个：

final List list;

try (

final Stream lines = Files.lines(...);

) {

list = lines.parallel().map(MyClass::new)

.collect(seeAbove);

}

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
复制链接

分享到 QQ

分享到新浪微博

扫一扫

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。