python中grid_search_python并行调参—scikit-learn grid_search_scikit learn

最新推荐文章于 2022-01-02 22:33:11 发布

Levana Dong

最新推荐文章于 2022-01-02 22:33:11 发布

阅读量313

点赞数

文章标签： python中grid_search

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/weixin_42356460/article/details/113968362

版权

python并行调参——scikit-learn grid_search

上篇应用scikit-learn做文本分类中以20newsgroups为例讲了如何用三种方法提取训练集=测试集的文本feature，但是

vectorizer取多少个word呢？

预处理时候要过滤掉tf>max_df的words，max_df设多少呢？

tfidftransformer只用tf还是加idf呢？

classifier分类时迭代几次？学习率怎么设？

……

“循环一个个试过来啊”……啊好吧，matlab里就是这么做的……

好在scikit-learn中提供了pipeline(for estimator connection) & grid_search(searching best parameters)进行并行调参。

官网上pipeline 解释如下：

Pipeline can be used to chain multiple estimators into one. This is useful as there is often a fixed sequence of steps in processing the data, for example feature selection, normalization and classification. Pipeline serves two purposes here:

Convenience : You only have to call fit and predict once on your data to fit a whole sequence of estimators.

Joint parameter selection : You can grid search

最低0.47元/天解锁文章

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
python中grid_search_python并行调参—scikit-learn grid_search_scikit learn

python并行调参——scikit-learn grid_search上篇应用scikit-learn做文本分类中以20newsgroups为例讲了如何用三种方法提取训练集=测试集的文本feature，但是vectorizer取多少个word呢？预处理时候要过滤掉tf>max_df的words，max_df设多少呢？tfidftransformer只用tf还是加idf呢？classif...
复制链接

扫一扫

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。