java ML回归预测_Java机器学习库ML之七分类预测输出概率值

这篇博客展示了如何使用Java Machine Learning Library (Java-ML) 中的RandomForest分类器进行概率分类。通过采样80%的Iris数据集进行训练,然后在剩余的20%数据上验证,输出每个实例属于各类别的概率分布,如Iris-setosa、Iris-versicolor和Iris-virginica。
摘要由CSDN通过智能技术生成

场景:一般分类预测直接输出类别标记,不过有些情况需要输出对应类别的概率值,比如判定为正例的概率是0.6,而判定为负例的概率是0.3,那自然标记为正例,这里就是看ML用classDistribution输出各类别的概率值。参考代码如下:

/**

* This file is part of the Java Machine Learning Library

*

* The Java Machine Learning Library is free software; you can redistribute it and/or modify

* it under the terms of the GNU General Public License as published by

* the Free Software Foundation; either version 2 of the License, or

* (at your option) any later version.

*

* The Java Machine Learning Library is distributed in the hope that it will be useful,

* but WITHOUT ANY WARRANTY; without even the implied warranty of

* MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the

* GNU General Public License for more details.

*

* You should have received a copy of the GNU General Public License

* along with the Java Machine Learning Library; if not, write to the Free Software

* Foundation, Inc., 51 Franklin St, Fifth Floor, Boston, MA 02110-1301 USA

*

* Copyright (c) 2006-2012, Thomas Abeel

*

* Project: http://java-ml.sourceforge.net/

*

*/

package com.gddx;

import java.io.File;

import java.util.Map;

import java.util.Random;

import be.abeel.util.Pair;

import net.sf.javaml.classification.Classifier;

import net.sf.javaml.classification.tree.RandomForest;

import net.sf.javaml.core.Dataset;

import net.sf.javaml.core.DenseInstance;

import net.sf.javaml.core.Instance;

import net.sf.javaml.sampling.Sampling;

import net.sf.javaml.tools.data.FileHandler;

/**

* Tutorial for the random forest classifier.

*

* @author Thomas Abeel

*

*/

public class TutorialRandomForest {

/**

* Shows the default usage of the random forest algorithm.

*/

public static void main(String[] args) throws Exception {

/* Load a data set */

Dataset ori_data = FileHandler.loadDataset(new File("D:\\tmp\\javaml-0.1.7-src\\UCI-small\\iris\\iris.data"), 4, ",");

Sampling s = Sampling.SubSampling;

Pair sam_data = s.sample(ori_data, (int) (ori_data.size() * 0.8));

/*

* Contruct a RF classifier that uses 5 neighbors to make a decision.

*/

Classifier rf = new RandomForest(50, false, 3, new Random());

rf.buildClassifier(sam_data.x());//80%样本训练

/* 输出预测的类别概率 */

for(Instance inst:sam_data.y()){ //20%样本验证

Map mprob=rf.classDistribution(inst);//输出类别的概率,[0,1]

System.out.println(mprob);

}

}

}

执行结果:

{Iris-virginica=0.0, Iris-setosa=1.0000000000000004, Iris-versicolor=0.0}

{Iris-virginica=0.0, Iris-setosa=1.0000000000000004, Iris-versicolor=0.0}

{Iris-virginica=0.0, Iris-setosa=1.0000000000000004, Iris-versicolor=0.0}

{Iris-virginica=0.0, Iris-setosa=1.0000000000000004, Iris-versicolor=0.0}

{Iris-virginica=0.0, Iris-setosa=1.0000000000000004, Iris-versicolor=0.0}

{Iris-virginica=0.0, Iris-setosa=1.0000000000000004, Iris-versicolor=0.0}

{Iris-virginica=0.0, Iris-setosa=1.0000000000000004, Iris-versicolor=0.0}

{Iris-virginica=0.0, Iris-setosa=1.0000000000000004, Iris-versicolor=0.0}

{Iris-virginica=0.0, Iris-setosa=1.0000000000000004, Iris-versicolor=0.0}

{Iris-virginica=0.0, Iris-setosa=1.0000000000000004, Iris-versicolor=0.0}

{Iris-virginica=0.0, Iris-setosa=0.7400000000000003, Iris-versicolor=0.25999999999999995}

{Iris-virginica=0.0, Iris-setosa=1.0000000000000004, Iris-versicolor=0.0}

{Iris-virginica=0.0, Iris-setosa=0.0, Iris-versicolor=1.0000000000000004}

{Iris-virginica=0.0, Iris-setosa=0.0, Iris-versicolor=1.0000000000000004}

{Iris-virginica=0.0, Iris-setosa=0.0, Iris-versicolor=1.0000000000000004}

{Iris-virginica=0.08, Iris-setosa=0.0, Iris-versicolor=0.9200000000000005}

{Iris-virginica=0.46000000000000013, Iris-setosa=0.0, Iris-versicolor=0.5400000000000001}

{Iris-virginica=0.0, Iris-setosa=0.0, Iris-versicolor=1.0000000000000004}

{Iris-virginica=0.0, Iris-setosa=0.0, Iris-versicolor=1.0000000000000004}

{Iris-virginica=0.0, Iris-setosa=0.0, Iris-versicolor=1.0000000000000004}

{Iris-virginica=0.0, Iris-setosa=0.0, Iris-versicolor=1.0000000000000004}

{Iris-virginica=1.0000000000000004, Iris-setosa=0.0, Iris-versicolor=0.0}

{Iris-virginica=1.0000000000000004, Iris-setosa=0.0, Iris-versicolor=0.0}

{Iris-virginica=0.32, Iris-setosa=0.0, Iris-versicolor=0.6800000000000003}

{Iris-virginica=1.0000000000000004, Iris-setosa=0.0, Iris-versicolor=0.0}

{Iris-virginica=1.0000000000000004, Iris-setosa=0.0, Iris-versicolor=0.0}

{Iris-virginica=1.0000000000000004, Iris-setosa=0.0, Iris-versicolor=0.0}

{Iris-virginica=1.0000000000000004, Iris-setosa=0.0, Iris-versicolor=0.0}

{Iris-virginica=1.0000000000000004, Iris-setosa=0.0, Iris-versicolor=0.0}

{Iris-virginica=0.9000000000000005, Iris-setosa=0.0, Iris-versicolor=0.1}

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值