决策树C4.5实例练习

最新推荐文章于 2023-06-13 09:37:12 发布

Garrison2012

最新推荐文章于 2023-06-13 09:37:12 发布

阅读量2.7k

点赞数

分类专栏：机器学习数据挖掘

本文链接：https://blog.csdn.net/Garrison2012/article/details/41170099

版权

本文通过实例详细介绍了如何使用C4.5决策树算法进行数据分类，包括数据预处理、特征选择及模型构建过程，帮助读者深入理解决策树的工作原理。

摘要由CSDN通过智能技术生成

以下为在网上找的MATLAB实现C4.5的代码

function [tree test_targets] = C4_5(train_patterns, train_targets, test_patterns, inc_node, Nu)

% Classify using Quinlan's C4.5 algorithm
% Inputs:
% 	training_patterns   - Train patterns 行是特征，列是样本
%	training_targets	- Train targets  1行多列，列是训练样本个数
%       test_patterns       - Test  patterns 行是特征，列是样本
%	inc_node            - Percentage of incorrectly assigned samples at a node
%       inc_node为防止过拟合参数，表示样本数小于一定阈值结束递归，可设置为5-10
%       Nu is to determine whether the variable is discrete or continuous (the value is always set to 10)
%
% Outputs
%	test_targets        - Predicted targets 1行m列（列的长度是测试样本的个数）

%NOTE: In this implementation it is assumed that a pattern vector with fewer than 10 unique values (the parameter Nu)
%is discrete, and will be treated as such. Other vectors will be treated as continuous

[Ni, M]	    = size(train_patterns);%输入向量为NI*M的矩阵，其中M表示训练样本个数，Ni为特征维数维数
inc_node    = inc_node*M/100;

%Find which of the input patterns are discrete, and discretisize the corresponding
%dimension on the test patterns
discrete_dim = zeros(1,Ni);
for i = 1

最低0.47元/天解锁文章

Garrison2012

关注

0
点赞
踩
2

收藏

觉得还不错? 一键收藏
0
评论
决策树C4.5实例练习

1、训练样本集本文用实例构造C4.5决策树进行介绍该算法，首先给出训练集其中前4列x为属性，最后一列y为分类结果C4.5介绍 C4.5，是机器学习算法中的另一个分类决策树算法，决策树构造方法其实就是每次选择一个好的特征以及分裂点作为当前节点的分类条件。C4.5算法是ID3的改进算法，那么C4.5相比于ID3改进的地方主要有一下几点： 1、用信息增益率来选择
复制链接

扫一扫

专栏目录