半监督学习matlab,半监督学习基准数据集 - osc_6n4iy0i5的个人空间 - OSCHINA - 中文开源技术交流社区...

半监督学习基准数据集

Semi-Supervised Learning Benchmark Dataset

该数据集出自:

Chapelle O, Scholkopf B, Zien A. Semi-supervised learning (chapelle, o. et al., eds.; 2006)[book reviews][J]. IEEE Transactions on Neural Networks, 2009, 20(3): 542-542.

网上链接为:

找了好久,网上个别链接打不开,存在此处方便以后查看。

另发现一个总结半监督数据集的好博客。

Self-Labeled Techniques for Semi-Supervised Learning: Taxonomy, Software and Empirical Study

以下为第一个链接的内容:

The Benchmark Data Sets

For each data set, we provide 12 splits (exception: only 10 splits for data set 8) into labeled points and remaining unlabeled points. We ensure that each split contains at least one point of each class. Apart from this, there is no bias in the labeling process.

数据集存在12次划分,(但数据集8例外),每次有10/100个点为标记样本,其余为未标记样本,每次划分中每类至少一个点。

The table contains individual files in matlab 5.0 format (.mat files).

You can also download all data sets and splits (excluding the extra unlabeled data of set 8) at once as archive files, in matlab format: gzipped TAR file, ZIP file; in ascii format: gzipped TAR file, ZIP file (here, only the indices of the labeled examples are provided -- all other examples are unlabeled). Data sets 8 and 9 are supplied in special formats: in set 8, all attributes are categorical and have to be expanded into a sparse binary vector (21 bits per attribute; cf to the matlab script); in set 9, the data are very sparse, and only non-zero values are supplied as a list of "index:value" pairs.

X = matrix of input data; each row corresponds to one example

X输入数据的矩阵,每行对应于一个样本;

y = the labels (either {0,1} or {-1,+1} for binary problems)

y为标记,分{0,1}或{-1,+1}.

idxLabs = each row contains the indices of the labeled points for a given split

idxLabs每行包含给定划分的标记样本点的索引

idxUnls = idem for the unlabeled points

idxUnls为未标记样本点

(all indices are 1-based as in matlab, not 0-based as in C)

索引从1开始。

2020.3.3

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值