pytorch数据抽样 随机取样

WeightedRandomSampler

不放回只能用WeightedRandomSampler,其他都是放回取样

只需将权重都设为1(or相同值)

from torch.utils.data import WeightedRandomSampler,DataLoader

a = torch.Tensor([1, 2, 3, 4, 5])
dataloader = DataLoader(dataset=a,
                    batch_size=4,  # 组织成batch大小
                    sampler=WeightedRandomSampler(torch.ones(a.size()), 20))
for i in dataloader:
     print(i)

结果

tensor([1., 5., 1., 2.])
tensor([5., 1., 5., 2.])
tensor([1., 4., 2., 2.])
tensor([4., 5., 4., 3.])
tensor([5., 2., 2., 1.])

参数

Arguments:
        weights (sequence)   : a sequence of weights, not necessary summing up to one
        num_samples (int): number of samples to draw
        replacement (bool): if ``True``, samples are drawn with replacement.
            If not, they are drawn without replacement, which means that when a
            sample index is drawn for a row, it cannot be drawn again for that row.

RandomSampler编组输出

#emb_pos[5,1600]
for num in [1, 2, 3, 4]:
    print('num', num)
    emb_pos_ = emb_pos.repeat(num,1)
    loader_pos = DataLoader(dataset=emb_pos_,
                            batch_size=num,  # 组织成batch大小
                            sampler=RandomSampler(emb_pos_))
    for i in loader_pos:
        print(i.size())

结果

num 1
torch.Size([1, 1600])
torch.Size([1, 1600])
torch.Size([1, 1600])
torch.Size([1, 1600])
torch.Size([1, 1600])
num 2
torch.Size([2, 1600])
torch.Size([2, 1600])
torch.Size([2, 1600])
torch.Size([2, 1600])
torch.Size([2, 1600])
num 3
torch.Size([3, 1600])
torch.Size([3, 1600])
torch.Size([3, 1600])
torch.Size([3, 1600])
torch.Size([3, 1600])
num 4
torch.Size([4, 1600])
torch.Size([4, 1600])
torch.Size([4, 1600])
torch.Size([4, 1600])
torch.Size([4, 1600])
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值