自监督预训练(三)wav2vec 2.0原理剖析
一、整体流程二、feature encoder理解Conv1dnn.Conv1d(in_channels=5, out_channels=20, kernel_size=3, stride=2)假设输入input=(batch, in_channels, in_len)batch=1in_channels=5,对应向量大小,比如word embeddingin_len=10,对应word的个数cnn内部kernel=(in_channels, kernel_size)=(5,3),相当





