零样本、单样本和小样本学习

最新推荐文章于 2025-03-24 09:32:19 发布

Chen_Chance

最新推荐文章于 2025-03-24 09:32:19 发布

阅读量1.8k

点赞数 4

文章标签：学习

本文链接：https://blog.csdn.net/qq_44154915/article/details/140919329

版权

零样本、单样本和小样本学习

在机器学习和人工智能领域，零样本（Zero-Shot）、单样本（One-Shot）和小样本（Few-Shot）学习是三种重要的学习范式，旨在解决数据稀缺问题。下面我们将详细介绍这些概念，并通过实际案例帮助理解。

一、零样本学习（Zero-Shot Learning）

定义：零样本学习是指在没有见过特定类别的样本情况下，模型能够正确分类或识别该类别。这通常依赖于对类之间关系的推理和知识的泛化。

应用场景：图像分类、自然语言处理、推荐系统等。

实际案例：假设我们有一个图像分类模型，它已经学习了猫、狗和鸟的图片。现在我们希望它能够识别一种新的动物，例如斑马，而不提供任何斑马的图片。

实现方法：

预训练模型：首先在大量的已知类别上进行训练，学习特征和语义信息。
语义嵌入：利用描述性文本（如斑马的特征描述）生成语义嵌入。
匹配推理：在推理阶段，将输入图像的特征与斑马的语义嵌入进行匹配，从而实现分类。

from transformers import CLIPProcessor, CLIPModel

# 加载预训练的CLIP模型
model = CLIPModel.from_pretrained("openai/clip-vit-base-patch32")
processor = CLIPProcessor.from_pretrained("openai/clip-vit-base-patch32")

# 输入图像和文本描述
image = processor(images=image, return_tensors="pt")
text = processor(text=["A photo of a zebra"], return_tensors="pt")

# 获取图像和文本的嵌入
image_features = model.get_image_features(**image)
text_features = model.get_text_features(**text)

# 计算相似度
logits_per_image = image_features @ text_features.T
probs = logits_per_image.softmax(dim=-1)
print("Probability of zebra: ", probs[0].item())

二、单样本学习（One-Shot Learning）

定义：单样本学习是指模型在仅有一个训练样本的情况下，能够正确分类或识别该类别。它通过借鉴已知类别的知识来泛化到新类别。

应用场景：人脸识别、字符识别、物体检测等。

实际案例：假设我们有一个人脸识别系统，只给定一个新用户的单张照片，系统需要在未来的识别任务中正确识别该用户。

实现方法：

度量学习：通过训练一个度量学习模型（如Siamese Network），学习样本之间的相似度度量。
匹配推理：在推理阶段，通过计算新用户照片与已知用户照片之间的相似度，进行分类。

from tensorflow.keras.models import Model
from tensorflow.keras.layers import Input, Conv2D, Lambda, Dense, Flatten

# 构建Siamese Network
def build_siamese_model(input_shape):
    input = Input(shape=input_shape)
    x = Conv2D(64, (10, 10), activation='relu')(input)
    x = Conv2D(128, (7, 7), activation='relu')(x)
    x = Flatten()(x)
    x = Dense(128, activation='relu')(x)
    x = Lambda(lambda x: x / tf.norm(x, axis=1, keepdims=True))(x)
    return Model(input, x)

# 训练Siamese Network
# 略

# 新用户照片和已知用户照片
new_user_photo = preprocess_image("new_user.jpg")
known_user_photo = preprocess_image("known_user.jpg")

# 获取嵌入
embedding_new = siamese_model.predict(new_user_photo)
embedding_known = siamese_model.predict(known_user_photo)

# 计算相似度
similarity = np.dot(embedding_new, embedding_known.T)
print("Similarity score: ", similarity)

三、小样本学习（Few-Shot Learning）

定义：小样本学习是指模型在只有少量训练样本（通常为几到几十个）的情况下，能够正确分类或识别新类别。

应用场景：语音识别、手写文字识别、医学图像分析等。

实际案例：假设我们有一个医学图像分析系统，只有少量的新疾病标注数据，系统需要在未来的分析任务中正确识别该疾病。

实现方法：

元学习（Meta-Learning）：通过训练一个元学习模型，学习如何快速适应新任务。
任务适应：在新任务中，通过少量样本进行快速微调，提升模型的性能。

from transformers import BertForSequenceClassification, BertTokenizer
import torch

# 加载预训练的BERT模型
model = BertForSequenceClassification.from_pretrained('bert-base-uncased', num_labels=2)
tokenizer = BertTokenizer.from_pretrained('bert-base-uncased')

# 预处理少量样本
texts = ["This is a positive example.", "This is a negative example."]
labels = [1, 0]
inputs = tokenizer(texts, padding=True, truncation=True, return_tensors="pt")
labels = torch.tensor(labels)

# 任务适应
model.train()
outputs = model(**inputs, labels=labels)
loss = outputs.loss
loss.backward()
optimizer = torch.optim.Adam(model.parameters(), lr=1e-5)
optimizer.step()

# 新任务推理
new_text = "This is a new example."
inputs = tokenizer(new_text, return_tensors="pt")
model.eval()
outputs = model(**inputs)
prediction = torch.argmax(outputs.logits, dim=-1)
print("Prediction: ", prediction.item())