条件随机场模型（CRF）

最新推荐文章于 2024-06-07 09:59:26 发布

yyyyyy66

最新推荐文章于 2024-06-07 09:59:26 发布

阅读量517

点赞数

文章标签：深度学习机器学习人工智能

本文链接：https://blog.csdn.net/yyyyyy66/article/details/129517152

版权

实验2的内容总结如下：

介绍了条件随机场模型（CRF）的原理和应用，CRF是一种判别式概率模型，常用于标注或分析序列资料，如自然语言文字或是生物序列。
给出了CRF的数学定义和推导，包括条件概率分布、特征函数、权值向量、归一化因子、对数似然函数、梯度下降法等。
展示了如何使用Pytorch实现CRF模型，并在中文命名实体识别（NER）任务上进行了实验，比较了不同的优化器和超参数对模型性能的影响。

import torch
import torch.nn as nn
from typing import List, Optional

"""
条件随机场实现类
"""
class CRF(nn.Module):
    """Conditional random field.
    This module implements a conditional random field [LMP01]_. The forward computation
    of this class computes the log likelihood of the given sequence of tags and
    emission score tensor. This class also has `~CRF.decode` method which finds
    the best tag sequence given an emission score tensor using `Viterbi algorithm`_.
    Args:
        num_tags: Number of tags.
        batch_first: Whether the first dimension corresponds to the size of a minibatch.
    Attributes:
        start_transitions (`~torch.nn.Parameter`): Start transition score tensor of size
            ``(num_tags,)``.
        end_transitions (`~torch.nn.Parameter`): End transition score tensor of size
            ``(num_tags,)``.
        transitions (`~torch.nn.Parameter`): Transition score tensor of size
            ``(num_tags, num_tags)``.
    .. [LMP01] Lafferty, J., McCallum, A., Pereira, F. (2001).
       "Conditional random fields: Probabilistic models for segmenting and
       labeling sequence data". *Proc. 18th International Conf. on Machine
       Learning*. Morgan Kaufmann. pp. 282–289.
    .. _Viterbi algorithm: https://en.wikipedia.org/wiki/Viterbi_algorithm
    """

    def __init__(self, num_tags: int, batch_first: bool = False) -> None:
        if num_tags <= 0:
            raise ValueError(f'invalid number of tags: {num_tags}')
        super().__init__()
        self.num_tags = num_tags
        self.batch_first = batch_first
        # 是一个类型转换函数，继承自torch.Tensor的子类，其主要作用是作为nn.Module中的可训练参数使用。
        # nn.Parameter()添加的参数会被添加到Parameters列表中，会被送入优化器中随训练一起学习更新。
        #
        self.start_transitions = nn.Parameter(torch.empty(num_tags))
        self.end_transitions = nn.Parameter(torch.empty(num_tags))
        self.transitions = nn.Parameter(torch.empty(num_tags, num_tags))

        self.reset_parameters()

    def reset_parameters(self) -> None:
        """Initialize the transition parameters.
        The parameters will be initialized randomly from a uniform distribution
        between -0.1 and 0.1.
        """
        # nn.init参数初始化方法
        nn.init.uniform_(self.start_transitions, -0.1, 0.1)
        nn.init.uniform_(self.end_transitions, -0.1, 0.1)
        nn.init.uniform_(self.transitions, -0.1, 0.1)

    def __repr__(self) -> str:
        return f'{self.__class__.__name__}(num_tags={self.num_tags})'

    def forward(self, emissions: torch.Tensor,
                tags: torch.LongTensor,
                mask: Optional[torch.ByteTensor] = None,
                reduction: str = 'mean') -> torch.Tensor:
        """Compute the conditional log likelihood of a sequence of tags given emission scores.
        Args:
            emissions (`~torch.Tensor`): Emission score tensor of size
                ``(seq_length, batch_size, num_tags)`` if ``batch_first`` is ``False``,
                ``(batch_size, seq_length, num_tags)`` otherwise.
            tags (`~torch.LongTensor`): Sequence of tags tensor of size
                ``(seq_length, batch_size)`` if ``batch_first`` is ``False``,
                ``(batch_size, seq_length)`` otherwise.
            mask (`~torch.ByteTensor`): Mask tensor of size ``(seq_length, batch_size)``
                if ``batch_first`` is ``False``, ``(batch_size, seq_length)`` otherwise.
            reduction: Specifies  the reduction to apply to the output:
                ``none|sum|mean|token_mean``. ``none``: no reduction will be applied.
                ``sum

最低0.47元/天解锁文章

yyyyyy66

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
1
评论
条件随机场模型（CRF）

展示了如何使用Pytorch实现CRF模型，并在中文命名实体识别（NER）任务上进行了实验，比较了不同的优化器和超参数对模型性能的影响。介绍了条件随机场模型（CRF）的原理和应用，CRF是一种判别式概率模型，常用于标注或分析序列资料，如自然语言文字或是生物序列。给出了CRF的数学定义和推导，包括条件概率分布、特征函数、权值向量、归一化因子、对数似然函数、梯度下降法等。
复制链接

扫一扫

条件随机场模型（CRF）

“相关推荐”对你有帮助么？