手写基于Logistic Regression的文章分类器（AG News）

YukihoKirihara

已于 2024-03-27 01:08:12 修改

阅读量879

点赞数 8

文章标签： python nlp

于 2024-03-27 01:07:02 首次发布

本文链接：https://blog.csdn.net/nawef/article/details/137062425

版权

本文介绍了如何从头开始创建一个文本分类器，使用Log-Linear模型（包括LogisticRegression），并详细阐述了预处理、TF-IDF特征提取、模型训练、评价指标（如Accuracy、Precision、Recall和F1Score）的过程。

摘要由CSDN通过智能技术生成

A Text Classifier based on Log-Linear Model

A simple model from scratch.

Use TF-IDF as the Feature

Logistic Regression Model
$\hat{y} = \text{softmax}\big( X_{N,F}W_{F,C}+b_{C} \big)$
, where $N$ is the size of train data, $F$ is the number of features and $C$ is the number of classes.
Cross Entropy Loss
$\text{loss} = -\frac{1}{N} \sum_{i=1}^N \sum_{j=1}^{C} y_{i,j} \log \hat{y}_{i,j}$
, where $y_{i,j}=1$ if train text $i$ belongs to class $j$ , and $0$ otherwise.
Gradients
$\frac{1}{N} X^T \cdot\big(\hat{y} - y\big) \\ db =\frac{1}{N} \sum_{i=1}^{N} \big(\hat{y} - y\big)$

Gradient Descend with a shrinking learning rate.

Accuracy
$\text{Accuracy} = \frac{TP+TN}{TP+TN+FP+FN}$
F1 Score (macro)
$\text{Precision} = \frac{TP}{TP+FP} \\ \text{Recall} = \frac{TP}{TP+FN} \\ \text{F1 Score} = 2 \times \frac{PR}{P+R}\\ \text{Macro F1 Score} = \frac{1}{C}\sum_{i=1}^{C}\text{F1 Score}$