TP5 敏感词

通过composer安装

composer require lustre/php-dfa-sensitive

然后再app/common/server 下建SensitiveWords.php

<?php

namespace app\common\server;
use DfaFilter\SensitiveHelper;

class SensitiveWords
{
    protected static $handle = null;
    private function __construct()
    {
    }
    private function __clone()
    {

    }
    /**
     * 获取实例
     */
    public static function getInstance($word_path = [])
    {
        if (!self::$handle) {
            //默认的一些敏感词库
            $default_path = [
                EXTEND_PATH.('dict/word.txt'),
                EXTEND_PATH.('dict/ms.txt'),
                EXTEND_PATH.('dict/qt.txt'),
                EXTEND_PATH.('dict/sq.txt'),
                EXTEND_PATH.('dict/tf.txt'),
            ];
            $paths = array_merge($default_path, $word_path);
            self::$handle = SensitiveHelper::init();
            if (!empty($paths)) {
                foreach ($paths as $path) {
                    self::$handle->setTreeByFile($path);
                }
            }
        }
        return self::$handle;
    }
    /**
     * 检测是否含有敏感词
     */
    public static function isLegal($content)
    {
        return self::getInstance()->islegal($content);
    }
    /**
     * 敏感词过滤
     */
    public static function replace($content, $replace_char = '', $repeat = false, $match_type = 1)
    {
        return self::getInstance()->replace($content, $replace_char, $repeat, $match_type);
    }
    /**
     * 标记敏感词
     */
    public static function mark($content, $start_tag, $end_tag, $match_type = 1)
    {
        return self::getInstance()->mark($content, $start_tag, $end_tag, $match_type);
    }
    /**
     * 获取文本中的敏感词
     */
    public static function getBadWord($content, $match_type = 1, $word_num = 0)
    {
        return self::getInstance()->getBadWord($content, $match_type, $word_num);
    }
}

调用

SensitiveWords::getBadWord($content);
if (!empty($bad_word)){
	return json_msg('包含敏感词',201,current($bad_word));
   }

敏感词文件
敏感词

敏感词过滤是指在文本中过滤掉一些敏感词汇,以避免给人带来不必要的困扰或者麻烦。在Java中实现敏感词过滤可以使用正则表达式或者Trie树等算法。这里提供一个简单的Trie树实现敏感词过滤的代码示例: ```java import java.util.HashSet; import java.util.Set; public class SensitiveWordsFilter { private TrieNode root; public SensitiveWordsFilter() { root = new TrieNode(); } public void addWord(String word) { TrieNode node = root; for (char c : word.toCharArray()) { if (!node.containsKey(c)) { node.put(c, new TrieNode()); } node = node.get(c); } node.setEnd(); } public Set<String> filter(String text) { Set<String> set = new HashSet<>(); for (int i = 0; i < text.length(); i++) { TrieNode node = root; for (int j = i; j < text.length(); j++) { char c = text.charAt(j); if (!node.containsKey(c)) { break; } node = node.get(c); if (node.isEnd()) { set.add(text.substring(i, j + 1)); } } } return set; } private static class TrieNode { private TrieNode[] children; private boolean isEnd; public TrieNode() { children = new TrieNode[256]; isEnd = false; } public boolean containsKey(char c) { return children[c] != null; } public void put(char c, TrieNode node) { children[c] = node; } public TrieNode get(char c) { return children[c]; } public void setEnd() { isEnd = true; } public boolean isEnd() { return isEnd; } } } ``` 使用示例: ```java SensitiveWordsFilter filter = new SensitiveWordsFilter(); filter.addWord("敏感词1"); filter.addWord("敏感词2"); Set<String> set = filter.filter("这是一段包含敏感词1和敏感词2的文本。"); System.out.println(set); // 输出 [敏感词1, 敏感词2] ```
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包

打赏作者

时间轴-小文同学

你的鼓励将是我创作的最大动力

¥1 ¥2 ¥4 ¥6 ¥10 ¥20
扫码支付:¥1
获取中
扫码支付

您的余额不足,请更换扫码支付或充值

打赏作者

实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值