福将～白鹿-CSDN博客

原创 BGE-m3 和 BCE-Embedding 模型对比分析

BGE-m3 和 BCE-Embedding 模型对比分析

2025-04-25 17:31:23 488

原创 openai.RateLimitError: Error code: 429 - {‘error‘: {‘message‘: ‘Your account co2faualnl9bb8bf99d0＜ak

根本问题，我TM的是KIMI的非续费用户，访问速率被严重限制了，日了。考虑到Kimi在内容理解上和百度文心一言的差距，果断续费了文心一言。

2024-04-25 10:56:14 3225 2

LoRA 模型是一种Stable Diffusion模型的小型模型，通过对标准检查点模型进行微小更改来实现。它们的大小通常比检查点模型小 10 到 100 倍，这使得它们对于拥有大量模型的人非常有吸引力。LoRA（Low-Rank Adaptation）是一种用于微调Stable Diffusion模型的训练技术。但我们已经有了其他的训练技术，例如 Dreambooth 和文本反转。那么 LoRA 有何特别之处呢？LoRA 在文件大小和训练能力之间取得了良好的平衡。

2024-04-24 12:53:27 95608 2

原创 RuntimeError: FlashAttention only supports Ampere GPUs or newer.

详细描述请查看：https://github.com/Dao-AILab/flash-attention。是否有解决方案，暂无，除非能搞到A100或者H100以及更高版本的机器；GPU机器配置低，不支持特斯拉-V100；哎，无奈手里机器不支持玩Llama 3；

2024-04-23 19:56:15 4683 1

原创 Llama网络结构介绍

LLaMA现在已经是开源社区里炙手可热的模型了，但是原文中仅仅介绍了其和标准Transformer的差别，并没有一个全局的模型介绍。因此打算写篇文章，争取让读者不参考任何其他资料把LLaMA的模型搞懂。

2024-04-23 17:30:06 9735 2

原创 2024年调研学习文档资料汇总

2、图文分类：https://huggingface.co/docs/transformers/model_doc/chinese_clip。3、多卡训练：https://blog.csdn.net/qq_51392112/article/details/129737803。18、腾讯预训练平台：https://github.com/Tencent/TencentPretrain/tree/main。1、chatGLM实践：https://zhuanlan.zhihu.com/p/622686205?

2024-04-03 11:34:10 544

原创 BUG:docker启动之后直接退出问题

妈的，竟然出现这错误，浪费我5分钟，记个笔记，加深印象。定位：未添加-it 交互模式启动镜像；

2024-02-04 20:38:37 986

原创 excel 文件分割

文件分割

2023-11-08 11:29:53 232

原创 OpenBLAS blas_thread_init: pthread_create failed for thread 1 of 40: Operation not permitted

容器权限不足

2023-10-24 15:26:55 6561 1

原创 Unexpected bus error encountered in worker. This might be caused by insufficient shared memory (shm)

容器共享内存资源不足

2023-10-24 15:20:32 540

原创 vim 常用快捷键

vim 常用快捷键

2023-05-19 17:00:40 803

原创 RuntimeError: The size of tensor a (631) must match the size of tensor b (512) at non-singleton dime

过滤下训练语料，将长度过长的数据直接丢弃；

2023-04-27 15:23:38 3904

原创 packaging.version.InvalidVersion: Invalid version: ‘0.10.1,＜0.11‘

packaging.version.InvalidVersion: Invalid version: '0.10.1,

2023-04-26 14:30:13 16137 6

原创 AttributeError: module ‘tensorflow._api.v2.train‘ has no attribute ‘Optimizer‘

我直接将TensorFlow从2.8降到了1.14。2、不降版本，直接改api–这个我没兴趣，真懒；版本迭代，相关方法被移除；

2023-04-25 19:53:25 1827

原创 AssertionError: The NVIDIA driver on your system is too old (found version 10010)

AssertionError 解决

2023-04-03 17:57:34 270

原创高效解决：remote: The project you were looking for could not be found.

remote: The project you were looking for could not be found.fatal: repository 'https://gitlab.vmic.xyz/72163948/game_category_rpc_server_dev.git/' not found

2023-03-09 19:51:05 366

原创 Error executing Jupyter command ‘notebook‘: [Errno 2] No such file or directory

Jupyter bug Error2

2023-02-17 17:21:57 919

原创 Command “python setup.py egg_info“ failed with error code 1 in /tmp/pip-build-30xnni_y/gensim/

python setup.py egg_info

2023-02-15 15:10:16 461

原创 anaconda3文件夹被移动之后，如何操作可以复用原有conda环境

解决anaconda文件被移动带来的bug

2023-02-15 11:56:16 1144

原创 SimBERT剖析

基于UniLM思想、融检索与生成于一体的BERT模型

2023-01-28 16:45:33 3158

原创 ERROR: Cannot uninstall ‘certifi‘. It is a distutils installed project and thus we cannot accurately

BUG原因：错误：无法卸载“证书”。这是一个 distutils 安装的项目，因此我们无法准确确定哪些文件属于它，这只会导致部分卸载。

2023-01-10 16:42:50 5181

原创解决BUG：error: metadata-generation-failed

在执行批量安装命令pip install -r requirements.txt -i https://pypi.tuna.tsinghua.edu.cn/simple之前，先执行pip install setuptools==57.5.0 -i https://pypi.tuna.tsinghua.edu.cn/simple 命令。

2023-01-10 16:17:43 58799 18

原创拉取分支代码到本地

git分支代码拉取

2022-12-07 11:24:58 1049

原创 torch与torchvision版本适配情况

版本适配情况

2022-12-05 16:21:50 868

原创 Bert中文词粒度级别[MASK]预训练总结

Bert中文全词mask预训练

2022-07-13 08:32:22 2657

原创 linux常见命令汇总（非常系统、非常全面）

Linux操作系统命令整理1. 今日梳理1.1 Linux操作系统1.2 Linux常见命令2. 知识点汇总描述2.1 快捷键：快速打开终端：Ctrl + Alt + t ；快速放大字体：Ctrl + shift + （+号）;快速缩小终端字体：Ctrl + （-号）；2.2 查看Linux版本：cat /proc/version2.3 目录操作相关命令：2.3.1 pwd：查看当前所停留的路径；2.3.2 ls 指定路径：查看制定目录下文件及文件夹信息；备注如果为加指定路径则默认为

2022-04-06 16:17:38 1507

空空如也

空空如也