OverLeaf教程(结合ESWA模版)
2025-11-07 10:11:43
1357
Expert Systems with Applications (ESWA)期刊模版说明
2025-11-04 22:42:27
1639
上下采样(步长/维度变化)
2025-10-18 10:06:10
810
Mixture-of-Experts MOE模块实现(代码)
2025-10-05 10:31:24
1088
SB3-Contrib(RecurrentPPO )
2025-10-03 14:30:39
848
Ensemble_StockTrading代码&论文
2025-09-28 17:17:12
858
多脚本大批量训练
2025-09-10 23:34:07
461
快速构建数据集-假数据(生成&划分)
2025-09-08 20:05:00
1164
PyTorch Lightning(训练评估框架)
2025-09-07 22:50:36
1220
现成的AI模型:训练+评估框架汇总
2025-09-07 22:43:16
920
训练+评估流程
2025-09-07 22:36:20
1079
HuggingFace Trainer(训练测试&自定义模型数据)
2025-09-06 14:46:22
1361
HuggingFace Trainer(回调&可视化)
2025-09-06 14:37:29
1329
SB3 PPO(回调&可视化)Stable Baselines3(SB3)
2025-09-05 15:54:42
1085
pytorch可视化工具(训练评估:Tensorboard、swanlab)
2025-09-03 21:35:01
1341
PPO代码实现说明(pytorch)
2025-08-24 23:17:52
877
Informer参数代码
2025-08-22 17:07:56
1254
Informer论文笔记
2025-08-22 17:05:59
1315
自定义数据集(pytorch&huggingface)
2025-08-15 22:37:04
969
4
CryptoMamba论文笔记
2025-08-12 20:52:57
986
Mamba 原理汇总2
2025-08-11 16:31:30
3921
Linux用户
2025-07-26 15:16:30
1144
Matplotlib和Plotly知识点(Dash+Plotly分页展示)
2025-07-19 15:49:11
1464
四六级英语作文模版
2025-06-14 12:28:36
1953
Pytorch知识点
2025-06-02 17:24:46
1215
Numpy知识点
2025-05-29 20:52:52
1527
GPU层次结构(Nvidia和Apple M芯片,从硬件到pytorch)
2025-05-29 17:23:18
2281
Mac完美终端(iterm2 + oh my zash + tmux+ControlMaster)
2025-05-28 17:08:31
1872
DAPO论文笔记(解耦剪辑与动态采样策略优化,GRPO的改进)
2025-05-19 11:13:30
2175
1
deepseek系列论文汇总(时至2025.5)
2025-05-18 10:44:45
7479
提示词工程框架:CoT、ToT、GoT、PoT( 链式提示)
2025-05-17 20:20:29
2393
DeepSeek-R1论文笔记
2025-05-17 20:15:04
1404
1
思维链Chain-of-Thought(CoT)论文笔记
2025-05-17 20:05:11
2387
1
多令牌预测Multi-Token Prediction(MTP)
2025-05-12 16:58:49
2449
DeepSeek-V3论文笔记
2025-05-12 16:57:49
1683
3
RoPE(旋转位置编码,参考:DeepSeek-V2)
2025-05-11 10:52:54
1615
Transformer KV缓存优化(MHA、MQA、GQA、MLA)
2025-05-11 10:38:34
1233
DeepSeek LLM论文笔记
2025-05-11 10:36:07
1786
1
DeepSeek-V2论文笔记
2025-05-11 09:57:22
1638
1
DeepSeekMath论文笔记(GRPO)
2025-05-10 21:57:23
1636
1