OverLeaf教程(结合ESWA模版)
2025-11-07 10:11:43
1537
Expert Systems with Applications (ESWA)期刊模版说明
2025-11-04 22:42:27
4013
上下采样(步长/维度变化)
2025-10-18 10:06:10
966
Mixture-of-Experts MOE模块实现(代码)
2025-10-05 10:31:24
1385
SB3-Contrib(RecurrentPPO )
2025-10-03 14:30:39
1073
Ensemble_StockTrading代码&论文
2025-09-28 17:17:12
925
多脚本大批量训练
2025-09-10 23:34:07
506
快速构建数据集-假数据(生成&划分)
2025-09-08 20:05:00
1272
PyTorch Lightning(训练评估框架)
2025-09-07 22:50:36
1308
现成的AI模型:训练+评估框架汇总
2025-09-07 22:43:16
1048
训练+评估流程
2025-09-07 22:36:20
1134
HuggingFace Trainer(训练测试&自定义模型数据)
2025-09-06 14:46:22
1491
HuggingFace Trainer(回调&可视化)
2025-09-06 14:37:29
1495
SB3 PPO(回调&可视化)Stable Baselines3(SB3)
2025-09-05 15:54:42
1187
pytorch可视化工具(训练评估:Tensorboard、swanlab)
2025-09-03 21:35:01
1455
PPO代码实现说明(pytorch)
2025-08-24 23:17:52
969
Informer参数代码
2025-08-22 17:07:56
1349
Informer论文笔记
2025-08-22 17:05:59
1450
自定义数据集(pytorch&huggingface)
2025-08-15 22:37:04
1080
4
CryptoMamba论文笔记
2025-08-12 20:52:57
1063
Mamba 原理汇总2
2025-08-11 16:31:30
4739
Linux用户
2025-07-26 15:16:30
1258
Matplotlib和Plotly知识点(Dash+Plotly分页展示)
2025-07-19 15:49:11
1540
四六级英语作文模版
2025-06-14 12:28:36
2522
Pytorch知识点
2025-06-02 17:24:46
1285
Numpy知识点
2025-05-29 20:52:52
1689
GPU层次结构(Nvidia和Apple M芯片,从硬件到pytorch)
2025-05-29 17:23:18
2561
Mac完美终端(iterm2 + oh my zash + tmux+ControlMaster)
2025-05-28 17:08:31
2345
DAPO论文笔记(解耦剪辑与动态采样策略优化,GRPO的改进)
2025-05-19 11:13:30
2417
1
deepseek系列论文汇总(时至2025.5)
2025-05-18 10:44:45
11088
提示词工程框架:CoT、ToT、GoT、PoT( 链式提示)
2025-05-17 20:20:29
3262
DeepSeek-R1论文笔记
2025-05-17 20:15:04
1631
1
思维链Chain-of-Thought(CoT)论文笔记
2025-05-17 20:05:11
2895
1
多令牌预测Multi-Token Prediction(MTP)
2025-05-12 16:58:49
3287
DeepSeek-V3论文笔记
2025-05-12 16:57:49
2129
3
RoPE(旋转位置编码,参考:DeepSeek-V2)
2025-05-11 10:52:54
1810
Transformer KV缓存优化(MHA、MQA、GQA、MLA)
2025-05-11 10:38:34
1441
DeepSeek LLM论文笔记
2025-05-11 10:36:07
1994
1
DeepSeek-V2论文笔记
2025-05-11 09:57:22
1832
1
DeepSeekMath论文笔记(GRPO)
2025-05-10 21:57:23
1850
1