算法
文章平均质量分 60
xuke_2022
这个作者很懒,什么都没留下…
展开
-
NVIDIA Tensor Core / DLA 资料汇总
Tensor Core / DLA 资料原创 2022-07-22 23:11:27 · 893 阅读 · 0 评论 -
[CUDA整理] CUDA优化小结
CUDA知识整理原创 2022-07-17 00:20:19 · 542 阅读 · 1 评论 -
[TensorRT] How to write code to using PluginV2
https://docs.nvidia.com/deeplearning/sdk/tensorrt-developer-guide/index.html#ipluginextIn the doc, it is written below:1. Create the Plugin and AddPlugin int the network.For example, to add a plugin layer to your network with plugin name set to plugi原创 2020-05-11 19:54:29 · 264 阅读 · 0 评论 -
A new way of Tree-All-Reduce (Mixed-Tree-All-Reduce)
1 Ring All Reduce详见:https://blog.csdn.net/gaofeipaopaotang/article/details/94028949 https://blog.csdn.net/dpppBR/article/details/80445569中的Ring-All-Reduce介绍2 Tree All Reduce详见...原创 2020-05-20 17:31:16 · 365 阅读 · 0 评论 -
[Summarry] Synchronized BatchNorm
MegDet 与 Synchronized BatchNormhttps://blog.csdn.net/yiran103/article/details/80820300[论文笔记]MegDet: A Large Mini-Batch Object Detectorhttps://blog.ddlee.cn/posts/e9b3289c/caffe:同步Batch Normali...原创 2020-04-08 17:33:45 · 781 阅读 · 0 评论 -
归一化/标准化 And 激活函数
归一化/标准化Batch Normalization详细解读https://blog.csdn.net/guo1988kui/article/details/83794343机器学习哪些算法需要归一化https://blog.csdn.net/qq_34872215/article/details/88363504机器学习数据归一化的的方法有哪些?适合于什么样的数据?...原创 2020-04-07 17:41:23 · 1203 阅读 · 0 评论 -
[TensorRT] Write your plugin which support FP16
Total Steps 3:1. In the plugin, 3 Interface need support FP162. Use cuda_fp16.hto writeKernel Function3. Use FP16 in the Engine===========================================1. In the plugi...原创 2020-04-03 14:28:58 · 636 阅读 · 0 评论