Triton Inference Server

The Straggling Crow

已于 2022-04-21 17:58:22 修改

阅读量750

点赞数

分类专栏：项目笔记文章标签：深度学习视觉检测边缘计算

于 2022-04-08 10:13:34 首次发布

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/qq_41834780/article/details/124033857

版权

项目笔记专栏收录该内容

50 篇文章 0 订阅

订阅专栏

github address
install model analysis
yolov4性能分析例子
 中文博客介绍
 关于服务器延迟，并发性，并发度，吞吐量经典讲解
 client py examples
用于模型仓库管理，性能测试工具
1、性能监测，优化
Model Analyzer section帮助你了解model 的 GPU内存使用率 — you can decide how to run multipe models on a single GPU.
提供analysis Concurrency: 1, throughput: 62.6 infer/sec, latency 21371 usec
2、开启自动Dynamic Batcher
就是将并发合并，然后推理
需要关闭Triton,在configuration file那里添加 dynamic_batching { }, restart Triton
3、In general the benefit of the dynamic batcher and multiple instances is model specific, so you should experiment with perf_analyzer to determine the settings that best satisfy your throughput and latency requirements.
4、perf_analyzer -m inception_graphdef --concurrency-range 1:4 -f perf.csv
将测试数据写到csv里
5、model_analyzer 详细
6、可以生成曲线图等
7、deploying on k8s
8、Performance Analyzer，性能分析器。
Model Analyzer，使用 Performance Analyzer 分析测量一个模型的 GPU 内存和计算使用率。
By default perf_analyzer sends input tensor data and receives output tensor data over the network. You can instead instruct perf_analyzer to use system shared memory or CUDA shared memory to communicate tensor data. By using these options you can model the performance that you can achieve by using shared memory in your application. Use –shared-memory=system to use system (CPU) shared memory or –shared-memory=cuda to use CUDA shared memory.

The Straggling Crow

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
复制链接

分享到 QQ

分享到新浪微博

扫一扫

专栏目录

The Straggling Crow CSDN认证博客专家 CSDN认证企业博客

码龄7年

473: 原创

7182: 周排名

3744: 总排名

64万+: 访问

: 等级

8883: 积分

1075: 粉丝

1016: 获赞

72: 评论

1406: 收藏

私信

关注

热门文章

分类专栏

最新评论

各种网络协议
征途黯然.: 如何利用IGMP和PIM协议在IPv4和IPv6网络中实现高效的多播数据传输？
各种网络协议
全栈小5: 通过文章可以看出，博主很有耐心，技术文章不仅是对技术的坚守，更是对知识的热忱和热爱，感谢您的优质分享和坚持更文。期待着博主更加深入的剖析，为大家带来更多的技术好文。【各种网络协议，博主这篇文章，值得一看】
各种网络协议
寻找09之夏: 博主技术实力在线，文章讲解清晰明了，内容实用性满分，有效解决了技术难题。真心感谢博主的分享，简短有力，必须点赞！
Ubuntu22.04配置静态ip
Allchenchen: 修改ip后，虚拟机关机重启后ip又变回原来的怎么解决
k8s一些名词解释
CSDN-Ada助手: 推荐云原生入门技能树：https://edu.csdn.net/skill/cloud_native?utm_source=AI_act_cloud_native

大家在看

最新文章

2024

目录

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。