OpenMMLab简介

skyeKing_trueKing

已于 2023-06-01 13:53:06 修改

阅读量1k

点赞数 1

文章标签：深度学习人工智能

于 2023-06-01 13:43:27 首次发布

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/weixin_38655947/article/details/130985876

版权

Github链接https://github.com/open-mmlab 官网链接https://openmmlab.com

目录

目标检测算法库MMDetection

目标检测算法库MMYOLO

文字检测识别算法库MMOCR

3D目标检测算法库MMDetection3D

旋转目标检测算法库MMRotate

图像分割算法库MMSegmentation

图像分类+预训练+多模态算法库MMPretrain

姿态估计算法库MMPose

三维人体姿态估计算法库MMHuman3D

视频动作识别算法库MMAction

目标追踪MMTracking

生成模型+底层视觉+AIGC算法库MMagic

模型部署工具箱MMDeploy

趣味应用游乐场Playground

算法库详解

目标检测算法库MMDetection

MMDetectionhttp://github.com/open-mmlab/mmdetection

MMDetection 支持了各种不同的检测任务，包括目标检测，实例分割，全景分割，以及半监督目标检测。可以按照自己需求做自己的数据集并使用现有的框架快速训练、推理。

算法框架

Object Detection	Instance Segmentation	Panoptic Segmentation	Other
Fast R-CNN (ICCV'2015) Faster R-CNN (NeurIPS'2015) RPN (NeurIPS'2015) SSD (ECCV'2016) RetinaNet (ICCV'2017) Cascade R-CNN (CVPR'2018) YOLOv3 (ArXiv'2018) CornerNet (ECCV'2018) Grid R-CNN (CVPR'2019) Guided Anchoring (CVPR'2019) FSAF (CVPR'2019) CenterNet (CVPR'2019) Libra R-CNN (CVPR'2019) TridentNet (ICCV'2019) FCOS (ICCV'2019) RepPoints (ICCV'2019) FreeAnchor (NeurIPS'2019) CascadeRPN (NeurIPS'2019) Foveabox (TIP'2020) Double-Head R-CNN (CVPR'2020) ATSS (CVPR'2020) NAS-FCOS (CVPR'2020) CentripetalNet (CVPR'2020) AutoAssign (ArXiv'2020) Side-Aware Boundary Localization (ECCV'2020) Dynamic R-CNN (ECCV'2020) DETR (ECCV'2020) PAA (ECCV'2020) VarifocalNet (CVPR'2021) Sparse R-CNN (CVPR'2021) YOLOF (CVPR'2021) YOLOX (CVPR'2021) Deformable DETR (ICLR'2021) TOOD (ICCV'2021) DDOD (ACM MM'2021) RTMDet (ArXiv'2022) Conditional DETR (ICCV'2021) DAB-DETR (ICLR'2022) DINO (ICLR'2023) DiffusionDet (ArXiv'2023) EfficientDet (CVPR'2020) Detic (ECCV'2022)	Mask R-CNN (ICCV'2017) Cascade Mask R-CNN (CVPR'2018) Mask Scoring R-CNN (CVPR'2019) Hybrid Task Cascade (CVPR'2019) YOLACT (ICCV'2019) InstaBoost (ICCV'2019) SOLO (ECCV'2020) PointRend (CVPR'2020) DetectoRS (ArXiv'2020) SOLOv2 (NeurIPS'2020) SCNet (AAAI'2021) QueryInst (ICCV'2021) Mask2Former (ArXiv'2021) CondInst (ECCV'2020) SparseInst (CVPR'2022) RTMDet (ArXiv'2022) BoxInst (CVPR'2021)	Panoptic FPN (CVPR'2019) MaskFormer (NeurIPS'2021) Mask2Former (ArXiv'2021)

模块组件

Backbones	Necks	Loss	Common
VGG (ICLR'2015) ResNet (CVPR'2016) ResNeXt (CVPR'2017) MobileNetV2 (CVPR'2018) HRNet (CVPR'2019) Generalized Attention (ICCV'2019) GCNet (ICCVW'2019) Res2Net (TPAMI'2020) RegNet (CVPR'2020) ResNeSt (ArXiv'2020) PVT (ICCV'2021) Swin (CVPR'2021) PVTv2 (ArXiv'2021) ResNet strikes back (ArXiv'2021) EfficientNet (ArXiv'2021) ConvNeXt (CVPR'2022) ConvNeXtv2 (ArXiv'2023)	PAFPN (CVPR'2018) NAS-FPN (CVPR'2019) CARAFE (ICCV'2019) FPG (ArXiv'2020) GRoIE (ICPR'2020) DyHead (CVPR'2021)	GHM (AAAI'2019) Generalized Focal Loss (NeurIPS'2020) Seasaw Loss (CVPR'2021)	OHEM (CVPR'2016) Group Normalization (ECCV'2018) DCN (ICCV'2017) DCNv2 (CVPR'2019) Weight Standardization (ArXiv'2019) Prime Sample Attention (CVPR'2020) Strong Baselines (CVPR'2021) Resnet strikes back (ArXiv'2021)

目标检测算法库MMYOLO

MMYOLO 中目前实现了目标检测和旋转框目标检测算法，但是相比 MMDeteciton 版本有显著训练加速，训练速度相比原先版本提升 2.6 倍。

MMYOLOhttps://github.com/open-mmlab/mmyolo

文字检测识别算法库MMOCR

MMOCRhttps://github.com/open-mmlab/mmocr

专注于文本检测，文本识别以及相应的下游任务，如关键信息提取。它是 OpenMMLab 项目的一部分。兼容中文，英文和数字。

3D目标检测算法库MMDetection3D

MMDection3Dhttps://github.com/open-mmlab/mmdetection3d

面向 3D 检测的平台

旋转目标检测算法库MMRotate

适合目标检测的框有方向的任务
GitHub - open-mmlab/mmrotate: OpenMMLab Rotated Object Detection Toolbox and BenchmarkOpenMMLab Rotated Object Detection Toolbox and Benchmark - GitHub - open-mmlab/mmrotate: OpenMMLab Rotated Object Detection Toolbox and Benchmarkhttps://github.com/open-mmlab/mmrotate

图像分割算法库MMSegmentation

语义分割，适合街景，无人驾驶，理疗影像等任务GitHub - open-mmlab/mmsegmentation: OpenMMLab Semantic Segmentation Toolbox and Benchmark.OpenMMLab Semantic Segmentation Toolbox and Benchmark. - GitHub - open-mmlab/mmsegmentation: OpenMMLab Semantic Segmentation Toolbox and Benchmark.https://github.com/open-mmlab/mmsegmentation

图像分类+预训练+多模态算法库MMPretrain

GitHub - open-mmlab/mmpretrain: OpenMMLab Pre-training Toolbox and BenchmarkOpenMMLab Pre-training Toolbox and Benchmark. Contribute to open-mmlab/mmpretrain development by creating an account on GitHub.https://github.com/open-mmlab/mmpretrain

图像分类：输入图像，AI给出类别的概率，概率最高的就是最后的类型
图像描述（Image Caption）：输入图片，AI使用文字描述图片
视觉问答（Visual Question Answering）：输入图片，问AI一个问题，AI根据图片给出答案
视觉定位（Visual Grounding）：输入图片，给出提示提，AI根据提示词换出框
检索（图搜图，图搜文，文搜图）

姿态估计算法库MMPose

GitHub - open-mmlab/mmpose: OpenMMLab Pose Estimation Toolbox and Benchmark.OpenMMLab Pose Estimation Toolbox and Benchmark. Contribute to open-mmlab/mmpose development by creating an account on GitHub.https://github.com/open-mmlab/mmpose

除了人体关键提检测也可以做自己标注数据的关键点检测

三维人体姿态估计算法库MMHuman3D

GitHub - open-mmlab/mmhuman3d: OpenMMLab 3D Human Parametric Model Toolbox and BenchmarkOpenMMLab 3D Human Parametric Model Toolbox and Benchmark - GitHub - open-mmlab/mmhuman3d: OpenMMLab 3D Human Parametric Model Toolbox and Benchmarkhttps://github.com/open-mmlab/mmhuman3d

视频动作识别算法库MMAction2

动作识别，时序动作检测，时空动作检测以及基于人体姿态的动作识别

GitHub - open-mmlab/mmaction2: OpenMMLab's Next Generation Video Understanding Toolbox and BenchmarkOpenMMLab's Next Generation Video Understanding Toolbox and Benchmark - GitHub - open-mmlab/mmaction2: OpenMMLab's Next Generation Video Understanding Toolbox and Benchmarkhttps://github.com/open-mmlab/mmaction2

无法通过单帧画面，需要结合上下文判断

目标追踪MMTracking

支持视频目标检测，多目标跟踪，单目标跟踪和视频实例分割等多种任务和算法。

GitHub - open-mmlab/mmtracking: OpenMMLab Video Perception Toolbox. It supports Video Object Detection (VID), Multiple Object Tracking (MOT), Single Object Tracking (SOT), Video Instance Segmentation (VIS) with a unified framework.OpenMMLab Video Perception Toolbox. It supports Video Object Detection (VID), Multiple Object Tracking (MOT), Single Object Tracking (SOT), Video Instance Segmentation (VIS) with a unified framework. - GitHub - open-mmlab/mmtracking: OpenMMLab Video Perception Toolbox. It supports Video Object Detection (VID), Multiple Object Tracking (MOT), Single Object Tracking (SOT), Video Instance Segmentation (VIS) with a unified framework.https://github.com/open-mmlab/mmtracking

生成模型+底层视觉+AIGC算法库MMagic

支持了流行的图像修复、图文生成、3D生成、图像修补、抠图、超分辨率和生成等任务的应用。支持 Stable Diffusion 的微调和ControlNet 动画生成。也支持 GANs 的插值，投影，编辑和其他流行的应用。

文生图，图生图，图像填充（inpainting）
图像去噪，重建，超分辨率，去雾
视频插帧，超分辨率
黑白老照片上色
Matting（抠人）

GitHub - open-mmlab/mmagic: OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, image/video restoration/enhancement, etc.OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, image/video restoration/enhancement, etc. - GitHub - open-mmlab/mmagic: OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, image/video restoration/enhancement, etc.https://github.com/open-mmlab/mmagic

模型部署工具箱MMDeploy

为各算法库提供统一的部署体验

open-mmlab/mmdeploy: OpenMMLab Model Deployment Framework (github.com)https://github.com/open-mmlab/mmdeploy

趣味应用游乐场Playground

一个用于收集和展示 OpenMMLab 相关前沿和有趣应用的项目

https://github.com/open-mmlab/playgroundhttps://github.com/open-mmlab/playground

评论 1

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。