RTP audio video profile

The RTP audio/video profile (RTP/AVP) is a profile for Real-time Transport Protocol (RTP) that specifies the technical parameters of audio and video streams. RTP specifies a general-purpose data format, but doesn't specify how encoded data should utilize the features of RTP (what payload type value to put in the RTP header, what sampling rate and clock rate [the rate at which the RTP timestamp increments] to use, etc.). An RTP profile specifies these details. The RTP audio/video profile specifies a mapping of specific audio and video codecs and their sampling rates to RTP payload types and clock rates, and how to encode each data format as an RTP data payload, as well as specifying how to describe these mappings using Session Description Protocol (SDP).

RTP/AVP audio and video payload types[edit]

Payload type (PT) Name Type No. of channels Clock rate (Hz)[note 1] Frame size (ms) Default packet size (ms) Description References
0PCMUaudio18000any20ITU-T G.711 PCM µ-Law audio 64 kbit/sRFC 3551
1reserved (previously FS-1016 CELP)audio18000  reserved, previously FS-1016 CELP audio 4.8 kbit/sRFC 3551, previously RFC 1890
2reserved (previously G721 or G726-32)audio18000  reserved, previously ITU-T G.721 ADPCM audio 32 kbit/s or ITU-T G.726 audio 32 kbit/sRFC 3551, previously RFC 1890
3GSMaudio180002020European GSM Full Rate audio 13 kbit/s (GSM 06.10)RFC 3551
4G723audio180003030ITU-T G.723.1 audioRFC 3551
5DVI4audio18000any20IMA ADPCM audio 32 kbit/sRFC 3551
6DVI4audio116000any20IMA ADPCM audio 64 kbit/sRFC 3551
7LPCaudio18000any20Experimental Linear Predictive Coding audio 5.6 kbit/sRFC 3551
8PCMAaudio18000any20ITU-T G.711 PCM A-Law audio 64 kbit/sRFC 3551
9G722audio18000[note 2]any20ITU-T G.722 audio 64 kbit/sRFC 3551 - Page 14
10L16audio244100any20Linear PCM 16-bit Stereo audio 1411.2 kbit/s,[2][3][4] uncompressedRFC 3551, Page 27
11L16audio144100any20Linear PCM 16-bit audio 705.6 kbit/s, uncompressedRFC 3551, Page 27
12QCELPaudio180002020Qualcomm Code Excited Linear PredictionRFC 2658RFC 3551
13CNaudio18000  Comfort noise. Payload type used with audio codecs that do not support comfort noise as part of the codec itself such as G.711G.722.1G.722G.726G.727G.728GSM 06.10Siren, and RTAudio.RFC 3389
14MPAaudio1, 2900008–72 MPEG-1 or MPEG-2 audio onlyRFC 3551RFC 2250
15G728audio180002.520ITU-T G.728 audio 16 kbit/sRFC 3551
16DVI4audio111025any20IMA ADPCM audio 44.1 kbit/sRFC 3551
17DVI4audio122050any20IMA ADPCM audio 88.2 kbit/sRFC 3551
18G729audio180001020ITU-T G.729 and G.729a audio 8 kbit/s; Annex B is implied unless the annexb=no parameter is usedRFC 3551, Page 20RFC 3555, Page 15
19reserved (previously CN)audio    reserved, previously comfort noiseRFC 3551
25CELBvideo 90000  Sun CellB video[5]RFC 2029
26JPEGvideo 90000  JPEG videoRFC 2435
28nvvideo 90000  Xerox PARC's Network Video (nv)[6]RFC 3551, Page 32
31H261video 90000  ITU-T H.261 videoRFC 4587
32MPVvideo 90000  MPEG-1 and MPEG-2 videoRFC 2250
33MP2Taudio/video 90000  MPEG-2 transport streamRFC 2250
34H263video 90000  H.263 video, first version (1996)RFC 3551RFC 2190
72–76reserved     reserved because RTCP packet types 200–204 would otherwise be indistinguishable from RTP payload types 72–76 with the marker bit setRFC 3550RFC 3551
dynamicH263-1998video 90000  H.263 video, second version (1998)RFC 3551RFC 4629RFC 2190
dynamicH263-2000video 90000  H.263 video, third version (2000)RFC 4629
dynamic (or profile)H264 AVCvideo 90000  H.264 video (MPEG-4 Part 10)RFC 6184, previously RFC 3984
dynamic (or profile)H264 SVCvideo 90000  H.264 videoRFC 6190
dynamic (or profile)H265video 90000  H.265 video (HEVC)RFC 7798
dynamic (or profile)theoravideo 90000  Theora videodraft-barbato-avt-rtp-theora
dynamiciLBCaudio1800020, 3020, 30Internet low Bitrate Codec 13.33 or 15.2 kbit/sRFC 3952
dynamicPCMA-WBaudio1160005 ITU-T G.711.1 A-lawRFC 5391
dynamicPCMU-WBaudio1160005 ITU-T G.711.1 µ-lawRFC 5391
dynamicG718audio 32000 (placeholder)20 ITU-T G.718draft-ietf-payload-rtp-g718
dynamicG719audio(various)4800020 ITU-T G.719RFC 5404
dynamicG7221audio 16000, 3200020 ITU-T G.722.1 and G.722.1 Annex CRFC 5577
dynamicG726-16audio18000any20ITU-T G.726 audio 16 kbit/sRFC 3551
dynamicG726-24audio18000any20ITU-T G.726 audio 24 kbit/sRFC 3551
dynamicG726-32audio18000any20ITU-T G.726 audio 32 kbit/sRFC 3551
dynamicG726-40audio18000any20ITU-T G.726 audio 40 kbit/sRFC 3551
dynamicG729Daudio180001020ITU-T G.729 Annex DRFC 3551
dynamicG729Eaudio180001020ITU-T G.729 Annex ERFC 3551
dynamicG7291audio 1600020 ITU-T G.729.1RFC 4749
dynamicGSM-EFRaudio180002020ITU-T GSM-EFR (GSM 06.60)RFC 3551
dynamicGSM-HR-08audio1800020 ITU-T GSM-HR (GSM 06.20)RFC 5993
dynamic (or profile)AMRaudio(various)800020 Adaptive Multi-Rate audioRFC 4867
dynamic (or profile)AMR-WBaudio(various)1600020 Adaptive Multi-Rate Wideband audio (ITU-T G.722.2)RFC 4867
dynamic (or profile)AMR-WB+audio1, 2 or omit7200013.3–40 Extended Adaptive Multi Rate – WideBand audioRFC 4352
dynamic (or profile)vorbisaudio(various)(various)  Vorbis audioRFC 5215
dynamic (or profile)opusaudio1, 248000[note 3]2.5–6020Opus audioRFC 7587
dynamic (or profile)speexaudio18000, 16000, 3200020 Speex audioRFC 5574
dynamicmpa-robustaudio1, 29000024–72 Loss-Tolerant MP3 audioRFC 5219(previously RFC 3119)
dynamic (or profile)MP4A-LATMaudio 90000 or others  MPEG-4 AudioRFC 6416(previously RFC 3016)
dynamic (or profile)MP4V-ESvideo 90000 or others  MPEG-4 VisualRFC 6416(previously RFC 3016)
dynamic (or profile)mpeg4-genericaudio/video 90000 or other  MPEG-4 Elementary StreamsRFC 3640
dynamicVP8video 90000  VP8 videoRFC 7741
dynamicVP9video 90000  VP9 videodraft-ietf-payload-vp9
dynamicL8audio(various)(various)any20Linear PCM 8-bit audio with 128 offsetRFC 3551Section 4.5.10 and Table 5
dynamicDAT12audio(various)(various)any20 (by analogy with L16)IEC 61119 12-bit nonlinear audioRFC 3190Section 3
dynamicL16audio(various)(various)any20Linear PCM 16-bit audioRFC 3551Section 4.5.11, RFC 2586
dynamicL20audio(various)(various)any20 (by analogy with L16)Linear PCM 20-bit audioRFC 3190Section 4
dynamicL24audio(various)(various)any20 (by analogy with L16)Linear PCM 24-bit audioRFC 3190Section 4
dynamicrawvideo 90000  Uncompressed VideoRFC 4175
dynamicac3audio(various)32000, 44100, 48000  Dolby AC-3 audioRFC 4184
dynamiceac3audio(various)32000, 44100, 48000  Enhanced AC-3 audioRFC 4598
dynamict140text 1000  Text over IPRFC 4103
dynamicEVRC
EVRC0
EVRC1
audio 8000  EVRC audioRFC 4788
dynamicEVRCB
EVRCB0
EVRCB1
audio 8000  EVRC-B audioRFC 4788
dynamicEVRCWB
EVRCWB0
EVRCWB1
audio 16000  EVRC-WB audioRFC 5188
dynamicjpeg2000video 90000  JPEG 2000 videoRFC 5371
dynamicUEMCLIPaudio 8000, 16000  UEMCLIP audioRFC 5686
dynamicATRAC3audio 44100  ATRAC3 audioRFC 5584
dynamicATRAC-Xaudio 44100, 48000  ATRAC3+ audioRFC 5584
dynamicATRAC-ADVANCED-LOSSLESSaudio (various)  ATRAC Advanced Lossless audioRFC 5584
dynamicDVvideo 90000  DV videoRFC 3189
dynamicBT656video    ITU-R BT.656 videoRFC 3555
dynamicBMPEGvideo    Bundled MPEG-2 videoRFC 2343
dynamicSMPTE292Mvideo    SMPTE 292M videoRFC 3497
dynamicREDaudio    Redundant Audio DataRFC 2198
dynamicVDVIaudio    Variable-rate DVI4 audioRFC 3551
dynamicMP1Svideo    MPEG-1 Systems Streams videoRFC 2250
dynamicMP2Pvideo    MPEG-2 Program Streams videoRFC 2250
dynamictoneaudio 8000 (default)  toneRFC 4733
dynamictelephone-eventaudio 8000 (default)  DTMF toneRFC 4733
dynamicaptxaudio2 – 6(equal to sampling rate)4000 ÷ sample rate4[note 4]aptX audioRFC 7310
  1. Jump up^ The "clock rate" is the rate at which the timestamp in the RTP header is incremented, which need not be the same as the codec's sampling rate. For instance, video codecs typically use a clock rate of 90000 so their frames can be more precisely aligned with the RTCP NTP timestamp, even though video sampling rates are typically in the range of 1–60 samples per second.
  2. Jump up^ Although the sampling rate for G.722 is 16000, its clock rate is 8000 to remain backwards compatible with RFC 1890, which incorrectly used this value.[1]
  3. Jump up^ Because Opus can change sampling rates dynamically, its clock rate is fixed at 48000, even when the codec will be operated at a lower sampling rate. The maxplaybackrate and sprop-maxcapturerate parameters in SDP can be used to indicate hints/preferences about the maximum sampling rate to encode/decode.
  4. Jump up^ For aptX, the packetization interval must be rounded down to the nearest packet interval that can contain an integer number of samples. So at sampling rates of 11025, 22050, or 44100, a packetization rate of "4" is rounded down to 3.99.

RFC 3551 lists details of the payload format, or a reference for the details is provided. Payload identifiers 96–127 are used for payloads defined dynamically during a session. The document recommends dynamically assigned port numbers, although port numbers 5004 and 5005 have been registered for use of the profile when a dynamically assigned port is not required. The standard also describes the process of registering new payload types with IANA.

Applications operating under this profile should always support PCMU (payload type 0). Previously, DVI4 (payload type 5) was also recommended, but this recommendation was removed in August 2013 by RFC 7007 because "many RTP deployments do not support DVI4, and there is little reason to use it when much more modern codecs are available."

See also[edit]

References[edit]

  1. Jump up^ RFC 3551RTP Profile for Audio and Video Conferences with Minimal Control, H. Schulzrinne, S. Casner, The Internet Society (July 2003).
  2. Jump up^ "RFC 2586 - The Audio/L16 MIME content type". May 1999. Retrieved 2010-03-16.
  3. Jump up^ "RFC 3108 - Conventions for the use of the Session Description Protocol (SDP) for ATM Bearer Connections". May 2001. Retrieved 2010-03-16.
  4. Jump up^ "RFC 4856 - Media Type Registration of Payload Formats in the RTP Profile for Audio and Video Conferences - Registration of Media Type audio/L16". March 2007. Retrieved 2010-03-16.
  5. Jump up^ XIL Programmer's Guide, Chapter 22 "CellB Codec". August 1997. Retrieved on 2014-07-19.
  6. Jump up^ nv - network video on Henning Schulzrinne's websiteNetwork Video on The University of Toronto's website, Retrieved on 2009-07-09.

External links[edit]

  • 0
    点赞
  • 3
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值