Introduction

Audio source data may be used in order to provide details of the audio content source. In general, such characteristics of audio are logically independent of the compression format. In the context of DASH they are typically used in order to select a proper Adaptation Set. This section focuses on source format description.

In the context of DASH as specific source descriptor exists, namely AudioConfiguration identifies channel configuration signaling for Representations that contain an audio component.

In addition, source data properties may be expressed by one of the following two descriptor elements:

A multitude of source properties may be expressed using these generic descriptors, some of them using the definition in ISO/IEC 23001-8 [4]. These are listed below.

Audio Channel Configuration

For the element AudioChannelConfiguration the @schemeIdUri attribute is used to identify the audio channel configuration scheme employed.

The following defines URIs that identify specific audio channel configuration schemes, i.e. schemes contained in the AudioChannelConfiguration element:

Identifier Reference Section Comment
urn:mpeg:mpegB:cicp:ChannelConfiguration ISO/IEC 23091-3 clause 6.2 ChannelConfiguration specifies the channel configuration. This defines the number of audio channels and their associated loudspeaker positions according to Table 8. The name, abbreviation, and general position of each loudspeaker can be deduced from Table 7 and Figure 9. Preferred scheme for all codecs.
urn:mpeg:dash:23003:3:audio_channel_configuration:2011 ISO/IEC 23009-1 section 5.8.5.4 The URI `urn:mpeg:dash:23003:3:audio_channel_configuration:2011` is defined to indicate the channel configuration as defined by Table 68 (Channel Configurations, meaning of channelConfigurationIndex, mapping of channel elements to loudspeaker positions’) of ISO/IEC 23003-3. The @value is the value as defined by Table 68 (Channel Configurations, meaning of channelConfigurationIndex, mapping of channel elements to loudspeaker positions’) of ISO/IEC 23003-3.
tag:dolby.com,2014:dash:audio_channel_configuration:2011 ETSI TS 102 366 clause I.1.2 Dolby audio channel configuration information. The @value attribute shall contain a four digit hexadecimal representation of the 16 bit field that describes the channel assignment as defined by table E.5 in ETSI TS 102 366 [14] where left channel is MSB.For example, for a stream with L, C, R, Ls, Rs, LFE, the value shall be “F801” (hexadecimal equivalent of the binary value 1111 1000 0000 0001). Legacy use, discouraged for new deployments.
tag:dolby.com,2015:dash:audio_channel_configuration:2015 ETSI TS 103 190-2 clause G.3.1 Dolby audio channel configuration information. The @value attribute shall contain a six digit hexadecimal representation of the 24 bit field as defined by clause G.3.1 in ETSI TS 103 109-2. For example, for a stream with an 3/2/2 (5.1.2) immersive audio channel configuration using loudspeakers L, R, C, Ls, Rs, TL, TR, LFE, the value shall be “0000C7” (hexadecimal equivalent of the binary value 0000 0000 0000 0000 1100 0111). Legacy use, discouraged for new deployments.
tag:dts.com,2014:dash:audio_channel_configuration:2012 ETSI TS 102 114 Annex G DTS-HD audio channel configuration information. The @value attribute shall be set to the total number of discrete output channels represented in the stream, including LFE channels. The value shall be a whole decimal number in the range of 1 to 32. Legacy use, discouraged for new deployments.
tag:dts.com,2018:uhd:audio_channel_configuration ETSI TS 103 491 annex D DTS-UHD audio channel configuration information. The @value attribute shall be a string representing the ChannelMask, presented in hexadecimal form. Legacy use, discouraged for new deployments.

Generic Audio Source Descriptors

The following defines a set of URIs that identify audio source descriptions to be used with the Essential Property and/or the Supplemental Property descriptor:

Identifier Reference Section Comment
urn:mpeg:mpegB:cicp:OutputChannelPosition ISO/IEC 23091-3 clause 6.1 OutputChannelPosition specifies the association of loudspeaker positions to a given audio channel according to Table 7. Table 7 indicates the loudspeaker position in the 3D environment of the listener. In order to ease the understanding of loudspeaker positions Table 7 also contains loudspeaker positions according to IEC 62574, which are listed here for information to the interested reader.
urn:mpeg:mpegB:cicp:ProgramLoudness ISO/IEC 23091-3 clause 6.4 ProgramLoudness indicates the overall loudness of the corresponding audio program. It should be measured in such a way, that if two pieces of content have their loudness normalized using the value of rogramLoudness, the resulting audio will have consistent loudness. Decoders evaluate this value and apply an appropriate level shift to audio to maintain a consistent loudness of output audio, matched with a user-given target loudness. The ProgramLoudness should be the overall loudness measured in LKFS according to Rec. ITU-R BS.1770.
urn:mpeg:mpegB:cicp:AnchorLoudness ISO/IEC 23091-3 clause 6.5 AnchorLoudness indicates the loudness of the anchor elements of the corresponding audio signal (usually the dialog). The method of identifying the anchor elements is chosen by the content author and is out of scope for this part of ISO/IEC 23001. It should be chosen in such a way, that if two pieces of content have their loudness normalized using the values of anchorLoudness, the anchor elements of the resulting audio will have consistent loudness. The AnchorLoudness should be the loudness of the anchor content of the program, measured in LKFS according to Rec. ITU-R BS.1770.
tag:dolby.com,2018:dash:EC3_ExtensionType:2018 ETSI TS 103 420 clause D.2 Presence of the enhanced AC-3 extension is signalled with a DASH supplemental property descriptor using the schemeIdUri tag:dolby.com,2018:dash:EC3_ExtensionType:2018. The value of this DASH descriptor shall be the three character string JOC.
tag:dolby.com,2018:dash:EC3_ExtensionComplexityIndex:2018 ETSI TS 103 420 clause D.2 Complexity of the enhanced AC-3 extension is signalled with a DASH supplemental property descriptor. The value of this DASH descriptor shall be decimal representation of the eight-bit element complexity_index_type_a in the EC3SpecificBox of an enhanced AC-3 audio track.