1 - ipsj/itscj

advertisement
INTERNATIONAL ORGANISATION FOR STANDARDISATION
ORGANISATION INTERNATIONALE DE NORMALISATION
ISO/IEC JTC 1/SC 29/WG 11
CODING OF MOVING PICTURES AND AUDIO
ISO/IEC JTC 1/SC 29/WG 11 N8913
San José, CA, US – April 2007
Source: Leonardo Chiariglione
Title:
Report of 80th meeting
Status
Report of 80th meeting
1
Opening
The 80th meeting was held at the invitation of ANSI and held at the San José Double Tree Hotel.
2
Roll call of participants
Annex 1 gives the attendance list
3
Approval of agenda
Annex 2 gives the approved agenda
4
Allocation of contributions
Annex 3 gives the list of input documents
5
Communications from Convenor
There was no specific communication
6
Report of previous meeting
This was approved
7
Processing of NB Position Papers
NB position papers were considered and responses provided where appropriate
8940 Response to National Bodies
8
Work plan
8.1 Media coding
8.1.1 MPEG-4 Visual Simple Profile Level 6
The following documents were approved
8948 Disposition of Comments on ISO/IEC 14496-2:2004/PDAM4
1
8949 Text of ISO/IEC 14496-2:2004/FPDAM4 Simple Profile Level 6
8.1.2 Scalable Video Coding
The following documents were approved
8962
8963
8964
8965
Study Text (version 3) of ISO/IEC 14496-10:2005/FPDAM3 Scalable Video Coding
Joint Scalable Video Model (JSVM) 10
JSVM 10 Software
Draft SVC Verification Test Plan Version 3.0
8.1.3 Multiview Video Coding
The following documents were approved
8966 Working Draft 3 of ISO/IEC 14496-10:2005/Amd.4 Multiview Video Coding
8967 Joint Multiview Video Model (JMVM) 4
8968 JMVM 4 Software
8.1.4 AAC-ELD
The following documents were approved
9072 DoC on ISO/IEC 14496-3:2005/PDAM 9 Request for Amendment.
9073 DoC on ISO/IEC 14496-3:2005/PDAM 9
9074 ISO/IEC 14496-3:2005/FPDAM 9, AAC-ELD
8.1.5 Geometry and Shadow
The following documents were approved
9136
9150
9137
9138
WD 2.0 of ISO/IEC 14496-16:2006/AMD2 (Frame-based Animated Mesh Compression)
Request for ISO/IEC 14496-16:2006/AMD3 (3D MultiResolution Profile)
WD 1.0 of ISO/IEC 14496-16:2006/AMD3 (3D MultiResolution Profile)
3D Graphics Core Experiments Description
8.1.6 Video Tool Library
The following document was approved
8984 WD 4 of ISO/IEC 23002-4
8.1.7 Bitstream Syntax Description Language
The following documents were approved
9127 Text of ISO/IEC 23001-5 FDIS Bitstream Syntax Description Language
8.1.8 Fixed point implementation of DCT/IDCT
The following documents were approved
8982 Disposition of Comments on ISO/IEC CD 23002-2
8983 Text of ISO/IEC FCD 23002-2 Fixed-point 8x8 IDCT and DCT
8.1.9 Spatial Audio Object Coding
The following documents were approved
2
9099 Final Spatial Audio Object Coding Evaluation Procedures and Criterion
9090 DoC ISO/IEC 23003-1:2007/PDAM 1
8.1.10 Free Viewpoint TV coding
The following documents were approved
8944 FTV Model and Requirements
8.1.11 Audio and speech coding
The following documents were approved
9095 Framework for Exploration of Speech and Audio Coding
9096 Workplan for Exploration of Speech and Audio Coding
8.2 Composition coding
8.2.1 Lightweight Scene Representation
The following documents were approved
9028
9029
9030
9031
9032
9033
9034
DoC on ISO/IEC 14496-20/FPDAM1 (LASeR Extensions)
Text of ISO/IEC 14496-20/FDAM1 (LASeR Extensions)
Request for ISO/IEC 14496-20/Amd.2 (SVGT1.2 Support)
Text of ISO/IEC 14496-20/FPDAM2 (SVGT1.2 Support)
TuC for ISO/IEC 14496-20/Amd2
WD3.0 of ISO/IEC 14496-20 2nd Edition (1st Ed. + Cor + Amd.1)
Ideas under Consideration (IuC) for LASeR
8.2.2 Symbolic Music Representation
The following documents were approved
9088 DoC ISO/IEC FCD 14496-23
9089 ISO/IEC FDIS 14496-23:200x, Symbolic Music Representation
8.3 Description Coding
8.3.1 Schema definition
The following documents were approved
9102 Schema Files for MPEG-7
8.3.2 Visual Descriptor Extensions
The following documents were approved
8970 MPEG-7 Visual XM Document version 30.0
8971 Description of Core Experiments for MPEG-7 New Visual Extensions
8.3.3 Improvements to Geographic Descriptor
The following documents were approved
3
9129 DoC on ISO/IEC PDAM/3 15938-5 Improvements to Geographic Descriptor
9100 ISO/IEC FPDAM/3 15938-5 Improvements to Geographic Descriptor
8.3.4 MPEG-7 Query Format
The following documents were approved
9151 Request for subdivision ISO/IEC 15938-12 MPEG-7 Query Format
9103 ISO/IEC 15938-12 CD MPEG-7 Query Format
9104 Technologies Under Consideration for MPEG-7 Query Format
8.4 Systems support
8.4.1 Fragments Request Unit
The following documents were approved
9050 DoC on ISO/IEC 23001-2/FCD (Fragment Request Unit)
9051 Text of ISO/IEC 23001-2/FDIS (Fragment Request Unit)
8.5 IPMP
8.5.1 IPMP XML Messages
The following documents were approved
9052 Text of ISO/IEC 23001-3/FCD (IPMP XML Messages)
9144 TuC for IPMP XML Messages
8.5.2 MPEG-21 IPMP Component Base Profile
The following documents were approved
9105 DoC of ISO/IEC 21000-4 FPDAM/1 IPMP Components Base Profile
9106 Text of ISO/IEC 21000-4 FDAM/1 IPMP Components Base Profile
8.5.3 REL Open Release Profile
The following documents were approved
9107
9108
DoC of ISO/IEC 21000-5 PDAM/3 ORC (Open Release Content) Profile
ISO/IEC 21000-5 FPDAM/3 ORC (Open Release Content) Profile
8.5.4 REL Distribution and Capture Profile
The following document was approved
9109
Interoperability between MPEG-21 REL DAC Profile and other Rights Information
Standards
8.6 Digital Item
8.6.1 Digital Item Adaptation
The following document was approved
9113 Text of ISO/IEC 21000-7 FDIS Second edition
4
8.7 Transport and File Format
8.7.1 Transport of MPEG Surround data in AAC
The following documents were approved
9066 DoC ISO/IEC 13818-7:2006/FPDAM 1
9067 ISO/IEC 13818-7:2006/FDAM 1, Transport of MPEG Surround data in AAC
8.7.2 Flute Hint Track
The following documents were approved
9022 DoC on ISO/IEC 14496-12/FPDAM2 (Flute Hint Track)
9023 Text of ISO/IEC 14496-12/FDAM2 (Flute Hint Track)
9025 TuC for ISO/IEC 14496-12 & 15444-12
8.7.3 AVC File Format extensions for SVC
The following documents were approved
9026
Study Text of ISO/IEC 14496-15/PDAM2 (SVC File Format)
8.7.4 MP4FF box for Original Audio File Information
The following documents were approved
9070 DoC on ISO/IEC 14496-3/PDAM 8
9071 ISO/IEC 14496-3/FPDAM 8, MP4FF Box for Original Audio File Information
8.7.5 Digital Item File Format
The following documents were approved
9035 Request of ISO/IEC 21000-9/Amd.1
9036 Text of ISO/IEC 21000-9/PDAM.1 Mime Type Registration
8.7.6 Digital Item Streaming
The following documents were approved
9119 DoC of ISO/IEC 21000-18/PDAM 1
9120 ISO/IEC 21000-18/FPDAM/1 Simple fragmentation rule
8.8 Multimedia architecture
8.8.1 M3W Component Download
The following document was approved
9053 Text of ISO/IEC 23004-5/FDIS Component Download
8.8.2 M3W Fault Management
The following document was approved
9054 Text of ISO/IEC 23004-6/FDIS Fault Management
5
8.8.3 M3W System Integrity Management
The following document was approved
9055 Text of ISO/IEC 23004-7/FDIS System Integrity Management
8.8.3.1 Codec Configuration Representation
The following documents were approved
8979
8985
8986
8987
8989
WD 4 of ISO/IEC 23001-4
Description of Core Experiments in RVC
RVC Simulation Model (RSM) V4.0
RVC Work Plan
Description of Exploration Experiments for Toolbox Extensions
8.8.4 3D Graphics Compression Models
The following documents were approved
9141 Request for Subdivision of ISO/IEC 14496: Part 25 - 3D Graphics Compression Model
9142 WD 1.0 for ISO/IEC 14496-25
8.8.5 Media Streaming MAF Protocols
The following documents were approved
9058 DoC on ISO/IEC 29116-1/CD Media Streaming MAF Protocol
9059 Text of ISO/IEC 29116-1/FCD Media Streaming MAF Protocol
8.8.6 Extensible Multimedia Platform
The following documents were approved
9060 A project to exploit MPEG standards in tune with industry practices and needs
8.9 Application formats
8.9.1 Protected Music Player MAF
The following documents were approved
9121 DoC of ISO/IEC 23000-2 FCD Music Player Application Format 2nd Edition
9122 Text of ISO/IEC 23000-2 FDIS Music Player Application Format 2nd Edition
8.9.2 Musical Slide Show MAF
The following documents were approved
9037
9038
9040
DoC of ISO/IEC FCD 23000-4 (Musical Slide Show MAF)
Text of ISO/IEC FDIS 23000-4 (Musical Slide Show MAF)
WD1.0 of ISO/IEC 23000-4/Amd.2 Protected Musical Slide Show
8.9.3 Media Streaming MAF
The following documents were approved
9123 DoC on ISO/IEC CD 23000-5 Media Streaming Player
6
9124 ISO/IEC FCD 23000-5 Media Streaming Player
8.9.4 Open Release Application Format
The following documents were approved
9125 DoC of ISO/IEC 23000-7 CD Open release MAF
9126 ISO/IEC 23000-7 FCD Open release MAF
8.9.5 Portable Video Player
The following documents were approved
9041 Text of ISO/IEC 23000-8/CD (Portable Video Player MAF)
8.9.6 Digital Multimedia Broadcasting Application Format
The following documents were approved
9042 DoC on ISO/IEC 23000-9/CD (MAF for DMB)
9043 Text of ISO/IEC 23000-9/FCD (MAF for DMB)
9044 TuC on MAF for DMB
8.9.7 Video Surveillance Application Format
The following documents were approved
9045
9046
Request for ISO/IEC 23000-10
WD1.0 on ISO/IEC 23000-10 (Video Surveillance MAF)
8.10 Reference implementation
8.10.1 File Format Reference Software
The following documents were approved
9019 DoC of ISO/IEC 14496-5/FPDAM12 File Format Reference Soft.
9020 Text of ISO/IEC 14496-5/FDAM12 File Format Reference Software
8.10.2 Reference Hardware Description
The following documents were approved
8994 Status of HDL submissions and commitments for MPEG
8995 Study of ISO/IEC DTR 14496-9
8.10.3 Geometry and Shadow Reference Software
The following documents were approved
9149
9135
Doc of ISO/IEC 14496-5:2001/ PDAM13 (Geometry and Shadow RefSoft)
Text of ISO/IEC 14496-5:2001/ FPDAM13 (Geometry and Shadow RefSoft)
8.10.4 MPEG-J GFX Reference Software
The following documents were approved
9148
Doc of ISO/IEC 14496-5:2001/ FPDAM11 (MPEG-J GFX RefSoft)
7
9134
Text of ISO/IEC 14496-5:2001/ FDAM11 (MPEG-J GFX RefSoft)
8.10.5 New Profiles for Professional Applications Reference Software
The following documents were approved
8958 Request for ISO/IEC 14496-5:2001/Amd.18
8959 Working Draft 1 of ISO/IEC 14496-5:2001/Amd.18 Reference Software for new Profiles for
Professional Applications
8.10.6 SVC Reference Software
The following documents were approved
8960 Request for ISO/IEC 14496-5:2001/Amd.19
8961 Working Draft 1 of ISO/IEC 14496-4:2001/Amd.19 Reference Software for SVC
8.10.7 BSAC Reference Software
The following documents were approved
9086 Request for Amendment, MPEG-1/2 on MPEG-4 Ref. Software
9087 Text of ISO/IEC 14496-5:2001/PDAM 20, MPEG-1/2 on MPEG-4 Ref. Software
8.10.8 Perceptual 3D Shape Reference Software
The following documents were approved
8974 Disposition of Comments on ISO/IEC 15938-6:2003/FPDAM2
8975 Text of ISO/IEC 15938-6:2003/FDAM2 (Perceptual 3D Shape)
8.10.9 Rights Expression Language Reference Software
The following documents were approved
9110 REL/RDD Reference Software Development Plan v.6
8.10.10Digital Item Reference Software
The following documents were approved
9114 Preliminary DoC of preliminary comments of ISO/IEC 21000-8 FCD Reference Software
9115 Study text of ISO/IEC 21000-8 FCD Reference Software
8.10.11Rights Data Dictionary Reference Software
The following documents were approved
9110 REL/RDD Reference Software Development Plan v.6
8.10.12Photo Player MAF Reference Software
The following documents were approved
8978 Study Text of ISO/IEC 23000-3/PDAM1 Reference Software for Photo Player MAF
8.10.13Musical Slide Show MAF Reference Software
8
9039
Workplan for Musical Slide Show MAF Conformance and Ref. Software
8.10.14Prefixes and wild card extensions Reference Software
The following documents were approved
9047 Study Text of ISO/IEC 23001-1/FPDAM2 (Prefixes and of wild cards extensions)
8.10.15Integer IDCT Accuracy Testing Reference Software
The following documents were approved
8980 Disposition of Comments on ISO/IEC 23002-1/PDAM1
8981 Text of ISO/IEC 23002-1/FPDAM1 Software for Integer IDCT Accuracy Testing
8.10.16MPEG Surround Reference Software
The following documents were approved
9093 ISO/IEC 23003-1:2007/FPDAM 2, MPEG Surround Reference Software
9094 Defect Report of ISO/IEC 23003-1:2007
8.10.17M3W Reference Software
The following documents were approved
9056
9057
WD2.0 of ISO/IEC 23004-8 Reference Software and Conformance
M3W Reference Software and Conformance Plan
8.11 Conformance
8.11.1 File Format Conformance
The following documents were approved
9013
9014
DoC on ISO/IEC 14496-4/PDAM 24 File Format Conformance
Text of ISO/IEC 14496-4/FPDAM 24 File Format Conformance
8.11.2 Geometry and Shadow Conformance
The following documents were approved
9147 DoC of ISO/IEC 14496-4:2001/ PDAM21 (Geometry and Shadow Conformance)
9133 Text of ISO/IEC 14496-4:2001/ FPDAM21 (Geometry and Shadow Conformance)
8.11.3 Synthesised Texture Conformance
The following documents were approved
8999
9012
DoC on ISO/IEC 14496-4/PDAM 23 Synthesised Texture Conformance
Text of ISO/IEC 14496-4/FPDAM 23 Synthesised Texture Conformance
8.11.4 MPEG-J GFX Conformance
The following documents were approved
9146 DoC of ISO/IEC 14496-4:2001/ FPDAM16 (MPEG-J GFX Conformance)
9132 Text of ISO/IEC 14496-4:2001/ FDAM16 (MPEG-J GFX Conformance)
9
8.11.5 Laser Conformance
The following documents were approved
9015
9016
DoC on ISO/IEC 14496-4/PDAM 25 LASeR V1 Conformance
Text of ISO/IEC 14496-4/FPDAM 25 LASeR V1 Conformance
8.11.6 Open Font Format Conformance
The following documents were approved
9017
9018
Request for ISO/IEC 14496-4/Amd.26
Text of ISO/IEC 14496-4/PDAM 26 Open Font Format Conformance
8.11.7 Visual Simple Profile Level 6 Conformance
The following documents were approved
8952
8953
Disposition of Comments on ISO/IEC 14496-4:2004/PDAM28
Text of ISO/IEC 14496-4:2004/FPDAM28 Visual Simple Profile Level 6 Conformance
Testing
8.11.8 New Profiles for Professional Applications Conformance
The following documents were approved
8954
8955
Request for ISO/IEC 14496-4:2004/Amd.30
Working Draft 1 of ISO/IEC 14496-4:2004/Amd.30 Conformance Testing for new
Profiles for Professional Applications
8.11.9 SVC Profiles Conformance
The following documents were approved
8956
8957
Request for ISO/IEC 14496-4:2004/Amd.31
Working Draft 1 of ISO/IEC 14496-4:2004/Amd.31 Conformance Testing for SVC
Profiles
8.11.10MPEG-1 and -2 Audio in MPEG-4 Conformance
The following documents were approved
9078 DoC ISO/IEC 14496-4:2004/FPDAM 18
9079 ISO/IEC 14496-4:2004/FDAM 18, MPEG-1 and -2 on MPEG-4 Conformance
8.11.11BSAC Conformance
The following documents were approved
9076 DoC on ISO/IEC 14496-4:2004/FPDAM 14
9077 ISO/IEC 14496-4:2004/FDAM 14, BSAC Extensions Conformance
8.11.12Audio Lossless Conformance
The following documents were approved
9080 DoC ISO/IEC 14496-4:2004/FPDAM 19
9081 ISO/IEC 14496-4:2004/FDAM 19, ALS Conformance
10
8.11.13Perceptual 3D Shape Conformance
The following documents were approved
8976 Disposition of Comments on ISO/IEC 15938-7:2003/FPDAM3
8977 Text of ISO/IEC 15938-7:2003/FPDAM3 (Perceptual 3D Shape)
8.11.14Improvements to Geographic Descriptor Conformance
9130 DoC on ISO/IEC PDAM/4 15938-7 Improvements to Geographic Descriptor Conformance
9101 ISO/IEC FPDAM/4 15938-7 Improvements to Geographic Descriptor Conformance
8.11.15Digital Item Conformance
The following documents were approved
9116 DoC of ISO/IEC 21000-14 Conformance
9117 Text of ISO/IEC FDIS 21000-14 Conformance
8.11.16Musical Slide Show MAF Conformance
The following document was approved
9039
Workplan for Musical Slide Show MAF Conformance and Ref. Software
8.11.17MPEG Surround Conformance
The following document was approved
9091 ISO/IEC 23003-1:2007/FPDAM 1, MPEG Surround Conformance
9092 DoC ISO/IEC 23003-1:2007/PDAM 2
8.11.18Codec Configuration Representation Conformance
The following document was approved
8988 RVC Conformance Testing Working Draft 1.0
8.12 Maintenance
8.12.1 Systems coding standards
The following documents were approved
8998
9021
9024
9027
9140
8972
8973
9048
9049
Text of ISO/IEC 13818-1:2003/DCOR1.2 (AVC Referencing and PS Signalling)
Text of ISO/IEC 14496-11/COR.6 (AudioFXProto correction and Bitwrapper)
Text of ISO/IEC 14496-12/COR.3
ISO/IEC 14496-20/DCOR2
Text of ISO/IEC 14496-21:2006/COR1
Disposition of Comments on ISO/IEC 15938-6:2003/ Amd.1:2006/DCOR 1
Text of ISO/IEC 15938-6:2003/Amd.1:2006/Cor.1 (Color Temperature)
DoC on ISO/IEC 23001/DCOR2
Text of ISO/IEC 23001/COR2
8.12.2 Video coding standards
The following documents were approved
11
9064
9065
8950
8951
DoC on ISO/IEC 11172-5:199x/DCOR 1
ISO/IEC 11172-5:199x/Cor. 1
Text of ISO/IEC 14496-4:2004/DCOR4
Text of ISO/IEC 14496-4:2004/Amd.1/DCOR2
8.12.3 Audio coding standards
The following documents were approved
9068 ISO/IEC 14496-3:2005/DCOR 5 (DST and MP3on4)
9069 ISO/IEC 14496-3:2005/DCOR 6 (SLS)
9085 Text of ISO/IEC 14496-5:2001/AMD 10:2007/DCOR 1, BSAC and SLS
8.12.4 Visual description coding standards
The following documents were approved
8969 Text of ISO/IEC 15938-3:2002/Amd.2:2006/Cor.1 (Perceptual 3D Shape)
8.12.5 Digital Item standards
9111 Disposition of Comments on ISO/IEC 21000-7:2004/DCOR 1
9118 ISO/IEC 21000-15:2006/DCOR1 MPEG-21 Event Reporting
9
Liaison matters
The following output liaisons were issued
8919
8920
8921
8922
8923
8924
8925
8926
8927
8928
8929
8930
8931
8932
8933
8934
8935
8936
8937
8938
8939
8941
Liaison statement to WG1
Liaison Statement to IETF
Liaison Statement to Khronos
Liaison Statement to ISO TC184 SC4
Liaison Statement to 3GPP
Liaison Statement to W3C
Liaison Statement to ITU-T FG/IPTV concerning M3W
Liaison Statement to ITU-T FG IPTV
Liaison Statement to SMPTE
Liaison Statement to DVD Forum
Liaison Statement to ETSI
Liaison Statement to SMPTE re file format
Liaison Statement to DVB
Liaison Statement to JCP
Liaison Statement to CEA
Liaison Statement to ATIS
Liaison Statement to SMPTE re RVC
Liaison Statement to 3D Consortium
Liaison Statement to FLOForum
Liaison Statement to TC46/SC9/WG7
Liaison Statement to AVS
Liaison Statement to DVB
12
10 Organisation of this meeting
10.1 Tasks for subgroups
The following tasks were assigned to subgroups
S
P
A
4
16
20
2
3
4
5
10
Y
Z
4
3D compression profiling
Laser profiling
New DID
MAFs under consideration: Protected Photo Player
MAFs under consideration: Protected Musical Slide Show
MAFs under consideration: Digital Cinema
MAFs under consideration: Surveillance
Stereoscopic MAF
Cross media interactive presentation
RVC Toolbox Extension
MPEG URNs
MAF Awareness Event
FTV
4 22
23
24
25
26
2x
5 12
14
16
17
12 2
15 1
20 1
9 1
4
1
2
8
9
10
1 2
2
3
Audio BIFS conformance
Synthesised texture conformance
File format conformance
Laser conformance
Open Font Format Conformance
Laser v.2 conformance
File Format Reference Software
Open Font Format Reference Software
Symbolic Music Representation Reference Software
Laser Reference Software
FLUTE hint track
SVC File Format
Lightweight Scene Representation
Mime type registration
Musical Slide Show MAF
Musical Slide Show MAF conformance & RS
Protected Musical Slide Show MAF
Portable Video Player MAF
DMB MAF
Video Surveillance MAF
Extension on encoding of wild cards
Fragment Request Unit
Binary to XML mapping of IPMP-X
MPEG Multimedia Middleware
Reqs
21
A
C
Systems
4
21
A
B
E
5
6
7
8
Reference Software
13
29116
1
X
MS MAF Protocols
Joint management of content description and presentation
E2E Multimedia Platform
MDS
7
21
12
4 1
8 1
14
18 1
A
2
5
6
7
4
7
A
2 4
3 3
3 1
2
4
2
4
Query Format
Schemas
IPMP Components Amendment 1
Reference software
IPMP Components
DIA
DIP
ER
FID
DIS
Conformance
IPMP Components
DIA
DIP
ER
FID
Digital Item Streaming
Schemas
Protected Music Player MAF
Media Streaming MAF
Professional Archival MAF
Open Release MAF
Video
B
C
JVT
Audio
4
2
4
10 3
4
7 1
3 8
3 9
3 5
4 14
18
19
20
29
Simple Profile level 6
Visual Signature Tools
Photo Player Reference Software
Photo Player Conformance
Reconfigurable Video Coding
Fixed-point 8x8 IDCT and DCT
Reconfigurable Video Coding
New AVC Profiles for Professional Applications Conformance
New AVC Profiles for Professional Applications Reference SW
Scalable Video Coding Conformance
Scalable Video Coding Reference SW
Scalable Video Coding
Multi-View Video Coding
Transport of MPEG Surround data in AAC
MP4 box for original audio file information
AAC-ELD
BSAC extensions and transport of MPEG Surround
BSAC conformance
MPEG-1 and -2 on MPEG-4 conformance
ALS conformance
SLS conformance
SMR Conformance
14
5 16
15
23
2 1
2
6
1 1
2
3
SMR Reference Software
BSAC and SLS Reference Software
SMR
Music Player MAF Conformance and reference software
Protected Music Player MAF
Professional Archival MAF
MPEG Surround Reference Software
MPEG Surround Conformance
Spatial Audio Object Coding
Audio and Speech Coding
4
4 16
21
5 11
13
16 2
3
25
Conformance MPEG-J GFX
Conformance of Geometry and shadow
Reference software MPEG-J GFX
Reference Software of Geometry and shadow
Frame-based animated mesh compression
3D Multiresolution profile
3D Graphics Compression model
4
10 3
SVC verification tests
4
9 2
3
6
A
D
X
3DG
Test
ISG
7
Reference Hardware Description
Reference Hardware Description
Reference software
Liaison
JPEG
IPMP-JPSEC
JPSearch - MP7QF
JPSearch – Photo Player MAF
10.2 Joint meetings
The following joint meetings were held
Groups
Req. Mds
Mds, Sys
Req, ISG, Vid
Req, Mds, Sys, Vid, Aud
Sys, JPEG
Sys, Aud
Req, 3dg
Req, Vid
Vid, Jvt, Req
Mds, Sys
Req, Sys
Vid, JPEG
Mds, JPEG
Mds, Req
What
URN, DID, MP7QF
DI FF issues
RVC and AVS
MAFs under cons.
JPSEC-IPMP
Mp4 FF
3D compr. Prof.
FTV, MVC
Video metadata carriage
MP21-Laser
Laser prof.
PP MAF, JPSearch
MP7QF
DID
15
Day
Tue
Tue
Tue
Tue
Tue
Wed
Wed
Wed
Wed
Wed
Thu
Thu
Thu
Thu
Where
Req
Mds
Req
Req
Sys
Aud
3dg
Jvt
Jvt
Sys
Req
Vid
Mds
Mds
Time
09:00-11:00
11:00-12:00
12:00-12:30
14:00-18:00
09:00-10:30?
11:30-12:00
12:00-12:30
14:00-15:30
15:30-16:00
16:00-17:00
09:00-09:30
10:00-11:00
11:00-12:00
15:00-16:00
11 Administrative matters
11.1 Schedule of future MPEG meetings
The following meeting schedule was approved
#
80
81
82
83
84
85
86
City
Country yy mm
San José
US
07 04
Lausanne
CH
07 07
Shenzhen
CN
07 10
Antalya
TR
08 01
Geneva?
CH?
08 04-05
Hannover
DE
08 07
Seoul
KR
08 10
dd-dd
23-27
02-06
22-26
14-18
28-02
21-25
13-17
11.2 Promotional activities
The press release from the 80th meeting was approved
8915 San José press release
12 Planning of future activities
The following ad hoc groups were established
9063 Ad Hoc Group on MAF Under Development in Systems
9062 Ad Hoc Group on MPEG File Formats
9061 Ad Hoc Group on Scene Representation
8997 AHG for Video Annotation
9143 AHG on 3DG documents, experiments and software maintenance
9097 AHG on Audio Standards Maintenance
8947 AHG on FTV
8990 AHG on Maintenance of MPEG-4 Visual related Documents, Reference Software and
Conformance
8996 AHG on MPEG-4 Part 9: Reference Hardware Description Phase 1 and 2.
9128 AHG on MPEG-7 Query Format
8992 AHG on MPEG-7 Visual and Photo Player MAF
8991 AHG on Reconfigurable Video Coding
8946 AHG on Review of MPEG-21 DID
9098 AHG on SAOC CfP, AAC-ELD and Speech and Audio Exploration
8993 AHG on SVC Verification Test
13 Resolutions of this meeting
These were approved
16
14 A.O.B
There was no other business
15 Closing
The meeting closed at 2007/04/27T22:40
17
Annex A – Attendance list
First name
Ian
Gerrard
Michael
Christian
Dan
Jan
Saar
Rik
Michael
Patrick
Wa James
Last name
Burnett
Drury
Ransburg
Timmerer
Cernea
De Cock
De Zutter
Van de Walle
Gallant
Rault
Tam
Liang
Zhang
Weizhong
Quqing
Lou
Yongying
Wei-Hung
Yu-Wen
Junyan
Gwo Giun (Chris)
Sixin
Yang
Honggang
Chen
Chen
Dongsheng
Gao
Huang
Huang
Huo
Lee
Lin
Ping
Qi
Cliff
Lianhuan
Xiaozhong
Haitao
Lu
Xiaozhen
Lihua
Ying
Miska
Huopaniemi
Jani
Justin
Kemal
Mauri
Vincent
Arnaud
Nathalie
Sylvain
Nicolson
Julien
Jean-Claude
Patrick
Marc
Joel
Mohamed-Chaker
Reader
Xiong
Xu
Yang
Yu
Zheng
Zhu
Chen
Hannuksela
Jyri
Lainema
Ridge
Ugur
Vaananen
Bottreau
Bourge
Cammas
Devillers
Didier
Dubois
Dufourd
Gioia
Guez Vucher
Jung
Larabi
Affiliation
University of Wollongong
University of Wollongong
Klagenfurt University
Klagenfurt University
ETRO - VUB
Ghent University
Ghent University
Ghent University - IBBT
LSI Logic
Quartics
Communications Research Centre Canada
(CRC)
Communications Research Centre Canada
(CRC)
Huawei Technologies Co., Ltd.
Thomson Broadband R&D (Beijing) Co. Ltd.
China Electronics Standardization Institute
Thomson Corporate Research Beijing
MediaTek
MediaTek
Xidian University
National Cheng Kung University
Huawei Tech. Co. Ltd
Tsinghua University
Institute of Computing Technology, Chinese
Academy of Sciences
Self
Huawei Technologies Co., Ltd.
Tsinghua University
Xidian University
Zhejiang University
Huawei Technologies Co., Ltd.
Thomson Inc
Tampere Univ. Tech.
Nokia
Nokia
Nokia
Nokia
Nokia
Nokia
Thomson
NXP Semiconductors
Orange-France Telecom R&D
France Telecom
Thales
University Burgundy
Streamezzo
France Telecom
SCPP
Orange-France Telecom R&D
SIC, University of Poitiers
18
Country
Australia
Australia
Austria
Austria
Belgium
Belgium
Belgium
Belgium
Canada
Canada
Canada
Canada
China
China
China
China
China
China
China
China
China
China
China
China
China
China
China
China
China
China
Finland
Finland
Finland
Finland
Finland
Finland
Finland
France
France
France
France
France
France
France
France
France
France
France
Anne
Khaled
LeBris
Mammou
Patrice
Stephane
Pierrick
Marius
David
Jerome
Matthias
Peter
Gero
Klaus
Mario
Ralf
Bernhard
Oliver
Juergen
Tilman
Karsten
Markus
Matthias
Jens-Rainer
Joern
Thomas
Thomas
Juergen
Andreas
Markus
Florian
Heiko
Alsosa
Ralph
Herbert
Thomas
Mathias
Steffen
Pierfrancesco
Filippo
Leonardo
Giovanni
Davide
Kohtaro
Yukihiro
Mark
Takeshi
Toshiaki
Junichi
Noboru
Satoshi
Takashi
Kota
Itaru
Hideaki
Takahiro
Abe
Onno
Pateux
Philippe
Preda
Thevenin
Vieron
Gruhne
Amon
Bäse
Diepold
Doeller
Geiger
Grill
Hellmuth
Herre
Liebchen
Müller
Multrus
Narroschke
Ohm
Ostermann
Rathgen
Schierl
Schmidt
Schneider
Schnell
Schreiner
Schwarz
Smolic
Sperschneider
Thoma
Wedi
Wien
Wittmann
Bellini
Chiariglione
Chiariglione
Cordara
Rogai
Asai
Bandoh
Callow
Chujoh
Fujii
Hara
Harada
Ito
Itoh
Iwamoto
Kaneko
Kimata
Kimoto
Kiyofumi
France Telecom
ARTEMIS Departement Institut National des
Télécommunications
Canon Research Centre France SAS
Orange-France Telecom R&D
Orange Labs
iNT
Expway
Thomson R&D
Fraunhofer IDMT
Siemens AG
Siemens AG
Technische Universitaet Muenchen
University of Passau
Fraunhofer IIS
Fraunhofer IIS
Fraunhofer IIS
Fraunhofer IIS
LG Electronics
Fraunhofer HHI
Fraunhofer IIS
University of Hannover
RWTH Aachen University
University of Hannover
Ilmenau Technical University
Fraunhofer IIS
Thomson Inc.
Coding Technolegies GmbH
Fraunhofer IIS
Technische Universität München
Fraunhofer HHI
Fraunhofer IIS
Fraunhofer IIS
Fraunhofer IIS
Panasonic
RWTH Aachen University
Panasonic
University of Florence - DISIT-DSI
CEDEO.net
CEDEO.net
Telecom Italia Lab
University of Florence - DISIT-DSI
Mitsubishi Electric Corporation
NTT
HI Corporation
Toshiba Corporation
Nagoya University
Ricoh Company, Ltd.
NTT
Toshiba Corporation
Fujitsu Laboratories Ltd.
NEC Corporation
Tokyo Polytechnic University
NTT Corporation
NEC Corporation
Matsushita Electric Industrial Co., Ltd.
19
France
France
France
France
France
France
France
France
Gemany
Germany
Germany
Germany
Germany
Germany
Germany
Germany
Germany
Germany
Germany
Germany
Germany
Germany
Germany
Germany
Germany
Germany
Germany
Germany
Germany
Germany
Germany
Germany
Germany
Germany
Germany
Germany
Italy
Italy
Italy
Italy
Italy
Japan
Japan
Japan
Japan
Japan
Japan
Japan
Japan
Japan
Japan
Japan
Japan
Japan
Japan
Takuyo
Takehiro
Tokumichi
Joji
Sei
Hiroya
Takashi
Toshiyuki
Takeshi
Ryoma
Yukiko
Satoru
Masanori
Kazushi
Shun-ichi
Takanori
Masato
Osamu
Shinya
Taichiro
Ken
Teruhiko
Masashi
Seishi
Kogure
Moriya
Murakami
Naito
Naito
Nakamura
Nishi
Nomura
Norimatsu
Oami
Ogura
Sakazume
Sano
Sato
Sekiguchi
Senoh
Shima
Shimada
Shimizu
Shiodera
Sugiyama
Suzuki
Takahashi
Takamura
Masayuki
Akiyuki
Yoichi
Akio
Yoshihisa
Tomoo
Tomoyuki
Takahiro
Yoshiyuki
Jeong-Hwan
Sunguk
Hyouk Jean
Jihun
Seo
Ayoung
Byeongho
Hae Chul
Miran
Woong Il
Yungho
Jong Bum
Hyon-Gon
Hyon-Gon
Sung-Moon
Woo-Jin
Ki Hun
Min Cheol
Chi Jung
Lee
Euee Seon
Byeong Moon
Tanimoto
Tanizawa
Yagasaki
Yamada
Yamada
Yamakage
Yamamoto
Yamasaki
Yashima
Ahn
Baik
Cha
Cha
Chanwon
Cho
Choi
Choi
Choi
Choi
Choi
Choi
Choo
Choo
Chun
Han
Han
Hong
Hwang
James
Jang
Jeon
Matsushita Electric Industrial Co., Ltd.
NTT
Mitsubishi Electric Corporation
JVC
KDDI Corp.
JVC
Oki Electric Industry Co., Ltd.
NEC Corporation
Matsushita Electric Industrial Co., Ltd.
NEC Corporation
IPSJ/ITSCJ
Victor Company of Japan, Limited
NHK
Sony Corporation
Mitsubishi Electric Corporation
National Institute of Info & Comm Tech
Texas Instruments Japan
NEC Corporation
NTT
Toshiba Corporation
NEC Corporation
Sony Corp
Hitachi, Ltd
NTT Cyber Space Laboratories, NTT
Corporation
Nagoya University
Toshiba Corporation
Sony Corp.
NEC Corporation
Mitsubishi Electric Corporation
Toshiba Corporation
Sharp Corporation
Oki Electric Industry Co., Ltd.
NTT Corporation
Samsung Electronics
Oniontech co.,ltd
LG Electronics
ETRI
Sejong University
Inha University
KETI
ETRI
ETRI
Samsung
SK Telecom
Samsung Electronics
ETRI
ETRI
ECT Inc.
Samsung Electronics
Sejong University
Soongsil University
ChungNam Univ
KETI
Hanyang University
LG Electronics
20
Japan
Japan
Japan
Japan
Japan
Japan
Japan
Japan
Japan
Japan
Japan
Japan
Japan
Japan
Japan
Japan
Japan
Japan
Japan
Japan
Japan
Japan
Japan
Japan
Japan
Japan
Japan
Japan
Japan
Japan
Japan
Japan
Japan
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Byeungwoo
Yongjoon
Dongseak
Jie
Sung Ho
Sanhhyun
Ye Sun
Jaebum
Yang-Won
Bongsoo
Jung Won
Chang Ick
Do-Hyung
Dong Soo
Hae Kwang
Hui Yong
Hyun Mun
Hyungyu
Jae-Gon
Jingwoong
Jong Lak
Munchurl
So Young
Tae Hyeon
Taehyun
Yong Goo
Yong Han
Yong-Hwan
Dongkyun
Jae-Il
Han-Suh
Sang Hoon
SangHeon
Sun Young
Yung Lyul
Chungku
Sangyoun
YungKi
SungChang
Young-Kwon
Taebeom
Moon
Hack Youp
Weongeun
Henney
Kwan-Jung
Jeon
Jeon
Jeong
Jia
Jin
Joo
Joung
Jun
Jung
Jung
Kang
Kim
Kim
Kim
Kim
Kim
Kim
Kim
Kim
Kim
Kim
Kim
Kim
Kim
Kim
Kim
Kim
Kim
Kim
Kim
Koo
Lee
Lee
Lee
Lee
Lee
Lee
Lee
Lim
Lim
Lim
Nam Mee
Noh
Oh
Oh
Oh
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Pang
Park
Park
Park
Park
Park
Sabirin
SKKU
LG Electronics
Inha University
Sejong University
Information and Communications University
ETRI
ETRI
Hanyang University
LG Electronics
SKKU
ETRI
Information and Communications University
Samsung Advanced Institute of Technology
LG Electronics
Sejong University
ETRI
Samsung AIT
Hanyang University
Hanbat National University
ETRI
DSP Group
Information and Communications University
Samsung Electronics
LG Electronics
DRM inside
SK Telecom
University of Seoul
KETI
Sejong University
Information and Communications University
LG Electronics
DSP Group
Seoul Nat'l Univ
Hanyang University
Sejong University
HUMAX Co.,Ltd.
Yonsei University
Sejong University
Sejong University
net&tv Inc.
Korea Electronics Technology Institute
Seoul University of Venture & Information
Korea
ETRI
LG Electronics
GIST (Gwangju Institue of Science and
Technology)
LG Electronics
KETI
Kyung Hee University
Kwangwoon Univ.
LG Electronics
LG Electronics
Information and Communications University
Hee-Suk
Ji Ho
Min Woo
Seanae
Seung-Wook
DongHwan
Muhammad Syah
Houari
Jeongil
Seo
ETRI
Korea
21
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Jungdong
Hee-Cheol
Juheon
Woo Sung
Donggyu
Jaeyeon
Doug Young
Jong-Yeul
Hendry
Jungyoup
Jeong-Hyu
Won Keun
Alex Chungku
Jeong-ju
Jisang
Kyoungro
Sungyong
Jianhua
Sebastien
Jeroen
Fons
Jean H.A.
Johan
Werner
Gisle
Marian
Lukasz
Fernando
Kok Seng
Kwong Huang
Haibin
Kelvin
Te
Zhengguo
Chong Soon
Leong
Sua Hong
Susanto
Shengmei
Wei
Thiow Keng
Jaime
Ruben
Per
Kristofer
Heiko
Jonas
Rickard
Peirre
Touradj
Christophe
Marco
Tanya
Miroslaw
Leszek
Kate
Seo
Seo
Seo
Shim
Sim
Song
Suh
Suh
Tan
Yang
Yang
Yang
Yie
Yoo
Yoo
Yoon
Yoon
Zheng
Brangoulo
Breebaart
Bruls
Gelissen
Muskens
Oomen
Bjøntegaard
Muczko
Pikula
Pereira
Chong
Goh
Huang
Lee
Li
Li
Lim
Mun Kew
Neo
Rahardja
Sheng
Yao
Tan
Delgado
Tous
Fröjdh
Kjörling
Purnhagen
Rödén
Sjöberg
Davy
Ebrahimi
Lucarz
Mattavelli
Beech
Bober
Cieplinski
Grant
Yonsei University
ETRI
Sejong University
Samsung Electronics
Kwangwoon Univ.
Samsung Elecronics
KHU
LG Electronics
Information and Communications University
Sungkyunkwan University
LG Electronics
ETRI
HUMAX Co.,Ltd.
ETRI
Kwangwoon University
Konkuk University
LG Electronics
Huawei Technologies Co., Ltd.
Joost Technologies
Philips Research
Philips
Philips Research
Philips Research
Philips Applied Technologies
Tandberg
Telekomunikacja Polska
Telekomunikacja Polska
IST-IT
Panasonic Singapore Laboratories
Institute for Infocomm Research
Institue for Infocomm Research
Institute for Infocomm Research
Institute for Infocomm Research
Institute for Infocomm Research
Panasonic Singapore Laboratories
Institute For Infocomm Research
Panasonic Singapore Laboratories
Institute for Infocomm Research
Panasonic Singapore Labs
Institute for Infocomm Research
NTT DoCoMo, Inc.
Universitat Politècnica de Catalunya
Universitat Politecnica de Catalunya
Ericsson
Coding Technologies AB
Coding Technologies AB
Coding Technologies AB
Ericsson
University of Geneva
EPFL
EPFL
EPFL
QinetiQ
Mitsubishi Electric Corporation
Mitsubishi Electric ITE-VIL
Nine Tiles
22
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Korea
Netherlands
Netherlands
Netherlands
Netherlands
Netherlands
Netherlands
Norway
Poland
Poland
Portugal
Singapore
Singapore
Singapore
Singapore
Singapore
Singapore
Singapore
Singapore
Singapore
Singapore
Singapore
Singapore
Singapore
Spain
Spain
Sweden
Sweden
Sweden
Sweden
Sweden
Switzerland
Switzerland
Switzerland
Switzerland
UK
UK
UK
UK
Mike
Robert
Ping
Jose Roberto
Yiliang
Lazar
Peter
Madhukar
Wo
Lulin
Yi-Jen
Hyukjune
Reha
Katie
Guy
Oscar
James
Alex
Matt
Onur
Oztan
Barry
Paul
Jones
Arianne
Danny
Shih-Ta
Walt
Faisal
Michael
Jorn
Sandeep
Mukta
Marta
Jae Hoon
Arkady
Shawmin
Athanasios
Vladimir
He-Yuan
Yuxin
Ning
Jiancong
Ajay
Sean
Jim
Debargha
Sam
Obianuju
Tokunbo
Purvin
Wen-Hsiao
Yolanda
Schuyler
Shankar
Nilsson
O'Callaghan
Wu
Alvarez
Bao
Bivolarski
Borgwardt
Budagavi
Chang
Chen
Chiu
Chung
Civanlar
Cornog
Cote
Divorra
Escoda
Durham
Eleftheriadis
Fellers
Guleryuz
Harmanci
Haskell
Haskell
He
Hinds
Hong
Hsiang
Husak
Ishtiaq
Isnardi
Janneck
Kanumuri
Kar
Karczewicz
Kim
Kopansky
Lei
Leontaris
Levantovsky
Lin
Liu
Lu
Luo
Luthra
McCarthy
Meany
Mukherjee
Narasimhan
Ndili
Ogunfunmi
Pandit
Peng
Prieto
Quackenbush
Regunathan
BT
Mitsubishi Electric ITE-VIL
Tandberg Television
Mobilygen Corporation
Qualcomm
BrightScale, Inc
Motorola
Texas Instruments Inc.
NIST
Omneon Video Networks
Intel Corp.
Qualcomm Inc.
DoCoMo USA Labs
Avid Technology
Mobilygen Corporation
Thomson Inc.
UK
UK
UK
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
JITC / DISA
Layered Media, Inc.
Dolby Laboratories
DoCoMo USA Labs
DoCoMo USA Labs
Apple Inc.
Harmonic, Inc.
Freescale Semiconductor
IBM
Layered Media, Inc.
Motorola
Dolby Laboratories
Motorola
Sarnoff Corporation
Xilinx
DoCoMo USA Labs
CableLabs
Qualcomm
University of Southern California
Sarnoff Corporation
MediaTek
Dolby Laboratories
Monotype Imaging Inc.
NCKU
Hewlett Packard Company
Intel
Thomson Inc.
Motorola
Modulus Video
Boeing
Hewlett Packard Company
Motorola
Santa Clara University
Santa Clara University
Thomson Inc.
National Chiao-Tung University/ITRI
Freescale Semiconductor
Audio Research Labs
Microsoft Corporation
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
23
Yuriy
Charles
Arturo
Mike
Jesus
Andrew
Xiaojin
Prasanna
David
Ramin
Yeping
Gary
Huifang
Yasser
Ali
Andrew
Dong
Pankaj
Alexandros
Chun-Jen
Yi-Shin
James
Victor
Anthony
Eric
Mohammed
Wade
Haohong
Xianglin
Yong
Xin
Hitoshi
Samuel
Hsi-Jung
John
Yan
Peng
Haoping
Sheng
Reznik
Robinson
Rodriguez
Rubinfeld
Sampedro
Segall
Shi
Singamsetty
Singer
Soheili
Su
Sullivan
Sun
Syed
Tabatabai
Tescher
Tian
Topiwala
Tourapis
Tsai
Tung
Van Loo
Vedovato
Vetro
Viscito
Visharam
Wan
Wang
Wang
Wang
Wang
Watanabe
Wong
Wu
Wus
Ye
Yin
Yu
Zhong
Qualcomm Inc.
Dolby Laboratories
Cisco
NIST
Polycom, Inc.
Sharp
Apple Inc.
Intel Corporation
Apple
Seda Solutions Corporation
Sharp Labs of America
Microsoft Corporation
Mitsubishi Electric Research Labs
Hewlett Packard Company
Sony
Microsoft Corporation
Thomson Inc.
FastVDO
Dolby Laboratories
NCTU/ITRI
Setabox Technology Corporation
Microsoft Corporation
Microsoft Corporation
Mitsubishi Electric Corporation
eV Consulting
Sony
Broadcom Corporation
Marvell Semiconductors
Nokia
Motorola
ContentGuard, Inc.
Qpixel Technology, Inc.
Intel
Apple Inc
Panasonic
Qualcomm Inc
Thomson
Thomson Inc.
Broadcom Corporation
24
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
USA
Annex B – Agenda
Item
1
2
3
4
5
6
7
8 1
Opening
Roll call of participants
Approval of agenda
Allocation of contributions
Communications from Convenor
Report of previous meeting
Processing of NB Position Papers
Media coding
1 Fixed point implementation of DCT/IDCT
2 Advanced 4:4:4 Profile
3 Scalable Video Coding
4 Multiview Video Coding
5 BSAC Extensions
6 MPEG Surround
7 Geometry and Shadow
8 Reconfigurable Video Coding
9 Video Tool Library
10 Scalable audio and speech coding
2 Composition coding
1 Lightweight Scene Representation
2 Symbolic Music Representation
3 Description Coding
1 Schema definition
2 Visual Descriptor Extensions
3 MPEG-7 Query Format
4 Systems support
1 Fragments Request Unit
2 JPEG2000 support in MPEG-4 Systems
5 IPMP
1 MPEG-21 IPMP Component Base Profile
2 REL Profiles
6 Digital Item
1 Schema files for MPEG-21 standards
7 1 Transport and File Format
2 Transport of Auxiliary Video Data
3 Transport of MPEG Surround data in AAC
4 File Format extensions for Description of Timed Metadata
5 Flute Hint Track
25
6 AVC File Format extensions for FRExt
7 AVC File Format extensions for SVC
8 File Format Issues for Support of Audio Media
9 Digital Item Streaming
8 Multimedia architecture
1 M3W Component Download
2 M3W Fault Management
3 M3W System Integrity Management
4 M3W Reference Software
9 Application formats
1 Protected Music Player MAF
2 Photo Player MAF
3 Musical Slide Show MAF
4 Media Streaming MAF
5 Professional Archival MAF
6 Open Release Application Format
7 Portable Video Player
8 Digital Multimedia Broadcasting Application Format
9 Exploration
10 Reference implementation
1 File Format Reference Software
2 Reference Hardware Description
3 MPEG Surround Reference Software
4 Symbolic Music Representation
5 Morphing & Textures Reference Software
6 MPEG-J GFX Reference Software
7 MPEG-7 Systems Reference Software
8 Perceptual 3D Shape Reference Software
9 MPEG-21 REL Reference Software
10 MPEG-21 DIA Reference Software
11 Binary MPEG format for XML Reference Software
12 Prefixes and wild card extensions reference software
13 M3W Reference Software
11 Conformance
1 Audio BIFS v3 Conformance
2 MPEG-1 and -2 Audio in MPEG-4 Conformance
3 BSAC conformance
4 1-bit Oversampled Audio Conformance
5 Audio Lossless Conformance
6 Audio Scalable to Lossless conformance
7 MPEG Surround conformance
26
8 Symbolic Music Representation
9 Morphing & Textures Conformance
10 File Format conformance
11 Advanced Text and Graphics Conformance
12 MPEG-J GFX Conformance
13 Open Font Format conformance
14 Perceptual 3D Shape Conformance
15 IPMP Components Conformance
16 Event Reporting Conformance
17 Fragment Identification of MPEG Resources Conformance
18 Music Player Application Format Conformance
19 Binary MPEG format for XML Conformance
20 Prefixes and wild card extensions conformance
21 M3W Conformance
12 Maintenance
1 Systems coding standards
2 Video coding standards
3 Audio coding standards
4 Visual description coding standards
5 Audio description coding standards
6 MDS standards
9
Liaison matters
10
Organisation of this meeting
Tasks for subgroups
Joint meetings
11
Administrative matters
Schedule of future MPEG meetings
Promotional activities
12
Planning of future activities
13
Resolutions of this meeting
14
A.O.B
15
Closing
27
Annex C – Input contributions
No.
Authors
Title
14268 Wo Chang
Document Register for SC29/WG11 Meeting San Jose,
USA
Francisco Mor. Burgos (UPM)
14269 Jeong-Hwan Ahn
Mark Callow
AHG on 3DG documents, experiments and software
maintenance
Marco Mattavelli
G. Sullivan
14270 A. Hinds
Y. Reznik
P. Topiwala
AHG on Video IDCT Specification
14271
Yi-Shin Tung
Chung-Neng Wang
AHG on Maintenance of MPEG-4 Visual related
Documents, Reference Software and Conformance
14272
Euee S. Jang
Yoshihisa Yamada
AHG on Reconfigurable Video Coding
Sang-Kyun Kim
14273 Robert O'Callaghan
Akio Yamada
AHG on Maintenance of MPEG-7 Visual related
Documents, Reference Software and Conformance
Miroslaw Bober
Sang-Kyun Kim
14274
Akio Yamada
Wo Chang
AHG on MPEG-7 Visual and Photo Player MAF
14275 Wo Chang
AHG on MAFs Awareness Event
14276
Robert Turney
Marco Mattavelli
AHG on MPEG-4 Part 9 Reference Hardware
Description Phase 2 and 3
14277
Gerrard Drury
Peder Drege
AHG on MPEG-21 DIS
Filippo Chiariglione
14278 Christian Timmerer
Thomas Skjolberg
AHG on the Media Streaming MAF demo for the
MAF-AE
Stefan Kraegeloh
14279 Filippo Chiariglione
Noboru Harada
AHG on MDS MAFs Under Development
Wo Chang
14280 Kyoungro Yoon
Mario Doeller
AHG on MPEG-7 Query Format
14281 R. Sperschneider
AHG on Audio Standards Maintenance
14282 S. Quackenbush
AHG on SAOC CfP and AAC-ELD
Tobias Oelbaum
14283 Mathias Wien
Justin Ridge
AHG on SVC Verification Test
28
Vincent Bottreau
Nathalie Cammas
Alexandros Eleftheraidis
14284 ITTF via SC 29 Secretariat
Table of Replies on ISO/IEC 14496-5:2001/FDAM 10
[SC 29 N 8174]
14285 W3C via SC 29 Secretariat
Liaison Statement from W3C [SC 29 N 8177]
14286 SC 29 Secretariat
Summary of Response to Proposal of Minor
Enhancement: 14496-3/Amd.9 [SC 29 N 8179]
14287 SC 29 Secretariat
Summary of Voting on ISO/IEC TR 111725:1998/DCOR 1 [SC 29 N 8178]
14288 SC 29 Secretariat
Summary of Voting on ISO/IEC 14496-3:2005/PDAM
9 [SC 29 N 8180]
14289 SC 29 Secretariat
Summary of Voting on ISO/IEC 14496-4:2004/PDAM
24 [SC 29 N 8182]
14290 SC 29 Secretariat
Summary of Voting on ISO/IEC 14496-4:2004/PDAM
25 [SC 29 N 8184]
14291 SC 29 Secretariat
Summary of Voting on ISO/IEC 21000-5:2004/PDAM
3 [SC 29 N 8190]
14292 ITTF via SC 29 Secretariat
Table of Replies on ISO/IEC 14496-3:2005/FDAM 1
[SC 29 N 8207]
14293 ITTF via SC 29 Secretariat
Table of Replies on ISO/IEC 13818-1:200X/FDAM 1
[SC 29 N 8211]
14294 SC 29 Secretariat
Summary of Voting on ISO/IEC 15938-5:2003/PDAM
3 [SC 29 N 8212]
14295 SC 29 Secretariat
Summary of Voting on ISO/IEC 15938-7:2003/PDAM
4 [SC 29 N 8213]
14296 SC 29 Secretariat
Summary of Voting on ISO/IEC CD 23002-2 [SC 29 N
8222]
14297 3GPP via SC 29 Secretariat
Liaison Statement from 3GPP [SC 29 N 8225]
14298 SC 29 Secretariat
Summary of Voting on ISO/IEC CD 23000-5 [SC 29 N
8226]
14299 SC 29 Secretariat
Summary of Voting on ISO/IEC CD 23001-3 [SC 29 N
8227]
14300 FG IPTV via SC 29 Secretariat
Liaison Statement from ITU-T IPTV Focus Group (FG
IPTV) [SC 29 N 8228]
Christophe Lucarz
Marco Mattavelli
14301 Andrew Kinane
Sunyoung Lee
Sinwook Lee
RVC Functional Units naming process proposal
14302 ITTF via SC 29 Secretariat
Table of Replies on ISO/IEC FDIS 14496-22 [SC 29 N
8234]
14303 SC 29 Secretariat
Summary of Voting on NWIP, Information technology
-- Supplemental media technologies [SC 29 N 8235]
29
14304 SC 29 Secretariat
14305
the DVD Forum WG-1 via SC 29
Secretariat
Summary of Voting on ISO/IEC CD 23005-1 [SC 29 N
8236]
Liaison Statement from the DVD Forum WG-1 [SC 29
N 8254]
14306 ITTF via SC 29 Secretariat
Table of Replies on ISO/IEC 14496-4:2004/FDAM 12
[SC 29 N 8249]
14307 ITTF via SC 29 Secretariat
Table of Replies on ISO/IEC 14496-5:2001/FDAM 9
[SC 29 N 8251]
14308 ITTF via SC 29 Secretariat
Table of Replies on ISO/IEC 14496-11:2005/FDAM 5
[SC 29 N 8252]
14309 SC 29 Secretariat
Summary of Voting on ISO/IEC 23002-1/PDAM 1
[SC 29 N 8259]
Yuriy A. Reznik
14310 Gary Sullivan
Arianne T. Hinds
Study Text of ISO/IEC 23002 CD (editors input)
14311 Yuriy Reznik
Study Text of ISO/IEC 23002-1/PDAM1 (editors
input)
14312 ITTF via SC 29 Secretariat
Table of Replies on ISO/IEC 13818-2:2000/FDAM 2
[SC 29 N 8260]
14313 IEC TC 100 via SC 29 Secretariat
IEC CDV 61937-3 [SC 29 N 8263]
14314 IEC TC 100 via SC 29 Secretariat
IEC CDV 61966-2-5 [SC 29 N 8264]
14315 Schuyler Quackenbush
Spatial Audio Object Coding Evaluation Procedures
and Criterion
14316 Schuyler Quackenbush
79th MPEG Audio Report
14317 Schuyler Quackenbush
Proposed Workplan for Speech and Audio Exploration
14318 Sylvain Devillers
Editors' input to draft text of 23001-5 (MPEG-B
BSDL)
14319 SC 29 Secretariat
Summary of Voting on ISO/IEC 138187:2006/FPDAM 1 [SC 29 N 8268]
14320 SC 29 Secretariat
Summary of Voting on ISO/IEC 144964:2004/FPDAM 18 [SC 29 N 8269]
14321 SC 29 Secretariat
Summary of Voting on ISO/IEC 144964:2004/FPDAM 19 [SC 29 N 8270]
14322 SC 29 Secretariat
Summary of Voting on ISO/IEC 144964:2004/FPDAM 20 [SC 29 N 8271]
14323 SC 29 Secretariat
Summary of Voting on ISO/IEC 14496-4:2004/PDAM
21 [SC 29 N 8272]
14324 SC 29 Secretariat
Summary of Voting on ISO/IEC 144965:2001/FPDAM 12 [SC 29 N 8273]
14325 SC 29 Secretariat
Summary of Voting on ISO/IEC 159386:2003/FPDAM 2 [SC 29 N 8274]
14326 SC 29 Secretariat
Summary of Voting on ISO/IEC 159387:2003/FPDAM 3 [SC 29 N 8275]
30
14327 SC 29 Secretariat
Summary of Voting on ISO/IEC 144964:2004/FPDAM 14 [SC 29 N 8276]
14328 SC 29 Secretariat
Summary of Voting on ISO/IEC FCD 14496-23 [SC
29 N 8277]
14329 A. G. Tescher for USNB
USNB Contribution: Response to resolution 3.1.2 of
79-th WG 11 meeting
Thomas Skjølberg
Peder Drege
14330
Joseph Thomas-Kerr
Gerrard Drury
Report of CE on DIS TuC
14331
ETSI TC DECT via SC 29
Secretariat
Liaison Statement from ETSI TC DECT to ITU-T SG
12 and ETSI TC STQ
14332 SC 29 Secretariat
Summary of Voting on ISO/IEC 14496-5:2001/PDAM
13 [SC 29 N 8280]
14333 ITTF via SC 29 Secretariat
Table of Replies on ISO/IEC 14496-12:2005/FDAM 1
[SC 29 N 8281]
14334 SC 29 Secretariat
Summary of Voting on ISO/IEC 210004:2006/FPDAM 1 [SC 29 N 8282]
14335 SC 29 Secretariat
Summary of Voting on ISO/IEC 21000-18/PDAM 1
[SC 29 N 8294]
14336 SC 29 Secretariat
Summary of Voting on ISO/IEC 1449612:2005/FPDAM 2 and ISO/IEC 1544412:2005/FPDAM 2 [SC 29 N 8297]
14337 SC 29 Secretariat
Summary of Voting on ISO/IEC FCD 23004-5 [SC 29
N 8298]
14338 SC 29 Secretariat
Summary of Voting on ISO/IEC FCD 23004-6 [SC 29
N 8299]
14339 SC 29 Secretariat
Summary of Voting on ISO/IEC FCD 23004-7 [SC 29
N 8301]
14340
Christophe Lucarz
Marco Mattavelli
Compression of the RVC DDL Decoder Description
with BiM (results of Core Experiment 1.3 in RVC)
Christian Timmerer
14341 Sylvain Devillers
Michael Ransburg
Editor's input on Draft MPEG-21 DIA 2nd edition
14342 CEA via SC 29 Secretariat
Liaison Statement from CEA [SC 29 N 8310]
14343 SC 29 Secretariat
Summary of Voting on ISO/IEC FCD 23000-4 [SC 29
N 8306]
14344 SC 29 Secretariat
Summary of Voting on ISO/IEC 23003-1/PDAM 1
[SC 29 N 8307]
14345 SC 29 Secretariat
Summary of Voting on ISO/IEC 23003-1/PDAM 2
[SC 29 N 8308]
14346 Yuriy Reznik
Updated 23002-1 IDCT precision testbed
14347
Yuriy Reznik
Arianne Hinds
Updated H.263-based IDCT testbed
31
14348 Arianne T. Hinds
Updated MPEG-4 IDCT Testbed
Gavin Schutz
14349 Teruhiko Suzuki
Michael Dolan
Liaison re w8559 Text of ISO/IEC 138181:200x/DCOR.1
Weon-Geun Oh
Dong-Seok Jeong
Ju-Kyoung Jin
14350 A-Young Cho
Jun-Woo Lee
Ik-Hwan Cho
Won-Keun Yang
Mathematical consideration on the degree of
geometrical modification
Saar De Zutter
14351 Jan De Cock
Rik Van de Walle
Conformance tests for DIDL documents - files
14352 James Orwell
Contribution to the Basic Video Surveillance MAF
14353 ATIS IIF via SC 29 Secretariat
Liaison Statement from ATIS IIF [SC 29 N 8317]
14354 ITU-T SG 16 via SC 29 Secretariat Liaison Statement from ITU-T SG 16 [SC 29 N 8324]
14355 Ralph Sperschneider
WD on MPEG-4 Audio Fourth Edition
Saar De Zutter
Jan De Cock
14356 Rik Van de Walle
on behalf of the Belgian National
Body
BNB comments on ISO/IEC FCD 21000-14:
Conformance Testing
14357 jungwonLee
ISO/IEC JTC 1/SC 29/WG 11 N6702
14358
Yi=Shin Tung
Ja-Ling Wu
Additional fixes on MPEG-4 video conformance
bitstreams
14359
Yi-Shin Tung
Ja-Ling Wu
Consider row-transform-first IDCT in 23002-2
14360 A. G. Tescher for USNB
USNB Contribution: Issues relating to expiring patents
14361 SC 29 Secretariat
Summary of Voting on ISO/IEC FCD 21000-14 [SC
29 N 8332]
14362 DVB via SC 29 Secretariat
Liaison Statement from the DVB [SC 29 N 8326]
14363
Nikolce Stefanoski
Jörn Ostermann
Scalable Compression of Dynamic 3D Meshes
Pierfrancesco Bellini
Paolo Nesi
14364
Maurizio Campanai
Giorgio Zoia
Editors study on ISO/IEC 14496-23/FCD
Davide Rogai
14365 Paolo Nesi
Pierfrancesco Bellini
Experience on using MPEG-21 File Format for nested
and/or protected DIs
Paolo Nesi
14366 Pierfrancesco Bellini
Davide Rogai
Additional examples on Cross-Media Interactive
Presentation MAF
32
Paolo Nesi
Pierfrancesco Bellini
14367
Davide Rogai
Kia Ng (University of Leeds)
Proposal for a MAF on Cross-Media Interactive
Presentation: Application Scenarios
Paolo Nesi
14368 Pierfrancesco Bellini
Davide Rogai
Proposal for a MAF on Cross-Media Interactive
Presentation: Requirements
Davide Rogai
14369 Pierfrancesco Bellini
Paolo Nesi
Proposal for a MAF on Cross-Media Interactive
Presentation: Relationships with other MAFs
14370 Jean-Claude Dufourd
LASeR fixes requested by 3GPP DIMS
14371
Jean H.A. Gelissen (editor)
Johan Muskens
Contribution to M3W Reference Software for M3W
Parts 2, 3, 5, 6 & 7
14372 Jean-Claude Dufourd
Splitting LASeR AMD1
14373 Jean-Claude Dufourd
LASeR profiles adjustments
Gwo Giun Lee
14374 He-Yuan Lin
Ming-Jiun Wang
Functional units of inter-prediction under reasonable
system partition for RVC framework
Gwo Giun Lee
14375 He-Yuan Lin
Ming-Jiun Wang
Conformance test tools of RVC functional units
14376 SC 29 Secretariat
Summary of Voting on ISO/IEC FCD 23000-2
14377 SC 29 Secretariat
Summary of Voting on ISO/IEC 144964:2004/FPDAM 16
14378 Jean-Claude Dufourd
Additions to LASeR AMD2 from 3GPP
14379 Arianne T. Hinds
Updated T.83 testbed for IDCTs
14380 Zhibo Ni
Updated MPEG-2 IDCT Testbed
14381 SC 29 Secretariat
Summary of Voting on ISO/IEC FCD 23001-2
14382 SC 29 Secretariat
Summary of Voting on ISO/IEC 13818-1:200X/DCOR
1
14383 SC 29 Secretariat
Summary of Voting on ISO/IEC 14496-2:2004/PDAM
4
14384 SC 29 Secretariat
Summary of Voting on ISO/IEC 14496-3:2005/PDAM
8
14385 SC 29 Secretariat
Summary of Voting on ISO/IEC 14496-4:2004/PDAM
23
14386 SC 29 Secretariat
Summary of Voting on ISO/IEC 14496-4:2004/PDAM
28
14387 SC 29 Secretariat
Summary of Voting on ISO/IEC 1449611:2005/DCOR 6
14388 SC 29 Secretariat
Summary of Voting on ISO/IEC 1449612:2005/DCOR 3 & ISO/IEC 15444-12:2005/DCOR 3
14389 SC 29 Secretariat
Summary of Voting on ISO/IEC 14496-
33
21:2006/DCOR 1
14390 SC 29 Secretariat
Summary of Voting on ISO/IEC 159383:2002/Amd.2:2006/DCOR 1
14391 SC 29 Secretariat
Summary of Voting on ISO/IEC 159386:2003/Amd.1:2006/DCOR 1
14392 SC 29 Secretariat
Summary of Voting on ISO/IEC 21000-7:2004/DCOR
1
14393 SC 29 Secretariat
Summary of Voting on ISO/IEC CD 23000-7
14394 SC 29 Secretariat
Summary of Voting on ISO/IEC CD 23000-9
14395 SC 29 Secretariat
Summary of Voting on ISO/IEC 23001-1:2006/DCOR
2
14396 Jeong-Hwan Ahn
Conformance bitstream for Geometry & Shadow
14397 SC 29 Secretariat
Late Vote on ISO/IEC 21000-4:2006/FPDAM 1 [SC
29 N 8331]
14398 SC 29 Secretariat
Common Patent Policy for ITU-T/ITU-R/ISO/IEC, and
Guidelines for Implementation of the Common Patent
Policy [SC 29 N 8314]
14399
Eva Rodríguez
Jaime Delgado
Adding Integrity and authenticity to Event Reporting
information
14400
Jaime Delgado
Eva Rodríguez
Defect Report Proposal of ISO/IEC 21000-15
14401
Eva Rodríguez
Jaime Delgado
Contribution to REL MAM Profile Conformance
14402
Simon Daniels
Vladimir Levantovsky
Proposed conformance test methodology and
bitstreams for ISO/IEC 14496-22
14403 Arianne T. Hinds
14404
Jani Peltotalo
Miska M. Hannuksela
Updated TM5 MPEG-2 Testbed
Comments and suggestions regarding ISO/IEC 1449612 Amd.2
14405 David Singer
Comments on the SVC File Format
Sangki Kim
14406 Hyobin Lee
Sangyoun Lee
CE Report for VCE-5
14407 Kelvin Lee
Status of SLS reference software update
Marius Preda
Benoit Le Bonhomme
14408
Son Tran
Françoise Preteux
3dod.org goes multimedia: MyMultimediaWorld.com
Saar De Zutter
Jan De Cock
14409 Rik Van de Walle
on behalf of the Belgian National
Body
Preliminary BNB comments on ISO/IEC FCD 210008: Reference Software (2nd edition)
14410 Noboru Harada
Proposed revision for ISO/IEC14496-3, AMD8:
34
TakehiroMoriya
Yutaka Kamamoto
MP4FF box for original audio file information
Noboru Harada
14411 Takehiro Moriya
Yutaka Kamamoto
Proposed text to WD of Professional Archical MAF
Weon-Geun Oh
14412 Won-Keun Yang
Dong-Seok Jeong
Modified GST Based Descriptor for MPEG-7 VCE-6
Complex Condition
14413 TTA via SC 29 Secretariat
Liaison Statement from TTA [SC 29 N 8333]
Kelvin Lee
14414 Te Li
Haibin Huang
Proposed Corrigenda to 14496-3:2005/AMD 3 (SLS)
Kisong Yoon
14415 Taehyun Kim
Hogab Kang
Interoperability between MPEG-21 REL DAC Profile
and Other Standards
14416
Jar-Sheng Chen
Chun-Jen Tsai
Implementation of B frame support in RVC CAL
Model
Masayuki Tanimoto
Toshiaki Fujii
14417
Hideaki Kimata
Shigeyuki Sakazawa
Proposal on Requirements for FTV
Jihun Cha
YeSun Joung
14418
Young-Kwon Lim
KyungAe Moon
Ideas on MPEG-21 and LASeR
Jihun Cha
Youngkwon Lim
14419
YeSun Joung
KyungAe Moon
Issues on the carriage of ISO/IEC 14496-20 contents
over MPEG-2
Hee-Cheol Seo
Miran Choi
Hyunki Kim
14420 Myung-Gil Jang
Soojong Lim
Jeong Heo
Kyoungro Yoon
CE Report for Query Expression of MPEG-7 Query
Format
Hee-Cheol Seo
Miran Choi
Hyunki Kim
14421 Myung-Gil Jang
Soojong Lim
Jeong Heo
Kyoungro Yoon
Revision of Proposed Input Query Format for MPEG-7
Query Format
Hyun-Kook Lee
Hee-Suk Pang
14422
Dong Soo Kim
Sung-Yong Yoon
Report on the SAOC test material provided by LGE
35
Henney Oh
Yang-Won Jung
Kwangcheol Choi
Sung-Moon Chun
Jaedo Kwak
14423 Seungheon Yang
Ji-Sang Yoo
Si-Hun Sung
Seong-Cheol Han
Requirements for Stereoscopic MAF
Jaedo Kwak
Si-Hun Sung
14424 Sung-Moon Chun
JinWoong Kim
Namho Hur
Whitepaper of Stereoscopic Project
Hui Yong Kim
14425 Hyon-Gon Choo
Munchurl Kim
(Editors Input) Updated Text of ISO/IEC 23000-9
MAF for DMB
Hui Yong Kim
Gun Bang
MyungSeok Ki
14426 Hyun Cheol Kim
Han-Kyu Lee
Jin Woo Hong
Young-Kwon Lim
Proposal for MPEG-2 TS Encapsulation with ISO/IEC
23000-9 MAF for DMB
Hui Yong Kim
Seung Jun Yang
Heekyung Lee
14427 Han-Kyu Lee
Jin Woo Hong
Munchurl Kim
Jinhan Kim
Proposal for Restrictions on TV-Anytime Metadata in
ISO/IEC 23000-9 MAF for DMB
14428 Tilman Liebchen
Proposed Text of ISO/IEC 14496-4:2004/FDAM 19,
Audio Lossless Coding (ALS) Conformance
14429 Tilman Liebchen
Updated Status of ALS Conformance
14430 Tilman Liebchen
Comments on Professional Archival MAF
Requirements
Yo-Sung Ho
14431 Cheon Lee
Kwan-Jung Oh
CE6: View Interpolation Prediction for Multi-view
Video Coding
Yo-Sung Ho
14432 Kwan-Jung Oh
Cheon Lee
Observations of Multi-view Test Sequences
Yo-Sung Ho
14433 Kwan-Jung Oh
Cheon Lee
CE5: Verification of JVT-W031
14434
Julien Dubois
Barthelemy Heyrman
Wildcard Platform Vs ML310
36
Marco Mattavelli
Johel Miteran
Hyouk Jean Cha
14435 Tae Hyeon Kim
Herbert Thoma
Proposed text of ISO/IEC 23000-8 CD Portable video
player MAF
14436 Ryoma Oami
CE report for VCE-3 on person identity-based photo
indexing
14437 Ryoma Oami
A proposal on metadata modification for Musical Slide
Show MAF
14438 Ryoma Oami
A proposal of an additional functionality to be
supported in Portable Video Player MAF
14439
Kota Iwamoto
Ryoma Oami
CE report for VCE-7 on video signature
14440
Kota Iwamoto
Ryoma Oami
Proposal of CE procedure for VCE-7
Oliver Hellmuth
14441 Juergen Herre
Thorsten Kastner
14442
Hyon-Gon Choo
Filippo Chiariglione
Proposed SAOC test items provided by Fraunhofer IIS
Proposed text of ISO/IEC 23000-5 FCD Media
Streaming MAF
Filippo Chiariglione(Editor)
14443 Hyon-Gon Choo(Editor)
Jooyoung Lee
Proposed text of ISO/IEC 23001-3 FCD Binary XML
to IPMP-X
Hyon-Gon Choo
14444 Filippo Chiariglione
Naito Joji
Proposed text of ISO/IEC 23005-1 FCD Media
Streaming MAF Protocol (Editor's Input)
Giseok Son
14445 Sinwook Lee
Euee S. Jang
Core Experiment Result on CDDL
14446
Hyungyu Kim
Euee S. Jang
Proposed Text of RVC CE
Jaebum Jun
14447 Sunyoung Lee
Euee S. Jang
Study on RVC Framework and Its Requirements
Yoshihisa Yamada
14448 Kenji Otoi
Kohtaro Asai
Proposed text of the RVC FUs for MPEG-4 AVC
(Results of CE 2.2)
Doeller
14449 Gruhne
Wolf
MP7QF CE Test Report
14450
David Thevenin
Philippe de Cuetos
Editor's study of 23001-1 FPDAM2
14451
David Thevenin
Philippe de Cuetos
Binary Conformance streams for MPEG-21
14452 Tokumichi Murakami
Requirement of Full-Color Video Coding for
37
Kohtaro Asai
Yoshihisa Yamada
Kristofer Kjörling
Jonas Rödén
14453 Jeroen Koppens
Erik Schuijers
Jeroen Breebaart
14454
Christophe Lucarz
Marco Mattavelli
Consumer Applications
Proposed draft corrigendum for MPEG Surround
Implementation of multiple reference frame support in
RVC CAL model
14455 Eunmi Oh
Evaluation of speech and audio coding scheme
Christian Timmerer
14456 Hermann Hellwagner
on behalf of Austrian NB
Austrian NB comments on ISO/IEC 21000-14 FCD
Ghislain Roquier
Maxime Pelcat
Mickaël Raulet
14457
Matthieu Wipliez
Jean-François Nezan
Olivier Déforges
A scheme for implementing MPEG-4 SP codec in the
RVC framework
Ingo Kofler
Christian Timmerer
14458
Hermann Hellwagner
on behalf of Austrian NB
Austrian NB comments on ISO/IEC 21000-7 Cor.1
Michael Eberhard
Christian Timmerer
14459
Hermann Hellwagner
on behalf of Austrian NB
Austrian NB comments on ISO/IEC 21000-8 FCD
Christian Timmerer
Hermann Hellwagner
Austrian NB comments on ISO/IEC CD XXXXX
Media Streaming MAF Protocols
Christian Timmerer
14461 Michael Ransburg
Hermann Hellwagner
Austrian NB comments on ISO/IEC 23000-5 CD
Michael Eberhard
14462 Christian Timmerer
Hermann Hellwagner
Update of gBSDtoBin and DIA Reference and Utility
Software Modules
Maxime Pelcat
Médéric Blestel
14463 Mickaël Raulet
Jean-François Nezan
Olivier Déforges
Evolutions of RVC so as to handle SVC decoding
14460
14464
Jeroen Breebaart
Werner Oomen
Proposed SAOC test items provided by Philips
14465
Erik Schuijers
Werner Oomen
Crosscheck FT enhanced LD AAC core experiment
14466
Patrick Gioia
Anne Le Bris
Report of CE2: Space Partitioning
38
Romain Cavagna
14467
Patrick Gioia
Olivier Aubault
Proposal for 3D Compression Profile
Nicola Adami
Riccardo Leonardi
14468
Pierangelo Migliorati
Claudia Tonoli
Performance of a Distributed Video Codec in Presence
of Transmission Errors
Honggang Qi
Wen Gao
14469
Debin Zhao
Siwei Ma
Crosscheck for IDCT CD
14470
Paul Brasnett
Miroslaw Bober
Improved Image Identifier (VCE6)
14471
Paul Brasnett
Miroslaw Bober
Modification of VCE6 Experimental Conditions
14472
Paul Brasnett
Miroslaw Bober
VCE7 Experimental Conditions
Honggang Qi
Wen Gao
14473
Tiejun Huang
Lu Yu
Extension to support non-MPEG standards (ICT/ZJU)
(Results of CE 1.6)
Honggang Qi
Wen Gao
Lu Yu
14474
Euee S. Jang
Marco Mattavelli
Andrew Kinane
Exploration experiments of AVS decoder description
in RVC framework
14475
Giovanni Cordara (on behalf of the
Italian NB proposal to revisit MPEG-21 DID
ITNB)
14476 AVS Workgroup
Liaison Statement to MPEG on RVC
Hendry
14477 Houari Sabirin
Munchurl Kim
Updated Proposal for Protected Musical Slide Show
MAF with IPMP
Hendry
14478 Houari Sabirin
Munchurl Kim
Updated Proposal for Protected Photo Player MAF
with IPMP
Taehyun Kim
Jaime Delgado
14479
Florian Schreiner
Chris Barlas
Editor's study of ISO/IEC 21000-5/PDAM3
14480 Paul Schumacher
Implementation of MPEG-4 AVC Deblocking Filter in
RVC CAL model
14481
Hendry
Takafumi Ueno
14482 Hendry
Some Editorial Update for ISO/IEC 21000-4/FPDAM1
MPEG-21 IPMP Components Base Profile
Late comment for ISO/IEC 21000-4/FPDAM1 MPEG-
39
21 IPMP Components Base Profile
14483
Hendry
Munchurl Kim
Kisong Yoon
14484 Taehyun Kim
Hogab Kang
14485
Zhibo Ni
Lu Yu
Houari Sabirrin
14486 Jeongyeon Lim
Munchurl Kim
Contribution for MPEG-21 IPMP Components Base
Profile Conformance
A Study on Use Cases of Derivative Works with
MPEG-21 REL ORC Profile License
IDCT Core Experiment Results
A Proposal for Basic Video Surveillance Application
Format
14487
Michael Ransburg
Hermann Hellwagner
Contribution to Conformance for ISO/IEC 14496-12
AMD/1
14488
Jonas Engdegård
Barbara Resch
Description of SAOC test items provided by Coding
Technologies
Filippo Chiariglione
14489 Jooyoung Lee
Hyon-Gon Choo
Proposal of Modified IPMP XML messages for
ISO/IEC 23001-3 Binary XML to IPMP-X
Christophe Lucarz
Marco Mattavelli
14490
Joseph Thomas-Kerr
Jörn Janneck
Reconfigurability potential of the MPEG-4 SP decoder
(results of CE 1.1)
Khaled Mamou
Marius Preda
14491
Titus Zaharia
Francoise Prêteux
FAMC bitstream description
14492 Fredrik Henn
Cross check of FhG Core Experiment on LD-SBR
filterbank for AAC-ELD
Khaled Mamou
Karsten Müller
Detlev Marpe
14493
Titus Zaharia
Marius Preda
Francoise Prêteux
Frame-based Animated Mesh Compression :
integration of the CABAC arithmetic encoder
Thomas Rathgen
Michael Ransburg
14494 Peter Amon
Andreas Hutter
Hermann Hellwagner
Extraction path description
Michael Ransburg
Thomas Rathgen
14495 Peter Amon
Andreas Hutter
Hermann Hellwagner
Terms and definitions for the SVC file format
14496
Thomas Rathgen
Peter Amon
On the SVC file format
40
Andreas Hutter
14497
Philippe de Cuetos on behalf of
FNB
French NB comment on FCD 21000-14
Khaled Mamou
Titus Zaharia
14498
Marius Preda
Françoise Prêteux
FAMC with streaming support
Johannes Hilpert
Sascha Disch
14499
Heiko Purnhagen
Werner Oomen
Proposed MPEG Surround Level Enhancement
14500 Sylvain Devillers
Use of MPEG URN for identifying profiles and levels
14501 Anisse Taleb
Report on the Evaluation of MPEG-4 Enhanced Low
Delay AAC on Speech Content
Daniel Oancea
Pedro Carvalho
14502 Teresa Andrade
Christian Timmerer
Hermann Hellwagner
Defect Report on ISO/IEC 21000-15
Hélder Castro
Pedro Carvalho
14503 Teresa Andrade
Christian Timmerer
Hermann Hellwagner
A DID model for Media Streaming MAF
Heiko Purnhagen
Andreas Schneider
14504 Frans de Bont
Karsten Linzmeier
Ralph Sperschneider
Proposed Updates for MPEG Surround Conformance
Eva Rodríguez
Jaime Delgado
Contribution to MPEG-21 Reference Software:
Validation Rules Checker for the REL MAM Profile
14505
14506 Yuriy Reznik
14507
Eva Rodríguez
Jaime Delgado
Summary of core experiments on fixed point
IDCT/DCT
Contribution to the current version of the Open Release
MAF
Eva Rodríguez
14508 Jaime Delgado
Víctor Torres
Some issues on the generation and modification of
Event Reports in the MPEG-21 Event Reporting
14509 Yuriy Reznik
Cross-check of IDCT conformance tests
14510 Yuriy Reznik
Proposal for adding ISO/IEC 23002-2 in RVC tool
library
14511
Florian Schreiner
Chun Hui Suen
Overview of ISO/IEC 23000-7 CD Open Release MAF
(1-pager)
14512
Gary J. Sullivan
Regis Crinon
Proposed technical alternative to MPEG-2 Systems
DCOR 1 text WG 11 N 8859
41
14513
Florian Schreiner
Chun Hui Suen
Proposed text to ISO/IEC 23000-7 CD Open Release
MAF
14514
Markus Schnell
Ralf Geiger
Proposed FPDAM of AAC-ELD
Markus Schmidt
14515 Ralf Geiger
Markus Schnell
Cross-check report on Proposed FT Core Experiment
for AAC-ELD
Ralf Geiger
Markus Schnell
14516
Jürgen Herre
Kristofer Kjörling
Utilizing AAC-ELD for delayless mixing in frequency
domain
Markus Schnell
Jürgen Herre
14517 Ralf Geiger
Markus Schmidt
Markus Multrus
Proposed Core Experiment on AAC-ELD
Markus Schmidt
14518 Ralf Geiger
Markus Schnell
Additional information on quality of AAC-ELD
Catherine Colomes
14519 Pierrick Philippe
David Virette
Listening test results on instantaneous block switching
CE for AAC ELD
14520
Pierrick Philippe
David Virette
Saar De Zutter
Frederik De Keukelaere
14521 Gerrard Drury
Christian Timmerer
Xin Wang
Updated description for AAC ELD instantaneous block
switching CE
Editors' input to ISO/IEC FCD 21000-8 Reference
Software (2nd edition)
Juha Ojanperä
14522 miikka.vilermo@nokia.com Miikka On AAC LTP conformance
Vilermo
A-Young Cho
Ik-Hwan Cho
14523 Jun-Woo Lee
Weon-Geun Oh
Dong-Seok Jeong
New Visual Identifier for MPEG-7 VCE-6 Basic
Condition
14524 Saar De Zutter
Review of Core Experiment on query operation based
on text description
Ying Chen
14525 Ye-Kui Wang
Miska M. Hannuksela
Signaling of leading pictures in file format
14526
Ye-Kui Wang
Miska M. Hannuksela
On SVC file format
14527
Ye-Kui Wang
Miska M. Hannuksela
Signaling of temporal layer switching points in SVC
file format
42
14528
Ye-Kui Wang
Miska M. Hannuksela
Alternate group parameters in ISO file format Amd. 2
14529 David Singer
MP4 file format considerations for high sample-rate
audio
Henney Oh
Yang-Won Jung
14530 Hyo Jin Kim
Chang-Heon Lee
Hong-Goo Kang
Cross-check report on proposed FT Core Experiment
for AAC-ELD
14531 Arianne T. Hinds
Fixed-Point IDCT Conformance Tests
14532 Gerrard Drury
Contribution on URI assets and Requirements and
Structure of URNs
14533
the 3D Consortium via SC 29
Secretariat
Liaison Statement from the 3D Consortium [SC 29 N
8334]
14534
ISO TC 46/SC 9/WG 7 via SC 29
Secretariat
Liaison Statement from ISO TC 46/SC 9/WG 7 [SC 29
N 8335]
14535 JSR-287 EG via SC 29 Secretariat
Liaison Statement from JSR 287 Expert Group [SC 29
N 8336]
14536
Frans de Bont
Werner Oomen
Cor to 14496-3:2005 subpart 10, DST (lossless
oversampled audio)
Masanori Sano
14537 Hideki Sumiyoshi
Nobuyuki Yagi
Test report of CE on Query expression
Masanori Sano
14538 Hideki Sumiyoshi
Nobuyuki Yagi
Test report of CE on specification of the request of the
Output
Masanori Sano
14539 Hideki Sumiyoshi
Nobuyuki Yagi
Test report of CE on Query operation based on text
description
Seungkwon Beack
Jeongil Seo
14540
Taejin Lee
kyungok kang
Information on SAOC test items by ETRI
43
Annex D – Output documents
No.
Source
Title
8910 Convener
List of Documents from the San Jose, USA Meeting
8911 Convener
Resolutions of the San Jose, USA
8912 Convener
List of AHGs Established at the 80th Meeting in San Jose, USA
8913 Convener
Report of the 80th Meeting in San Jose, USA
8914 Convener
Guidelines for Electronic Distribution of MPEG and WG 11 Documents
8915 Convener
Press Release of the 80th Meeting in San Jose, USA
8916 Convener
Meeting Notice of the 81st Meeting in Lausanne, Switzerland
8917 HoD
Guide for WG 11 Meeting Hosts
8918 HoD
MPEG 101
8919 Liaison
Liaison statement to WG1
8920 Liaison
Liaison Statement to IETF
8921 Liaison
Liaison Statement to Khronos
8922 Liaison
Liaison Statement to ISO TC184 SC4
8923 Liaison
Liaison Statement to 3GPP
8924 Liaison
Liaison Statement to W3C
8925 Liaison
Liaison Statement to ITU-T FG/IPTV concerning M3W
8926 Liaison
Liaison Statement to ITU-T FG IPTV
8927 Liaison
Liaison Statement to SMPTE
8928 Liaison
Liaison Statement to DVD Forum
8929 Liaison
Liaison Statement to ETSI
8930 Liaison
Liaison Statement to SMPTE re File Format
8931 Liaison
Liaison Statement to DVB
8932 Liaison
Liaison Statement to JCP
8933 Liaison
Liaison Statement to CEA
8934 Liaison
Liaison Statement to ATIS
8935 Liaison
Liaison Statement to SMPTE re RVC
8936 Liaison
Liaison Statement to 3D consortium
8937 Liaison
Liaison Statement to FLOForum
8938 Liaison
Liaison Statement to TC46/SC9/WG7
8939 Liaison
Liaison Statement to AVS
8940 Liaison
Response to National Bodies
8941 Liaison
Liaison Statement to DVB
44
8942 Requirement MAFs Overview
8943 Requirement RVC Requirements
8944 Requirement FTV Model and Requirements
8945 Requirement Requirements on and Structure for Assignment of MPEG URNs
8946 Convenor
AHG on Review of MPEG-21 DID
8947 Convenor
AHG on FTV
8948 Video
Disposition of Comments on ISO/IEC 14496-2:2004/PDAM4
8949 Video
Text of ISO/IEC 14496-2:2004/FPDAM4 Simple Profile Level 6
8950 Video
Text of ISO/IEC 14496-4:2004/DCOR4
8951 Video
Text of ISO/IEC 14496-4:2004/Amd.1/DCOR2
8952 Video
Disposition of Comments on ISO/IEC 14496-4:2004/PDAM28
8953 Video
Text of ISO/IEC 14496-4:2004/FPDAM28 Visual Simple Profile Level 6
Conformance Testing
8954 Video
Request for ISO/IEC 14496-4:2004/Amd.30
8955 Video
Working Draft 1 of ISO/IEC 14496-4:2004/Amd.30 AVC Professional Profiles
Conformance Testing
8956 Video
Request for ISO/IEC 14496-4:2004/Amd.31
8957 Video
Working Draft 1 of ISO/IEC 14496-4:2004/Amd.31 SVC Conformance
Testing
8958 Video
Request for ISO/IEC 14496-5:2001/Amd.18
8959 Video
Working Draft 1 of ISO/IEC 14496-5:2001/Amd.18 Professional Profiles
Reference Software
8960 Video
Request for ISO/IEC 14496-5:2001/Amd.19
8961 Video
Working Draft 1 of ISO/IEC 14496-4:2001/Amd.19 SVC Reference Software
8962 Video
Study Text (version 3) of ISO/IEC 14496-10:2005/FPDAM3 Scalable Video
Coding
8963 Video
Joint Scalable Video Model (JSVM) 10
8964 Video
JSVM 10 Software
8965 Video
Draft SVC Verification Test Plan Version 3.0
8966 Video
Working Draft 3 of ISO/IEC 14496-10:2005/Amd.4 Multiview Video Coding
8967 Video
Joint Multiview Video Model (JMVM) 4
8968 Video
JMVM 4 Software
8969 Video
Text of ISO/IEC 15938-3:2002/Amd.2:2006/Cor.1 (Perceptual 3D Shape)
8970 Video
MPEG-7 Visual XM Document version 30.0
8971 Video
Description of Core Experiments for MPEG-7 New Visual Extensions
8972 Video
Disposition of Comments on ISO/IEC 15938-6:2003/ Amd.1:2006/DCOR 1
8973 Video
Text of ISO/IEC 15938-6:2003/Amd.1:2006/Cor.1 (Color Temperature)
8974 Video
Disposition of Comments on ISO/IEC 15938-6:2003/FPDAM2
45
8975 Video
Text of ISO/IEC 15938-6:2003/FDAM2 (Perceptual 3D Shape)
8976 Video
Disposition of Comments on ISO/IEC 15938-7:2003/FPDAM3
8977 Video
Text of ISO/IEC 15938-7:2003/FPDAM3 (Perceptual 3D Shape)
8978 Video
Study Text of ISO/IEC 23000-3/PDAM1 Reference Software for Photo Player
MAF
8979 Video
WD 4 of ISO/IEC 23001-4
8980 Video
Disposition of Comments on ISO/IEC 23002-1/PDAM1
8981 Video
Text of ISO/IEC 23002-1/FPDAM1 Software for Integer IDCT Accuracy
Testing
8982 Video
Disposition of Comments on ISO/IEC CD 23002-2
8983 Video
Text of ISO/IEC FCD 23002-2 Fixed-point Implementation of 8x8 IDCT and
DCT
8984 Video
WD 4 of ISO/IEC 23002-4
8985 Video
Description of Core Experiments in RVC
8986 Video
RVC Simulation Model (RSM) V4.0
8987 Video
RVC Work Plan
8988 Video
RVC Conformance Testing Working Draft 1.0
8989 Video
Description of Exploration Experiments for Toolbox Extensions
8990 Convenor
AHG on Maintenance of MPEG-4 Visual related Documents, Reference
Software and Conformance
8991 Convenor
AHG on Reconfigurable Video Coding
8992 Convenor
AHG on MPEG-7 Visual and Photo Player MAF
8993 Convenor
AHG on SVC Verification Test
8994 ISG
Status of HDL submissions and commitments for MPEG-4 Part-9
8995 ISG
Study of “ISO/IEC DTR 14496-9 3rd Edition Reference Hardware
Description”
8996 Convenor
AHG on MPEG-4 Part 9: Reference Hardware Description Phase 1 and 2.
8997 Convenor
AHG for Video Annotation
8998 Systems
Text of ISO/IEC 13818-1:2003/DCOR1.2 (AVC Referencing and PS
Signaling)
8999 Systems
DoC on ISO/IEC 14496-4/PDAM.23 Synthesised Texture Conformance
9000 Convenor
Terms of Reference
9001 Convenor
MPEG Standards
9002 Convenor
Table of unpublished FDIS
9003 Convenor
Work plan and time line
9004 Convenor
Work item assignment
9005 Convenor
MPEG Standard Editors
9006 Convenor
Software assets
46
9007 Convenor
Conformance assets
9008 Convenor
Content assets
9009 Convenor
URI assets
9010 Convenor
Standards under development for which a call for patent statements is issued
9011 Convenor
List of Organisations with which MPEG entertains liaisons
9012 Systems
Text of ISO/IEC 14496-4/FPDAM.23 Synthesised Texture Conformance
9013 Systems
DoC on ISO/IEC 14496-4/PDAM.24 File Format Conformance
9014 Systems
Text of ISO/IEC 14496-4/FPDAM.24 File Format Conformance
9015 Systems
DoC on ISO/IEC 14496-4/PDAM.25 LASeR V1 Conformance
9016 Systems
Text of ISO/IEC 14496-4/FPDAM.25 LASeR V1 Conformance
9017 Systems
Request for ISO/IEC 14496-4/Amd.26
9018 Systems
Text of ISO/IEC 14496-4/PDAM.26 Open Font Format Conformance
9019 Systems
DoC of ISO/IEC 14496-5/FPDAM12 File Format Reference Software
9020 Systems
Text of ISO/IEC 14496-5/FDAM12 File Format Reference Software
9021 Systems
Text of ISO/IEC 14496-11/COR.6 (AudioFXProto correction and Bitwrapper)
9022 Systems
DoC on ISO/IEC 14496-12/FPDAM2 (Flute Hint Track)
9023 Systems
Text of ISO/IEC 14496-12/FDAM2 (Flute Hint Track)
9024 Systems
Text of ISO/IEC 14496-12/COR.3
9025 Systems
TuC for ISO/IEC 14496-12 & 15444-12
9026 Systems
Study Text of ISO/IEC 14496-15/PDAM2 (SVC File Format)
9027 Systems
ISO/IEC 14496-20/DCOR2
9028 Systems
DoC on ISO/IEC 14496-20/FPDAM1 (LASeR Extensions)
9029 Systems
Text of ISO/IEC 14496-20/FDAM1 (LASeR Extensions)
9030 Systems
Request for ISO/IEC 14496-20/Amd.2 (SVGT1.2 Support)
9031 Systems
Text of ISO/IEC 14496-20/FPDAM2 (SVGT1.2 Support)
9032 Systems
TuC for ISO/IEC 14496-20/Amd2
9033 Systems
WD3.0 of ISO/IEC 14496-20 2nd Edition (1st Ed. + Cor + Amd.1)
9034 Systems
IuC for LASeR
9035 Systems
Request of ISO/IEC 21000-9/Amd.1
9036 Systems
Text of ISO/IEC 21000-9/PDAM.1 Mime Type Registration
9037 Systems
DoC of ISO/IEC 23000-4/FCD (Musical Slide Show MAF)
9038 Systems
Text of ISO/IEC 23000-4/FDIS (Musical Slide Show MAF)
9039 Systems
Workplan for Musical Slide Show MAF Conformance and Ref. Software
9040 Systems
WD1.0 of ISO/IEC 23000-4/Amd.2 Protected Musical Slide Show
9041 Systems
Text of ISO/IEC 23000-8/CD (Portable Video Player MAF)
9042 Systems
DoC on ISO/IEC 23000-9/CD (MAF for DMB)
47
9043 Systems
Text of ISO/IEC 23000-9/FCD (MAF for DMB)
9044 Systems
TuC on MAF for DMB
9045 Systems
Request for ISO/IEC 23000-10
9046 Systems
WD1.0 on ISO/IEC 23000-10 (Video Surveillance MAF)
9047 Systems
Study Text of ISO/IEC 23001-1/FPDAM2 (Prefixes and of wild cards
extensions)
9048 Systems
DoC on ISO/IEC 23001/DCOR2
9049 Systems
Text of ISO/IEC 23001/COR2
9050 Systems
DoC on ISO/IEC 23001-2/FCD (Fragment Request Unit)
9051 Systems
Text of ISO/IEC 23001-2/FDIS (Fragment Request Unit)
9052 Systems
Text of ISO/IEC 23001-3/FCD (IPMP XML Messages)
9053 Systems
Text of ISO/IEC 23004-5/FDIS Component Download
9054 Systems
Text of ISO/IEC 23004-6/FDIS Fault Management
9055 Systems
Text of ISO/IEC 23004-7/FDIS System Integrity Management
9056 Systems
WD2.0 of ISO/IEC 23004-8 Reference Software and Conformance
9057 Systems
M3W Reference Software and Conformance Plan
9058 Systems
DoC on ISO/IEC 29116-1/CD Media Streaming MAF Protocol
9059 Systems
Text of ISO/IEC 29116-1/FCD Media Streaming MAF Protocol
9060 Systems
A project to exploit MPEG standards in tune with industry practices and needs
9061 Convenor
Ad Hoc Group on Scene Representation
9062 Convenor
Ad Hoc Group on MPEG File Formats
9063 Convenor
Ad Hoc Group on MAF Under Development in Systems
9064 Audio
DoC on ISO/IEC 11172-5:199x/DCOR 1
9065 Audio
ISO/IEC 11172-5:199x/Cor. 1
9066 Audio
DoC ISO/IEC 13818-7:2006/FPDAM 1
9067 Audio
ISO/IEC 13818-7:2006/FDAM 1, Transport of MPEG Surround data in AAC
9068 Audio
ISO/IEC 14496-3:2005/DCOR 6, DST and MP3on4
9069 Audio
ISO/IEC 14496-3:2005/DCOR 7, SLS
9070 Audio
DoC on ISO/IEC 14496-3/PDAM 8
9071 Audio
ISO/IEC 14496-3/FPDAM 8, MP4FF Box for Original Audio File Information
9072 Audio
DoC on ISO/IEC 14496-3:2005/PDAM 9 Request for Amendment.
9073 Audio
DoC on ISO/IEC 14496-3:2005/PDAM 9
9074 Audio
ISO/IEC 14496-3:2005/FPDAM 9, AAC-ELD
9075 Audio
WD on MPEG-4 Audio Fourth Edition
9076 Audio
DoC on ISO/IEC 14496-4:2004/FPDAM 14
9077 Audio
ISO/IEC 14496-4:2004/FDAM 14, BSAC Extensions Conformance
48
9078 Audio
DoC ISO/IEC 14496-4:2004/FPDAM 18
9079 Audio
ISO/IEC 14496-4:2004/FDAM 18, MPEG-1 and -2 on MPEG-4 Conformance
9080 Audio
DoC ISO/IEC 14496-4:2004/FPDAM 19
9081 Audio
ISO/IEC 14496-4:2004/FDAM 19, ALS Conformance
9082 Audio
Study on ISO/IEC 14496-4:2004/FPDAM 20, SLS Conformance
9083 Audio
Status of MPEG-4 Audio Conformance
9084 Audio
Status of MPEG-4 SLS Conformance
9085 Audio
ISO/IEC 14496-5:2001/AMD 10:2007/DCOR 1, BSAC and SLS
9086 Audio
Request for Amendment, MPEG-1/2 on MPEG-4 Ref. Software
9087 Audio
ISO/IEC 14496-5:2001/AMD XX, MPEG-1/2 on MPEG-4 Ref. Software
9088 Audio
DoC ISO/IEC FCD 14496-23
9089 Audio
ISO/IEC FDIS 14496-23:200x, Symbolic Music Representation
9090 Audio
DoC ISO/IEC 23003-1:2007/PDAM 1
9091 Audio
ISO/IEC 23003-1:2007/FPDAM 1, MPEG Surround Conformance
9092 Audio
DoC ISO/IEC 23003-1:2007/PDAM 2
9093 Audio
ISO/IEC 23003-1:2007/FPDAM 2, MPEG Surround Reference Software
9094 Audio
Defect Report of ISO/IEC 23003-1:2007
9095 Audio
Framework for Exploration of Speech and Audio Coding
9096 Audio
Workplan for Exploration of Speech and Audio Coding
9097 Convenor
AHG on Audio Standards Maintenance
9098 Convenor
AHG on SAOC CfP, AAC-ELD and Speech and Audio Exploration
9099 Audio
Final Spatial Audio Object Coding Evaluation Procedures and Criterion
9100 MDS
ISO/IEC FPDAM/1 15938-5 Improvements to Geographic Descriptor
9101 MDS
ISO/IEC FPDAM/1 15938-7 Improvements to Geographic Descriptor
Conformance
9102 MDS
Schema Files for MPEG-7
9103 MDS
ISO/IEC 15938-12 CD MPEG-7 Query Format
9104 MDS
Technologies Under Consideration for MPEG-7 Query Format
9105 MDS
DoC of ISO/IEC 21000-4 FPDAM/1 IPMP Components Base Profile
9106 MDS
Text of ISO/IEC 21000-4 FDAM/1 IPMP Components Base Profile
9107 MDS
DoC of ISO/IEC 21000-5 PDAM/3 ORC (Open Release Content) Profile
9108 MDS
ISO/IEC 21000-5 FPDAM/3 ORC (Open Release Content) Profile
9109 MDS
Interoperability between MPEG-21 REL DAC Profile and other Rights
Information Standards
9110 MDS
REL/RDD Reference Software Development Plan v.6
9111 MDS
Disposition of Comments on ISO/IEC 21000-7:2004/DCOR 1
9112 MDS
Text of ISO/IEC 21000-7:2004/COR 1 MPEG-21 Digital Item Adaptation
49
9113 MDS
Text of ISO/IEC 21000-7 FDIS Second edition
9114 MDS
Preliminary DoC of preliminary comments of ISO/IEC 21000-8 FCD
Reference Software
9115 MDS
Study text of ISO/IEC 21000-8 FCD Reference Software
9116 MDS
Doc of ISO/IEC 21000-14 Conformance
9117 MDS
Text of ISO/IEC FDIS 21000-14 Conformance
9118 MDS
ISO/IEC 21000-15:2006/DCOR1 MPEG-21 Event Reporting
9119 MDS
DoC of ISO/IEC 21000-18/PDAM 1
9120 MDS
ISO/IEC 21000-18/FPDAM/1 Simple fragmentation rule
9121 MDS
DoC of ISO/IEC 23000-2 FCD Music Player Application Format 2nd Edition
9122 MDS
Text of ISO/IEC 23000-2 FDIS Music Player Application Format 2nd Edition
9123 MDS
DoC on ISO/IEC CD 23000-5 Media Streaming Player
9124 MDS
ISO/IEC FCD 23000-5 Media Streaming Player
9125 MDS
DoC of ISO/IEC 23000-7 CD Open release MAF
9126 MDS
ISO/IEC 23000-7 FCD Open release MAF
9127 MDS
Text of ISO/IEC 23001-5 FDIS Bitstream Syntax Description Language
9128 Convenor
AHG on MPEG-7 Query Format
9129 MDS
DoC ISO/IEC PDAM/1 15938-5 Improvements to Geographic Descriptor
9130 MDS
DoC ISO/IEC PDAM/1 15938-7 Improvements to Geographic Descriptor
Conformance
9131 Requirements MPEG Profiles and Levels URIs
9132 3DGC
Text of ISO/IEC 14496-4:2001/ FDAM16 (MPEG-J GFX Conformance)
9133 3DGC
Text of ISO/IEC 14496-4:2001/ FPDAM21 (Geometry and Shadow
Conformance)
9134 3DGC
Text of ISO/IEC 14496-5:2001/ FDAM11 (MPEG-J GFX RefSoft)
9135 3DGC
Text of ISO/IEC 14496-5:2001/ FPDAM13 (Geometry and Shadow RefSoft)
9136 3DGC
WD 2.0 of ISO/IEC 14496-16:2006/AMD2 (Frame-based Animated Mesh
Compression)
9137 3DGC
WD 1.0 of ISO/IEC 14496-16:2006/AMD3 (3D MultiResolution Profile)
9138 3DGC
3D Graphics Core Experiments Description
9139 3DGC
3D Graphics Compression FAQ 19.0
9140 3DGC
Text of ISO/IEC 14496-21:2006/COR1
9141 3DGC
Request for Subdivision of ISO/IEC 14496: Part 25 - 3D Graphics
Compression Model
9142 3DGC
WD 1.0 for ISO/IEC 14496-25
9143 Convenor
AHG on 3DG documents, experiments and software maintenance
9144 Systems
TuC for IPMP XML Messages
9145 Convenor
Project Editors for ISO/IEC Certificate of Appreciation
50
9146 3DGC
DoC on ISO/IEC 14496-4:2001/ FDAM16 (MPEG-J GFX Conformance)
9147 3DGC
DoC on ISO/IEC 14496-4:2001/ FPDAM21 (Geometry and Shadow
Conformance)
9148 3DGC
DoC on ISO/IEC 14496-5:2001/ FDAM11 (MPEG-J GFX RefSoft)
9149 3DGC
Doc on ISO/IEC 14496-5:2001/ FPDAM13 (Geometry and Shadow RefSoft)
9150 3DGC
Request for ISO/IEC 14496-16:2006/AMD3 (3D MultiResolution Profile)
9151 MDS
Request of subdivision for MPEG-7 Query Format
9152 Systems
Disposition of Comments NWIP, Information technology -- Supplemental
media technologies
9153 Systems
Elements for a solution for storage of MPEG-2 TS in the MPEG-4 File Format
51
Annex E – Requirements report
Source: Fernando Pereira (Instituto Superior Técnico, Lisboa-Portugal
Note: Requirements agenda for the San Jose MPEG meeting is annexed at the end of this report.
16 Requirements documents approved at this meeting
8942
8943
8944
8945
9131
MAFs Overview
RVC Requirements
FTV Model and Requirements
Requirements on and Structure for Assigning MPEG URNs
MPEG Profiles and Levels URIs
17 MPEG Structure
17.1 MPEG URNs and URIs (joint with MDS)
14500, Sylvain Devillers, Use of MPEG URN for identifying profiles and levels
MPEG video and audio coding formats are used by a large number of standards developed by other
bodies such as DBV and 3GPP. Such standards may normatively reference a video or audio coding
format, but in some cases reference a given profile and level of such format. This contribution
proposed that, to promote the adoption of WG11 standards, it is the responsibility and interest of the
WG11 to define, publish and maintain a list of unique identifiers for profiles and levels of MPEG
coding formats. Following, this contribution, it was decided to create a document (N9131) with
MPEG profiles and levels URIs. This document will include unique URIs for all MPEG profiles
and levels. All the MPEG subgroups are kindly asked to review this document, especially in the
parts regarding their own profiles and levels.
14532, Gerrard Drury, Contribution on URI assets and Requirements and Structure of URNs
The use of Uniform Resource Identifiers (URIs) within MPEG standards has become more
prevalent, particularly with the increased use of XML in MPEG standards. Because there was no
global standard structure for the URNs being used in MPEG standards, a document was created at
the last meeting (N8785) including motivation, objectives and process to define URNs,
requirements on URNs, definition of required URNs structure, and URN examples. This
contribution proposed some corrections and improvements to the document issued at last meeting
that have been approved. Following this approval a revised version of the Requirements on and
Structure for Assigning MPEG URNs document (8945) has been issued.
18 MPEG-4
18.1 Metadata in AVC (joint with Video & JVT)
Some contributions regarding metadata in AVC were submitted to JVT at this meeting. During a
joint meeting with Video and JVT, it was concluded that AVC metadata shall be based on MPEG-7
tools and thus the issue is to be addressed in MPEG; coding related metadata may need especial
consideration when its purpose target coding efficiency. The next steps for this activity may
include:
1. Identification of requirements at various levels
2. Understanding if new MPEG-7 tools are needed to address requirements
52
3. Understanding if new MPEG-7 profile is needed
18.2 3D Compression Profiling (joint with 3DGC)
14467, Patrick Gioia, Olivier Aubault, Proposal for 3D Compression Profile
This contribution proposes profiles in the 3D area to address Google-earth like applications, in realtime, with adaptive navigation. Following discussions at last meeting, it was agreed that the full
picture in the 3 graphics related profiling dimensions has to be kept in mind to cover well the
profiling space. Following the discussions, profiles in the 3 graphics related profiling dimensions
will be defined, notably:
 Basic AFX in Scene Graph
 Basic AFX in Graphics
 Multires in 3D Compression (with 8 object types and 2 levels)
18.3 Laser (joint with Systems)
14373, Jean-Claude Dufourd, LASeR profiles adjustment
Laser version 1 includes currently the Mini and Full profiles. Following this contribution and
discussions at last meeting, it was decided:
1. To correct MINI profile to make it useful and hierarchical to Core
2. To remove FULL profile because useless and illy defined
3. To define CORE profile (hierarchical to MINI)
4. To start studying possible MAIN profile (hierarchical to CORE)
19 MPEG-7
19.1 MP7QF
14420, Hee-Cheol Seo, Miran Choi, Hyunki Kim, Myung-Gil Jang, Soojong Lim, Jeong Heo,
Kyoungro Yoon, CE Report for Query Expression of MPEG-7 Query Format
14421, Hee-Cheol Seo, Miran Choi, Hyunki Kim, Myung-Gil Jang, Soojong Lim, Jeong Heo,
Kyoungro Yoon, Revision of Proposed Input Query Format for MPEG-7 Query Format
14449, Doeller, Gruhne, Wolf, MP7QF CE Test Report
These contributions have been addressed by the MDS subgroup since they include technical inputs
related to an activity managed by MDS.
20 MPEG-21
20.1 Digital Item Declaration
14475, Giovanni Cordara (on behalf of the ITNB), Italian NB proposal to revisit MPEG-21 DID
This contribution states that “Italy believes that it would be beneficial to revisit the MPEG-21 DID
requirements on the basis of the experience gathered with ISO/IEC 21000-1 and propose a New
Project that aims at a new standard with the functionalities derived from the revisiting of the
MPEG-21 DID requirements and with the constraint that no IP contained in patents whose rights
are currently valid be required to implement the new standard or, if such IPR exists, it is licensed by
its holder royalty free.”
Following this contribution, a BoG was established to:
1. Identify possible DID deficiencies and possible solutions
53
2. Revisit DID requirements
3. Assess the feasibility of reaching the target proposed by ITNB
To continue the work from the BoG, an AHG has been established (N8946) with the following
mandates:
1. Investigate whether the current DID (ISO/IEC 21000-2) requirements fit with today’s
industry, and if not review the requirements.
2. Collect information on how DID is currently used.
3. Identify current deficiencies with DID and propose ways to address these deficiencies.
4. Investigate feasibility of producing royalty-free DID.
21 MPEG-A
21.1 Professional Archival MAF
14430, Tilman Liebchen, Comments on Professional Archival MAF Requirements
14411, Noboru Harada, Takehiro Moriya and Yutaka Kamamoto, Proposed text to WD of
Professional Archival MAF
Although this MAF is already under development by MDS, it was discussed in a joint meeting with
MDS to review the requirements and check the industry support. It was confirmed that there is
currently no significant industry support for the current set of requirements. The experts involved in
this MAF committed to bring at the next meeting further requirements contributions and evidence
of more industry support.
21.2 Surveillance MAF
14352, James Orwell, Contribution to the Basic Video Surveillance MAF
This contribution was not presented because the author was not available.
14486, Houari Sabirrin, Jeongyeon Lim, Munchurl Kim, A Proposal for Basic Video Surveillance
Application Format
Following this contributions and discussions in a BoG, it was decided to promote to ‘under
development’ a rather simple MAF to package surveillance video content, mainly including the
following tools: AVC file format, AVC video (Baseline profile) and some MPEG-7 metadata. It is
recognized that this simple MAF may be important to penetrate in a rather new application domain
for MPEG: surveillance. Since there is support to create in the future a more complete MAF for
surveillance applications, e.g. including audio, there is still a surveillance related MAF under
consideration, now renamed ‘Advanced surveillance’.
21.3 Protected Musical Slide Show MAF
14477, Hendry, Houari Sabirin, Munchurl Kim, Updated Proposal for Protected Musical Slide
Show MAF with IPMP
Following evidence of need and industry support, this MAF was promoted to ‘under development’.
This MAF adds protection capabilities to the Musical Slide Show already under development by the
Systems subgroup. It was agreed that the technical solution for the additional protection capabilities
will be siilar to the solution used for the Music Player MAF.
54
21.4 Protected Photo Player MAF
14478, Hendry, Houari Sabirin, Munchurl Kim, Updated Proposal for Protected Photo Player
MAF with IPMP
Following the discussion, this MAF stays ‘under consideration’ since it needs clear industry support
and also to address technical issues raised by the MPEG-7 Visual BoG.
21.5 Stereoscopic MAF
14423, Kwangcheol Choi, Sung-Moon Chun, Jaedo Kwak, Seungheon Yang, Ji-Sang Yoo, Si-Hun
Sung, Seong-Cheol, Han, Requirements for Stereoscopic MAF
14424, Jaedo Kwak, Si-Hun Sung, Sung-Moon Chun, JinWoong Kim, Namho Hur, Whitepaper of
Stereoscopic Project
Following evidence of market need and industry support, this MAF was promoted to ‘under
consideration’. Further contributions are expected at the next meeting (notably in terms of technical
solutions) in order further progress may be made.
21.6 Cross-Media Interactive Presentation MAF
14367, Paolo Nesi, Pierfrancesco Bellini, Davide Rogai, Kia Ng (University of Leeds), Proposal for
a MAF on Cross-Media Interactive Presentation: Application Scenarios
14368, Paolo Nesi, Pierfrancesco Bellini, Davide Rogai, Proposal for a MAF on Cross-Media
Interactive Presentation: Requirements
14369, Davide Rogai, Pierfrancesco Bellini, Paolo Nesi, Proposal for a MAF on Cross-Media
Interactive Presentation: Relationships with other MAFs
Although this new MAF proposal was discussed in three sessions, it was not possible to identify the
scope and main functional target of this MAF. There was also no clear industry support for this
MAF. Further progress on this MAF will require solving these two issues.
21.7
Summary on MAFs
The global MAF situation after the San Jose MPEG meeting is summarized in the MAFs Overview
document (N8942) as follows:
1. MAFs Finalized
a. Music Player MAF (including protection)
b. Photo Player MAF
2. MAFs Under Development
a. Photo Player MAF (under Video)
b. Musical Slide Show MAF, including protection (under Systems)
c. Media Streaming MAF (under MDS)
d. Professional Archival MAF (under MDS)
e. Open Release MAF (under MDS)
f. Portable Video Player MAF (under Systems)
g. MAF for Digital Multimedia Broadcasting (under Systems)
h. Video Surveillance MAF (under Systems)
3. MAFs Under Consideration
a. Advanced Surveillance MAF
b. Protected Photo Player MAF
55
c. Digital Video/Cinema MAF
d. Stereoscopic MAF
22 MPEG-B and MPEG-C
22.1 RVC (joint with Video/ISG)
14511, AVS Working Group, Liaison Statement to MPEG on RVC
Following this contribution from AVS, MPEG states that the RVC project is about developing
 A full collection of MPEG individual coding tools organized in the MPEG video tool library
and
 A generic framework that can be used to make an implementation of any MPEG video coding
standard and additionally is capable of supporting the implementation of video coding
standards from other organizations with which a collaboration can be established.
As part of this project, an identification mechanism will be developed whereby MPEG video
coding tools will be identified by MPEG and video coding tools from other organizations can be
identified via a registration authority.
23 Explorations
23.1 Freeviewpoint Television (FTV)
14417, Masayuki Tanimoto, Toshiaki Fujii, Hideaki Kimata, Shigeyuki Sakazawa, Proposal on
Requirements for FTV
14533, Liaison from 3D Consortium
Based on these contributions, it was agreed that FTV is an important application domain which
MPEG has been trying to address since a long time. Following recent inputs, there is a need to
revisit the way MPEG may address this application domain using existing MPEG standards and
very likely adding new standards. In conclusion, FTV is currently an MPEG activity, targetting at
this stage to
1. Identify an FTV architecture and model
2. Identify for which architectural modules normative technology should be specified, e.g.
FTV data format, decoding, rendering
3. Identify the requirements for each normative modules from the visual, audio and systems
perspectives
After the issues above are clarified, the FTV roadmap will be defined, notably the relation with JVT
activities. A response to the 3D Consortium has been prepared describing the activities MPEG is
currently developing in this area. An AHG (N8947) has been created with the following mandates:
1.
To refine the FTV architecture.
2.
To refine the identification and definition of normative elements in the FTV architecture
3.
To refine the FTV requirements
23.2 Full Colour Video Coding
14452, Tokumichi Murakami, Kohtaro Asai, Yoshihisa Yamada, Requirement of Full-Color Video
Coding for Consumer Applications
56
This contribution proposed requirements for a possible “full color” video coding standard adapted
for consumer applications. The discussion confirmed these requirements are still drafty and thus
further contributions are welcome at the next meeting.
23.3 IPTV Requirements
23.4
This activity reviewed and answered the liaison contributions on IPTV
Requirements from ATIS/IIF IPTV, CEA and the ITU-T IPTV Focus Group. It was
agreed there is a need to continue identifying the relevant requirements for MPEG
from the inputs provided and checking the coverage of relevant requirements by
existing MAFs, notably the Media Streaming MAF.
23.5
Dual-Track Licensing Approach
14360, USNB Contribution: Issues relating to expiring patents
The USNB contribution states that “if it is technically possible to develop a standard which does
this (royalty free), the USNB prefers that it be done in WG 11 where there is expertise in doing it
well, and where such a putative standard could be made a 'family member' with other MPEG
standards (with an upgrade path, for example, or related technical ‘roots’ etc.)” and “the 'terms of
engagement' of a study on developing a process for royalty-free standards, and the results and
follow-on for such work, should be made more clear before more discussion is held at WG 11.”
After discussion and based on past experience on the dual-track approach, it was decided that no
further progress is possible in this activity until sufficient commitment is made available.
24 80th MPEG (San Jose) Agenda Requirements
25 Room: Oak
TIME
TOPIC
ROOM
Monday
Opening Plenary Meeting
9:00-end
DID
Reqs
11:00-12:00
14475, Giovanni Cordara (on behalf of the ITNB), Italian NB proposal to revisit MPEG-21 DID
Lunch
57
NEW MAF PROPOSALs
Stereoscopic MAF
14423, Kwangcheol Choi, Sung-Moon Chun, Jaedo Kwak, Seungheon Yang, Ji-Sang Yoo,
Si-Hun Sung, Seong-Cheol, Han, Requirements for Stereoscopic MAF
14424, Jaedo Kwak, Si-Hun Sung, Sung-Moon Chun, JinWoong Kim, Namho Hur,
Whitepaper of Stereoscopic Project
14:30-16:30
MAF on Cross-Media Interactive Presentation
Reqs
14367, Paolo Nesi, Pierfrancesco Bellini, Davide Rogai, Kia Ng (University of Leeds),
Proposal for a MAF on Cross-Media Interactive Presentation: Application Scenarios
14368, Paolo Nesi, Pierfrancesco Bellini, Davide Rogai, Proposal for a MAF on Cross-Media
Interactive Presentation: Requirements
14369, Davide Rogai, Pierfrancesco Bellini, Paolo Nesi, Proposal for a MAF on Cross-Media
Interactive Presentation: Relationships with other MAFs
16:30-18:00
BoGs
-
18:00-20:00
HoDs Meeting
HoD
Tuesday
Various (joint with MDS)
URNs
14500, Sylvain Devillers, Use of MPEG URN for identifying profiles and levels
14532, Gerrard Drury, Contribution on URI assets and Requirements and Structure of URNs
MP7QF
Reqs
9:00-11:00
14420, Hee-Cheol Seo, Miran Choi, Hyunki Kim, Myung-Gil Jang, Soojong Lim, Jeong Heo,
Kyoungro Yoon, CE Report for Query Expression of MPEG-7 Query Format
14421, Hee-Cheol Seo, Miran Choi, Hyunki Kim, Myung-Gil Jang, Soojong Lim, Jeong Heo,
Kyoungro Yoon, Revision of Proposed Input Query Format for MPEG-7 Query
Format
14449, Doeller, Gruhne, Wolf, MP7QF CE Test Report
RVC and AVS (joint with ISG & Video)
Reqs
12:00-13:00
14511, AVS Working Group, Liaison Statement to MPEG on RVC
Lunch
13:00-14:00
MPEG-A (joint with MDS, Systems, Audio and Video)
MAFs UNDER CONSIDERATION
Reqs
14:00-18:00
Surveillance MAF
14352, James Orwell, Contribution to the Basic Video Surveillance MAF
58
14486, Houari Sabirrin, Jeongyeon Lim, Munchurl Kim, A Proposal for Basic Video
Surveillance Application Format
Protected Musical Slide Show MAF
14477, Hendry, Houari Sabirin, Munchurl Kim, Updated Proposal for Protected Musical Slide
Show MAF with IPMP
Protected Photo Player MAF
14478, Hendry, Houari Sabirin, Munchurl Kim, Updated Proposal for Protected Photo Player
MAF with IPMP
MAFs UNDER DEVELOPMENT
Professional Archival MAF
14430, Tilman Liebchen, Comments on Professional Archival MAF Requirements
14411, Noboru Harada, Takehiro Moriya and Yutaka Kamamoto, Proposed text to WD of
Professional Archival MAF
NEW MAF PROPOSALs
Stereoscopic MAF
14423, Kwangcheol Choi, Sung-Moon Chun, Jaedo Kwak, Seungheon Yang, Ji-Sang Yoo,
Si-Hun Sung, Seong-Cheol, Han, Requirements for Stereoscopic MAF
14424, Jaedo Kwak, Si-Hun Sung, Sung-Moon Chun, JinWoong Kim, Namho Hur,
Whitepaper of Stereoscopic Project
MAF on Cross-Media Interactive Presentation
14367, Paolo Nesi, Pierfrancesco Bellini, Davide Rogai, Kia Ng (University of Leeds),
Proposal for a MAF on Cross-Media Interactive Presentation: Application Scenarios
14368, Paolo Nesi, Pierfrancesco Bellini, Davide Rogai, Proposal for a MAF on Cross-Media
Interactive Presentation: Requirements
14369, Davide Rogai, Pierfrancesco Bellini, Paolo Nesi, Proposal for a MAF on Cross-Media
Interactive Presentation: Relationships with other MAFs
18:00-19:00
19:00-end
Liaison Meeting
Chairs Meeting
Wednesday
09:00-end
plenary
Plenary Meeting
Profiling (joint with 3DGC)
3DGC
12:00-12:30
14467, Patrick Gioia, Olivier Aubault, Proposal for 3D Compression Profile
Lunch
Various (joint with Video, JVT)
JVT
14:00-15:30
14417, Masayuki Tanimoto, Toshiaki Fujii, Hideaki Kimata, Shigeyuki Sakazawa, Proposal on
Requirements for FTV
59
14533, Liaison from 3D Consortium
14452, Tokumichi Murakami, Kohtaro Asai, Yoshihisa Yamada, Requirement of Full-Color
Video Coding for Consumer Applications
14360, USNB Contribution: Issues relating to expiring patents
15:3016:00
Carriage of MPEG-7 metadata in AVC (joint with Video, JVT)
JVT
BoGs
Social Event
Thursday
LASeR (joint with Systems)
Rqs
9:00-9:30
14373, Jean-Claude Dufourd, LASeR profiles adjustment
Joint JPEG – MPEG on JPSearch
9:30-12:00
Reqs
Lunch
14:00-15:00
Feedback from IPTV Requirements
Reqs
15:00-16:00
Feedback from DID BoG (joint Reqs & MDS)
Reqs
MAFs BoG Feedback
16:00-17:00
Reqs
Surveillance MAF
Protected Photo Player MAF
Cross-media Interactive Presentation MAF
17:00-18:00
Reviewing FTV Requirements Doc
18:00-end
Chairs Meeting
Reqs
Friday
-
Concluding MPEG-4
Concluding MPEG-7
-
Reqs
MDS
Concluding MPEG-21
9:00-9:15
Response to Italian NB on new DID technologies – Giovani
Reqs
AHG on New DID Technologies - Gerrard
Concluding MPEG-A
9:15-9:45
Reqs
MAFs Overview - Florian
RVC (MPEG-B & MPEG-C)
Reqs
9:45-10:00 RVC Requirements - Euee
60
Explorations
IPTV related Liaisons – Xin, Anthony
Response to US NB on royalty free standards
Revised Doc with URNs structure – Christian
Reqs
10:00-11:00
FTV Model and Requirements – Tanimoto-san
AHG on FTV
Response to Liaison from 3D Consortium
12:00 14:00
Lunch
14:00-end
plenary
Plenary Meeting
61
Annex F – Systems report
Source:
Systems Chair and Break-out group Chairs
Contributors: David Singer (Apple), Young-Kwon Lim (Net&TV), Jean Gelissen (Philips)
1
Overview
The main outputs of the meeting from the Systems Sub-group perspective are:
No.
Title
X
8998
X
8999
9012
9013
9014
9015
9016
9017
9018
X
9019
9020
X
9021
X
9022
9023
9024
9025
X
9026
X
9027
9028
9029
9030
9031
9032
9033
9034
X
9035
9036
X
9037
9038
9039
9040
X
9041
X
9042
9043
9044
13818-1 Systems
Text of ISO/IEC 13818-1:2003/DCOR1.2 (AVC Referencing and PS Signaling)
14496-4 Conformance testing
DoC on ISO/IEC 14496-4/PDAM 23 Synthesised Texture Conformance
Text of ISO/IEC 14496-4/FPDAM 23 Synthesised Texture Conformance
DoC on ISO/IEC 14496-4/PDAM 24 File Format Conformance
Text of ISO/IEC 14496-4/FPDAM 24 File Format Conformance
DoC on ISO/IEC 14496-4/PDAM 25 LASeR V1 Conformance
Text of ISO/IEC 14496-4/FPDAM 25 LASeR V1 Conformance
Request for ISO/IEC 14496-4/Amd.26
Text of ISO/IEC 14496-4/PDAM 26 Open Font Format Conformance
14496-5 Reference Software
DoC of ISO/IEC 14496-5/FPDAM 12 File Format Reference Software
Text of ISO/IEC 14496-5/FDAM 12 File Format Reference Software
14496-11 Scene Description and Application Engine
Text of ISO/IEC 14496-11/COR.6 (AudioFXProto correction and Bitwrapper)
14496-12 ISO Base Media File Format
DoC on ISO/IEC 14496-12/FPDAM 2 (Flute Hint Track)
Text of ISO/IEC 14496-12/FDAM 2 (Flute Hint Track)
Text of ISO/IEC 14496-12/COR.3
TuC for ISO/IEC 14496-12 & 15444-12
14496-15 AVC File Format
Study Text of ISO/IEC 14496-15/PDAM 2 (SVC File Format)
14496-20 Lightweight Application Scene Representation
ISO/IEC 14496-20/DCOR 2
DoC on ISO/IEC 14496-20/FPDAM 1 (LASeR Extensions)
Text of ISO/IEC 14496-20/FDAM 1 (LASeR Extensions)
Request for ISO/IEC 14496-20/Amd.2 (SVGT1.2 Support)
Text of ISO/IEC 14496-20/FPDAM 2 (SVGT1.2 Support)
TuC for ISO/IEC 14496-20/Amd.2
WD3.0 of ISO/IEC 14496-20 2nd Edition (1st Ed. + Cor + Amd.1)
IuC for LASeR
21000-9 File Format
Request of ISO/IEC 21000-9/Amd.1
Text of ISO/IEC 21000-9/PDAM 1 Mime Type Registration
23000-4 Musical Slide Show MAF
DoC of ISO/IEC 23000-4/FCD (Musical Slide Show MAF)
Text of ISO/IEC 23000-4/FDIS (Musical Slide Show MAF)
Workplan for Musical Slide Show MAF Conformance and Ref. Software
WD1.0 of ISO/IEC 23000-4/Amd.2 Protected Musical Slide Show
23000-8 Portable Video Player
Text of ISO/IEC 23000-8/CD (Portable Video Player MAF)
23000-9 Digital Multimedia Broadcasting Application Format
DoC on ISO/IEC 23000-9/CD (MAF for DMB)
Text of ISO/IEC 23000-9/FCD (MAF for DMB)
TuC on MAF for DMB
TBP
62
No
No
No
No
No
No
No
No
No
No
No
No
No
No
No
No
No
No
No
No
No
Yes
No
Yes
No
No
No
No
No
No
No
No
No
No
No
X
9045
9046
X
9047
9048
9049
X
9050
9051
X
9052
9144
X
9053
X
9054
X
9055
X
9056
9057
X
9058
9059
X
9060
23000-10 Video Surveillance MAF
Request for ISO/IEC 23000-10
WD1.0 on ISO/IEC 23000-10 (Video Surveillance MAF)
23001-1 Binary MPEG Format for XML
Study Text of ISO/IEC 23001-1/FPDAM2 (Prefixes and of wild cards extensions)
DoC on ISO/IEC 23001/DCOR2
Text of ISO/IEC 23001/COR2
23001-2 Fragment Request Unit
DoC on ISO/IEC 23001-2/FCD (Fragment Request Unit)
Text of ISO/IEC 23001-2/FDIS (Fragment Request Unit)
23001-3 IPMP XML Messages
Text of ISO/IEC 23001-3/FCD (IPMP XML Messages)
TuC for IPMP XML Messages
23004-5 Component Download
Text of ISO/IEC 23004-5/FDIS Component Download
23004-6 Fault Management
Text of ISO/IEC 23004-6/FDIS Fault Management
23004-7 Systems Integrity Management
Text of ISO/IEC 23004-7/FDIS System Integrity Management
23004-8 Reference Software
WD2.0 of ISO/IEC 23004-8 Reference Software and Conformance
M3W Reference Software and Conformance Plan
29116-1 Media Streaming MAF Protocol
DoC on ISO/IEC 29116-1/CD Media Streaming MAF Protocol
Text of ISO/IEC 29116-1/FCD Media Streaming MAF Protocol
Exploration
A project to exploit MPEG standards in tune with industry practices and needs
63
No
No
No
No
No
No
No
No
No
No
No
No
No
No
No
No
No
2
General issues
2.1
General
The meeting report from Hangzou has been approved.
The following demonstrations have been made:
 None.
2.2
List of standards under development
Pr
2
Pt
1
Edit. Project
2000 Cor.1
4
4
2004 Amd.22
4
4
2004 Amd.23
4
4
4
4
4
4
2004 Amd.24
2007 Amd.25
2007 Amd.26
4
4
2007 Amd.27
4
4
5
5
2007 Amd.14
2007 Amd.16
4
4
4
4
21
A
A
5
15
20
20
9
4
4
2007
2005
2004
2004
200x
200x
200x
A
A
8
9
200x 1st Ed.
200x 1st Ed.
A
B
10
1
200x 1st Ed.
200x Amd.2
B
E
X
3
8
1
200x 1st Ed.
200x 1st Ed.
200x
Amd.17
Amd.2
Cor.2
Amd.2
Amd.1
Amd.1
Amd.2
Description
CfP
Reference to AVC
Specification
Audio BIFS v3
conformance
Synthesized Texture
conformance
File Format Conformance
LASeR V1 Conformance
Open Font Format
Conformance
LASeR Amd.1
Conformance
Open Font Format Ref. Soft
Symbolic Music Rep. Ref.
Soft
LASeR Ref. Soft
SCV File Format Extensions
Profile Removal
SVGT1.2 Support
MP21 Mime Type
MSS MAF Conf. and Soft
Protected Musical Slide
Show
Portable Video Player MAF
Digital Multi. Broadcasting
MAF
Video Surveillance MAF
Exten. On encoding of wild
cards
IPMP XML Messages
Ref. Soft. and Conformance
Media Streaming MAF
Protocols
64
WD
CD
FCD
07/04
FDIS
07/10
06/04 06/07 07/01 07/07
06/07 07/01 07/04 07/10
06/04 06/10 07/04 07/10
06/04 06/10 07/04 07/10
07/04 07/10 08/01 08/07
06/10 07/07 07/10 08/04
07/07 07/10 08/01 08/04
06/10 07/01 07/07 08/01
06/10 07/01 07/07
05/10 06/07 07/07
07/04
05/10 07/04
07/04 07/07
07/07 07/10 08/01
07/04 07/07 07/10
08/01
08/01
07/10
07/10
07/10
08/04
08/04
06/10 07/04 07/10 08/01
06/10 07/01 07/04 07/10
07/04 07/07 07/10 08/04
06/04 06/07 07/01 07/07
06/10 07/04 07/10
07/01 07/07 07/10 08/01
06/10 07/04 07/10
2.3
Standing Documents
Pr
1
1
1
Pt
1
1
1
2
2
2
1
1
1
2
4
4
4
4
4
4
4
4
4
4
4
4
4
4
11
1
1
1
1
6
11
12
14
15
13
13
17
18
20
4
4
7
7
21
B
E
20
22
1
1
9
X
X
E
E
E
E
X
X
X
X
E
E
E
X
X
X
E
E
E
X
X
X
Documents
MPEG-1 White Paper – Multiplex Format
MPEG-1 White Paper – Terminal Architecture
MPEG-1 White Paper – Multiplexing and
Synchronization
MPEG-2 White Paper – Multiplex Format
MPEG-2 White Paper – Terminal Architecture
MPEG-2 White Paper – Multiplexing and
Synchronization
MPEG-2 White Paper – MPEG-2 IPMP
MPEG-4 White Paper – MPEG-4 Systems
MPEG-4 White Paper – Terminal Architecture
MPEG-4 White Paper – M4MuX
MPEG-4 White Paper – OCI
MPEG-4 White Paper – DMIF
MPEG-4 White Paper – BIFS
MPEG-4 White Paper – ISO File Format
MPEG-4 White Paper – MP4 File Format
MPEG-4 White Paper – AVC FF
White Paper on MPEG-4 IPMP
MPEG IPMP Extensions Overview
White Paper on Streaming Text
White Paper on Font Compression and Streaming
Presentation Material on LASER
No.
N7675
N7676
N7677
Meeting
05/07 Nice
05/07 Nice
05/07 Nice
N7678
N7679
N7680
05/07 Nice
05/07 Nice
05/07 Nice
N7503
N7504
N7610
N7921
N8148
N8149
N7608
N8150
N7923
N7924
N7505
N6338
N7515
N7508
N6969
White Paper on LASeR
White Paper on Open Font Format
MPEG-7 White Paper - MPEG-7 Systems
MPEG-7 White Paper – Terminal Architecture
MPEG-21 White Paper – MPEG-21 File Format
MPEG-B White Paper – BinXML
MPEG Multimedia Middleware Context and
Objectives
1rst M3W White paper
2nd M3W White Paper : Architecture
Tutorial on M3W
M3W White Paper : Multimedia Middleware
Architecture
M3W White Paper : Multimedia API
M3W White Paper : Component Model
M3W White Paper : Resource and Quality
Management
M3W White Paper : Component Download
M3W White Paper : Fault Management
M3W White Paper : System Integrity
Management
N7507
N7519
N7509
N8151
N7925
N7922
N6335
05/07 Poznan
05/07 Poznan
05/10 Nice
06/01 Bangkok
06/04 Montreux
06/04 Montreux
05/10 Nice
06/04 Montreux
06/01 Bangkok
06/01 Bangkok
05/07 Poznan
04/03 München
05/07 Poznan
05/07 Poznan
05/01 HongKong
05/07 Poznan
05/07 Poznan
05/07 Poznan
06/04 Montreux
06/01 Bangkok
06/01 Bangkok
04/03 München
N7510
N8152
N8153
N8687
05/07 Poznan
06/04 Montreux
06/04 Monreux
06/10 Hanzhou
N8688
N8689
N8690
06/10 Hanzhou
06/10 Hanzhou
06/10 Hanzhou
N8691
N8692
N8693
06/10 Hanzhou
06/10 Hanzhou
06/10 Hanzhou
65
2.4
Mailing Lists Reminder
Topic
General
Systems
List
BiM
File
Format
LASeR
MAF
2.5
Information
Liste Reflector : gen-sys@lists.uni-klu.ac.at
List-Subscribe:
http://lists.uni-klu.ac.at/mailman/listinfo/gen-sys
mailto:gen-sys-request@lists.uniklu.ac.at?subject=subscribe
List-Archive: http://lists.uni-klu.ac.at/pipermail/gen-sys
List-Help: mailto:gen-sys-request@lists.uniklu.ac.at?subject=help
Liste Reflector : mpeg7-sys@lists.uni-klu.ac.at
List-Subscribe:
http://lists.uni-klu.ac.at/mailman/listinfo/mpeg7-sys
mailto:mpeg7-sys-request@lists.uniklu.ac.at?subject=subscribe
List-Archive: http://lists.uni-klu.ac.at/pipermail/mpeg7-sys
List-Help: mailto:mpeg7-sys-request@lists.uniklu.ac.at?subject=help
Liste Reflector : mp4-sys@lists.uni-klu.ac.at
List-Subscribe:
http://lists.uni-klu.ac.at/mailman/listinfo/mp4-sys
mailto:mp4-sys-request@lists.uniklu.ac.at?subject=subscribe
List-Archive: http://lists.uni-klu.ac.at/pipermail/mp4-sys
List-Help: mailto:mp4-sys-request@lists.uniklu.ac.at?subject=help
Liste Reflector : mpeg-laser@lists.uni-klu.ac.at
List-Subscribe:
http://lists.uni-klu.ac.at/mailman/listinfo/mpeg-laser
mailto:mpeg-laser-request@lists.uniklu.ac.at?subject=subscribe
List-Archive: http://lists.uni-klu.ac.at/pipermail/mpeg-laser
List-Help: mailto:mpeg-laser-request@lists.uniklu.ac.at?subject=help
Liste Reflector : maf-sys@lists.uni-klu.ac.at
List-Subscribe:
http://lists.uni-klu.ac.at/mailman/listinfo/maf-sys
mailto:maf-sys-request@lists.uni-klu.ac.at?subject=subscribe
List-Archive: http://lists.uni-klu.ac.at/pipermail/maf-sys
List-Help: mailto:maf-sys-request@lists.uniklu.ac.at?subject=help
FAQ
The FAQ were updated as needed.
66
Kindly Managed
by
University of
Klagenfurt
University of
Klagenfurt
University of
Klagenfurt
University of
Klagenfurt
University of
Klagenfurt
2.6
AOB
None.
3
MPEG-2 Systems (13818-1)
3.1
13818-1:2005 Amd.3 Carriage of SVC
3.1.1
Topics
1.
Transport of Scalable Video Coding
3.1.2
Contributions
M14305, M14349, M14329, M14382, M14512: Various input contributions related to the current
DCOR. All of them requesting to find a solution that would not break backward compatibility. All
of them where seriously reviewed during the meeting and proponents have been hardly working
together to propose new text for this DCOR. This was successfully achieved and a new DCOR has
been issued and submitted to ballot, replacing previous DCOR that will be abandoned.
Technical Work in Progress.
4
MPEG-4 Conformance (14496-4)
4.1
4.1.1
14496-4 Amd.22
Topics
1.
4.1.2
None.
Audio BIFS Conformance
Contributions
Technical Work in Progress.
4.2
4.2.1
14496-4 Amd.23
Topics
1.
Synthesized Texture Conformance
4.2.2
Contributions
M14385: Summary of Voting on ISO/IEC 14496-4:2004/PDAM 23. No comment. Text of FPDAM
produced.
Technical Work in Progress.
4.3
4.3.1
14496-4 Amd.24
Topics
1.
File Format Conformance
67
4.3.2
Contributions
M14487: Contribution to Conformance for ISO/IEC 14496-12 AMD.1. Accepted and integrated in
the text of the FPDAM.
M14289: Summary of Voting on ISO/IEC 14496-4:2004/PDAM 24 [SC 29 N 8182]. All comments
have been disposed of. See DoC.
-- only FR boiler-plate comment
-- see 8648 (Hangzhou)
Updated with one new file (timed meta-data), from Michael.
Technical Work in Progress.
4.4
14496-4 Amd.25 LASeR V1 Conformance
4.4.1
Topics
1.
LASeR Conformance
4.4.2
Contributions
M14290: Summary of Voting on ISO/IEC 14496-4:2004/PDAM 25 [SC 29 N 8184]. All comments
have been disposed of. See DoC.
Technical Work in Progress.
4.5
14496-4 Amd.26 Open Font Format Conformance
4.5.1
Topics
2.
Open Font Format Conformance
4.5.2
Contributions
M14402: Proposed conformance test methodology and bitstreams for ISO/IEC 14496-22. Taken as
the basis for the production of the PDAM.
Technical Work in Progress.
4.6
14496-4 Amd.27 LASeR V2 Conformance
4.6.1
Topics
1.
4.6.2
None.
LASeR V2 Conformance
Contributions
Technical Work in Progress.
5
MPEG-4 Reference Software (14496-5)
5.1
5.1.1
14496-5 Amd.12
Topics
1.
ISO File Format Reference Software
5.1.2
Contributions
M14324 : Summary of Voting on ISO/IEC 14496-5:2001/FPDAM 12 [SC 29 N 8273]. All
comments have been disposed of. See DoC.
68
-- only FR boiler-plate comment
-- see 8653 (Hangzhou)
Updated with bug fixes, more support for 3G and better sample entry support, from Dave.
Technical Work Finalized.
5.2
14496-5 Amd.14
5.2.1
Topics
1.
5.2.2
None
Open Font Format Reference Software
Contributions
Technical Work in Progress.
5.3
14496-5 Amd.16
5.3.1
Topics
1.
5.3.2
None.
Symbolic Music Representation Reference Software
Contributions
Technical Work in Progress.
5.4
14496-5 Amd.17
5.4.1
Topics
1.
5.4.2
None.
LASeR Reference Software
Contributions
Technical Work in Progress.
6
Scene Representation (14496-11)
6.1
6.1.1
14496-11:2005 Cor.6
Topics
1.
AudioFX Proto
6.1.2
Contributions
M14387: Summary of Voting on ISO/IEC 14496-11:2005/DCOR 6. No comment. COR produced.
Technical Work Finalized.
69
7
ISO File Format (14496-12)
7.1
7.1.1
14496-12/Amd.2
Topics
1.
Flute Hint Track
7.1.2
14122 ISO Base Media File Format Branding
14336 Summary of Voting on ISO/IEC 14496-12:2005 2 & ISO/IEC 15444-12:2005/FPDAM 2.
US and SE comments only. See the disposition of comments report. We have a potential issue wrt
referring to I-Ds.
14404 Comments and suggestions regarding ISO/IEC 14496-12 Amd.2. Thank you for the careful
read and the editorial improvements.
7.2
Miscellaneous
14529: MP4 file format considerations for high sample-rate audio. We will see what the
conformance files do, but perhaps the Corr. we issued in Marrakech is enough.
14525: Signaling of leading pictures in file format. This is neat. We like it. But we think we can fit
it into the sample dependency table, the reserved two bits (which are also available in the movie
fragments). At this meeting we propose a new output document “technologies under consideration”
for Part 12, in which we hope to collect other amendment-ready material and then issue it sometime
soon. “is, is-not, unknown, reserved” leading picture, where leading picture is defined with respect
to the previous sample marked as an I picture,
Technical Work Finalized.
7.3
7.3.1
14496-12/Cor.3
Topics
1.
Misc. Correction on File Format
7.3.2
14264 AAC SBR timescales and sample rates
M14388: Summary of Voting on ISO/IEC 14496-12:2005/DCOR 3 & ISO/IEC 1544412:2005/DCOR 3. No comment. COR produced.
-- 19 approve, no disapprove.
-- dealt with audio fields in MP4 files (8873)
Technical Work Finalized.
70
8
MPEG-4 AVC File Format (14496-15)
8.1
8.1.1
14496-15:2004/Amd.2
Topics
1.
SVC File Format Extensions
14405: Comments on the SVC File Format. Thank you. Text adjusted.
14494: Extraction path description. This seems interesting, but also quite complex. It is interesting
to try to describe extraction paths and their consequences, but we’re not sure of the description.
14495: Terms and definitions for the SVC file format . Excellent, thank you.
14496: On the SVC File format. Yes, extractors need to be temporally mis-aligned, and we agree to
have a sample offset (+/- sample count), and be careful about defining temporally aligned. Yes, we
need to adjust for prefix/suffix, and for FGS, tl0 etc. Thank you, the toolsets adjusted also. On ROI,
we understand the desire, but it does seem a little integration and description work may be needed.
E.g., how do I know what ‘object’ each ROI is tracking?
14526: On SVC file format. HRD done, thank you! The quality information goes next to the
scalabilityinfoSEIbox (or maybe a Tier); we choose the first for now. We don’t link it to meta-data
or anything.
We don’t think we need a slice header meta-data statement (yet). We would welcome re-timing
information possibly using sample groups, and/or time-parallel meta-data, or new boxes in the
sample table.
14527: Signaling of temporal layer switching points in SVC file format. Accepted, thank you.
14550: Addendum to ISO/IEC 14496-15 AMD2: File Format Support for Scalable Video Coding.
Yes, version 1 of sample groups is required. JVT Joint Meeting. The SVC specification will
formally be a study text, produced on a long editing period, from this meeting. That study should
be available 2-3 weeks before the next meeting. It is intended that a minor variant of that will be
approved as the final text, at the next meeting. Given that the high-level syntax is still not firm, it
would be imprudent for the file format text to go to ballot at this meeting. In particular, the exact
NAL types, and the use of prefixes, are still under discussion.
8.2
JPSEC/FFSEC Joint meeting
We had a good exchange of designs and the motivation for those designs. We’re going to
encourage FFSEC people to join the MP4 reflector, and in email correspondence between now and
Lausanne work on the aspects of the FFSEC design that could be more general. At some point
these pieces could (should) be moved into Part 12 and Corr’d out of FFSEC, but they can start in
FFSEC.
We also shared some information on IPMP.
We think it is too late to make significant changes to the FLUTE amendment.
Areas that look fruitful include:
71
a) general design for layered protection (asked for at this meeting by an MPEG-21 person)
b) some kind of item reference box, like a track reference box (typed references), that would allow
for ‘annotation’ or linking of items
c) a better design than the ‘xml box’ for putting item data inside the meta-box
d) maybe some kind of sub-item information/structure box
e) some kind of support for general scalability, not codec specific (SVC extractors are specific to
SVC); perhaps also a ‘scalable RTP hint track’
There may be other areas.
8.3
MDS Joint meeting
We considered the input contribution M14365, and a number of ideas were raised. First, it is
possible to use the item protection provisions at file format level as well as at the DIDL level.
Second, it is possible to embed a digital item as a file item resource to another digital item, and
protect it whole. Third, the layered protection design being done with FFSEC (above) might help in
future. There did seem to be a tension between including something ‘as an item’, and wanting to
protect it ‘as an item’, yet still wanting to see its structure.
Technical Work in Progress.
9
LASeR (14496-20)
9.1
9.1.1
14496-20/Amd. 1
Topics
1.
Lightweight Application Scene Representation (LASeR Extensions)
9.1.2
Contributions
M14373: LASeR profiles adjustments. Accepted after discussion and integrated into the FDAM.
M14551: Proposal for a new LASeR Profile. after discussion and integrated into profile under
consideration.
Technical Work Finalized.
9.2
9.2.1
14496-20/Amd. 2
Topics
1.
Lightweight Application Scene Representation (SGVT1.2 Support)
9.2.2
Contributions
M14372 This contribution proposes split of current AMD in two pieces since SVGT1.2, the
technology LASeR scene description is based on, does not seem to be finished by July meeting.
Therefore, the elements not related to SVGT1.2 are promoted to FPDAM at this meeting. And new
AMD will start this meeting to hold remainders.
M14370 This contribution proposes changes to AMD1 for harmonization between LASeR and
3GPP DIMS.
72
 Additional width and height fields for rectClip containing same values with what the
size field represents. (if two values doesn’t match, the last values will be used.)
 Renaming of updateSource to updates and addition of syncReference
 Reduce the cases of rotation by two because orientation of the screen is always the top
left corner of the resulted screen by rotation. (portrait or landscape is only matters) 
no semantic changes but names.
Proposed modification will be implemented in AMD1
M14378 This contribution lists new technologies coming from 3GPP
 Immediate Script Execution for script executed immediately without inserting script
node and removed after execution
 New command, “seek”, for seeking across the boundary of presentation regardless of
the scene time to be reset at the execution of NewScene command.
Proposed technologies will be included in the AMD2.
M14418 This contribution analysis the relationship between MPEG-21 and LASeR. It is identified
converting DID into LASeR for presentation cannot be done easily. So it is proposed to use LASeR
as a presentation description for DI and this is agreed with MDS subgroup during the joint meeting.
It is decided to include this contribution in the output document about Items under considerations in
LASeR.
M14419 This contribution analysis the possibilities and the potential issues of carrying ISO/IEC
14496-20 contents over MPEG-2. It is decided to include this contribution in the output document
about Items under considerations in LASeR.
Technical Work in Progress.
9.3
9.3.1
14496-20/Cor 2
Topics
1.
9.3.2
None.
Profile Removal
Contributions
Technical Work in Progress.
10 21000-09 MPEG-21 File Format
10.1
10.1.1
MPEG-21 File Format Amendment
Topics
1.
Mime Type
10.1.2 Contributions
M14555: MIME Type registration for MPEG-21 File Format. Accepted and used as the basis for the
production of the PDAM text.
73
11 21000-14 Conformance
11.1
11.1.1
MPEG-21 File Format Conformance
Topics
1.
Conformance
11.1.2 Contributions
M14497: French NB comment on FCD 21000-14. All comments have been addressed. See DoC.
M11451: Binary Conformance streams for MPEG-21. Accepted. Integrated in text of FDIS.
12 MPEG-A MAF (23000)
12.1
12.1.1
23000-4 Musical Slide Show MAF
Topics
2.
Musical Slide Show MAF
12.1.2
Contributions
M14343: Summary of Voting on ISO/IEC FCD 23000-4 [SC 29 N 8306]. All comments have been
disposed of. See DoC.
M14437: A proposal on metadata modification for Musical Slide Show MAF. Accepted. Will be
included in FDIS.
Technical Work Finalized
12.2
12.2.1
23000-8 Portable Video Player MAF
Topics
1.
Portable Video Player MAF
12.2.2 Contributions
14435: Proposed text of ISO/IEC 23000-8 CD Portable video player MAF. Taken as a basis for the
documentation of the CD.
14438 : A proposal of an additional functionality to be supported in Portable Video Player MAF.
Accepted.
Technical Work in Progress.
12.3
12.3.1
23000-9 Digital Multimedia Broadcasting MAF
Topics
1.
Digital Multimedia Broadcasting MAF
12.3.2 Contributions
M14394 : Summary of Voting on ISO/IEC CD 23000-9. All comments have been addressed and
documented in DoC.
M14425: (Editors Input) Updated Text of ISO/IEC 23000-9 MAF for DMB. Taken as input for
producing text of FCD.
74
M14426 This contribution proposes a method to store MPEG-2 TS in a MP4 file. It was identified during the
discussion that DVB is working on the same problem. So it is decided to send a liaison letter to DVB and to try to
find the harmonized solution before we take a specific solution for this MAF. Proposed method will be included
in the TuC.
M14427 This contribution present the draft list of TVA features appropriate to be used in MAF for
DMB. Since the selection is not completed and the schema is not validated yet, this will be included
in the Technologies under consideration.
Technical Work in Progress.
13 MPEG-B
13.1
13.1.1
23001-1 Binary Format Amd.2
Topics
1.
Extension on Encoding of Wild Cards
13.1.2 Contributions
M14450: Editor's study of 23001-1 FPDAM2. Taking into account for the production of study text.
Technical Work in Progress.
13.2
13.2.1
23001-1 Binary Format Cor.2
Topics
1.
Misc. Editorial Corrections on MPEG-B Part 1
13.2.2 Contributions
M14395 : Summary of Voting on ISO/IEC 23001-1:2006/DCOR 2. See DoC.
Technical Work in Progress.
13.3
13.3.1
23001-2 Fragment Request Unit
Topics
1.
Fragment Request Unit
13.3.2 Contributions
M14381: Summary of Voting on ISO/IEC FCD 23001-2. See Doc.
Technical Work Finalized
13.4
13.4.1
23001-3 Binary to XML Mapping of IPMP-X
Topics
1.
Binary to XML Mapping of IPMP-X
13.4.2 Contributions
M14299: Summary of Voting on ISO/IEC CD 23001-3 [SC 29 N 8227]. No comments. Text of
FCd was produced.
75
M14443: Proposed text of ISO/IEC 23001-3 FCD Binary XML to IPMP-X. Taken as input for the
production of the FCD.
M14498: Proposal of Modified IPMP XML messages for ISO/IEC 23001-3 Binary XML to IPMPX. Approved and included in the FCD.
Technical Work in Progress.
14 MPEG-E Multimedia Middleware (23004)
14.1
14.1.1
Multimedia Middleware
Topics
1.
MPEG Multimedia Middleware
14.1.2 Contributions
At the 80th MPEG Meeting in San Jose, California, USA (April 23 – 27, 2007) MPEG has
promoted the remaining three parts (Part 5: Component Download, Part 6: Fault Management and
Part 7: System Integrity Management) of M3W (ISO/IEC 23004, MPEG-E (Multimedia
Middleware)) to the FDIS (Final Draft International Standard) stage. Please note that the first four
parts (Part 1: Architecture, Part 2: Multimedia API, Part 3: Component Model and Part 4: Resource
and Quality Management) have already reached this stage at the previous MPEG meeting in
January 2007. This implies that all seven parts of M3W are now completed.
At the April MPEG Meeting also a second version of the WD (Working Draft) for the reference
software and conformance testing (Part 8: Reference Software and Conformance) has been released.
The reference software and conformance testing includes the implementation of the logical
components and optional frameworks, supporting tools and sample application demonstrating the
functionality of the individual parts and this then feeds in to the conformance testing process. The
associated plan for the delivery of the reference software and conformance testing (M3W Reference
Software and Conformance Plan) has been updated to reflect the current status and future planned
activities.”
M14337 : Summary of Voting on ISO/IEC FCD 23004-5 [SC 29 N 8298]. See DoC.
M14338 : Summary of Voting on ISO/IEC FCD 23004-6 [SC 29 N 8299]. See DoC.
M14339 : Summary of Voting on ISO/IEC FCD 23004-6 [SC 29 N 8299]. See DoC.
M14371 : Contribution to M3W Reference Software for M3W Parts 2, 3, 5, 6 & 7. Taken as input
for the production of WD2.0 of ISO/IEC 23004-8 Reference Software and Conformance.
Technical Work in Progress.
76
15 Supplementary Media Technology (29116-1)
15.1
15.1.1
Media Streaming MAF Protocols
Topics
1.
Media Streaming MAF Protocols
15.1.2 Contributions
M14304: Summary of Voting on ISO/IEC CD 23005-1 [SC 29 N 8236]. All comments have been
disposed of. See DoC.
M14460: Austrian NB comments on ISO/IEC CD XXXXX Media Streaming MAF Protocols. See
DoC.
M14444: Proposed text of ISO/IEC 23005-1 FCD Media Streaming MAF Protocol (Editor's Input).
Taken as input to produce text of the FCD.
Technical Work in Progress.
16 Exploration
M14418: Ideas on MPEG-21 and LASeR. Follow-up on the discussion we had in previous meeting.
1. Exploration on convertion of digital items to LASeR MPEG-21 into LASeR. Hard
to convert MPEG-21 into LASeR.
2. Add LASeR representation in MPEG-21 and a DIBO.
Document on first ideas on LASeR was updated.
77
17 Latest References and Publication Status
Pr
Pt
Standard
No.
2
2
2
2
2
1
1
1
1
1
ISO/IEC 13818-1/Amd.7
ISO/IEC 13818-1:2000/COR1 (FlexMux Descr.)
ISO/IEC 13818-1:2000/COR2 (FlexMuxTiming_ descriptor)
ISO/IEC 13818-1:2000/Amd.1 (Metadata on 2) & COR1 on Amd.1
N3844
N4404
N5867
2
2
1
1
ISO/IEC 13818-1:2000/Amd.2 (Support for IPMP on 2)
ISO/IEC 13818-1:2000/Amd.3 (AVC Carriage on MPEG-2)
N5604
N5771
2
2
1
1
ISO/IEC 13818-1:2000/Amd.4 (Metadata Application CP)
ISO/IEC 13818-1:2000/Amd.5 (New Audio P&L Sig.)
N6847
N6585
2
2
2
1
1
1
ISO/IEC 13818-1:2000/COR3 (Correction for Field Picture)
ISO/IEC 13818-1:2000/COR4 (M4MUX Code Point)
ISO/IEC 13818-1:2000/COR5 (Corrections related to 3rd Ed.)
N6845
N7469
N7895
2
2
1
1
ISO/IEC 13818-1:2006 (MPEG-2 Systems 3rd Edition)
ISO/IEC 13818-1:2006/Amd.1 (Transport of Streaming text)
N8369
2
1
ISO/IEC 13818-1:2006/Amd.2 (Carriage of Auxialiry Video Data)
N8798
2
4
4
4
11
1
1
1
ISO/IEC 13818-1:2003 (IPMP on 2)
ISO/IEC 14496-1 (MPEG-4 Systems 1st Ed.)
ISO/IEC 14496-1/Amd.1 (MP4, MPEG-J)
ISO/IEC 14496-1/Cor.1
N5607
N2501
N3054
N3278
Issue
ISO/IEC 13818-1:2000 (MPEG-2 Systems 2nd Edition)
00/12
01/01 Pisa
01/12 Pattaya
03/07
Trondheim
03/03 Pattaya
03/07
Trondheim
04/10 Palma
04/07
Redmond
04/10 Palma
05/07 Poznan
06/01
Bangkok
06/xx
06/07
Klagenfurt
07/01
Marrakech
03/03 Pattaya
98/10 Atl. City
99/12 Hawaii
00/03
78
Status
Doc. With
Purpose
Published
Published
Published
Published
Published
2000/12
2000/12
2002/03
2002/12
2003/12
ISO
Award
Done
Proposed
N/A
N/A
Proposed
Published
Published
2004/03
XXXX
N/A
Proposed
FDAM
FDAM
ITTF
ITTF
to be published
to be published
N/A
N/A
COR
COR
COR
ITTF
ITTF
ITTF
to be published
to be published
to be published
N/A
N/A
N/A
Published
FDAM
ITTF
ITTF
to be published
TBP
TBP
FDAM
ITTF
to be published
TBP
Published
Published
Published
Published
2003/12
1999/12
2001/11
2001/11
Proposed
Done
Done
N/A
1
1
1
1
1
ISO/IEC 14496-1:2001 (MPEG-4 Systems 2nd Ed.)
ISO/IEC 14496-1:2001/Amd.1 (Flextime)
ISO/IEC 14496-1:2001/Cor.1
ISO/IEC 14496-1:2001/Cor.2
ISO/IEC 14496-1:2001/Cor.3
N3850
4
1
ISO/IEC 14496-1:2001/Amd.2 (Textual Format)
N4698
4
1
ISO/IEC 14496-1:2001/Amd.3 (IPMP Extensions)
N5282
4
4
1
1
ISO/IEC 14496-1:2001/Amd.4 (SL Extension)
ISO/IEC 14496-1:2001/Amd.7 (AVC on 4)
N5471
N5976
4
4
1
1
ISO/IEC 14496-1:2001/Amd.8 (ObjectType Code Points)
ISO/IEC 14496-1:200x/Amd.1 (Text Profile Descriptors)
N6202
N7229
4
4
1
1
ISO/IEC 14496-1:200x/Cor4 (Node Coding Table)
Ed.)
N7473
N5277
4
1
ISO/IEC 14496-1:200x/Amd.1 (Text Profile Descriptors)
N7229
4
1
ISO/IEC 14496-1:200x/Cor.1 (Clarif. On audio codec behavior)
N8117
4
1
ISO/IEC 14496-1:200x/Amd.2 (3D Profile Descriptor Extensions)
N8372
4
1
ISO/IEC 14496-1:200x/Cor.2 (OD Dependencies)
N8646
4
1
ISO/IEC 14496-1:200x/Amd.3 (JPEG 2000 support in Systems)
N8860
4
4
ISO/IEC 14496-1:200x/Amd.17 (ATG Conformance)
N8861
4
4
4
4
4
ISO/IEC 14496-1 (MPEG-4 Systems
3rd
Noordwijk.
01/01 Pisa
01/07 Sydney
02/10 Shangai
04/07
Redmond
02/03 Jeju
Island
02/10
Shanghai
02/12 Awaji
03/10
Brisbanne
03/12 Hawaii
05/04 Busan
N4264
N5275
N6587
05/07 Poznan
02/10
Shanghai
05/04 Busan
06/04
Montreux
06/07
Klagenfurt
06/10
Hangzhou
07/01
Marrakech
07/01
Marrakech
79
Published
Published
COR
COR
COR
2001/11
2002/10
ITTF
ITTF
ITTF
N/A
Done
N/A
N/A
N/A
AMD
ITTF
N/A
Published
2004-05
N/A
Published
Published
2003/12
2004-08
N/A
N/A
AMD
PDAM
ITTF
ITTF
PDAM
IS
ITTF
ITTF
PDAM
ITTF
COR
ITTF
PDAM
to be published
Final Text
Editing
to be published
to be published
N/A
N/A
N/A
Proposed
N/A
ITTF
Final Text
Editing
Final Text
Editing
to be published
COR
ITTF
to be published
N/A
PDAM
ITTF
to be published
N/A
PDAM
ITTF
to be published
N/A
N/A
N/A
4
4
4
4
5
6
8
11
ISO/IEC 14496-1:200x/Amd.12 (File Format)
ISO/IEC 14496-6:2000
ISO/IEC 14496-8 (MPEG-4 on IP Framework)
4
11
ISO/IEC 14496-11/Amd.1 (AFX)
4
11
ISO/IEC 14496-11/Amd.2 (Advanced Text and Graphics)
4
4
11
11
ISO/IEC 14496-11/Cor.1
4
11
ISO/IEC 14496-11/Amd.3 Audio BIFS Extensions
N6591
4
11
ISO/IEC 14496-11/Amd.4 XMT and MPEG-J Extensions
N6959
4
11
ISO/IEC 14496-11/Cor.3 (Audio BIFS Integrated in 3rd Edition)
N7230
4
11
ISO/IEC 14496-11/Cor.5 (Misc Corrigendum)
N8383
4
11
ISO/IEC 14496-11/Amd.5 Symbolic Music
Representation
N8657
4
4
11
12
ISO/IEC 14496-11/Cor.6 (AudioFx Correction)
ISO/IEC 14496-12 (ISO Base Media File Format)
N9021
N5295
4
12
ISO/IEC 14496-12/Amd.1 ISO FF Extension
N6596
4
12
N7232
4
12
ISO/IEC 14496-12/Cor.1 (Correction on File Type
Box)
ISO/IEC 14496-12/Cor.2 (Miscellanea)
4
12
ISO/IEC 14496-12/Cor.3 (Miscellanea)
N9024
N9020
07/04 San Jose
N4712
N6960
N5480
02/03 Jeju
05/01
HongKong
02/12 Awaji
N6205
N6203
ISO/IEC 14496-11/Cor.3 Valuator/AFX related correction N6594
ISO/IEC 14496-11 (MPEG-4 Scene Description 3rd
Edition)
FDAM
ITTF
03/12 Hawaii
FDAM
ITTF
03/12 Hawaii
04/07
Redmond
04/07
Redmond
05/01
HongKong
05/04 Busan
COR
COR
SC29
ITTF
FDAM
ITTF
FDAM
ITTF
COR
ITTF
COR
SC29
N/A
FDAM
ITTF
TBP
COR
Published
SC29
2004-02
N/A
Proposed
FDAM
ITTF
FDAM 04/11/30
N/A
COR
ITTF
N/A
COR
ITTF
COR
ITTF
Final Text
Editing
Final Text
Editing
Final Text
06/01
Bangkok
07/04 San Jose
80
N/A
N/A
Proposed
Proposed
ITTF
2000/12
2004-05
SC29
06/07
Klagenfurt
06/10
Hangzhou
07/04 San Jose
02/10
Shanghai
04/07
Redmond
05/04 Busan
N7901
to be published
PDAM
Published
Published
FDIS
Final Text
Editing
Integration in 1st
Ed.
Integration in 1st
Ed.
Integration in 1st
Ed.
Integration in 1st
Ed.
Integration in 1st
Ed.
Final Text
Editing
N/A
N/A
N/A
N/A
Proposed
N/A
N/A
N/A
N/A
Editing
4
12
4
4
N8659
12
13
ISO/IEC 14496-12/Amd.1 (Description of timed
metadata)
ISO/IEC 14496-12/Amd.2 (Flute Hint Track)
ISO/IEC 14496-13 (IPMP-X)
4
14
ISO/IEC 14496-14 (MP4 File Format)
N5298
4
14
ISO/IEC 14496-14/Cor.1 (Audio P&L Indication)
N7903
4
15
ISO/IEC 14496-15 (AVC File Format)
N5780
4
15
ISO/IEC 14496-15/Amd.1 (Support for FREXT)
N7585
4
4
15
15
ISO/IEC 14496-15/Cor.1
ISO/IEC 14496-15/Cor.2 (NAL Unit Restriction)
N7575
N8387
4
4
4
17
18
18
N7479
N6215
N8664
4
4
4
19
20
20
4
4
20
22
ISO/IEC 14496-17 (Streaming Text)
ISO/IEC 14496-18 (Font Compression and Streaming)
ISO/IEC 14496-18/Cor.1 (Misc. corrigenda and
clarification)
ISO/IEC 14496-19 (Synthesized Texture Stream)
ISO/IEC 14496-20 (LASeR)
ISO/IEC 14496-20/Cor.1 (Misc. corrigenda and
clarification)
ISO/IEC 14496-20/Amd.1 (LASeR Extension)
ISO/IEC 14496-22 (Open Font Format)
7
7
7
7
1
1
1
1
ISO/IEC 15938-1 (MPEG-7 Systems)
N4285
N6326
N6328
N7490
ISO/IEC 15938-1/Amd.1 (MPEG-7 Systems Extensions)
ISO/IEC 15938-1/Cor.1 (MPEG-7 Systems Corrigendum)
ISO/IEC 15938-1/Cor.2 (MPEG-7 Systems Corrigendum)
06/10
Hangzhou
07/04 San Jose
02/10
Shanghai
02/10
Shanghai
06/01
Bangkok
03/07
Trondheim
05/10 Nice
N9023
N5284
05/10 Nice
06/07
Klagenfurt
05/07 Poznan
03/12 Hawaii
06/10
Hangzhou
03/12 Hawaii
05/10 Nice
06/10
Hangzhou
07/04 San Jose
06/07
Klagenfurt
01/07 Sydney
04/03 Munich
04/03 Munich
05/07 Poznan
N6217
N7588
N8666
N9029
N8395
81
N/A
FDAM
ITTF
FDAM
IS
ITTF
ITTF
Published
2003-11
COR
ITTF
Published
2004-04
FDAM
ITTF
COR
COR
ITTF
ITTF
N/A
N/A
FDAM
Published
COR
ITTF
2004-07
ITTF
TBP
Proposed
N/A
Published
FDAM
COR
2004-07
Editor
ITTF
Proposed
TBP
N/A
FDAM
FDAM
ITTF
Editor
N/A
TBP
Published
FDAM
COR
COR
2002/07
ITTF
Editor
ITTF
to be published
N/A
Proposed
Proposed
Final Text
Editing
N/A
Proposed
Final Text
Editing
Final Text
Editing
FDAM 04/11/28
N/A
Done
N/A
N/A
N/A
7
7
7
1
2
7
ISO/IEC 15938-1/Amd.2 (BiM extension)
ISO/IEC 15938-7/Amd.2 (Fast Access Ext. Conformance)
N7532
N4288
N8672
21
9
ISO/IEC 21000-9 (MPEG-21 File Format)
N6975
21
A
B
B
16
1
1
1
N7247
N9037
N7597
N8680
B
1
B
1
ISO/IEC 21000-16 (MPEG-21 Binary Format)
ISO/IEC 23000-4 (Musical Slide Show MAF)
ISO/IEC 23001-1 (XML Binary Format)
ISO/IEC 23001-1/Cor.1 (Misc. Editorial and technical
clar.)
ISO/IEC 23001-1/Cor.2 (Misc. Editorial and technical
clar.)
ISO/IEC 23001-1/Amd.1 (Reference Soft. & Conf.)
B
E
2
1
ISO/IEC 23001-1 (Fragment Request Unit)
ISO/IEC 23008-1 Architecture
N9051
N8892
E
2
ISO/IEC 23008-2 Multimedia API
N8893
E
3
ISO/IEC 23008-3 Component Model
N8894
E
4
ISO/IEC 23008-4 Ressource & Quality Management
N8895
E
E
E
5
6
7
ISO/IEC 23008-5 Component Download
ISO/IEC 23008-6 Fault Management
ISO/IEC 23008-7 System Integrity Management
N9053
N9054
N9055
ISO/IEC 15938-2 (MPEG-7 DDL)
N9049
N8886
82
05/10 Nice
01/07 Sydney
06/10
Hangzhou
05/01
HongKong
05/04 Busan
07/04 San Jose
05/10 Nice
06/10
Hangzhou
07/04 San Jose
FDAM
Published
FDAM
ITTF
2002/02
ITTF
N/A
Done
N/A
FDIS
ITTF
FDIS 05/01/21
Done
FDIS
FDIS
FDIS
COR
ITTF
ITTF
ITTF
ITTF
FDIS 05/04/22
TBP
TBP
TBP
N/A
COR
ITTF
N/A
07/01
Marrakech
07/04 San Jose
07/01
Marrakech
07/01
Marrakech
07/01
Marrakech
07/01
Marrakech
07/04 San Jose
07/04 San Jose
07/04 San Jose
FDAM
ITTF
N/A
FDIS
FDAM
ITTF
ITTF
TBP
N/A
FDAM
ITTF
N/A
FDAM
ITTF
N/A
FDAM
ITTF
N/A
FDAM
FDAM
FDAM
ITTF
ITTF
ITTF
N/A
N/A
N/A
18 Resolutions of Systems
Cf. WG11 resolution.
19 List of Reviewed Contributions
N°
Title
Authors
14289 Summary of Voting on ISO/IEC 144964:2004/PDAM 24 [SC 29 N 8182]
14290 Summary of Voting on ISO/IEC 144964:2004/PDAM 25 [SC 29 N 8184]
14297 Liaison Statement from 3GPP [SC 29 N 8225]
14299 Summary of Voting on ISO/IEC CD 23001-3 [SC
29 N 8227]
14304 Summary of Voting on ISO/IEC CD 23005-1 [SC
29 N 8236]
14305 Liaison Statement from the DVD Forum WG-1
[SC 29 N 8254]
14324 Summary of Voting on ISO/IEC 144965:2001/FPDAM 12 [SC 29 N 8273]
14329 USNB Contribution: Response to resolution 3.1.2
of 79-th WG 11 meeting
14336 Summary of Voting on ISO/IEC 1449612:2005/FPDAM 2 and ISO/IEC 1544412:2005/FPDAM 2 [SC 29 N 8297]
14337 Summary of Voting on ISO/IEC FCD 23004-5
[SC 29 N 8298]
14338 Summary of Voting on ISO/IEC FCD 23004-6
[SC 29 N 8299]
14339 Summary of Voting on ISO/IEC FCD 23004-7
[SC 29 N 8301]
14343 Summary of Voting on ISO/IEC FCD 23000-4
[SC 29 N 8306]
14349 Liaison re w8559 Text of ISO/IEC 138181:200x/DCOR.1
14362 Liaison Statement from the DVB [SC 29 N 8326]
14366 Additional examples on Cross-Media Interactive
Presentation MAF
14367 Proposal for a MAF on Cross-Media Interactive
Presentation: Application Scenarios
83
SC 29 Secretariat
SC 29 Secretariat
3GPP via SC 29
Secretariat
SC 29 Secretariat
SC 29 Secretariat
the DVD Forum WG-1
via SC 29 Secretariat
SC 29 Secretariat
A. G. Tescher for
USNB
SC 29 Secretariat
SC 29 Secretariat
SC 29 Secretariat
SC 29 Secretariat
SC 29 Secretariat
Gavin Schutz
Teruhiko Suzuki
Michael Dolan
DVB via SC 29
Secretariat
Paolo Nesi
Pierfrancesco Bellini
Davide Rogai
Paolo Nesi
Pierfrancesco Bellini
Davide Rogai
Kia Ng (University of
N°
Title
Authors
14368 Proposal for a MAF on Cross-Media Interactive
Presentation: Requirements
14369 Proposal for a MAF on Cross-Media Interactive
Presentation: Relationships with other MAFs
14370 LASeR fixes requested by 3GPP DIMS
14371 Contribution to M3W Reference Software for
M3W Parts 2, 3, 5, 6 & 7
14372
14373
14378
14381
14382
14385
14387
14388
14394
14395
14402
14404
14405
14413
Splitting LASeR AMD1
LASeR profiles adjustments
Additions to LASeR AMD2 from 3GPP
Summary of Voting on ISO/IEC FCD 23001-2
Summary of Voting on ISO/IEC 138181:200X/DCOR 1
Summary of Voting on ISO/IEC 144964:2004/PDAM 23
Summary of Voting on ISO/IEC 1449611:2005/DCOR 6
Summary of Voting on ISO/IEC 1449612:2005/DCOR 3 & ISO/IEC 1544412:2005/DCOR 3
Summary of Voting on ISO/IEC CD 23000-9
Summary of Voting on ISO/IEC 230011:2006/DCOR 2
Proposed conformance test methodology and
bitstreams for ISO/IEC 14496-22
Comments and suggestions regarding ISO/IEC
14496-12 Amd.2
Comments on the SVC File Format
Liaison Statement from TTA [SC 29 N 8333]
14418 Ideas on MPEG-21 and LASeR
14418 Ideas on MPEG-21 and LASeR
14419 Issues on the carriage of ISO/IEC 14496-20
contents over MPEG-2
14425 (Editors Input) Updated Text of ISO/IEC 23000-9
MAF for DMB
84
Leeds)
Paolo Nesi
Pierfrancesco Bellini
Davide Rogai
Davide Rogai
Pierfrancesco Bellini
Paolo Nesi
Jean-Claude Dufourd
Jean H.A. Gelissen
(editor)
Johan Muskens
Jean-Claude Dufourd
Jean-Claude Dufourd
Jean-Claude Dufourd
SC 29 Secretariat
SC 29 Secretariat
SC 29 Secretariat
SC 29 Secretariat
SC 29 Secretariat
SC 29 Secretariat
SC 29 Secretariat
Simon Daniels
Vladimir Levantovsky
Jani Peltotalo
Miska M. Hannuksela
David Singer
TTA via SC 29
Secretariat
Jihun Cha
YeSun Joung
Young-Kwon Lim
KyungAe Moon
Jihun Cha
YeSun Joung
Young-Kwon Lim
KyungAe Moon
Jihun Cha
Youngkwon Lim
YeSun Joung
KyungAe Moon
Hui Yong Kim
Hyon-Gon Choo
Munchurl Kim
N°
Title
Authors
14426 Proposal for MPEG-2 TS Encapsulation with
ISO/IEC 23000-9 MAF for DMB
14427 Proposal for Restrictions on TV-Anytime
Metadata in ISO/IEC 23000-9 MAF for DMB
14435 Proposed text of ISO/IEC 23000-8 CD Portable
video player MAF
14437 A proposal on metadata modification for Musical
Slide Show MAF
14438 A proposal of an additional functionality to be
supported in Portable Video Player MAF
14443 Proposed text of ISO/IEC 23001-3 FCD Binary
XML to IPMP-X
14444 Proposed text of ISO/IEC 23005-1 FCD Media
Streaming MAF Protocol (Editor's Input)
14450 Editor's study of 23001-1 FPDAM2
14451 Binary Conformance streams for MPEG-21
14460 Austrian NB comments on ISO/IEC CD XXXXX
Media Streaming MAF Protocols
14477 Updated Proposal for Protected Musical Slide
Show MAF with IPMP
14478 Updated Proposal for Protected Photo Player
MAF with IPMP
14487 Contribution to Conformance for ISO/IEC 1449612 AMD/1
14489 Proposal of Modified IPMP XML messages for
ISO/IEC 23001-3 Binary XML to IPMP-X
14494 Extraction path description
85
Hui Yong Kim
Gun Bang
MyungSeok Ki
Hyun Cheol Kim
Han-Kyu Lee
Jin Woo Hong
Young-Kwon Lim
Hui Yong Kim
Seung Jun Yang
Heekyung Lee
Han-Kyu Lee
Jin Woo Hong
Munchurl Kim
Jinhan Kim
Hyouk Jean Cha
Tae Hyeon Kim
Herbert Thoma
Ryoma Oami
Ryoma Oami
Filippo
Chiariglione(Editor)
Hyon-Gon
Choo(Editor)
Jooyoung Lee
Hyon-Gon Choo
Filippo Chiariglione
Naito Joji
David Thevenin
Philippe de Cuetos
David Thevenin
Philippe de Cuetos
Hendry
Houari Sabirin
Munchurl Kim
Hendry
Houari Sabirin
Munchurl Kim
Michael Ransburg
Hermann Hellwagner
Filippo Chiariglione
Jooyoung Lee
Hyon-Gon Choo
Thomas Rathgen
Michael Ransburg
Peter Amon
Andreas Hutter
N°
Title
Authors
14495 Terms and definitions for the SVC file format
14496 On the SVC file format
14497 French NB comment on FCD 21000-14
14512 Proposed technical alternative to MPEG-2
Systems DCOR 1 text WG 11 N 8859
14525 Signaling of leading pictures in file format
14526 On SVC file format
14527 Signaling of temporal layer switching points in
SVC file format
14529 MP4 file format considerations for high samplerate audio
14535 Liaison Statement from JSR 287 Expert Group
[SC 29 N 8336]
14551 Proposal for a new LASeR Profile
MIME Type registration for MPEG-21 File
14555
Format
86
Hermann Hellwagner
Michael Ransburg
Thomas Rathgen
Peter Amon
Andreas Hutter
Hermann Hellwagner
Thomas Rathgen
Peter Amon
Andreas Hutter
Philippe de Cuetos on
behalf of FNB
Gary J. Sullivan
Regis Crinon
Ying Chen
Ye-Kui Wang
Miska M. Hannuksela
Ye-Kui Wang
Miska M. Hannuksela
Ye-Kui Wang
Miska M. Hannuksela
David Singer
JSR-287 EG via SC 29
Secretariat
Jean-Claude Dufourd
Annex G – MDS report
Source: Ian Burnett, Chair
1.0 Introduction
MDS commenced with an overview of the weeks planned activities:
MPEG Multimedia Description Schemes (MDS) Sub-group
Kick-off Multimedia Description
Schemes (MDS) Activities
80th MPEG Meeting
San Jose, CA, USA
Ian S Burnett, Chair, MPEG MDS Group
April 23rd – 27th, 2007
July 24th, 2005
MPEG Multimedia Description Schemes (MDS) Sub-group
Overview of MDS Activities
MPEG-21 & MAFs:
•
•
•
•
•
•
•
•
•
•
REL (OR Profile FPDAM)
Reference s/w & Confomance (FDIS)
IPMP Components (FPDAM/1, FPDAM/2)
ER (Defect Reports??)
DI Streaming (FPDAM/1)
Media Streaming MAF – (FCD)
OR MAF (FCD)
Prof. Archival MAF (?)
MAFs – joint meetings with Reqts/systems
MPEG-21 Schema Doc
• DIA (COR/1)
• DIA 2nd edition (FDIS)
• BSDL (FDIS)
MPEG-7:
• MPEG-7 Query (CD)
July 19—24, 2004
|
69 th MPEG Meeting
|
© 2003 IBM Corporation
Redmond, WA USA
87
MPEG Multimedia Description Schemes (MDS) Sub-group
MPEG-7 Timeline
7 12 200x
July 19—24, 2004
Query Format
|
69 th MPEG Meeting
|
07/04 07/07 07/10 08/04
© 2003 IBM Corporation
Redmond, WA USA
MPEG Multimedia Description Schemes (MDS) Sub-group
MPEG-21 & MPEG-A Timeline
21 4 2006 Amd.1 MPEG-21 IPMP base profile
21 4 2006 Amd.2 Media streaming profile
21 5 2004 Amd.3 ORC (Open Release Content) profile
21 7 2004 Cor.1
21 8 200x
Reference software
21 14 200x 1st Ed. Conformance testing
21 18 200x Amd.1 Simple fragmentation rule
A 3 200x Amd.1 Reference software for photo player MAF
A 5 200x 1st Ed. Media streaming player
A 6 200x 1st Ed. Professional archival MAF
A 7 200x 1st Ed. Open release MAF
B 5 200x 1st Ed. BSDL
July 19—24, 2004
|
69th MPEG Meeting
|
06/07 06/10 07/04
07/04 07/07 08/01
06/07 07/01 07/04 07/10
07/04
06/07 07/01 07/07
03/10 06/04 06/10 07/04
06/10 07/04 07/10
06/10 07/01 07/07 08/01
06/04 06/10 07/04 07/10
06/10 07/04 07/10 08/04
06/10 07/01 07/04 07/10
07/04
© 2003 IBM Corporation
Redmond, WA USA
88
MPEG Multimedia Description Schemes (MDS) Sub-group
Overview of Activities

80th
•
•
•
•
•
•
MPEG meeting – Organization of work:
Main MDS track (MPEG-21) – Room
Break-out groups:
–
MAFs, MPEG-7 Query
Joint meetings with other groups on MAFs & MPEG-21
DIDs/File Format
MDS plenary meetings (Mon, Thurs)
Single wrap-up meeting on Friday!!!!
Thursday am – no scheduled activities – BoGs!
July 19—24, 2004
|
69 th MPEG Meeting
|
© 2003 IBM Corporation
Redmond, WA USA
MPEG Multimedia Description Schemes (MDS) Sub-group
Major MDS goals of the week

MPEG-21 IPMP Components (Part 4):

MPEG-21 REL (Part 5):

MPEG-21 Digital Item Adaptation (Part 7):


•
•
•
•
•
•
•
•
•
Base Profile, Media Streaming Profile
No input on TuC from last meeting
Output: FDAM/1, FPDAM/2
Open Release Profile, Ref s/w
Output: Ref s/w, FPDAM/3
Inputs
COR/1
BSDL issues
Draft 2nd edition
MPEG-21 Ref s/w 2nd edn (Part 8):
•
Study of Reference Software FCD
MPEG-21 Conformance (Part 14):
•
•
Inputs
Output: FDIS
July 19—24, 2004
|
69 th MPEG Meeting
|
© 2003 IBM Corporation
Redmond, WA USA
89
MPEG Multimedia Description Schemes (MDS) Sub-group
Major MPEG-21 & MAF goals of the week (cont.)

MPEG-21 DI Streaming (Part 18)

MPEG-21 Schemas output document

MAF – Media Streaming
•
•
•
•
•
•
Discussion of CE results/inputs
Output: FPDAM/1
Host on ITTF site
Working Document – output kept up to date
Inputs, AHG inputs
Output: FCD, Ref s/w
MAF – Professional Archival

•
•

Inputs
Output: ???????
MPEG-7 Query Format
•
•
CE inputs
Output: CD
July 19—24, 2004
|
69 th MPEG Meeting
|
© 2003 IBM Corporation
Redmond, WA USA
MPEG Multimedia Description Schemes (MDS) Sub-group
Joint meetings schedule
 Joint Meetings:
–
–
–
–
MDS/Reqts Issues (9am-11am Tuesday)
Proposed MAFs with Reqts (2.00pm-6.00pm Tuesday)
MDS/Systems (11am-12pm Tuesday) MPEG-21/LASeR
MDS/Systems (4pm-5pm Wednesday) DI/FF issues
July 19—24, 2004
|
69 th MPEG Meeting
|
© 2003 IBM Corporation
Redmond, WA USA
2.0 Notes on discussions on Input Documents
These contemporaneous notes summarise the activities of the MDS subgroup during the 80th MPEG
meeting. Over the week several short break out activities dealt with tasks. The Break out groups
worked on the REL and Open Release MAF, Professional Archival work and MPEG-7 Query.
(Reports of the break out groups are included at the end of this section).
Following a short MPEG plenary, a joint meeting with Requirements considered the following from
11am -12pm.
14475
Giovanni Cordara (on behalf of the ITNB)
Italian NB proposal to revisit MPEG-21 DID
Input:
This input from the Italian NB proposes the development of a new DID based on a royalty-free
basis.
90
Actions:
There appears to be support for a royalty free standard in MPEG but one question is whether MPEG
can usefully create a new DID standard.
Issues:
1. Royalty Free
2. Technical Issues – agnosticism of the DI, application specific containers
A BOG was established to specifically consider the technical issues. The discussion will consider
the limitations of the current DID as a starting point
MDS officially opened at 1.30pm with a run through of the weeks activities. Note that in the
following only MDS input documents are discussed. Joint meetings with Requirements and
Systems (see schedule) and the treatment of those documents considered in the joint meetings are
considered in the respective group reports.
14415
Kisong Yoon Taehyun Kim Hogab Kang
Interoperability between MPEG-21 REL DAC Profile and
Other Standards
Input:
This input considers how the DAC profile will provide interoperability with TV-Anytime, DVB and
OMA.
Actions:
There was agreement that this was a very useful analysis. MDS will investigate ways to publicise
this information.
14484
Kisong Yoon Taehyun Kim Hogab Kang
A Study on Use Cases of Derivative Works with MPEG-21
REL ORC Profile License
Input:
This input considers how the ORC profile will provide for derivative and aggregate works.
Actions:
The input proposes structures for licenses to provide effectively for derivative works. Thiswas
discussed further in the BOG
14507
Eva Rodríguez Jaime Delgado
Contribution to the current version of the Open Release MAF
Input:
This input considers how the OR MAF and suggests addition of descriptions of DIDL elements etc.
It wants the text to make the usage of the elements more specific.
Actions:
Discussions suggested that profiling of the DID wasn’t a solution. There were questions raised as to
why full DID descriptions were needed. It seems that it may be worthwhile improving the usage
explanations in the MAF text. The BoG will consider this suggestion further.
14511
14513
Florian Schreiner Chun Hui Suen
Florian Schreiner Chun Hui Suen
Overview of ISO/IEC 23000-7 CD Open Release MAF (1pager)
Proposed text to ISO/IEC 23000-7 CD Open Release MAF
Input:
M14511 provides an overview of the OR MAF.
Actions:
The BoG will consider the overview an d then MDS will create an output document of the overview
for the web site. One issue is whether the relationship to CC rights should be made explicit.
91
Input:
M14513 provides improved text for the CD.
Actions:
The BoG used this text as a basis for work during the week. The BoG also considered rights issues
brought out.
14503
Hélder Castro Pedro Carvalho Teresa
Christian Timmerer Hermann Hellwagner
Andrade
A DID model for Media Streaming MAF
Input:
This input proposed a constrained DID – a model /profile – for use in the MS MAF. The model
contains Descriptors which cater for each stakeholder in the DI delivery chain. A possible problem
with referencing Digital Items was also identified.
Actions:
This input is related to the BoG activities on a new improved, royalty free DI. The requirements of
this application will be considered during those discussions. It is also envisaged that the input may
have impact on the MS MAF work and will be discussed in that BoG.
MDS Room FIR
MPEG-7 Query Format (16h30 - 18h00)
This was the first meeting in MDS of the MP7QF BoG.
14365
Davide Rogai Paolo Nesi Pierfrancesco Bellini
Experience on using MPEG-21 File Format for nested and/or
protected DIs
Input:
This input considered some problems that were encountered using the MPEG-21 FF and DIDs with
protected content requirements. One use case is where a piece of content has been protected once
and then protected with a second technology.
Actions:
There are various solutions involving layered protection in the ISO FF and then also through the use
of MPEG-21 IPMP Components. No further action at this meeting. The authors of the input will try
the layers of solutions and report back at a future meeting.
14351
Saar De Zutter Jan De Cock Rik Van de Walle
14356
Saar De Zutter Jan De Cock Rik Van de Walle on
behalf of the Belgian National Body
14409
Saar De Zutter Jan De Cock Rik Van de Walle on
behalf of the Belgian National Body
Conformance tests for DIDL documents - files
BNB comments on ISO/IEC FCD 21000-14: Conformance
Testing
Preliminary BNB comments on ISO/IEC FCD 21000-8:
Reference Software (2nd edition)
Input: 14351
These are the XML files for testing DIDs
Actions:
These should be attached to the Conformance FDIS
Input: 14356
Belgain NB comments on the conformance document. Recommends accepting the Study document
an d the annex referencing, spacing, explanations in Annex A have incomplete sentences.
Actions:
92
These changes should be incorporated into the Conformance FDIS.
Input: 14409
Preliminary comments on the Reference software from Belgium.
Actions:
These should be added to the study of the Reference software.
14462
Michael Eberhard Christian
Hermann Hellwagner
Timmerer
Update of gBSDtoBin and DIA Reference and Utility Software
Modules
Input:
This input updates the gBSDtoBin and DIA Ref/Utility software modules.
Actions:
Add these modules to the reference software, replacing older modules
14505
14401
Eva Rodríguez Jaime Delgado
Eva Rodríguez Jaime Delgado
Contribution to MPEG-21 Reference Software: Validation
Rules Checker for the REL MAM Profile
Contribution to REL MAM Profile Conformance
Input: 14401
This input suggests mechanisms for Conformance for the REL MAM profile. It suggests creating a
subset of the REL rules for conformance and one new rule. Reference software checking the rules is
available.
Actions:
Add these modules to the reference software, replacing older modules
Input: 14505
This software implements the rules specified in m14401.
Actions:
Add these modules to the reference software (study document).
14399
14400
Eva Rodríguez Jaime Delgado
Jaime Delgado Eva Rodríguez
Adding Integrity and authenticity to Event Reporting
information
Defect Report Proposal of ISO/IEC 21000-15
Input: 14399
This input raises again the possibility of adding security to ER. It proposes using both MPEG and
non-MPEG standards. The integrity is provided using Digital Signatures. For data encryption: XML
encryption.
Actions:
MDS agrees this is useful. However, modifications to the standard are only required if there is a use
case for protecting ‘part’ of an ER.
Input: 14400
This input raises problems with the inData element of ER. The standard is inconsistent between the
text and schema.
Actions:
93
To correct the schema in Annex A
14508
Some issues on the generation and modification of Event
Reports in the MPEG-21 Event Reporting
Eva Rodríguez Jaime Delgado Víctor Torres
Input:
This input raises issues with MPEG-21 ER. It suggests adding structure to Descriptor, a new child
element to modification or introducing multiple ER report elements.
Actions:
This would be a useful correction to the current ER specification as multiple ER report elements is
important in various applications.
14502
Daniel Oancea Pedro Carvalho Teresa Andrade
Christian Timmerer Hermann Hellwagner
Defect Report on ISO/IEC 21000-15
Input:
This input raises three scenarios for the use of ER: Terminal, Service monitoring, network related.
It recommends removing the requirement for an ER Request. Also, highlights several
inconsistencies in ER, and that the schema documents are not extensible.
Actions:
This would need a Corrigenda to be created on ER. MDS will issue a DCOR at this meeting.
14481
Hendry Takafumi Ueno
14482
Hendry
14483
Hendry Munchurl Kim
Some Editorial Update for ISO/IEC 21000-4/FPDAM1
MPEG-21 IPMP Components Base Profile
Late comment for ISO/IEC 21000-4/FPDAM1 MPEG-21
IPMP Components Base Profile
Contribution for MPEG-21 IPMP Components Base Profile
Conformance
Input: 14481
This input describes a series of editorial updates for IPMP Components Base profile. These address
the comments of the Japanese National Body.
Actions:
MDS accepts the editorial changes and will issue these as part of the AMD/1
Input: 14482
This is a late comment supporting editorial changes and was withdrawn.
Input: 14483
This input provides the conformance for the IPMP Components Base Profile. It provides test
sequences according to the Based Profile Restrictions. Each test sequence provides for testing of
instances one by one.
Actions:
This will be included in the Conformance FDIS and will be supported by an NB comment.
14557
Christian
(on behalf of ANB)
Timmerer Late Austrian NB comments on ISO/IEC 21000-7 Cor.1
Input:
This input notes that FGS has been removed from the FDIS of SVC and hence the descriptors must
be adjusted in ISO/IEC 21000-7.
94
Actions:
This will be accepted and the adjustments made.
Note on the Professional Archival MAF
At the San Jose meeting, the support for the Professional Archival MAF was discussed and the
consensus was that at this stage there is not enough support for progression of this MAF on the
standards track. At this stage the MAF part is on hold pending increased support for the activity.
The Break Out group reports for the meeting are given below:
MPEG-7 Query
Report of BoG on MP7QF
Kyoungro Yoon
Konkuk University
April, 26, 2007
CD on Its Way (1/4)











4.2 Requirements For Input Query Format
4.2.1.Query-by-textual description (Not Yet)
4.2.2.Free text query (OK)
4.2.3.Query-by-example of various media types (OK)
4.2.4.Query-by-example segment of various media format (OK)
4.2.5.Query-by-“mixed example” of various media format (Not
Yet)
4.2.6.Query-by-ID of various standardized unique identifiers
(OK)
4.2.7.Query-by-descriptions specified by MPEG-7 standard
(OK)
4.2.7.1.User Preferences and/or Usage History based query
(Not Yet)
4.2.8.Various combinations of query conditions (Boolean only)
4.2.9.Empty query (OK)
95
CD on Its Way (2/4)
 4.2.10.Specifying the use of personal information
(Personalization) (NO)
 4.2.11.Removal of personal information (NO)
 4.2.12.Query based on spatio-temporal relationships (NO)
 4.2.13.Specifying any specific data as the result set. (OK)
 4.2.14.Specifying the media formats/types of the result set
(NO)
 4.2.15.Sorting and grouping parameters for the result set
(NO)
 4.2.16.Specification of the structure of the result set (OK)
 4.2.17.Limiting the size of the result set (OK)
 4.2.18.Paging the result set (OK)
CD on Its Way (3/4)




4.3Requirements on Query Output Format (DONE)
4.3.1.Structure of the response containing the result set
4.3.2.Default response for the result set
4.3.3.Acknowledgement of Removal Request
CD on Its Way (4/4)





4.4Requirements on Query Management Tools
4.4.1.Specification of the exceptions (Input)
4.4.2.Service selection (TuC)
4.4.3.Relevance feedback (Input)
4.4.4.Searching within the result set of the previous
search (Input)
 4.4.5.Querying server capabilities (TuC)
 4.4.6.Providing time limit to the query response (TuC?)
 4.4.7.Specifying the mode of operation (TuC?)
96
TuC Contains
 4.4.2.Service selection
 Tech. from FH_UP_TS
 4.4.5.Querying server capabilities
 Tech. from FH_UP_TS
 4.4.6.Providing time limit to the query response
(?)
 Tech. from KETI: The right position for this tech?
 4.4.7.Specifying the mode of operation (?)
 Tech. from KETI: The right position for this tech?
Issues
 Place for “mode selection” tool
 Need more study on Operators
Place for “mode selection” tool (1/2)
 Original Proposal
<complexType name="MP7QFInputType">
<sequence>
<element name="QFDeclaration"
type="mp7qf:QFDeclarationType" minOccurs="0"/>
<element name="OutputDescription"
type="mp7qf:OutputDescriptionType" minOccurs="0"/>
<element name="QueryCondition"
type="mp7qf:QueryConditionType" minOccurs="0"/>
</sequence>
<attribute name="previousAnswerID" type="anyURI"
use="optional"/>
<attribute name="syncMode" type="boolean"
default="true"/>
<attribute name="timeout" type="mpeg7:durationType"
use="optional"/>
</complexType>
97
Place for “mode selection” tool (2/2)
 Alternative Solution
<complexType name="Mpeg7QueryType">
<sequence>
<element name=“MP7QFMngt” type=“mp7qf:QueryManagementType”/>
<choice>
<element name="MP7QFInput" type="mp7qf:MP7QFInputType"/>
<element name="MP7QFOutput" type="mp7qf:MP7QFOutputType"/>
</choice>
</sequence>
<attribute name="mp7qfID" type="anyURI"/>
</complexType>
<complexType name="QueryManagementType">
<attribute name="syncMode" type="boolean" use="optional"
default="true"/>
<attribute name="timeOut" type="mpeg7:durationType"
use="optional"/>
</complexType>
Need more study on Operators
 Most of the operators on the table except
boolean are premature.
 Through the CE process, we found out that
they have more complex nature than we
thought.
 We need to study further on the functionality
and syntax of operators such as arithmetic,
comparison operators.
 Wish to establish an AhG at this meeting.
REL/Open Release Discussions
BoG of REL and OR MAF
April 24th, 2007
MPEG-21 Part 5: REL profiles
DAC
14415
2007-04-17
2007-04-16
MPEG-21
Kisong Yoon
Taehyun Kim
Hogab Kang
MDS
Interoperability between MPEG-21 REL DAC Profile
and Other Standards
1. MPEG white paper, with three weeks editing period.
2. OutputRegulation
a. with no occurrence, the behavior is to allow output the source content signal (or into any
possible output signal) – Satoshi
b. with occurrence but no child elements, the behavior is to allow output/preserve the source
content signal – Taehyun
c. with occurrence and child elements, the behavior is to allow output signal according to the
constraint in the child elements.
98
ORC
1. Dispose NB comments from Spain and Korea
2.
For embedding: embed right for source content and enlarge or enhance right for target content
Use case
Right for Source
Embedding a source into a new target
Embedding a source into an existing target
Adapt
Aggregate
Embed
Right
Target
Adapt
Adapt
Aggregate
N/A
N/A
for
2. FPDAM/3
14479
14484
2007-04-18
2007-04-16
MPEG-21
MDS
Taehyun Kim
Jaime Delgado
Florian Schreiner
Chris Barlas
2007-04-18
2007-04-16
MPEG-21
MDS
Kisong Yoon
Taehyun Kim
Hogab Kang
Editor's study of ISO/IEC 21000-5/PDAM3
A Study on Use Cases of Derivative Works with MPEG21 REL ORC Profile License
1. Derivative works with their derived licenses
MPEG-21 Part 8: Reference Software (for REL profile sections)
14505
2007-04-18
2007-04-16
MPEG-21
Eva Rodríguez
Jaime Delgado
MDS
Contribution to MPEG-21 Reference Software:
Validation Rules Checker for the REL MAM Profile
1. Update the software plan and Part 8.
2. (informative) License creator is still missing for DAC. Need it for the next meeting.
3. Update the software plan (version 6) to include modules for ORC
MPEG-21 Part 14: Conformance Testing (for REL profiles sections)
14401
2007-04-18
2007-04-13
MPEG-21
Eva Rodríguez
Jaime Delgado
MDS
Contribution to REL MAM Profile Conformance
1. Add the rules for MAM to the spec.
2. DAC may need new rules and thus will be considered at the next meeting.
OR MAF
1. DIDL input text
a. Incorporate the input text into text of 23000-7 FCD
2. Relationship with CC licenses
a. Why including an ORC license in an OR MAF content/package?
i. To provide a mechanical means to enable and help users to manage (use,
adapt and distribute) Open Release content
99
b. Why including an identifier, name, or link to a CC license in an OR MAF
content/package? Choose one of the following:
i. To indicate the intentions of the OR content creator as expressed in the CC
license
ii. To provide CC license information as metadata for the legal notification
purpose (depending on positive feedback from CC)
c. When both a CC license and an ORC license are present in a same OR package,
i. the CC license is for information only, and the ORC license is for usage
management
ii.
d. Can OR package only mention CC license names but not CC license links or content,
in order to avoid any legal issues?
3. CC comments from Mike.
The example includes
...
<CopyrightString>
Creative Commons (CC) License: Attribution Non-commercial No Derivatives (by-nc-nd)
</CopyrightString>
</Creation>
<RelatedMaterial>
<MaterialType>
<Name>Licensing Information Page</Name>
</MaterialType>
<MediaLocator>
<MediaUri>http://creativecommons.org/licenses/by-nd-nc/1.0/</MediaUri>
</MediaLocator>
</RelatedMaterial>
The content of <CopyrightString> is presumably a notice for humans.
Is the value of <Name>, that is "Licensing Information Page", from a controlled vocabulary?
Or is that just informational for humans as well?
In the Open Release MAF we use at the moment the MPEG-7 "RelatedMaterial" element to provide
related information such as the link to a related CC license. The "Name" in Materialtype is only a piece of
information for humans. To address your concern, we will consider how to provide some information to
say that the related material is in fact a reference to the CC license.
Regarding the "Rights Expression Language, AMENDMENT 1: MPEG-21 REL profiles"
document, which the "Open Release MAF" document says "defines rights and conditions for
modelling creative-commons like licenses."
It looks like the right primitives are present, though I'm not sure I understand how each is
expressed. Take "Figure1 - m3x:governedAdapt Right" which is described as "any principal is
granted the right to play a movie clip, and the right to adapt the clip together with the same
license."
I don't see where in the example "with the same license" is expressed. Is this implicit?
I also do not see any means for explicitly identifying the license used.
Even if the rights associated with a CC license are accurately described the specific license
should be identified with a license URI.
Regarding the question on "with the same license", it means the same license which original content has.
More precisely, the right "m3x:governedAdapt" needs to make another license which is same as the
original one the right is part of when it is exercised. So currently it does not have to have a specific
identifier. However, if it is needed, an identifier can be specified for the original license.
100
4. More questions to CC
a. Merging two conflicting sharelike licenses (e.g., commercial and non-commercial).
Output Documents
1. DoC of 21000-5 PDAM/3 -- Done
2. Text of 21000-5 FPDAM/3, two weeks editing period – Taehyun and Jaime
3. Output on DAC interoperability with other rights information standards – Taehyun, Xin,
Jaime, Satoshi
4. REL/RDD reference software development plan v6 – Florian and Xin
5. DoC of 23000-7 CD – Florian
6. Text of 23000-7 FCD, with four weeks editing period – Florian
7. Contribution to MPEG-21 Parts 8 and 14 – Jaime and Xin
3.0 MDS Output Documents and Resolutions – San Jose 80th Meeting
The MDS subgroup recommends approval of the following documents
MPEG-7
No.
Title
15938-5 Multimedia Description Schemes
DoC on ISO/IEC PDAM/1 15938-5 Improvements to Geographic
9129
Descriptor
ISO/IEC FPDAM/1 15938-5 Improvements to Geographic
9100
Descriptor
TBP Available
No.
Title
15938-7 Conformance testing
DoC on ISO/IEC PDAM/1 15938-7 Improvements to Geographic
9130
Descriptor Conformance
TBP Available
No.
TBP Available
Title
15938-10 Schema definition
9102 Schema Files for MPEG-7
07/04/27
07/04/27
07/04/27
07/04/27
1.1.3. The MDS subgroup notes that the document NXXXX is a first version of an ongoing
working document containing the ‘electronic’ versions of schemas for the current
MPEG-7 parts at IS/FDIS. The MDS subgroup requests that the versions of the
schemas be updated on the ITTF WWW site at a similar URL to the equivalent
MPEG-21 schemas.
1.1.4. The MDS subgroup also requests that users of the schemas who choose to create
reduced or profiled schemas input these to MPEG so MPEG might understand usage
of the MPEG-7 descriptors. Further details are provided with the schema files.
1.1.5. The MDS subgroup recommends appointing Robert O'Callaghan and Akio Yamada as
the editors of ISO/IEC 15938-10:2005/COR 1 and thanks them for taking
101
responsibility for that project.
No.
Title
15938-12 MPEG-7 Query Format
9103 ISO/IEC 15938-12 CD MPEG-7 Query Format
9104 Technologies Under Consideration for MPEG-7 Query Format
TBP Available
07/05/25
07/04/27
1.1.6. The MDS subgroup recommends appointing Kyoungro Yoon, Mario Doeller, Matthias
Gruhne, Ruben Tous, Masanori Sano, Miran Choi, Tae-Beom Lim, Jongseol James
Lee and Hee-Cheol Seo as the editors of ISO/IEC 15938-12 MPEG-7 Query Format
and thanks them for taking responsibility for that project.
MPEG-21
No.
Title
21000-4 IPMP Components
DoC of ISO/IEC 21000-4 FPDAM/1 IPMP Components Base
9105
Profile
9106 Text of ISO/IEC 21000-4 FDAM/1 IPMP Components Base Profile
TBP Available
07/04/27
07/04/27
1.1.3. The MDS subgroup thanks the National Body of Japan for their useful comments on
ISO/IEC PDAM/1 21000-4.
No.
9107
9108
9109
9110
Title
21000-5 Rights Expression Language
DoC of ISO/IEC 21000-5 PDAM/3 ORC (Open Release Content)
Profile
ISO/IEC 21000-5 FPDAM/3 ORC (Open Release Content) Profile
Interoperability between MPEG-21 REL DAC Profile and other
Rights Information Standards
REL/RDD Reference Software Development Plan v.6
TBP Available
07/04/27
07/05/25
07/05/18
07/04/27
1.1.4. The MDS subgroup thanks the National Bodies of Korea, Japan and Spain for their
useful comments on ISO/IEC PDAM/3 21000-5.
No.
Title
21000-7 Digital Item Adaptation
9111 Disposition of Comments on ISO/IEC 21000-7:2004/DCOR 1
TBP Available
07/04/27
Text of ISO/IEC 21000-7:2004/COR 1 MPEG-21 Digital Item
9112 Adaptation
07/05/25
9113 Text of ISO/IEC 21000-7 FDIS Second edition
07/05/25
102
The MDS subgroup recommends appointing Christian Timmerer (Klagenfurt University),
Sylvain Devillers (France Telecom), and Michael Ransburg (Klagenfurt University) as
the editors of ISO/IEC 21000-7 2nd edition and thanks them for taking responsibility
for that project.
The MDS subgroup recommends appointing Christian Timmerer (Klagenfurt University) as
the editor of ISO/IEC 21000-7:2004/COR and thanks him for taking responsibility for
that project.
1.1.3. The MDS subgroup thanks the National Bodies of Austrian and France for their useful
comments on ISO/IEC DCOR/1 21000-7.
No.
Title
21000-8 Reference Software
Preliminary DoC of preliminary comments of ISO/IEC 21000-8
9114
FCD Reference Software
9115 Study text of ISO/IEC 21000-8 FCD Reference Software
TBP Available
No.
TBP Available
Title
21000-14 Conformance
9116 DoC of ISO/IEC 21000-14 Conformance
9117 Text of ISO/IEC FDIS 21000-14 Conformance
07/04/27
07/04/27
07/04/27
07/05/25
1.1.4. The MDS subgroup thanks the National Bodies of Australia, Austria, Belgium, France,
Korea, Spain and the US for their useful comments on ISO/IEC FCD 21000-14.
No.
Title
21000-15 Event reporting
9118 ISO/IEC 21000-15:2006/DCOR1 MPEG-21 Event Reporting
TBP Available
07/05/21
The MDS subgroup recommends appointing Christian Timmerer (Klagenfurt University) and
Jaime Delgado (DMAG) as the editors of ISO/IEC 21000-15:2006/DCOR1 and thanks
them for taking responsibility for that project.
No.
Title
21000-18 Digital Item Streaming
9119 DoC of ISO/IEC 21000-18/PDAM 1
9120 ISO/IEC 21000-18/FPDAM/1 Simple fragmentation rule
TBP Available
07/04/27
07/06/08
WG11 thanks the International Confederation of Societies of Authors and Composers
(CISAC) for its current role in serving as the Registration Authority (RA) for ISO/IEC
21000-3. WG11 requires the services of an RA for ISO/IEC 21000-18 and has
determined the requirements to be compatible with those of the RA for ISO/IEC
21000-3 and has received a letter of in principle agreement from CISAC to serve as RA
for ISO/IEC 21000-18. WG11 therefore requests the SC29 secretariat to issue the
ballot asking for CISAAC to be appointed Registration Authority for ISO/IEC 21000103
18.
MPEG-A
No.
9121
9122
Title
23000-2 Music Player Application Format
DoC of ISO/IEC 23000-2 FCD Music Player Application Format
2nd Edition
Text of ISO/IEC 23000-2 FDIS Music Player Application Format
2nd Edition
TBP Available
07/04/27
07/05/25
1.1.5. The MDS subgroup thanks the National Bodies of Germany, Japan and the UK for
their useful comments on ISO/IEC FCD 23000-2.
No.
9123
9124
Title
23000-5 Media Streaming MAF
DoC on ISO/IEC CD 23000-5 Media Streaming Player
ISO/IEC FCD 23000-5 Media Streaming Player
TBP Available
07/04/27
07/05/25
1.1.3. The MDS subgroup thanks the National Bodies of Austria, Korea and the UK for their
useful comments on ISO/IEC CD 23000-5.
No.
Title
23000-7 Open Release Application Format
9125 DoC of ISO/IEC 23000-7 CD Open release MAF
9126 ISO/IEC 23000-7 FCD Open release MAF
TBP Available
07/04/27
07/05/25
1.1.3. The MDS subgroup thanks the National Body of Spain for their useful comments on
ISO/IEC CD 23000-7.
No.
Title
23001-5 Bitstream Syntax Description Language
Text of ISO/IEC 23001-5 FDIS Bitstream Syntax Description
9127
Language
TBP Available
07/07/01
The MDS subgroup recommends appointing Sylvain Devillers and Joe Thomas-Kerr as the
editors of ISO/IEC 23001-5 and thanks them for taking responsibility for that project.
9128
Mandate:
AHG on MPEG-7 Query Format
To address the following issues:
1. Complete editing of the MPEG-7 Query Format CD
2. Consider improvements to the CD and TuC documents
3. Continue discussions on Server Selection and Capabilities
4. Study the functionality and syntax of operators
104
Chairman: Kyoungro Yoon (yoonk *at* konkuk.ac.kr)
Mario Doeller (Mario.doeller_*at*_uni_passau.de)
Duration: Until 81st Meeting
AHG meeting will be held on the weekend prior to 81st meeting. Other business
Meetings
will be conducted by e-mail or telephone conference.
Reflector: cbsearch@yahoogroups.com
Subscribe: To subscribe send email to cbsearch-subscribe@yahoogroups.com
105
4.0 MDS Final Schedule – San Jose 80th Meeting
MPEG MDS Chair: Ian S Burnett
Number
MPEG-7, MPEG-21,
MAF
v3.0
Source
Title
Monday Morning
(9h00-13h00)
MPEG Plenary
Plenary room
Monday
Afternoon
(13h30-20h00)
Kick-off of
MPEG MDS
activities
(13h30-14h00)
MDS Room FIR
Agenda, Goals and
Issues for the Week for
MDS Group
Review of AHG
resolutions, CE
results and
action points
(13h30-14h20)
14277
14278
14279
14280
MDS Room FIR
Gerrard Drury Peder
Drege
Filippo Chiariglione
Christian Timmerer
Thomas Skjolberg
Stefan Kraegeloh
Filippo Chiariglione
Noboru Harada
Wo Chang
Kyoungro Yoon
Mario Doeller
14539
Masanori Sano Hideki
Sumiyoshi Nobuyuki Yagi
Masanori Sano Hideki
Sumiyoshi Nobuyuki Yagi
Masanori Sano Hideki
Sumiyoshi Nobuyuki Yagi
14543
Ruben Tous Jaime Delgado
14524
Saar De Zutter
14330
Thomas Skjølberg Peder
Drege Joseph Thomas-Kerr
Gerrard Drury
14537
14538
14458
14459
Ian S Burnett
Ingo Kofler Christian
Timmerer Hermann
Hellwagner on
behalf of Austrian
NB
Michael Eberhard
Christian Timmerer
Hermann
Hellwagner on
AHG on MPEG-21 DIS
AHG on the Media Streaming MAF demo for the MAF-AE
AHG on MDS MAFs Under Development
AHG on MPEG-7 Query Format
Test report of CEs on MP7QF
Test report of CE on specification of the request of the Output
Test report of CE on Query operation based on text description
DMAG CE Report for CEs on MPEG-7 Query Format
Review of Core Experiment on query operation based on text
description
Report of CE on DIS TuC
Austrian NB comments on ISO/IEC 21000-7 Cor.1
Austrian NB comments on ISO/IEC 21000-8 FCD
behalf of Austrian
NB
14460
14461
Christian Timmerer
Hermann
Hellwagner
Christian Timmerer
Michael Ransburg
Hermann
Hellwagner
Define BoGs and
Mandates
(14h20-14h30)
Austrian NB comments on ISO/IEC CD XXXXX Media Streaming
MAF Protocols
Austrian NB comments on ISO/IEC 23000-5 CD
MDS Room FIR
BoG1 = San Carlos
MPEG-7 QF
Zinfandel
OR MAF
Tues am
Prof Archival
DID
Mon 4.30-6pm
MS MAF
Wed 4pm
REL Profiles
Tues am
DIA Futures
(14h00 14h30)
14318
14341
MDS Room FIR
Sylvain Devillers
Christian Timmerer
Sylvain Devillers
Michael Ransburg
REL (14h30 15h30)
MDS Room FIR
14479
Kisong Yoon Taehyun Kim
Hogab Kang
Taehyun Kim Jaime
Delgado Florian Schreiner
Chris Barlas
14484
Kisong Yoon Taehyun Kim
Hogab Kang
14415
Open Release
MAF/MS MAF
(15h30 16h30)
14507
14511
14513
14442
14503
MPEG-7 Query
Format (16h30
- 18h00)
Editors' input to draft text of 23001-5 (MPEG-B BSDL)
Editor's input on Draft MPEG-21 DIA 2nd edition
Interoperability between MPEG-21 REL DAC Profile and Other
Standards
Editor's study of ISO/IEC 21000-5/PDAM3
A Study on Use Cases of Derivative Works with MPEG-21 REL
ORC Profile License
MDS Room FIR
Eva Rodríguez Jaime
Delgado
Florian Schreiner Chun Hui
Suen
Florian Schreiner Chun Hui
Suen
Hyon-Gon Choo Filippo
Chiariglione
Hélder Castro Pedro
Carvalho Teresa Andrade
Christian Timmerer
Hermann Hellwagner
Contribution to the current version of the Open Release MAF
Overview of ISO/IEC 23000-7 CD Open Release MAF (1-pager)
Proposed text to ISO/IEC 23000-7 CD Open Release MAF
Proposed text of ISO/IEC 23000-5 FCD Media Streaming MAF
A DID model for Media Streaming MAF
MDS Room FIR
Tuesday Morning
(9h00-13h00)
107
MDS/Reqts
issues (09h00 11h00)
Reqts
14500
Sylvain Devillers
14532
Gerrard Drury
Giovanni Cordara
(on behalf of the
ITNB)
14475
14420
14421
14449
Hee-Cheol Seo Miran Choi
Hyunki Kim Myung-Gil
Jang Soojong Lim Jeong
Heo Kyoungro Yoon
Hee-Cheol Seo Miran Choi
Hyunki Kim Myung-Gil
Jang Soojong Lim Jeong
Heo Kyoungro Yoon
Doeller Gruhne Wolf
MDS/Systems
DID (11h00 12h00)
14365
Use of MPEG URN for identifying profiles and levels
Contribution on URI assets and Requirements and Structure of
URNs
Italian NB proposal to revisit MPEG-21 DID
CE Report for Query Expression of MPEG-7 Query Format
Revision of Proposed Input Query Format for MPEG-7 Query
Format
MP7QF CE Test Report
MDS
Davide Rogai Paolo Nesi
Pierfrancesco Bellini
Experience on using MPEG-21 File Format for nested and/or
protected DIs
Tuesday
Afternoon
(14h00-18h00)
MAFs (14h00 18h00)
14430
14411
14352
14486
Tilman Liebchen
Noboru Harada Takehiro
Moriya Yutaka Kamamoto
James Orwell James
Annesley
Houari Sabirrin Jeongyeon
Lim Munchurl Kim
14424
Hendry Houari Sabirin
Munchurl Kim
Hendry Houari Sabirin
Munchurl Kim
Kwangcheol Choi SungMoon Chun Jaedo Kwak
Seungheon Yang Ji-Sang
Yoo Si-Hun Sung SeongCheol Han
Jaedo Kwak Si-Hun Sung
Sung-Moon Chun JinWoong
Kim Namho Hur
14367
Paolo Nesi Pierfrancesco
Bellini Davide Rogai Kia Ng
(University of Leeds)
14368
Paolo Nesi Pierfrancesco
Bellini Davide Rogai
14369
Davide Rogai Pierfrancesco
Bellini Paolo Nesi
14477
14478
14423
Comments on Professional Archival MAF Requirements
Proposed text to WD of Professional Archical MAF
Contribution to the Basic Video Surveillance MAF
A Proposal for Basic Video Surveillance Application Format
Updated Proposal for Protected Musical Slide Show MAF with
IPMP
Updated Proposal for Protected Photo Player MAF with IPMP
Requirements for Stereoscopic MAF
Whitepaper of Stereoscopic Project
Proposal for a MAF on Cross-Media Interactive Presentation:
Overview and Application Scenarios
Proposal for a MAF on Cross-Media Interactive Presentation:
Requirements
Proposal for a MAF on Cross-Media Interactive Presentation:
Relationships with other MAFs
Wednesday
108
Morning (09h0013h00)
MPEG Plenary
(9h00-11h00)
Conformance
/Ref s/w
(11h00-12h00)
Plenary room
MDS Room FIR
14409
Saar De Zutter Jan De Cock
Rik Van de Walle
Saar De Zutter Jan De Cock
Rik Van de Walle on behalf
of the Belgian National Body
Saar De Zutter Jan De Cock
Rik Van de Walle on behalf
of the Belgian National Body
14462
Michael Eberhard
Christian Timmerer
Hermann
Hellwagner
14351
14356
14505
14401
Eva Rodríguez Jaime
Delgado
Eva Rodríguez Jaime
Delgado
MPEG-7
Discussions Schema (12h0012h30)
14502
14508
Eva Rodríguez Jaime
Delgado Víctor Torres
IPMP
Components
(15h00 16h00)
14481
14482
14483
Adding Integrity and authenticity to Event Reporting information
Defect Report Proposal of ISO/IEC 21000-15
Defect Report on ISO/IEC 21000-15
Some issues on the generation and modification of Event
Reports in the MPEG-21 Event Reporting
MDS Room FIR
Hendry Takafumi Ueno
Hendry
Hendry Munchurl Kim
MPEG-21 &
LASeR (16h00 17h00)
14418
Contribution to MPEG-21 Reference Software: Validation Rules
Checker for the REL MAM Profile
Contribution to REL MAM Profile Conformance
MDS Room FIR
Eva Rodríguez Jaime
Delgado
Jaime Delgado Eva
Rodríguez
Daniel Oancea Pedro
Carvalho Teresa Andrade
Christian Timmerer
Hermann Hellwagner
14400
BNB comments on ISO/IEC FCD 21000-14: Conformance
Testing
Preliminary BNB comments on ISO/IEC FCD 21000-8: Reference
Software (2nd edition)
Update of gBSDtoBin and DIA Reference and Utility Software
Modules
MDS Room FIR
Wednesday
Afternoon
(14h00-17h45)
ER (14h00 15h00)
14399
Conformance tests for DIDL documents - files
Some Editorial Update for ISO/IEC 21000-4/FPDAM1 MPEG-21
IPMP Components Base Profile
Late comment for ISO/IEC 21000-4/FPDAM1 MPEG-21 IPMP
Components Base Profile
Contribution for MPEG-21 IPMP Components Base Profile
Conformance
SYSTEMS
Jihun Cha YeSun Joung
Young-Kwon Lim KyungAe
Moon
Ideas on MPEG-21 and LASeR
Thursday
Morning (9h0012h30)
109
Breakout Issues
MPEG-7
QF(11h0012h00)
Thursday
Afternoon
(14h00-19h00)
MPEG-7 Query
Discussions
(14h00 15h00)
DID
discussions(15h0
0-16h00)
Plenary MDS
and Reports of
BoG (16h00 18h00)
Reqts joint with JPEG
MDS Room FIR
Reqts
MDS Room FIR
Further review
of Output
documents,
AHGs, CEs,
DoC, Std
(18h00+++)
Friday Morning
(09h00-13h00)
MDS Room FIR
Wrapping up
(09h00 13h00)
MDS Room FIR
Approval of resolutions,
AHGs and Output
documents
Friday
Afternoon
(14h00-21h00)
MPEG Plenary
Contact: Ian S
Burnett
Plenary room
x
110
Annex H – Video report
Source: Jens-Rainer Ohm, Gary J. Sullivan (Video), Miroslaw Z. Bober (MPEG-7 Visual)
20 MPEG-4 Visual Simple Profile Level 6
The specification text and conformance part related to the new level 6 (720p resolution) of
MPEG-4 Visual Simple Profile have progressed as expected. Comments made by NBs ad been
mostly of editorial nature.
Documents reviewed:
14383
14386
SC 29 Secretariat
SC 29 Secretariat
Summary of Voting on ISO/IEC 14496-2:2004/PDAM 4
Summary of Voting on ISO/IEC 14496-4:2004/PDAM 28
Documents approved:
No.
Title
14496-2 Visual
8948
Disposition of Comments on ISO/IEC 14496-2:2004/PDAM4
8949
Text of ISO/IEC 14496-2:2004/FPDAM4 Simple Profile Level 6
8952
Disposition of Comments on ISO/IEC 14496-4:2004/PDAM28
8953
Text of ISO/IEC 14496-4:2004/FPDAM28 Visual Simple Profile
Level 6 Conformance Testing
TBP Available
No
No
No
No
07/04/27
07/04/27
07/04/27
07/04/27
21 MPEG-4 Video Conformance Corrigenda
Errors in MPEG-4 Video conformance bitstreams (incorrect signaling of low delay mode) were
reported in 14358. It was decided to go for a new corrigendum directly (no defect report before)
because currently no other problems with conformance streams are envisaged. In the same
context, an editorial error which occurred by the time when the 2004 edition was produced is
corrected, where bitstreams relating to studio profile, FGS, ASP and new levels were by mistake
attached to the new edition, even though their description is only included in Amd.1 and Amd.3
of the new edition. In fact, various studio profile and ASP stream would now have been missing
in Amd.1, such that another corrigendum on that part became necessary.
Documents reviewed:
14358
Yi-Shin Tung, Ja-Ling Wu
Additional fixes on MPEG-4 video conformance
bitstreams
Documents approved:
No.
Title
14496-4 Conformance testing
8950
Text of ISO/IEC 14496-4:2004/DCOR4
8951
Text of ISO/IEC 14496-4:2004/Amd.1/DCOR2
111
TBP Available
No
No
07/04/27
07/06/29
22 MPEG-7 Visual
22.1 MPEG-7 Visual related work in San Jose
The MPEG-7 breakout group was active during the whole week. Input documents related to the
Visual part in 15938-3 and Photo Player MAF (23000-3) are listed in the table below. All of
these documents were reviewed and discussed.
14350
14406
14412
Weon-Geun Oh, Ju-Kyoung
Jin, A-Young Cho, Jun-Woo
Lee, Ik-Hwan Cho, Won-Keun
Yang, Dong-Seok Jeong
Sangki Kim, Hyobin Lee,
Sangyoun Lee
Weon-Geun Oh, Won-Keun
Yang, Dong-Seok Jeong
14436
Ryoma Oami
14439
14440
Kota Iwamoto, Ryoma Oami
Kota Iwamoto, Ryoma Oami
Paul Brasnett, Miroslaw
Bober
Paul Brasnett, Miroslaw
Bober
Paul Brasnett, Miroslaw
Bober
A-Young Cho, Ik-Hwan Cho,
Jun-Woo Lee, Weon-Geun
Oh, Dong-Seok Jeong
14470
14471
14472
14523
Mathematical consideration on the degree of geometrical
modification
CE Report for VCE-5
Modified GST Based Descriptor for MPEG-7 VCE-6
Complex Condition
CE report for VCE-3 on person identity-based photo
indexing
CE report for VCE-7 on video signature
Proposal of CE procedure for VCE-7
Improved Image Identifier (VCE6)
Modification of VCE6 Experimental Conditions
VCE7 Experimental Conditions
New Visual Identifier for MPEG-7 VCE-6 Basic
Condition
Summary of key work items:
 Review of the Core Experiment results and future planning
 VCE-3 – Face-bases Annotation
 VCE-5 - Evaluation of MPEG-7 Face Recognition Technology on IR
Images
 VCE-6 - Image Signatures
 VCE-7 – Video Signatures
 Photo Player MAF
 S/W development – second version
 Review of the Protected Photo Player proposal
 Study Text of ISO/IEC 23000-3/PDAM1 Reference Software for Photo Player
MAF
 Editorial work, Maintenance and Software development
 Joint meeting with JPEG-search
Results of Core Experiments:
Much of the time during the week was spent discussing the core experiment on Visual Identifiers
(VCE-6). An improved version of the current XM algorithm was presented, based on the Trace
Transform (see M14470). There was a competing contribution (M14523), using the “concentric
circle-based visual identifier", which showed inferior performance to M14470, despite exceeding
that of the previous XM version. Consequentially, the proposed modifications of M14470 were
adopted. The experimental conditions were tightened for the continuing CE, in order that
differences between the algorithms may become more apparent, by the next meeting. In revising
the experimental conditions, account was taken of two other proposals (M14552 & M14471).
The image database used in the CE for independence testing has itself been found to contain
112
several duplicated images. Significant time and effort was dedicated to agreeing which pairs of
images are modified copies of one another and which are independent (i.e., different) images.
Final list will be agreed by consensus on the reflector.
There was also a contribution (M14412) to the other part ("Complex Condition") of VCE-6, but
this was deemed to not yet have performance sufficient for adoption into the XM.
There was one contribution each for VCE3 on person-identity-based photo clustering (M14436)
and VCE-5 on IR-sensor-based face recognition (M14406). Both experiments will continue.
The former has been hampered by the lack of source code for the (prospective) reference method,
from Samsung. In the latter, a key milestone in the coming period will be the distribution of an
IR face-image database for other participants to share.
Three contributions were made to VCE-7 on the Video Identifier (M14439 , M14440, and
M14472). Video sequences used previously in MPEG-7 visual core experiments were shared
amongst the participants for use in VCE-7; however, sufficient test material has not yet been
accumulated for the independence test (in which the recall-bias will be set to achieve a predetermined false positive rate). The experimental conditions were amended in response to the
inputs.
New Amendment:
The working draft of a new amendment to 15938-3 (defining Visual identifiers for different
purposes) is planned for the July meeting.
Editorial work, Maintenance and Software:




COR1 of 15938-3:2002/Amd.2 was produced (related to perceptual 3D shape)
COR1 of 15938-6:2003/Amd.1 was produced (related to color temperature)
FDAMs of software and conformance related to the Perceptual 3D Shape descriptor were
produced
Study of PDAM1 of 23000-3 was produced (new stabilized version of reference software)
22.2 Output documents related to MPEG-7 Visual
No.
8969
8970
8971
8972
8973
8974
8975
8976
8977
Title
15938-3 Visual
Text of ISO/IEC 15938-3:2002/Amd.2:2006/Cor.1 (Perceptual
3D Shape)
MPEG-7 Visual XM Document version 30.0
Description of Core Experiments for MPEG-7 New Visual
Extensions
15938-6 Reference Software
Disposition of Comments on ISO/IEC 15938-6:2003/
Amd.1:2006/DCOR 1
Text of ISO/IEC 15938-6:2003/Amd.1:2006/Cor.1 (Color
Temperature)
Disposition of Comments on ISO/IEC 15938-6:2003/FPDAM2
Text of ISO/IEC 15938-6:2003/FDAM2 (Perceptual 3D Shape)
15938-7 Conformance testing
Disposition of Comments on ISO/IEC 15938-7:2003/FPDAM3
Text of ISO/IEC 15938-7:2003/FDAM3 (Perceptual 3D Shape)
113
TBP Available
No
07/04/27
No
No
07/04/27
07/04/27
No
07/04/27
No
07/04/27
No
No
07/04/27
07/04/27
No
No
07/04/27
07/04/27
22.3 Output documents related to MPEG-A Photo Player MAF
No.
8978
Title
TBP Available
23000-3 Photo Player Application Format
Study Text of ISO/IEC 23000-3/PDAM1 Reference Software for No
07/05/25
Photo Player MAF
23 Misceallanea
14468
Nicola Adami, Riccardo
Leonardi, Pierangelo
Migliorati, Claudia Tonoli
Performance of a Distributed Video Codec in Presence of
Transmission Errors
The contribution reports that currently, Distributed Video Coding (DVC) approaches
significantly lag behind the conventional codecs. It is however claimed that DVC would have
good properties in terms of error resilience. For the experiments, a scenario is used where key
frames are encoded by AVC, and Wyner-Ziv coded frames are interpolated from them (similar as
B frames, but without motion information). Good performance is found in case where the “side
information” (key frames) is undistorted. In case where the side information (key frames) is
distorted, intra error concealment is used in addition. It is shown that the Wyner-Ziv coded
frames are quite robust, while keyframes show significant dropping. Rate for Wyner-Ziv coded
frames is significantly higher than for key frames (which would be opposite in case of B frames).
The contributors are noted that it would be interesting to compare this against a version where the
overhead rate caused by Wyner-Ziv coding is rather used for error protection, or also unequal
error protection as would be possible when using SVC with hierarchical B frames.
24 23002 MPEG-C Video Technologies
According to the current policy, any software and conformance supplement would be included in
the respective part of MPEG-C, and not concentrated in a dedicated part as it had been the case
for previous standards. Therefore, a resolution was issued recommending ITTF makes the
reference software and conformance testing bitstreams of ISO/IEC 23002-x freely available on
the ITTF website.
24.1 23002-1
Again, an improved version of the software for IDCT conformance testing was provided for the
FPDAM1 text. It not only contains the software which makes it possible to perform the precision
test as described in the standard, but also allows to explore the performance of an IDCT in a full
test bed, currently supporting MPEG-2 part 2 and MPEG-4 part 2.
No.
8980
8981
Title
23002-1 Accuracy specification for implementation of integer-output
IDCT
Disposition of Comments on ISO/IEC 23002-1/PDAM1
Text of ISO/IEC 23002-1/FPDAM1 Software for Integer IDCT
Accuracy Testing
24.2 23002-2 Fixed-point DCT/IDCT
The progression to FCD happened very smoothly, without any major conflicts.
114
TBP Available
No
No
07/04/27
07/05/31






Substantial input was provided by the editors for improving the overall quality of the text.
Based on results from CE, one small change in the algorithm was made (saving two shifts
without penalizing performance, see ISG report for more details)
It was decided to perform row-transform first, which has no impact on complexity or
performance, but is more consistent with other DCT/IDCT algorithms in the market, such that
it may inflict less drift effect in cases where a different DCT/IDCT is used at the other end
More investigations were made on the problem with the quarter-pel motion interpolation filter
in MPEG-4 part 2, which imposes more critical drift when additionally different transforms
are used at encoder and decoder. It has been verified that the DCT/IDCT algorithm of 230022 is in fact more resistant against this phenomenon
It was planned to include a software implementation of the algorithm in the standard, and
such software was added into the FCD.
The word “implementation” was removed from the title of the draft standard, to avoid the
impression that the standard requires a particular method of implementing the design (when
in fact it prescribes only the result to be obtained by an implementation).
Documents reviewed:
14485
Yuriy A. Reznik, Gary Sullivan,
Arianne T. Hinds
Yuriy Reznik
Yuriy Reznik
Yuriy Reznik, Arianne Hinds
Arianne T. Hinds
Yi-Shin Tung, Ja-Ling Wu
Arianne T. Hinds
Zhibo Ni
Arianne T. Hinds
Honggang Qi, Wen Gao, Debin
Zhao, Siwei Ma
Zhibo Ni, Lu Yu
14506
Yuriy Reznik
14509
14531
14544
Yuriy Reznik
Arianne T. Hinds
Zhibo Ni, Lu Yu
14310
14311
14346
14347
14348
14359
14379
14380
14403
14469
Study Text of ISO/IEC 23002 CD (editors input)
Study Text of ISO/IEC 23002-1/PDAM1 (editors input)
Updated 23002-1 IDCT precision testbed
Updated H.263-based IDCT testbed
Updated MPEG-4 IDCT Testbed
Consider row-transform-first IDCT in 23002-2
Updated T.83 testbed for IDCTs
Updated MPEG-2 IDCT Testbed
Updated TM5 MPEG-2 Testbed
Crosscheck for IDCT CD
IDCT Core Experiment Results
Summary of core experiments on fixed point
IDCT/DCT
Cross-check of IDCT conformance tests
Fixed-Point IDCT Conformance Tests
On the Problem of Quarter Pixel Motion Compensation
Output Documents:
No.
8982
8983
Title
23002-2 Fixed-point 8x8 IDCT and DCT
Disposition of Comments on ISO/IEC CD 23002-2
Text of ISO/IEC FCD 23002-2 Fixed-point 8x8 IDCT and DCT
24.3 23001-4 and 23002-4 Reconfigurable Video Coding (RVC)
(High-level summary, for details on particular documents see ISG report)
MPEG-B related CE
MPEG-B
Notes
14446
Proposed Text of RVC CE
 reorganization of CE structure stated
 Move CE 1.1  CE 2 (implementation)
Recommendation  Breakout meeting for each parties
115
TBP Available
No
No
07/04/27
07/05/04
MPEG-B
Notes
MPEG-B
Notes
MPEG-B
Study on RVC Framework and Its Requirements
 Need to evaluate CE results with RVC requirements identified.
Core Experiment Result on CDDL
14445
 Compression results are given
Compression of the RVC DDL Decoder Description with BiM
(results of Core Experiment 1.3 in RVC)
14340
Notes
 Compression results are given
MPEG-B
Extension to support non-MPEG standards (ICT/ZJU) (Results of
CE 1.6)
14473
Notes
 Some modifications should be made on the design of syntax
parsing
 Restructuring of CE on MPEG-B part (CE 1) is done.
 Common ground of understanding and conducting CE is needed for better evaluation and
convergence between tools.
14447
MPEG-C related CE (discussed on Tuesday)
14301 MPEG-C RVC Functional Units naming process proposal
14375 MPEG-C Conformance test tools of RVC functional units
MPEG-C Functional units of inter-prediction under reasonable system
14374
partition for RVC framework
MPEG-C
14416
Implementation of B frame support in RVC CAL Model
MPEG-C Implementation of multiple reference frame support in RVC
14454
CAL model
MPEG-C Proposed text of the RVC FUs for MPEG-4 AVC (Results of CE
14448
2.2)
MPEG-C A scheme for implementing MPEG-4 SP codec in the RVC
14457
framework
MPEG-C Implementation of MPEG-4 AVC Deblocking Filter in RVC
14480
CAL model
MPEG-C Reconfigurability potential of the MPEG-4 SP decoder (results of
14490
CE 1.1)
 M14301: will be adopted in VTL WD. Further CEs will be continued.
 M14375: will be adopted in VTL conformance WD. Editor: Kris
 M14374: workplan will be updated for further FU implementation
 M14416: Bug fixing will be done in RSM implementation.
 M14454: will be adopted in VTL WD.
 M14448: FU textual description will be adopted in VTL WD & RSM. FU naming has to
follow the new naming rule
 M14457: For information.
 M14490: A figure should be added in the WD. The work will continue as CE 1.
 M14480: FU textual description will be adopted in VTL WD & RSM.
Exploration Experiments (EE) related
MPEG-B Exploration experiments of AVS decoder description in RVC
14474
framework
 M14474: Will continue EE. EE should look at standardized token specification.
Comparison with FUs.
Other issues
14510
MPEG-C Proposal for adding ISO/IEC 23002-2 in RVC tool library
116
MPEG-C Evolutions of RVC so as to handle SVC decoding
14463
 M14510: will be integrated into VTL WD. FU implementation will continue till the next
meeting.
 M14463: welcomes the contribution and expects more development by the next meeting.
Regarding the work plan (which was very ambitious), in particular the completeness of the tool
library 23002-4 has not yet reached the expected status. Currently, it is estimated that only 2030% of all MPEG video coding tools are fully described and implemented. It is of course useful
to concentrate the work on the most relevant standards, however the tools of AVC baseline had
not been finished before the San Jose meeting as originally planned; MPEG-4 Simple Profile
(with fixed parser as FU) will be fully available next meeting (this is the minimum that should go
into the first version of toolbox, or which may be added as future amendments). From the new
work plan, it is expected that all of the most relevant profiles of MPEG-2, MPEG-4 Visual and
MPEG-4 AVC will not be fully implemented before January 2008. Therefore, the time for
producing the CD was decided to be delayed until July. In the case of 23001-4, one key issue is
still the way how parsers can be constructed, for which two different solutions are currently on
the table:


via description of bitstream, possibly BSDL
as CAL-based FU(s)
In particular from the latter possibility, it is still necessary to clarify which parts of the parser will
go into MPEG-B and MPEG-C. Further evaluation based on CE results will be necessary to find
out which is the best solution. Evaluation criteria on this are agreed (see report of ISG for more
details).
Liaison with AVS
MPEGLiaison Statement to MPEG on
14541
B/MPEG-C RVC

AVS has provided specification & software of their standard (as necessary for the current EE)
to MPEG

Even though there is no necessity for a “joint standard” on RVC, AVS representative(s) are
highly welcome (as liaison) to participate in RVC development, in particular for the
possibility of using 23001-4 with non-MPEG toolboxes.

It is in MPEG’s own interest that the framework is generically applicable to non-MPEG
standards

Clear distinction between MPEG and non-MPEG toolboxes is necessary

A registration mechanism for non-MPEG toolboxes will be needed

To reflect the outcome of this discussion, the following wording is included in the RVC
project description: “The project is about developing a full collection of individual coding
tools organized in the video tool library and a generic framework that can be used to make an
implementation of any MPEG video coding standard and additionally is capable of
supporting the implementation of video coding standards from other organizations with
which a collaboration can be established.”

As part of this project, an identification mechanism will be developed whereby MPEG video
coding tools will be identified by MPEG and video coding tools from other organizations can
be identified via a registration authority.

A new version of the requirements document will be edited, including a statement like this as
well.
Documents reviewed:
14301
Christophe Lucarz, Marco
RVC Functional Units naming process proposal
117
14340
14374
14375
14416
14445
14446
14447
14448
14454
14457
14463
14473
14474
Mattavelli, Andrew Kinane,
Sunyoung Lee, Sinwook Lee
Christophe Lucarz, Marco
Mattavelli
Gwo Giun Lee, He-Yuan Lin,
Ming-Jiun Wang
Gwo Giun Lee, He-Yuan Lin,
Ming-Jiun Wang
Compression of the RVC DDL Decoder Description
with BiM (results of Core Experiment 1.3 in RVC)
Functional units of inter-prediction under reasonable
system partition for RVC framework
Jar-Sheng Chen, Chun-Jen Tsai
Implementation of B frame support in RVC CAL
Model
Giseok Son, Sinwook Lee, Euee
S. Jang
Hyungyu Kim, Euee S. Jang
Jaebum Jun, Sunyoung Lee,
Euee S. Jang
Yoshihisa Yamada, Kenji Otoi,
Kohtaro Asai
Christophe Lucarz, Marco
Mattavelli
Ghislain Roquier, Maxime
Pelcat, Mickaël Raulet
Matthieu Wipliez, JeanFrançois Nezan, Olivier
Déforges
Maxime Pelcat, Médéric
Blestel, Mickaël Raulet, JeanFrançois Nezan, Olivier
Déforges
Honggang Qi, Wen Gao, Tiejun
Huang, Lu Yu
Honggang Qi, Wen Gao, Lu Yu,
Euee S. Jang, Marco Mattavelli,
Andrew Kinane
Conformance test tools of RVC functional units
Core Experiment Result on CDDL
Proposed Text of RVC CE
Study on RVC Framework and Its Requirements
Proposed text of the RVC FUs for MPEG-4 AVC
(Results of CE 2.2)
Implementation of multiple reference frame support in
RVC CAL model
A scheme for implementing MPEG-4 SP codec in the
RVC framework
Evolutions of RVC so as to handle SVC decoding
Extension to support non-MPEG standards (ICT/ZJU)
(Results of CE 1.6)
Exploration experiments of AVS decoder description in
RVC framework
14480
Paul Schumacher
Implementation of MPEG-4 AVC Deblocking Filter in
RVC CAL model
14490
Christophe Lucarz, Marco
Mattavelli, Joseph ThomasKerr, Jörn Janneck
Reconfigurability potential of the MPEG-4 SP decoder
(results of CE 1.1)
14510
Yuriy Reznik
14546
Jorn Janneck
Marco Mattavelli
Proposal for adding ISO/IEC 23002-2 in RVC tool
library
Description of Tools for the RVC framework: editors,
simulator, SW and HDL code generator
Output Documents:
No.
8979
8984
8985
8986
8987
8988
8989
Title
23001-4 Codec Configuration Representation
WD 4 of ISO/IEC 23001-4
23002-4 Video Tool Library
WD 4 of ISO/IEC 23002-4
Description of Core Experiments in RVC
RVC Simulation Model (RSM) V4.0
RVC Work Plan
RVC Conformance Testing Working Draft 1.0
Description of Exploration Experiments for Toolbox Extensions
118
TBP Available
No
07/05/04
No
No
No
No
No
No
07/05/25
07/05/04
07/05/25
07/05/04
07/05/14
07/05/14
Annex I– JVT report
Source: Jens-Rainer Ohm, Gary J. Sullivan, Thomas Wiegand, and Ajay Luthra
1
Abstract
The Joint Video Team (JVT) of ITU-T Q.6/16 and ISO/IEC JTC 1/SC 29/WG 11 held its 23rd
meeting during April 21-27, 2007 in San Jose, CA, USA. The JVT meeting was held under the
chairmanship of Dr. Gary Sullivan (Microsoft/USA) and Dr. Jens-Rainer Ohm (RWTH
Aachen/Germany), and under the associate chairmanship of Dr. Thomas Wiegand (Fraunhofer
HHI/Germany) and Dr. Ajay Luthra (Motorola/USA). The JVT meetings opened at
approximately 14:30 on Saturday 21 April 2007 and closed at approximately 13:50 on Friday 27
April 2007. Approximately 185 people attended the JVT meetings and approximately 130 input
documents were discussed. The meetings took place in a co-located fashion with a meeting of
ISO/IEC JTC 1/SC 29/WG 11 (MPEG) – one of the two parent bodies of the JVT. The subject
matter of the JVT meeting activities consisted of work on video coding.
2
1.
2.
3.
Contents
Abstract
119
Contents
119
Documents of the JVT meeting
125
3.1. Input documents
125
3.1.1 Administrative input contributions ......................................................................... 125
3.1.2 Input liaison statements, WG 11 NB inputs and other noted WG 11 inputs .......... 125
3.1.3 Non-administrative input contributions .................................................................. 125
3.1.4 Late-registered input contributions ......................................................................... 129
3.2. Late document availability 129
3.3. Withdrawn document registrations 130
3.4. Major output documents
130
JVT-W200 Meeting report of the 23rd JVT meeting [07/05/20]............................................................... 130
JVT-W201-M (WG 11 N8962) Joint Draft 10: Scalable Video Coding [07/05/31] ................................. 130
JVT-W202-M (WG 11 N8963) Joint Scalable Video Model (JSVM) 10 [07/05/31] ............................... 130
JVT-W203-M (WG 11 N8964) JSVM 10 Software [07/06/29] ................................................................ 131
JVT-W204-M (WG 11 N8955) WD 1 conformance test spec for Prof Prof (Teruhiko Suzuki) [07/06/29]
................................................................................................................................................................... 131
JVT-W205-M (WG 11 N8957) WD 1 conformance test for SVC (V. Bottreau) [07/06/29] .................... 131
JVT-W206-M (WG 11 N8959) WD reference software for Prof Prof [07/06/29] .................................... 131
JVT-W207-M (WG 11 N8967) Joint Multi-view Video Model (JMVM) 4 [07/05/18] ............................ 131
JVT-W208-M (WG 11 N8968) JMVM 4 Software [07/05/31] ................................................................. 131
JVT-V209-M (WG 11 N8966) Joint draft 3 Multi-view Video Coding [07/02/09] .................................. 131
JVT-W211-M (WG 11 N8961) WD reference software for SVC [07/06/29] ........................................... 131
JVT-W212-M (WG 11 N8965) Verification test plan for SVC [07/05/18] ............................................... 131
3.5. JVT internal output documents
131
JVT-W210-M ITU-T Rec. H.264 | ISO/IEC 14496-10 Advanced Video Coding Defect Report [07/06/18]
................................................................................................................................................................... 131
3.6. SVC core experiment output documents
131
JVT-W301 CE 1 on SVC subband techiques ............................................................................................ 131
JVT-W302 CE 2 on SVC bit depth and chroma format scalability ........................................................... 131
3.7. MVC core experiment output documents
132
JVT-W303 CE 3 on MVC view interpolation/synthesis ........................................................................... 132
4.
JVT administrative and liaison topics 132
4.1. IPR policy reminder and update
132
119
4.2.
4.3.
4.4.
4.5.
4.6.
Meeting opening remarks by the chairmen
JVT communication practices133
Scheduling and logistics notes
134
Closing session notes 134
Administrative documents 134
133
JVT-W001 (Admin) [G. J. Sullivan, J.-R. Ohm, A. Luthra, T. Wiegand] AHG Report: Proj mgmt and
errata .......................................................................................................................................................... 134
JVT-W002 (Admin) [K. Suehring, A. Tourapis, T. Suzuki] AHG Report: JM text, ref soft, bitstream, conf
................................................................................................................................................................... 137
JVT-W003 (Admin) [T. Suzuki] AHG Report: Professional applications ................................................ 139
JVT-W004 (Admin) [J.-R. Ohm, T. Wiegand, M. Bober] AHG Report: Video annotation ..................... 140
JVT-W005 (Admin) [G. J. Sullivan, J. Luo] AHG Report: AVC splicing ................................................ 141
JVT-W006 (Admin) [J. Vieron, M. Wien, H. Schwarz, T. Wiegand] AHG Report: JD & JSVM text and
S/W ............................................................................................................................................................ 141
JVT-W007 (Admin) [A. Segall, S. Regunathan] AHG Report: SS resampling ........................................ 144
JVT-W008 (Admin) [H. Schwarz, S. Regunathan, A. Eleftheriadis] AHG Report: SVC complexity
reduction .................................................................................................................................................... 144
JVT-W009 (Admin) [Y.-K. Wang, S. Pateux, P. Amon, T. Schierl] AHG Report: SVC high-level syntax,
err resil....................................................................................................................................................... 145
JVT-W010 (Admin) [Y. Gao, A. Segall, T. Wiegand] AHG Report: SVC bit depth and chroma format 145
JVT-W011 (Admin) [A. Vetro, P. Pandit] AHG Report: MVC high-level syntax & buffering ................ 145
JVT-W012 (Admin) [H.-S. Koo] AHG Report: MVC motion/disparity vector coding ............................ 146
JVT-W013 (Admin) [H. Kimata, A. Smolic, P. Pandit, A. Vetro, C. Ying] AHG Report: JMVM & JD text
editing ........................................................................................................................................................ 146
JVT-W014 (Admin) [H. Kimata, A. Smolic] AHG Report: MVC exper. framework & test cond ........... 148
4.7. JVT liaison communications 148
M14548 WG 11 input [FLO Forum] Liaison statement from FLO Forum to WG 11 ............................... 148
5.
Scalable video coding 148
5.1. CE 1 & related docs: SVC FGS simplification
148
JVT-W090 ( Prop 2.2/3.1) [H. Kirchhoffer, H. Schwarz, T. Wiegand] CE1: Simplified FGS .............. 148
JVT-W115-QV (Late Info) [A. Segall] CE1: Verif JVT-W090 simplified FGS ...................................... 149
JVT-W111 ( Prop 2.2) [M. Karczewicz, S. Park, H. Chung] CE1: Report on FGS simplif ................... 149
JVT-W124-QV (Late Info) [J. Ridge] CE1: Verif JVT-W111 FGS simplif ............................................. 149
JVT-W121 ( Prop 2.2.1/3.1) [J. Ridge, X. Wang] CE1: FGS refinement pass simplif .......................... 150
JVT-W119 ( Prop 2.0/3.1) [Y. Bao, M. Karczewicz, X. Wang, J. Ridge, Y. Ye, W. J. Han, S. Y. Kim]
CE1: FGS simplif ...................................................................................................................................... 150
JVT-W120 ( Info) [P. Yin] CE1: Verif JVT-W119 FGS simplif ........................................................... 150
5.2. CE 2 & related docs: SVC ESS improvement
150
JVT-W030 ( Prop 2.2.1) [X. Wang, J. Ridge] CE2: Improvement of MB mode pred in ESS ............... 150
JVT-W058 ( Info) [E. Francois] CE2: Cross-check of JVT-W030 on ESS mode pred improvement ... 151
JVT-W117 ( Prop 2.2/3.1) [Y. Ye, Y. Bao] CE2: Improved resid upsampling for ESS ........................ 151
JVT-W106-QV (Late Info) [X. Wang] CE2: Verif Qualcomm JVT-W117 improved resid upsamp for ESS
................................................................................................................................................................... 152
JVT-W105 ( Prop 2.0) [X. Wang, J. Ridge] Study on residual upsampling without block boundary check
under ESS .................................................................................................................................................. 152
JVT-W109-LV (Late Info) [E. Francois] Verif JVT-W105 on residual upsampling without block boundary
check under ESS ........................................................................................................................................ 153
JVT-W123 ( Prop NN2.2.1) [X. Wang, J. Ridge] Analysis of visual artifacts in ESS residual pred ..... 154
5.3. CE 3 & related docs: SVC subband coding 154
JVT-W097 ( Prop 2.2/3.1) [S.-T. Hsiang] CE3: Intra-frame dyadic spatial SVC based on
subband/wavelet filter banks framework ................................................................................................... 154
JVT-W122-QV (Late Info) [J. Ridge] CE3: Verif JVT-W097 wavelet-based intra dyadic spatial SVC .. 154
5.4. CE 4 & related docs: SVC bit-depth scalability
155
JVT-W102 ( Prop 2.2/3.1) [Y. Gao, Y. Wu] CE4: SVC bit-depth scalability simulation results .......... 155
JVT-W116 ( Info) [A. Segall] CE4: Verif JVT-W102 (Thomson prop) ................................................ 156
JVT-W113 ( Prop 2.2) [A. Segall, Y. Su] System for bit-depth scalable coding ................................... 156
JVT-W076 ( Prop 2.2) [J. Jia, H. K. Kim, H. C. Choi, J. J. Yoo] SVC chroma format scalability ........ 156
5.5. SVC high-level syntax
156
JVT-W020 ( Prop 2.0) [Z. G. Li, S. Rahardja, S. L. Xie and W. Yao] Hypothetical reference decoder for
video coding .............................................................................................................................................. 156
120
JVT-W046 ( Prop 2.2.1/3.1) [M. M. Hannuksela, Y.-K. Wang] Support for SVC header rewriting to
AVC .......................................................................................................................................................... 158
JVT-W047 ( Prop 2.2.1/3.1) [M. M. Hannuksela, Y.-K. Wang] Pictures not for output in SVC .......... 158
JVT-W048 ( Prop 2.2.1/3.1) [M. M. Hannuksela, Y.-K. Wang, Y. Chen] On SVC high-level syntax .. 159
JVT-W051 ( Prop 2.2.1/3.1) [Y.-K. Wang, Y. Chen, M. M. Hannuksela] On SVC scalability information
related SEI messages ................................................................................................................................. 159
JVT-W052 ( Prop 2.2.1/3.1) [Y.-K. Wang, M. M. Hannuksela] SVC feedback based coding .............. 159
JVT-W137-B (BoG) [I. Amonou, N. Cammas, S. Kervadec, S. Pateux] Revised syntax for quality layer
SEI message .............................................................................................................................................. 160
JVT-W053 ( Prop 2.0/3.1) [M. M. Hannuksela, Y.-K. Wang, D. Singer, T. Rathgen] SVC priority_id
value setting method indication ................................................................................................................. 160
JVT-W064 ( Prop 2.2/3.1) [J. Luo, L. Zhu, P. Yin, C. Gomila] VUI updates for SVC ......................... 160
JVT-W091 ( Prop 2.2/3.1) [L. Cieplinski] HRD parameters for SVC bitstream rewriting .................... 161
JVT-W114 ( Prop 2.2) [A. Segall, J. Zhao] Showcase for transcoding scalability info SEI .................. 161
JVT-W125 ( Prop 2.2) [G. J. Sullivan] On SVC high-level syntax and HRD ....................................... 161
JVT-W049 ( Prop 2.2.1/3.1) [C. He, H. Liu, H. Li, Y.-K. Wang, M. M. Hannuksela] Redundant pictures
in SVC ....................................................................................................................................................... 162
JVT-W050 ( Prop 2.2.1/3.1) [Y.-K. Wang, M. M. Hannuksela] On tl0_pic_idx in SVC ...................... 162
JVT-W062 ( Prop 2.2/3.1) [A. Eleftheriadis, S. Cipolli, J. Lennox] Improved error resilience using
temporal level 0 picture index ................................................................................................................... 163
JVT-W054 ( Info) [I. Radulovic, Y.-K. Wang, S. Wenger, A. Hallapuro, M. M. Hannuksela] Multiple
description coding using AVC redundant pictures .................................................................................... 163
JVT-W068 ( Prop 2.2/3.1) [C. Tu, S. Srinivasan, S. Regunathan, G. Sullivan] CE4: 4-tap MC interp for
high-res SVC enh layers ............................................................................................................................ 164
JVT-W072 ( Info) [H. Schwarz] Results comparing JSVM, 4-tap, and RCDO MC interp. .................. 165
JVT-W027 ( Info) [E. Francois, V. Bottreau, J. Vieron] Evaluation of 4 tap motion compensation interp
................................................................................................................................................................... 165
Discussion of potential rearrangement of NAL unit order ........................................................................ 165
5.6. SVC applications and profiles
166
JVT-W075 ( Prop 2.0/3.1) [M. Horowitz, A. Eleftheriadis] Max frame size for enh layers of SVC
profiles <withdrawn> ................................................................................................................................ 166
JVT-W093 ( Prop 2.2.1/3.1) [H. Chung, M. Karczewicz, J. Ridge, X. Wang, W. Han, S. Kim] SVC FGS
profile ........................................................................................................................................................ 166
Profiles definition changes ........................................................................................................................ 167
5.7. SVC other normative design proposals
169
5.7.1 SVC restrictions on interlaced coding .................................................................... 169
JVT-W025 ( Prop 2.0/3.1) [E. Francois, V. Bottreau, J. Vieron] Restrictions on interlaced coding in SVC
................................................................................................................................................................... 169
5.7.2 SVC smoothed reference prediction ....................................................................... 169
JVT-W026 ( Prop 2.0/3.1) [E. Francois, V. Bottreau, J. Vieron] Profile SVC B: Evaluation of smoothed
ref pred ...................................................................................................................................................... 169
JVT-W118 ( Prop 2.2) [Y. Ye, Y. Bao, W. J. Han, S. Y. Kim] Perf and complexity of smoothed ref pred
................................................................................................................................................................... 170
JVT-W126 ( Info) [Z. He] Verif JVT-W118 perf and complexity of smoothed ref pred ....................... 172
JVT-W112-L (Late Prop 2.2) [A. Segall] Clarification of base_mode_flag <withdrawn> ....................... 172
5.7.3 SVC deblocking ...................................................................................................... 172
JVT-W061 ( Prop 2.2/3.1) [D. Hong, A. Eleftheriadis, O. Shapiro] Modified deblocking filter process in
scalable extension ...................................................................................................................................... 172
JVT-W063 ( Prop 2.0/3.1 Layered Media, then 2.2 from Polycom) [D. Hong. A. Eleftheriadis, O.
Shapiro] Deblocking filter for SVC to support multi-threading with slice boundary ................................ 174
JVT-W069 ( Prop 2.2/3.1) [Z. He] Simplified H.264/AVC deblocking filter for SVC enh layer .......... 175
JVT-W128-QV (Late Info) [Y. Ye] Verif of JVT-W069: Simplified deblocking for SVC enh layer....... 175
5.7.4 SVC spatial scalability resampling ......................................................................... 175
JVT-W028 ( Info) [E. Francois, V. Bottreau, J. Vieron] Evaluation of flexible 4-tap upsampling filters
................................................................................................................................................................... 175
JVT-W022 ( Prop 2.2/3.1) [T. Tran, L. Liu, P. Topiwala] Dyadic spatial down- and up-sampling filters
for SVC ..................................................................................................................................................... 176
JVT-W086 ( Prop 2.2/3.1) [I. Amonou, N. Cammas, S. Kervadec, S. Pateux] Some consideration on the
up-sampling position calculation ............................................................................................................... 177
JVT-W136-B (BoG) [G. J. Sullivan, S. Pateux] BoG report on JVT-W086 ............................................. 177
5.8. SVC non-normative contributions 177
5.8.1 SVC editorial input ................................................................................................. 177
121
JVT-W070 ( Text) [H. Schwarz, M. Wien] Editors input for SVC draft ............................................... 177
JVT-W099 ( Info) [J. H. Park, Y. H. Kim, B. H. Choi] Clarification of mb_qp_delta syntax ............... 177
5.8.2 SVC tutorial material .............................................................................................. 178
JVT-W132-B (Requested Info) [T. Wiegand] Overview paper and presentation on SVC ........................ 178
5.8.3 SVC encoder and extractor optimization ................................................................ 178
JVT-W071 ( Info) [H. Schwarz, T. Wiegand] Further results for an rd-opt. multi-loop SVC enc. ........ 178
JVT-W029 ( Info 2.2.1/3.1) [W.-H. Peng] Low-complexity mode decision algorithm for combined CGS
and temporal scalability ............................................................................................................................. 178
JVT-W043 ( Prop NN) [A. Leontaris, A. M. Tourapis] Rate Control for the Joint Scalable Video Model
(JSVM) ...................................................................................................................................................... 179
5.9. SVC conformance
179
JVT-W138-B (BoG) [V. Bottreau] Toward an SVC conformance specification ...................................... 179
5.10.
SVC verification testing
180
JVT-W110 ( Info) [E. Francois, V. Bottreau, J. Vieron] SVC verif test plan: Updated results for SVC
High Profile intra ....................................................................................................................................... 180
JVT-W131-B (Late Info) [D. Hong, A. Eleftheriadis] Verification bitstreams for SVC Profile A ........... 180
JVT-W135-B (BoG) [I. Amonou, N. Cammas, S. Kervadec, S. Pateux] On SVC verif test plan ............. 180
6.
Multi-view coding
180
6.1. CE 5 & related docs: MVC illumination compensation
180
JVT-W024 ( Prop 2.2/3.1) [W. S. Shim, M. W. Park, G. H. Park, D. Y. Suh, H. S. Song, Y. H. Moon, J.
B. Choi] CE5 results- joint prop for MVC deblocking .............................................................................. 180
JVT-W023 ( Info) [S.-C. Lim, D.-H. Han, Y.-L. Lee] CE5: Verification of loop filtering in MVC ..... 180
JVT-W031 ( Prop 2.2) [J.-H. Yang] CE5: Illumination comp. info. derivation for MVC ..................... 181
JVT-W085 ( Info) [Y. S. Ho, K. J. Oh, C. Lee, B. H. Choi, J. H. Park] CE5: Verification of JVT-W031
illumination comp. info. derivation ........................................................................................................... 181
6.2. CE 6 & related docs: MVC view interpolation
181
JVT-W055 ( Prop 2.2/3.1) [T. Senoh, M. Okui, K. Enami] Experimental results of camera-rotationcompensated prediction in CE6 ................................................................................................................. 181
JVT-W059 ( Prop 2.2/3.1) [S. Yea, A. Vetro] CE6: View synthesis prediction .................................... 182
JVT-W084 ( Info 2.2/3.1) [Y. S. Ho, K. J. Oh, C. Lee, B. H. Choi, J. H. Park] Observations of multiview test sequences.................................................................................................................................... 182
JVT-W083 ( Prop 2.2/3.1) [Y. S. Ho, C. Lee, K. J. Oh, B. H. Choi, J. H. Park] CE6: View interp pred for
MVC .......................................................................................................................................................... 182
JVT-W103 ( Info) [J.-H. Yang, S.-H. Lee] CE6: Verif GIST MVC contribution JVT-W083 MVC view
interp pred.................................................................................................................................................. 183
JVT-W096 ( Prop 2.2/3.1) [S. Naito, A. Koike] CE6: Results on MVC ................................................ 183
JVT-W087 ( Prop 2.2/3.1) [S. Shimizu, H. Kimata] New view synthesis pred framework using resid pred
................................................................................................................................................................... 183
Anthony Vetro presents new CE6 work plan. ........................................................................................... 184
JVT-W133-B (BoG) [A. Vetro] BoG report on MVC view interpolation pred ........................................ 184
6.3. MVC high-level syntax
184
JVT-W035 ( Prop 2.2.1/3.1) [Y. Chen, Y.-K. Wang, M. M. Hannuksela] On MVC JD 2.0 ................. 184
JVT-W036 ( Prop 2.2.1/3.1) [Y.-K. Wang, M. M. Hannuksela, Y. Chen] MVC output related
conformance .............................................................................................................................................. 185
JVT-W037 ( Prop 2.2.1/3.1) [Y. Chen, Y.-K. Wang, M. M. Hannuksela] View scalable SEI message for
MVC .......................................................................................................................................................... 185
JVT-W038 ( Prop 2.2.1/3.1) [Y. Chen, Y.-K. Wang, M. M. Hannuksela] Operation point and view
dependency changes SEI messages for MVC ........................................................................................... 185
JVT-W039 ( Prop 2.2.1/3.1) [Y.-K. Wang, Y. Chen, M. M. Hannuksela] Non-required pictures SEI
message for MVC ...................................................................................................................................... 186
JVT-W056 ( Prop 2.2) [J. B. Choi, W. S. Shim, H. S. Song, Y. H. Moon] Inter-view prediction reference
picture marking.......................................................................................................................................... 186
JVT-W066 ( Prop 2.2/3.1) [P. Pandit, P. Yin, C. Gomila] Ref pic list reordering for MVC ................. 186
JVT-W067 ( Prop 2.2/3.1) [P. Pandit, P. Yin, C. Gomila] H.264/AVC extension for MVC using SEI
message ..................................................................................................................................................... 187
JVT-W074 ( Prop 2.2) [H. S. Song, W. S. Shim, Y. H. Moon, J. B. Choi] Comments on view
dependency info ........................................................................................................................................ 187
JVT-W080 ( Prop 2.2.1) [K. Ugur, H. Liu, Y.-K. Wang] Showcase for parallel decoding info SEI
message for MVC ...................................................................................................................................... 187
JVT-W088 ( Prop 2.2) [S. Lin, S. Gao, Y. Liu, L. Xiong] H.264/AVC SEI extensions for MVC ........ 188
6.4. MVC other normative technical inputs
188
6.4.1 MVC motion/disparity vector coding ..................................................................... 188
122
JVT-W081 ( Prop 2.2) [H. S. Koo, Y. J. Jeon, B. M. Jeon] MVC motion skip mode ........................... 188
JVT-W139-B (BoG) [LG, Thomson] Break-out conclusions on JVT-W081............................................ 189
JVT-W073 ( Info) [K. Sohn, J. Seo] Verification of JVT-W081 LGE MVC motion skip contrib. ....... 189
JVT-W101 ( Prop 2.2) [H. Yan, J. Huo, Y. Chang, S. Lin, S. Gao, L. Xiong] MV/DV prediction based
on RDV ..................................................................................................................................................... 189
JVT-W104 ( Prop 2.2) [S.-H. Lee, S.-H. Lee, N.-I. Cho, J.-H. Yang] MVC disparity vector pred ....... 189
JVT-W107 ( Info) [K. Sohn, J. Seo] Verif JVT-W104 MVC disparity vector pred .............................. 189
6.4.2 MVC weighted prediction ....................................................................................... 190
JVT-W040 ( Prop 2.2.1/3.1) [S. Liu, Y. Chen, Y.-K. Wang, M. M. Hannuksela] Constraints on temporal
direct mode and weighted prediction in MVC........................................................................................... 190
JVT-W098 ( Prop 2.2) [J. H. Park, Y. H. Kim, J. W. Kim, B. H. Choi] Weighted prediction for MVC190
6.4.3 MVC downsampled reference etc. .......................................................................... 190
JVT-W079 ( Prop 2.2/3.1) [H. Kimata, S. Shimizu, K. Kamikura, Y. Yashima] Inter-view prediction
with downsampled reference pictures ....................................................................................................... 190
JVT-W092 ( Prop 2.2/3.1) [P. Pandit, P. Yin, C. Gomila] Reduced resolution update for MVC .......... 191
JVT-W094 ( Info 2.2) [W. J. Tam] Image and depth quality of asymmetrically coded stereoscopic video
for 3D-TV .................................................................................................................................................. 191
6.4.4 MVC modes and other coding efficiency topics ..................................................... 192
JVT-W078 ( Prop 2.2/3.1) [H. Kimata, S. Shimizu, K. Kamikura, Y. Yashima] Co-located block
condition for inter-view prediction ............................................................................................................ 192
JVT-W082 ( Prop 2.2) [Y. J. Jeon, H. S. Koo, B. M. Jeon] Modified spatial direct mode in MVC ...... 192
JVT-W065 ( Prop 2.2/3.1) [P. L. Lai, A. Ortega, P. Pandit, P. Yin, C. Gomila] Adaptive reference
filtering for MVC ...................................................................................................................................... 192
6.4.5 MVC depth-based methods & displays .................................................................. 193
JVT-W095 ( Info 2.2) [W. J. Tam, L. Zhang] Depth map preproc and minimal content for 3D-TV using
depth-based rendering ............................................................................................................................... 193
JVT-W100 ( Prop 2.0/3.1) [A. Smolic, K. Mueller, P. Merkle, N. Atzpadin, C. Fehn, M. Mueller, O.
Schreer, R. Tanger, P. Kauff, T. Wiegand, T. Balogh, Z. Megyesi, A. Barsi] Multi-view video plus depth
(MVD) format for advanced 3D video systems......................................................................................... 193
JVT-W060 ( Prop 2.2/3.1) [A. Vetro, S. Yea, W. Matusik, H. Pfister, M. Zwicker] Anti-aliasing for 3D
displays ...................................................................................................................................................... 195
6.4.6 MVC view parallel processing ................................................................................ 195
JVT-W077 ( Prop 2.2) [P. Yang, X. Xu, G. Zhu, Y. He] View parallel processing on MVC................ 195
JVT-W108-QV (Late Info) [Q. Chen, Z. Chen] Verif JVT-W077 view parallel proc on MVC ............... 196
6.5. MVC reference software, common conditions, encoder optimization
7.
AVC base specification and related topics 196
196
JVT-W041 ( Prop NN) [A. M. Tourapis, K. Suehring, G. J. Sullivan, A. Leontaris] H.264/MPEG-4 AVC
reference software (JM) manual ................................................................................................................ 196
JVT-W042 ( Prop NN) [A. Leontaris, A. M. Tourapis] Rate Control reorganization in the JM reference
software ..................................................................................................................................................... 197
JVT-W044 ( Info) [A. M. Tourapis, A. Leontaris, K. Suehring] New JM reference software
enhancements ............................................................................................................................................ 197
JVT-W057 (Late Info) [K. P. Lim] Improved JM text algorithm description ........................................... 198
JVT-W140-B (BoG) [T. Suzuki] Toward a professional profiles conformance specification .................. 198
8.
Video annotation (jointly discussed with MPEG requirements 3:30 pm Wednesday 25
April) 198
JVT-W032 ( Info) [Q. Chen, C. Louis, Z. Chen] Requirements of video annotation in video coding ... 198
JVT-W033 ( Prop 2.2/3.1) [Q. Chen, Z. Chen, X. Gu] Video annotation SEI message ......................... 199
JVT-W034 ( Prop 2.2/3.1) [C. Louis, O. Lionel, L. Frederic, Z. Chen, Q. Chen] Fingerprint and video
structure for video annotation SEI message .............................................................................................. 199
9.
AVC errata and clarification issues 200
10.
Requirements joint discussions with WG 11 200
JVT-W134-Q (Late Prop 2.2) [S. Narasimhan] Splicing issues and some suggested changes ................. 200
M14452 WG 11 input [T. Murakami, K. Asai, Y. Yamada] Requirement of full-color video coding for
consumer applications ............................................................................................................................... 200
M14360 [USNB to WG 11] Issues relating to expiring patents ................................................................ 201
JVT-W127 ( Req) [M. Tanimoto, T. Fujii, H. Kimata, S. Sakazawa] Requirements for FTV (MPEG
M14417) .................................................................................................................................................... 201
11.
JVT internal operating rules 202
12.
List of adoptions
204
12.1.
SVC normative adoptions into JD
123
204
12.2.
SVC normative adoptions into JSVM
204
12.3.
SVC non-normative adoptions
204
12.4.
SVC software adoptions
205
12.5.
MVC normative JD adoptions
205
12.6.
MVC JMVM adoptions
205
12.7.
MVC non-normative adoptions
205
12.8.
JM non-normative adoptions 205
12.9.
Other normative adoptions 205
12.10.
Other non-normative adoptions
205
13.
Software integration plan
205
14.
SVC conformance work plan 205
15.
SVC verification test plan
206
16.
List of AHGs established
206
16.1.
JVT project management and errata reporting
206
16.2.
JM Text, reference software, bitstream exchange and conformance 206
16.3.
AVC professional applications
206
16.4.
SVC JD and JSVM text, software and conformance 207
16.5.
SVC bit depth and chroma format scalability 207
16.6.
SVC FGS applications and design simplification
207
16.7.
MVC high-level syntax and buffer management
207
16.8.
MVC JD and JSVM text and software
207
16.9.
MVC experimental framework and testing conditions
208
16.10.
MVC solutions using existing AVC decoders
208
16.11.
MVC reduced resolution update, downsampled reference and adaptive reference
filtering
208
17.
Resolutions conveyed to MPEG parent body 208
17.1.
Resolutions relating to ISO/IEC 14496-4
208
17.1.1 The JVT and the video subgroup recommend to approve the following documents
208
17.1.2 The JVT and the video subgroup thank the following companies for their
commitment to provide conformance testing streams for ISO/IEC 14496-4:2004/Amd.30:
Mitsubishi Electric Corp., Panasonic, Sejong University, Thomson. ................................. 209
17.1.3 The JVT and the video subgroup thank the following companies for their
commitment to provide conformance testing streams for ISO/IEC 14496-4:2004/Amd.31:
ETRI, FhG-HHI, France Telecom/Orange, Layered Media, Sharp, Thomson. .................. 209
17.2.
Resolutions relating to ISO/IEC 14496-5
209
17.2.1 The JVT and the video subgroup recommend to approve the following documents
209
17.3.
Resolutions relating to ISO/IEC 14496-10 209
17.3.1 The JVT and the video subgroup recommend to approve the following documents
209
17.3.2 The JVT and the video subgroup request WG 11 National Bodies to kindly consider
the SVC Study Document N8962 [JVT-W201] and if necessary provide additional
comments by the July 2007 meeting. .................................................................................. 209
17.4.
Resolutions relating to future meeting scheduling 209
17.4.1 The JVT chairmen propose to hold a JVT meeting during June 29 through July 6,
2007 under the auspices of the meeting of ITU-T SG 16 in Geneva, CH. Further meetings
are proposed to be held during October 19-26, 2007 under WG 11 auspices in Shenzhen, CN,
and during January 11-18, 2008 under WG 11 auspices in Antalya, TR. .......................... 209
17.5.
Resolutions relating to ad hoc group activities
210
17.5.1 The JVT provides the following list of JVT ad hoc groups appointed to progress
work in the interim period until the next JVT meeting: ...................................................... 210
18.
Attendance 210
124
3
Documents of the JVT meeting
3.1 Input documents
3.1.1 Administrative input contributions
JVT-W000 (Admin) List of documents of San Jose meeting
JVT-W001 (Admin) [G. J. Sullivan, J.-R. Ohm, A. Luthra, T. Wiegand] AHG Report: Proj mgmt
and errata
JVT-W002 (Admin) [K. Suehring, A. Tourapis, T. Suzuki] AHG Report: JM text, ref soft,
bitstream, conf
JVT-W003 (Admin) [T. Suzuki] AHG Report: Professional applications
JVT-W004 (Admin) [J.-R. Ohm, T. Wiegand, M. Bober] AHG Report: Video annotation
JVT-W005 (Admin) [G. J. Sullivan, J. Luo] AHG Report: AVC splicing
JVT-W006 (Admin) [J. Vieron, M. Wien, H. Schwarz, T. Wiegand] AHG Report: JD & JSVM
text and S/W
JVT-W007 (Admin) [A. Segall, S. Regunathan] AHG Report: SS resampling
JVT-W008 (Admin) [H. Schwarz, S. Regunathan, A. Eleftheriadis] AHG Report: SVC
complexity reduction
JVT-W009 (Admin) [Y.-K. Wang, S. Pateux, P. Amon, T. Schierl] AHG Report: SVC high-level
syntax, err resil
JVT-W010 (Admin) [Y. Gao, A. Segall, T. Wiegand] AHG Report: SVC bit depth and chroma
format
JVT-W011 (Admin) [A. Vetro, P. Pandit] AHG Report: MVC high-level syntax & buffering
JVT-W012 (Admin) [H.-S. Koo] AHG Report: MVC motion/disparity vector coding
JVT-W013 (Admin) [H. Kimata, A. Smolic, P. Pandit, A. Vetro, C. Ying] AHG Report: JMVM
& JD text editing
JVT-W014 (Admin) [H. Kimata, A. Smolic] AHG Report: MVC exper. framework & test cond
3.1.2 Input liaison statements, WG 11 NB inputs and other noted WG 11 inputs
The following input documents to WG 11 were noted by the JVT and discussed jointly with
WG 11 (without JVT action).
M14360 WG 11 input [USNB to WG 11] Issues relating to expiring patents
M14452 WG 11 input [T. Murakami, K. Asai, Y. Yamada] Requirement of full-color video
coding for consumer applications
M14548 WG 11 input [FLO Forum] Liaison statement from the FLO Forum
3.1.3 Non-administrative input contributions
JVT-W020 ( Prop 2.0) [Z. G. Li, S. Rahardja, S. L. Xie and W. Yao] Hypothetical reference
decoder for video coding
JVT-W021 [withdrawn] <withdrawn>
JVT-W022 ( Prop 2.2/3.1) [T. Tran, L. Liu, P. Topiwala] Dyadic spatial down- and upsampling filters for SVC
JVT-W023 ( Info) [S.-C. Lim, D.-H. Han, Y.-L. Lee] CE5: Verification of loop filtering in
MVC
JVT-W024 ( Prop 2.2/3.1) [W. S. Shim, M. W. Park, G. H. Park, D. Y. Suh, H. S. Song, Y. H.
Moon, J. B. Choi] CE5 results- joint prop for MVC deblocking
JVT-W025 ( Prop 2.0/3.1) [E. Francois, V. Bottreau, J. Vieron] Restrictions on interlaced
coding in SVC
125
JVT-W026 ( Prop 2.0/3.1) [E. Francois, V. Bottreau, J. Vieron] Profile SVC B: Evaluation of
smoothed ref pred
JVT-W027 ( Info) [E. Francois, V. Bottreau, J. Vieron] Evaluation of 4 tap motion
compensation interp
JVT-W028 ( Info) [E. Francois, V. Bottreau, J. Vieron] Evaluation of flexible 4-tap upsampling
filters
JVT-W029 ( Info 2.2.1/3.1) [W.-H. Peng] Low-complexity mode decision algorithm for
combined CGS and temporal scalability
JVT-W030 ( Prop 2.2.1) [X. Wang, J. Ridge] CE2 report: Improvement of macroblock mode
prediction in ESS
JVT-W031 ( Prop 2.2) [J.-H. Yang] CE5: Illumination comp. info. derivation for MVC
JVT-W032 ( Info) [Q. Chen, C. Louis, Z. Chen] Requirements of video annotation in video
coding
JVT-W033 ( Prop 2.2/3.1) [Q. Chen, Z. Chen, X. Gu] Video annotation SEI message
JVT-W034 ( Prop 2.2/3.1) [C. Louis, O. Lionel, L. Frederic, Z. Chen, Q. Chen] Fingerprint and
video structure for video annotation SEI message
JVT-W035 ( Prop 2.2.1/3.1) [Y. Chen, Y.-K. Wang, M. M. Hannuksela] Comments to MVC
JD 2.0
JVT-W036 ( Prop 2.2.1/3.1) [Y.-K. Wang, M. M. Hannuksela, Y. Chen] MVC output related
conformance
JVT-W037 ( Prop 2.2.1/3.1) [Y. Chen, Y.-K. Wang, M. M. Hannuksela] View scalable SEI
message for MVC
JVT-W038 ( Prop 2.2.1/3.1) [Y. Chen, Y.-K. Wang, M. M. Hannuksela] Operation point and
view dependency changes SEI messages for MVC
JVT-W039 ( Prop 2.2.1/3.1) [Y.-K. Wang, Y. Chen, M. M. Hannuksela] Non-required pictures
SEI message for MVC
JVT-W040 ( Prop 2.2.1/3.1) [S. Liu, Y. Chen, Y.-K. Wang, M. M. Hannuksela] Constraints on
temporal direct mode and weighted prediction in MVC
JVT-W041 ( Prop NN) [A. M. Tourapis, K. Suehring, G. J. Sullivan, A. Leontaris]
H.264/MPEG-4 AVC reference software (JM) manual
JVT-W042 ( Prop NN) [A. Leontaris, A. M. Tourapis] Rate Control reorganization in the JM
reference software
JVT-W043 ( Prop NN) [A. Leontaris, A. M. Tourapis] Rate Control for the Joint Scalable
Video Model (JSVM)
JVT-W044 ( Info) [A. M. Tourapis, A. Leontaris, K. Suehring] New JM reference software
enhancements
JVT-W045 [withdrawn] <withdrawn>
JVT-W046 ( Prop 2.2.1/3.1) [M. M. Hannuksela, Y.-K. Wang] Support for SVC header
rewriting to AVC
JVT-W047 ( Prop 2.2.1/3.1) [M. M. Hannuksela, Y.-K. Wang] Pictures not for output in SVC
JVT-W048 ( Prop 2.2.1/3.1) [M. M. Hannuksela, Y.-K. Wang, Y. Chen] On SVC high-level
syntax
JVT-W049 ( Prop 2.2.1/3.1) [C. He, H. Liu, H. Li, Y.-K. Wang, M. M. Hannuksela] Redundant
pictures in SVC
JVT-W050 ( Prop 2.2.1/3.1) [Y.-K. Wang, M. M. Hannuksela] On tl0_pic_idx in SVC
JVT-W051 ( Prop 2.2.1/3.1) [Y.-K. Wang, Y. Chen, M. M. Hannuksela] On SVC scalability
information related SEI messages
JVT-W052 ( Prop 2.2.1/3.1) [Y.-K. Wang, M. M. Hannuksela] SVC feedback based coding
JVT-W053 ( Prop 2.0/3.1) [M. M. Hannuksela, Y.-K. Wang, D. Singer, T. Rathgen] SVC
priority_id value setting method indication
JVT-W054 ( Info) [I. Radulovic, Y.-K. Wang, S. Wenger, A. Hallapuro, M. M. Hannuksela]
Multiple description coding using AVC redundant pictures
126
JVT-W055 ( Prop 2.2/3.1) [T. Senoh, M. Okui, K. Enami] Experimental results of camerarotation-compensated prediction in CE6
JVT-W056 ( Prop 2.2) [J. B. Choi, W. S. Shim, H. S. Song, Y. H. Moon] Inter-view prediction
reference picture marking
JVT-W057 (Late Info) [K. P. Lim] Improved JM text algorithm description
JVT-W058 ( Info) [E. Francois] CE2: Cross-check of JVT-W030 on ESS mode pred
improvement
JVT-W059 ( Prop 2.2/3.1) [S. Yea, A. Vetro] CE6: View synthesis prediction
JVT-W060 ( Prop 2.2/3.1) [A. Vetro, S. Yea, W. Matusik, H. Pfister, M. Zwicker] Antialiasing for 3D displays
JVT-W061 ( Prop 2.2/3.1) [D. Hong, A. Eleftheriadis, O. Shapiro] Modified deblocking filter
process in scalable extension
JVT-W062 ( Prop 2.2/3.1) [A. Eleftheriadis, S. Cipolli, J. Lennox] Improved error resilience
using temporal level 0 picture index
JVT-W063 ( Prop 2.0/3.1, then 2.2) [D. Hong. A. Eleftheriadis, O. Shapiro] Deblocking filter
for SVC to support multi-threading with slice boundary
JVT-W064 ( Prop 2.2/3.1) [J. Luo, L. Zhu, P. Yin, C. Gomila] VUI updates for SVC
JVT-W065 ( Prop 2.2/3.1) [P. L. Lai, A. Ortega, P. Pandit, P. Yin, C. Gomila] Adaptive
reference filtering for MVC
JVT-W066 ( Prop 2.2/3.1) [P. Pandit, P. Yin, C. Gomila] Ref pic list reordering for MVC
JVT-W067 ( Prop 2.2/3.1) [P. Pandit, P. Yin, C. Gomila] H.264/AVC extension for MVC
using SEI message
JVT-W068 ( Prop 2.2/3.1) [C. Tu, S. Srinivasan, S. Regunathan, G. J. Sullivan] CE4: 4-tap MC
interp for high-res SVC enh layers
JVT-W069 ( Prop 2.2/3.1) [Z. He] Simplified H.264/AVC deblocking filter for SVC enh layer
JVT-W070 ( Text) [H. Schwarz, M. Wien] Editors input for SVC draft
JVT-W071 ( Info) [H. Schwarz, T. Wiegand] Further results for an rd-opt. multi-loop SVC enc.
JVT-W072 ( Info) [H. Schwarz] Results comparing JSVM, 4-tap, and RCDO MC interp.
JVT-W073 ( Info) [K. Sohn, J. Seo] Verification of JVT-W081 LGE MVC motion skip contrib.
JVT-W074 ( Prop 2.2) [H. S. Song, W. S. Shim, Y. H. Moon, J. B. Choi] Comments on view
dependency info
JVT-W075 ( Prop 2.0/3.1) [M. Horowitz, A. Eleftheriadis] Max frame size for enh layers of
SVC profiles <withdrawn>
JVT-W076 ( Prop 2.2) [J. Jia, H. K. Kim, H. C. Choi, J. J. Yoo] SVC chroma format scalability
JVT-W077 ( Prop 2.2) [P. Yang, X. Xu, G. Zhu, Y. He] View parallel processing on MVC
JVT-W078 ( Prop 2.2/3.1) [H. Kimata, S. Shimizu, K. Kamikura, Y. Yashima] Co-located
block condition for inter-view prediction
JVT-W079 ( Prop 2.2/3.1) [H. Kimata, S. Shimizu, K. Kamikura, Y. Yashima] Inter-view
prediction with downsampled reference pictures
JVT-W080 ( Prop 2.2.1) [K. Ugur, H. Liu, Y.-K. Wang] Showcase for parallel decoding
information SEI message for MVC
JVT-W081 ( Prop 2.2) [H. S. Koo, Y. J. Jeon, B. M. Jeon] MVC motion skip mode
JVT-W082 ( Prop 2.2) [Y. J. Jeon, H. S. Koo, B. M. Jeon] Modified spatial direct mode in
MVC
JVT-W083 ( Prop 2.2/3.1) [Y. S. Ho, C. Lee, K. J. Oh, B. H. Choi, J. H. Park] CE6: View
interp pred for MVC
JVT-W084 ( Info 2.2/3.1) [Y. S. Ho, K. J. Oh, C. Lee, B. H. Choi, J. H. Park] Observations of
multi-view test sequences
JVT-W085 ( Info) [Y. S. Ho, K. J. Oh, C. Lee, B. H. Choi, J. H. Park] CE5: Verification of
JVT-W031 illumination comp. info. derivation
JVT-W086 ( Prop 2.2/3.1) [I. Amonou, N. Cammas, S. Kervadec, S. Pateux] Some
consideration on the up-sampling position calculation
127
JVT-W087 ( Prop 2.2/3.1) [S. Shimizu, H. Kimata] New view synthesis prediction framework
using residual prediction
JVT-W088 ( Prop 2.2) [S. Lin, S. Gao, Y. Liu, L. Xiong] H264/AVC SEI extensions for MVC
JVT-W089 [withdrawn] <withdrawn>
JVT-W090 ( Prop 2.2/3.1) [H. Kirchhoffer, H. Schwarz, T. Wiegand] CE1: Simplified FGS
JVT-W091 ( Prop 2.2/3.1) [L. Cieplinski] HRD parameters for SVC bitstream rewriting
JVT-W092 ( Prop 2.2/3.1) [P. Pandit, P. Yin, C. Gomila] Reduced resolution update for MVC
JVT-W093 ( Prop 2.2.1/3.1) [H. Chung, M. Karczewicz, J. Ridge, X. Wang, W. Han, S. Kim]
SVC FGS profile
JVT-W094 ( Info 2.2) [W. J. Tam] Image and depth quality of asymmetrically coded
stereoscopic video for 3D-TV
JVT-W095 ( Info 2.2) [W. J. Tam, L. Zhang] Depth map preproc and minimal content for 3DTV using depth-based rendering
JVT-W096 ( Prop 2.2/3.1) [S. Naito, A. Koike] CE6: Results on MVC
JVT-W097 ( Prop 2.2/3.1) [S.-T. Hsiang] CE3: Intra-frame dyadic spatial SVC based on
subband/wavelet filter banks framework
JVT-W098 ( Prop 2.2) [J. H. Park, Y. H. Kim, J. W. Kim, B. H. Choi] Weighted prediction for
MVC
JVT-W099 ( Info) [J. H. Park, Y. H. Kim, B. H. Choi] Clarification of mb_qp_delta syntax
JVT-W100 ( Prop 2.0/3.1) [A. Smolic, K. Mueller, P. Merkle, N. Atzpadin, C. Fehn, M.
Mueller, O. Schreer, R. Tanger, P. Kauff, T. Wiegand, T. Balogh, Z. Megyesi, A.
Barsi] Multi-view video plus depth (MVD) format for advanced 3D video systems
JVT-W101 ( Prop 2.2) [H. Yan, J. Huo, Y. Chang, S. Lin, S. Gao, L. Xiong] MV/DV
prediction based on RDV
JVT-W102 ( Prop 2.2/3.1) [Y. Gao, Y. Wu] CE4: SVC bit-depth scalability simulation results
JVT-W103 ( Info) [J.-H. Yang, S.-H. Lee] CE6: Verif GIST MVC contribution JVT-W083
MVC view interp pred
JVT-W104 ( Prop 2.2) [S.-H. Lee, S.-H. Lee, N.-I. Cho, J.-H. Yang] MVC disparity vector
pred
JVT-W105 ( Prop 2.0) [X. Wang, J. Ridge] Study on residual upsampling without block
boundary check under ESS
JVT-W106-QV (Late Info) [X. Wang] CE2: Verif Qualcomm JVT-W117 improved resid upsamp
for ESS
JVT-W107 ( Info) [K. Sohn, J. Seo] Verif JVT-W104 MVC disparity vector pred
JVT-W108-Q (Late Info) [Q. Chen, Z. Chen] Verif JVT-W077 view parallel proc on MVC
JVT-W109-LV (Late Info) [E. Francois] Cross-check of JVT-W105 on residual upsampling
without block boundary check under ESS
JVT-W110 ( Info) [E. Francois, V. Bottreau, J. Vieron] SVC verif test plan: Updated results for
SVC High Profile intra
JVT-W111 ( Prop 2.2) [M. Karczewicz, S. Park, H. Chung] CE1: Report on FGS simplif
JVT-W112-L (Late Prop 2.2) [A. Segall] Clarification of base_mode_flag <withdrawn>
JVT-W113 ( Prop 2.2) [A. Segall, Y. Su] System for bit-depth scalable coding
JVT-W114 ( Prop 2.2) [A. Segall, J. Zhao] Showcase for transcoding scalability info SEI
JVT-W115-QV (Late Info) [A. Segall] CE1: Verif JVT-W090 simplified FGS
JVT-W116 ( Info) [A. Segall] CE4: Verif JVT-W102 (Thomson prop)
JVT-W117 ( Prop 2.2/3.1) [Y. Ye, Y. Bao] CE2: Improved resid upsampling for ESS
JVT-W118 ( Prop 2.2) [Y. Ye, Y. Bao, W. J. Han, S. Y. Kim] Perf and complexity of smoothed
ref pred
JVT-W119 ( Prop 2.0/3.1) [Y. Bao, M. Karczewicz, X. Wang, J. Ridge, Y. Ye, W. J. Han, S. Y.
Kim] CE1: FGS simplif
JVT-W120 ( Info) [P. Yin] CE1: Verif JVT-W119 FGS simplif
JVT-W121 ( Prop 2.2.1/3.1) [J. Ridge, X. Wang] CE1: FGS refinement pass simplif
128
JVT-W122-QV (Late Info) [J. Ridge] CE3: Verif JVT-W097 wavelet-based intra dyadic spatial
SVC
JVT-W123 ( Prop NN2.2.1) [X. Wang, J. Ridge] Analysis of visual artifacts in ESS residual
pred
JVT-W124-QV (Late Info) [J. Ridge] CE1: Verif JVT-W111 FGS simplif
JVT-W125 ( Prop 2.2) [G. J. Sullivan] On SVC high-level syntax and HRD
JVT-W126 ( Info) [Z. He] Verif JVT-W118 perf and complexity of smoothed ref pred
JVT-W127 ( Req) [M. Tanimoto, T. Fujii, H. Kimata, S. Sakazawa] Requirements for FTV
(MPEG M14417)
3.1.4 Late-registered input contributions
JVT-W128-QV (Late Info) [Y. Ye] Verif of JVT-W069: Simplified deblocking for SVC enh
layer
JVT-W129 [withdrawn] <withdrawn>
JVT-W130 [withdrawn] <withdrawn>
JVT-W131-B (Late Info) [D. Hong, A. Eleftheriadis] Verification bitstreams for SVC Profile A
JVT-W132-B (Requested Info) [T. Wiegand] Overview paper and presentation on SVC
JVT-W133-B (BoG) [A. Vetro] BoG report on MVC view interpolation pred
JVT-W134-Q (Late Prop 2.2) [S. Narasimhan] Splicing issues and some suggested changes
JVT-W135-B (BoG) [I. Amonou, N. Cammas, S. Kervadec, S. Pateux] On SVC verif test plan
JVT-W136-B (BoG) [G. J. Sullivan, S. Pateux] BoG report on JVT-W086
JVT-W137-B (BoG) [I. Amonou, N. Cammas, S. Kervadec, S. Pateux] Revised syntax for
quality layer SEI message
JVT-W138-B (BoG) [V. Bottreau] Toward an SVC conformance specification
JVT-W139-B (BoG) [LG, Thomson] Break-out conclusions on JVT-W081
JVT-W140-B (BoG) [T. Suzuki] Toward a professional profiles conformance specification
3.2 Late document availability
Document numbers suffixed in this report with "-L", "-Q", or "-M" were classified as late. Such
documents will only be considered as information documents only (unless agreed otherwise by
the group) if time permits, and consideration of them may be shifted to the end of the meeting as
determined appropriate by the group.
Furthermore, due to our difficulties with a large quantity of late-submitted contributions at recent
previous meetings, the JVT agreed at its preceding meeting that for this meeting, no lateuploaded (non-AHG-report, non-liaison) contribution would be presented without having a
minimum of 4 JVT participants (working for organizations other than that of the primary
contribution author) recorded by name as supporting the allowance of such a presentation, in
addition to a consensus of the general JVT membership to allow the presentation. Such support
to allow a presentation is to be understood to not necessarily imply support of the adoption of the
content of the late contribution, but only as a positive expression that the document should be
allowed to be presented. Additionally, the provider of a presented late contribution shall send an
email apology to the JVT email reflector. This rule does not apply to material requested by the
JVT at the meeting (e.g., reports of JVT-authorized side activities).
Clarification: Does not apply to verification contributions.
Further clarification: The four people shall be from different organizations.
JVT decision: Agreed.
129
A check mark () indicates a contribution considered to be available on time.
The suffixes for contributions not marked as “” are explained below:
– "-L" indicates a contribution that was somewhat late but was available by the first meeting
day.
– "-Q" were more late than that.
– "-M" were still missing at the time of preparation of this report.
– "-B" were break-out group discussion reports and other input requested during the meeting
Further suffixing by “V” indicates a verification contribution.
Contribution JVT-W134 (from S. Narasimhan) was subject to lateness penalties. An apology for
the lateness of the contribution was sent to the JVT email reflector, and JVT members were
recorded by name requesting presentation as follows: Mukta Kar, Jian Zong, Katie Cornog, and
Wade Wan. Presentation of JVT-W134 was postponed to the last meeting day, and no immediate
action was taken in response to the contribution (other than to include it in a list of issues to be
considered for later action).
There were no objections to presentations of late documents at this meeting.
JVT-W112 (from A. Segall) was also late. Although supported for presentation by four JVT
members (requesting presentation: Miska Hannuksela, Mathias Wien, Peter Amon, Vincent
Bottreau), the contribution was withdrawn as moot after some discussion, in consideration of
action taken in response to other contributions.
It was noted that, with only one (non-withdrawn) contribution subject to lateness penalties (and
that one having no immediate action requested or taken), the situation surrounding the need for
on-time availability of contributions has substantially improved.
3.3 Withdrawn document registrations
JVT-W021 [withdrawn] <withdrawn>
JVT-W045 [withdrawn] <withdrawn>
JVT-W075 [M. Horowitz, A. Eleftheriadis] Max frame size for enh layers of SVC profiles
<withdrawn>
JVT-W089 [withdrawn] <withdrawn>
JVT-W112-L [A. Segall] Clarification of base_mode_flag <withdrawn>
JVT-W129 [withdrawn] <withdrawn>
JVT-W130 [withdrawn] <withdrawn>
3.4 Major output documents
Major output documents submitted to parent-body review included the following. (Dates listed
are planned dates of availability.)
3.4.1.1.1 JVT-W200 Meeting report of the 23rd JVT meeting [07/05/20]
3.4.1.1.2 JVT-W201-M (WG 11 N8962) Joint Draft 10: Scalable Video Coding
[07/05/31]
3.4.1.1.3 JVT-W202-M (WG 11 N8963) Joint Scalable Video Model (JSVM) 10
[07/05/31]
130
3.4.1.1.4 JVT-W203-M (WG 11 N8964) JSVM 10 Software [07/06/29]
3.4.1.1.5 JVT-W204-M (WG 11 N8955) WD 1 conformance test spec for Prof Prof
(Teruhiko Suzuki) [07/06/29]
3.4.1.1.6 JVT-W205-M (WG 11 N8957) WD 1 conformance test for SVC (V. Bottreau)
[07/06/29]
3.4.1.1.7 JVT-W206-M (WG 11 N8959) WD reference software for Prof Prof
[07/06/29]
3.4.1.1.8 JVT-W207-M (WG 11 N8967) Joint Multi-view Video Model (JMVM) 4
[07/05/18]
3.4.1.1.9 JVT-W208-M (WG 11 N8968) JMVM 4 Software [07/05/31]
3.4.1.1.10
JVT-V209-M (WG 11 N8966) Joint draft 3 Multi-view Video Coding
[07/02/09]
3.4.1.1.11
JVT-W211-M (WG 11 N8961) WD reference software for SVC [07/06/29]
3.4.1.1.12
JVT-W212-M (WG 11 N8965) Verification test plan for SVC [07/05/18]
3.5 JVT internal output documents
JVT internal output documents included the following. (Dates listed are planned dates of
availability.)
3.5.1.1.1 JVT-W210-M ITU-T Rec. H.264 | ISO/IEC 14496-10 Advanced Video Coding
Defect Report [07/06/18]
3.6 SVC core experiment output documents
Submission (to the JVT) of final description (and any data necessary for conducting experiment):
next meeting start – 3 weeks
Submission (to the CE partners) of final software and results: next meeting start – 2 weeks
3.6.1.1.1 JVT-W301 CE 1 on SVC subband techiques
Coordinator(s): Shih-Ta Hsiang
Participants: Motorola, Nokia, Qualcomm, HHI, Sharp, Microsoft, RWTH Aachen, Thomson,
FT/Orange, Huawei, Intel
Technology to be tested: JVT-W097 (and generalizations discussed – non-dyadic, interlaced,
etc.)
3.6.1.1.2 JVT-W302 CE 2 on SVC bit depth and chroma format scalability
Coordinator(s): Andrew Segall
Participants: Sharp, Thomson, HHI, Qualcomm, Mitsubishi, Microsoft, Intel, Huawei, Motorola,
NTT, ETRI
Technology to be tested: JVT-W102, JVT-V078, JVT-W113
131
3.7 MVC core experiment output documents
Submission (to the JVT) of final description (and any data necessary for conducting experiment):
next meeting start – 3 weeks
Submission (to the CE partners) of final software and results: next meeting start – 2 weeks
3.7.1.1.1 JVT-W303 CE 3 on MVC view interpolation/synthesis
Coordinator(s): Hideaki Kimata
Participants: Nokia, Qualcomm, Thomson, Microsoft, NTT, Samsung, KHU, Sejong Univ.,
KETI, GIST, Yonsei Univ., HHI, Sharp, Mitsubishi, Huawei
Technology to be tested: JVT-W059 and JVT-W087
4
JVT administrative and liaison topics
4.1 IPR policy reminder and update
Participants were reminded of the IPR policy established by the parent organizations of the JVT
and were referred to the parent body web sites for further information. The IPR policy was
summarized for the participants.
Participants were particularly reminded of the need to supply a completed JVT IPR status
reporting form in all technical proposals for normative standardization. Participants were also
reminded of the need to formally report patent rights to the top-level parent bodies (using the
common reporting form found on the database listed below) and to make verbal and/or document
IPR reports within the JVT as necessary in the event that they are aware of unreported patents
that are essential to implementation of a standard or of a draft standard under development.
The JVT chair noted that the top-level parent bodies have agreed upon a new common patent
policy for ITU-T, ITU-R, ISO, and IEC.
Some relevant links for organizational and IPR policy information are provided below:
– http://www.itu.int/ITU-T/ipr/index.html (new common patent policy for ITU-T, ITU-R, ISO,
IEC and guidelines and forms for formal reporting to the parent bodies)
– http://ftp3.itu.int/av-arch/jvt-site (JVT contribution template for each meeting)
– http://www.itu.int/ITU-T/studygroups/com16/jvt/index.html (JVT founding charter)
– http://www.itu.int/ITU-T/dbase/patent/index.html (ITU-T IPR database)
– http://www.itscj.ipsj.or.jp/sc29/29w7proc.htm (SC29 Procedures)
The JVT chair noted that the ITU TSB director's AHG on IPR had recently issued a clarification
of the IPR reporting process for ITU-T standards, as follows (and as previously sent to the JVT
email reflector), per upcoming TD 327 (GEN/16):
“TSB has reported to the TSB Director’s IPR Ad Hoc Group that they are receiving
Patent Statement and Licensing Declaration forms regarding technology submitted in
Contributions that may not yet be incorporated in a draft new or revised Recommendation.
The IPR Ad Hoc Group observes that, while disclosure of patent information is strongly
encouraged as early as possible, the premature submission of Patent Statement and
Licensing Declaration forms is not an appropriate tool for such purpose.
In cases where a contributor wishes to disclose patents related to technology in
Contributions, this can be done in the Contributions themselves, or informed verbally or
otherwise in written form to the technical group (e.g. a Rapporteur’s group), disclosure
132
which should then be duly noted in the meeting report for future reference and record
keeping.
It should be noted that the TSB may not be able to meaningfully classify Patent Statement
and Licensing Declaration forms for technology in Contributions, since sometimes there
are no means to identify the exact work item to which the disclosure applies, or there is no
way to ascertain whether the proposal in a Contribution would be adopted into a draft
Recommendation.
Therefore, patent holders should submit the Patent Statement and Licensing Declaration
form at the time the patent holder believes that the patent is essential to the
implementation of a draft or approved Recommendation.”
The JVT chair noted (as also previously remarked on the JVT email reflector) that since we are
nearing completion of the SVC amendment project, it was suggested that now would be a good
time to file formal notices to the parent bodies for any patent rights that are believed to be
essential to the implementation of the SVC extensions (not to mention any notices not previously
filed relating to the new professional profiles or other previous projects).
It is suggested that, to enable proper interpretation of such formal notices, the SVC amendment
should be clearly identified in such formal notices. For example, as “ITU-T Rec. H.264 and
ISO/IEC 14496-10 Advanced video coding (2005 Ed.) Amendment 3 (2007): Scalable video
coding”. Notices pertaining to other efforts should be made with a similar degree of clarity of
identification of the specific standardization work item to which the declaration pertains.
The chair invited participants to make any necessary verbal reports of previously-unreported IPR
in draft standards under preparation and opened the floor for such reports: No such verbal reports
were made.
4.2 Meeting opening remarks by the chairmen
At the opening session of the meeting, the JVT chairs reminded participants of the relevant IPR
policy as described above, and reviewed the status and plans for the major projects under way in
the JVT, The two largest areas of activity consisting of scalable video coding (SVC) and multiview video coding (MVC) extensions of the ITU-T Rec. H.264 | ISO/IEC 14496-10 Advanced
video coding (AVC) standard. Further work and additional needs on the development,
standardization, and maintenance of the base specification and the recently-completed
professional profiles, and of associated reference software and conformance specifications was
also noted.
The chair remarked that there were fewer late document uploads this time and that the submitted
documents seem to be adhering better to the JVT guidelines in terms of formatting, filenames,
etc., which is a good development, although further improvement (particularly in the formatting
conventions) is still needed. The new JVT operating rules established in Hangzhou that took into
effect at the preceding Marrakech meeting on that subject may have helped.
4.3 JVT communication practices
JVT documents are available at http://ftp3.itu.int/av-arch/jvt-site.
These can also be accessed via ftp with the site name ftp3.itu.int, user ID avguest and password
Avguest. Upon login, documents will then be found in the directory "jvt-site". Uploading of
contributions is done by upload via ftp protocol to the "jvt-site/dropbox" directory.
133
JVT email lists are managed through the site http://mailman.rwth-aachen.de/mailman/options/jvtxyz, and to send email to one of these reflectors, the email address is "jvt-xyz@lists.rwthaachen.de", where "xyz" is
– "experts" for general experts group discussions
– "bitstream" for bitstream exchange activities
– "svc" for SVC work
– "mvc" for MVC work
4.4 Scheduling and logistics notes
Some parallel sessions were held during the meeting, particularly including some parallel review
of MVC and SVC contributions (prior to Thursday afternoon). Some “break-out group” (BoG)
side activities and informal study efforts were also conducted. Documents produced by break-out
group activities are listed in this report with the abbreviation “BoG” and are suffixed with “-B”.
4.5 Closing session notes
In the closing session there were no requests to reopen discussions of preceding agenda topics
and side activities recorded elsewhere in this report.
The JVT thanked the USNB to WG 11, and Julie Higgins, Betsy Bartlett and Scott Porter from
Meeting Planit for the organization of this meeting.
The JVT also thanked Apple, Microsoft and Mobilygen for providing financial support for the
meeting.
The meeting was closed at 1:50 pm on Friday 27 April 2007.
4.6 Administrative documents
4.6.1.1.1 JVT-W001 (Admin) [G. J. Sullivan, J.-R. Ohm, A. Luthra, T. Wiegand] AHG
Report: Proj mgmt and errata
This document (available late) is a report of the JVT Ad hoc group on project management and
errata reporting. Its purpose is to provide a high-level survey the current state of JVT projects
and to report on recent events and progress made since the last meeting. This document’s content
consists primarily of a summary of high-level information found also in other inputs to the
meeting.
The primary JVT projects, as of our previous meeting, were reported to be the following:
– New profiles for professional applications
– Scalable video coding (SVC) extensions
– Multi-view video coding (MVC) extensions
Additional smaller efforts, such as definition of new SEI messages for various purposes, were
reported to also be under consideration.
Additionally, we have continuing efforts toward:
– Development of good conformance testing suites
– Development of good reference software for:
– Providing guidance to clarify proper decoder results
134
–
–
– Providing guidance to ease encoder and decoder product development
– Providing examples of encoding algorithms
– Providing examples of decoder error/loss concealment behavior
– Serving as a “touchstone” for development of future algorithms
Aiding in verification testing of design capabilities
Identification and aid toward support of JVT video coding specifications in relevant system
designs
JVT communication practices were reviewed and summarized.
Amendment 2, specifying new profiles (designed primarily) for professional applications, was
reported to have reached the following status:
–
In ITU-T, Amd.2 reached full “in force” Recommendation status (i.e., final standardization
approval) on 6 April 2007, but is not yet published. The “last call” period resulted in one set
of sector member comments, which were from Microsoft requesting the latest developments
in the JVT to be incorporated. The specification draft was changed to address these
comments, and was posted for an “additional review” (AR) period. The AR period closed
without further comment.
–
In ISO/IEC JTC 1, essentially the same Amd.2 draft text as was approved by ITU-T was
forwarded to the SC 29 secretariat as an FDIS. It will soon be subject to a final 2-month
FDIS approval ballot (the result of which will be either Yes or No – and almost certainly
Yes, without changes to the text).
Further relevant information on Amd.2 was reported to be found in JVT-W003.
The SVC Joint Draft (JD 9) and SVC Joint Scalable Video Model (JSVM 9) were reported to
have been submitted as JVT-V201 and JVT-V202, respectively.
The editors were reported to have further worked on the JD and JSVM text after providing JVTV202. Updated versions of the texts were reported to have been provided as input document JVTW070.
The JSVM 8 software was reported to have been delivered to the group at the end of the
Marrakech meeting. The JSVM software integration process was reported to have followed the
rules and procedures defined in the JSVM Software Manual available in the CVS server.
CVS reference:
host address: garcon.ient.rwth-aachen.de
user name: jvtuser password: jvt.Amd.2
authentication: pserver path: /cvs/jvt module name: jsvm_red
Some integration work on text and software was reported to remain to be finalized.
Four SVC “core experiments” were reported to have been the subject of work since the
Marrakech meeting. Some of these are toward topics for the current first phase of SVC work and
some are for subjects identified as longer term “phase 2” study efforts, as follows:
– CE 1: FGS simplification (phase 2)
– CE 2: ESS improvement (phase 1)
– CE 3: Subband intra coding (phase 2)
– CE 4: Bit depth scalability (phase 2)
135
Input reports of work on these experiments have been provided as input contributions to this
meeting.
Further relevant information on SVC work was reported to be found in JVT-W006 (and JVTW007, JVT-W008, JVT-W009, and JVT-W010).
The JMVM 3 and MVC JD 2 were reported to have been submitted to the JVT as JVT-V207 and
JVT-V209, respectively.
The JMVM 3 software was reported to have been delivered to the group on February 24th, 2007.
This release was reported to contain the integration of new syntax element as described in JVTV054, reference list reordering commands for inter-view pictures as described in JVT-V043, bug
fixes and code clean-ups. Subsequently two bug-fix versions tagged JMVM 3_0_1 and
JMVM_3_0_2 were reported to have been released which contained significant bug-fixes which
addressed the high memory usage and spatial direct mode.
CVS reference:
host address: garcon.ient.rwth-aachen.de
user name: jvtuser password: jvt.Amd.2
authentication: pserver path: /cvs/jvt module name: jmvm or jmvm_red
jmvm_red does not check out certain old folders related to SVC.
Two MVC “core experiments” were reported to have been the subject of work since the
Marrakech meeting, as follows:
– CE 5: Illumination compensation
– CE 6: View interpolation
Input reports of work on these experiments have been provided as input contributions to this
meeting.
Further relevant information on MVC work was reported to be found in JVT-W013 (and JVTW011, JVT-W012, and JVT-W014).
The latest available state of errata reporting on the AVC base specification was reported to be
found in JVT-U210, plus relevant notes in the meeting report of the Marrakech meeting. The San
Jose input document JVT-W134 was also reported to be relevant.
As of the writing of the report, the latest errata list JVT-V210 (planned as a JVT internal output
document in Marrakech), had not yet been produced. Hope was expressed for it, or a furtherupdated errata list to be produced as an output document from the San Jose meeting, to be
produced soon.
The latest JM algorithm description text was reported to have been submitted as JVT-W057.
JM software versions 12.1 and 12.2 were reported to have been released since the Marrakech
meeting
Improvements to the JM software are described in JVT-W044.
The integration of the new 4:4:4 profiles had reportedly been started and was still a work in
progress.
The software and updated documentation is available at:
136
http://iphome.hhi.de/suehring/tml
The JM software manual had reportedly been updated to match the released version JM 12.2 and
had been submitted to this meeting as document JVT-W041.
A web based bug tracking system had been set up for keeping track of known issues and missing
features. The system is publicly accessible but requires registration for entering bug reports.
The system is located at
http://ipbt.hhi.de
A list of known issues and their state can be found at:
https://ipbt.hhi.de/mantis/view_all_bug_page.php
Further relevant information was reported to be found in JVT-W002.
The JVT, as a child organization with parents in ISO/IEC JTC 1 and ITU-T, is operated under the
top-level IPR policies of these organizations. Two recent noteworthy developments were
reported to have occurred in the IPR policies of these top-level organizations.
1) The top-level parent bodies have agreed upon a new common patent policy for ITU-T, ITUR, ISO, and IEC. That policy, and guidelines and forms for formal reporting to the parent
bodies, can be found at http://www.itu.int/ITU-T/ipr/index.html.
2) The ITU TSB director's AHG on IPR had recently issued a clarification of the IPR reporting
process for ITU-T standards (as previously sent to the JVT email reflector), per upcoming
ITU-T TD 327 (GEN/16).
4.6.1.1.2 JVT-W002 (Admin) [K. Suehring, A. Tourapis, T. Suzuki] AHG Report: JM
text, ref soft, bitstream, conf
The JM reference text includes the adopted contribution of the document JVT-T046 on "Context
Adaptive Lagrange Multiplier (CALM) for Motion Estimation in JM - Improvement". It had been
submitted as document JVT-W057.
The integration of the 4:4:4 profiles had been started and was still work in progress. JM 12.2 (see
Software releases) already contained the code for of Tone Mapping and Post Filter hint SEI
messages as well as the Intra only profiles. The Independent Color Coding mode software had
been finished, but the code had not yet been released.
The JM versions 12.1 and 12.2 had been released since the Marrakech meeting. Besides the 4:4:4
features, the main focus of these releases was restructuring, code improvement and speedup. The
decoder runs at least more than twice as fast than previous versions.
The most important improvements are described in JVT-W044.
The complete list of changes can be found in the CHANGES.TXT file which is included in each
software archive.
The software and updated documentation is available at:
http://iphome.hhi.de/suehring/tml
The JM software manual had been updated to match the released version JM 12.2 and had been
submitted to this meeting as document JVT-W041. It was reportedly planned to add the manual
to the software archive in subsequent versions.
137
As the official H.264/AVC reference software, the JM should be a correct source for checking
implementations. This means the decoder should be able to decode all valid H.264/AVC
bitstreams and the encoder should never create non-conforming bitstreams (at least not without
generating warnings). This is currently not the case.
Depending on the configuration the JM encoder can create invalid bitstreams:
– Level constraints are not properly checked
– The 16-bit transform processing range requirements are not checked
– In Baseline/Main/Extended profile the restriction of CAVLC syntax elements needs proper
handling
The software coordinators encouraged all H.264/AVC experts to volunteer for fixing these issues.
A web based bug tracking system has been set up for keeping track of known issues and missing
features. The system is publicly accessible but requires registration for entering bug reports.
The system is located at
http://ipbt.hhi.de
This internet site contains some usage instructions.
Please note that the bug tracking system is using encrypted/secure http (https) for protecting the
user’s login. The used certificate is self signed and has to be imported into the user’s web
browser. The SHA-1 fingerprint of the certificate is
69:21:86:d9:3e:72:da:3f:e8:30:df:a8:dd:fa:a5:4c:ed:85:b5:09
A list of known issues and their state can be found at:
https://ipbt.hhi.de/mantis/view_all_bug_page.php
A list of current bugs can also be found in the annex of the AHG report.
It was requested that certain rules should be followed before reporting any new bugs:
– The database should be searched on whether the same issue was previously reported. If the
problem was reported before, but there is additional information, then this information
should be added to the original report.
– It should be specified if the problem is related to the encoder, decoder or both.
– The version of the software used should be specified.
– Description of the problem should be as precise as possible.
– The necessary steps to reproduce the problem should be described in detail.
– If available, the configuration files or/and command line syntax used to run the software
should be provided.
– The language of the standard should be used when referencing the text description.
– After filing the report, the user should check if he/she is requested to provide additional or
other information relating to this issue.
Communications related to this ad-hoc activity have taken place on the JVT bitstream exchange
reflector (“jvt-bitstream@lists.rwth-aachen.de”). The reflector of this AHG was moved from
IMTC to Univ. of Aachen some time ago. However AHG was not so active since the last JVT
meeting.
The FTP area for downloading bitstream files is on the main JVT Experts FTP site:
ftp://ftp3.itu.int/jvt-site/bitstream_exchange/ (login: avguest, password Avguest).
138
The bitstreams can also be accessed from the following http site.
http://ftp3.itu.int/av-arch/jvt-site/bitstream_exchange/
To volunteer a bitstream for testing, please include it in a zip archive along with related files
(trace files, configuration, reconstructed frames) in a zip archive and upload it to the dropbox:
ftp://ftp3.itu.int/jvt-site/dropbox (login: avguest, password Avguest)
In general, the following naming convention is being followed for the bitstreams in the exchange:
FeatureCode_Source_VersionLetter
Please refer to the spreadsheet and files on the FTP site for examples.
Once a bitstream has been uploaded to the dropbox, send an e-mail to teruhiko@av.crl.sony.co.jp,
and/or the bitstream exchange reflector and it will be made available in the bitstream_exchange
directory.
To sign up for the bitstream exchange reflector, use the web address given below.
– Over the web: < http://mailman.rwth-aachen.de/mailman/listinfo/jvt-bitstream >
Conformance Activities and Corrigendum work:
No new conformance specification problems were reported since the last meeting. All known
problems must be fixed for the corrigendum of AVC conformance and FRExt conformance.
New conformance activity for new professional profiles should be started at San Jose meeting.
The AHG recommended
– to fix all bitstreams with conformance problems
– to encourage volunteers to provide more conformance streams
– to start activity of new conformance amendment work to support new professional profiles
A desire for corrigendum and software work was expressed – e.g., range of values checking and
avoiding allowing “hostile” non-conforming bitstream corner cases.
4.6.1.1.3 JVT-W003 (Admin) [T. Suzuki] AHG Report: Professional applications
The main JVT reflector (jvt-experts@lists.rwth-aachen.de) was used for the AHG activities. The
term “[4:4:4]” was inserted at the beginning of a subject field to identify the email related to this
AHG. The descriptions of the specifications were updated, e.g. description of independent color
mode, and the FDAM document was released to ISO and the corresponding AR text was released
to ITU-T. All remaining issues on the document were fixed in those documents. The remaining
issues are the reference software and conformance.
The volunteers to integrate JFVM software into the latest JM were identified. The order of
integration is as follows.
1) Tone Mapping SEI
(Sharp)
2) 4:2:0/4:2:2intra only coding & post-filter hint SEI (Panasonic)
3) independent color coding
(Mitsubishi)
4) 4:4:4 intra & predictive
(Thomson)
5) Lossless coding
(Sejong University)
139
The integration had not been finished by this meeting. The integration of 1), 2) and 3) had been
finished, however the schedule was delayed. 4) and 5) should be integrated after the San Jose
meeting. The schedule of formal integration remained to be defined during San Jose meeting.
Conformance streams for new profiles will be started after San Jose meeting. The following
volunteers were identified.
High 4:4:4
Thomson
Mitsubishi (independent color coding mode)
Sejong Univ. (lossless coding)
High 4:4:4 intra
Thomson
Mitsubishi (independent color coding mode)
Sejong Univ. (lossless coding)
CAVLC 4:4:4 intra Thomson
High 4:2:2 intra
Panasonic
High 10 intra
Panasonic
It was encouraged to generate bitstreams by other volunteers in addition to the above
organizations. The schedule of conformance work plan was recommended to be defined at San
Jose.
The AHG recommended
– To finalize the work plan of the integration of JM software
– To finalize the conformance work plan
Various identified problems have been fixed.
Regarding the software – separate color plane coding (Thomson) and lossless coding reamained
still to be done.
Regarding conformance – volunteers were listed – additional volunteers would be helpful, should
also tabulate further detail.
4.6.1.1.4 JVT-W004 (Admin) [J.-R. Ohm, T. Wiegand, M. Bober] AHG Report: Video
annotation
During the interim period since the last JVT meeting, some active email discussions of video
annotation were held on the JVT email reflector. These consisted primarily of an airing of views
regarding where it is best to carry video annotation data (i.e., at the systems level or within the
video bitstream as SEI messages or registered or unregistered user data SEI messages), where it
is best to specify the definition of such data (i.e., in a separate standard such as the MPEG-7
standard or in particular SEI message definition sections of the AVC standard), and how to deal
with an asserted confusion resulting from an asserted overabundance of defined types of such
data. Various views were expressed, along with pros and cons of each approach. No obvious
consensus was evident on those issues.
Various perspectives were expressed, ranging from doing nothing (letting people use user data
SEI or system level support) to selecting particular messages for definition in SEI. Discussion of
scope, system interaction, specification interaction, …
The JVT-W032, JVT-W033, and JVT-W034 input contributions to the San Jose meeting are
relevant to the subject.
140
4.6.1.1.5 JVT-W005 (Admin) [G. J. Sullivan, J. Luo] AHG Report: AVC splicing
Email to initiate the discussion was sent to the JVT reflector. But little discussion occurred there.
The normative requirements may often make concatenation/splicing of coded video sequences
rather difficult.
There is one AVC HRD related proposal in the San Jose meeting (JVT-W020). It may have some
relevance. The study of the issues should be continued and action items should be identified.
Contributions are needed to determine what can be done.
Basically, not much happened in this AHG.
4.6.1.1.6 JVT-W006 (Admin) [J. Vieron, M. Wien, H. Schwarz, T. Wiegand] AHG
Report: JD & JSVM text and S/W
The SVC Joint Draft (JD 9) and SVC Joint Scalable Video Model (JSVM 9) were reported to
have been submitted as JVT-V201 and JVT-V202, respectively. They were also reported to have
been submitted as MPEG output documents N8750 (Study Text of ISO/IEC 1449610:2005/FPDAM3 Scalable Video Coding) and N8751 (Joint Scalable Video Model (JSVM) 9).
The provided Joint Draft 9 corresponds to JSVM 8 Annex G with FGS removed.
The JSVM 9 document includes a generic description of the principles used for scalable coding
in SVC to help people to get familiar with scalability principles. It also includes a description of
non-normative tools for the encoding process.
The JSVM 9 also includes an annex (Annex-G) corresponding to a modified version of the JD 9
including all tools adopted during the 22nd JVT meeting. The purpose of this additional
document is to serve as a base for the creation of the future JD 10.
The document JVT-V202_JSVM9.doc contains a new part (Annex A), that contains the draft text
for FGS, which was removed from the Joint Draft 9, including dedicated subclauses and the
specification of changes to subclauses in Annex G that are required for application of FGS in
SVC.
Presented. FGS moved to “Annex A”. Lots of work. Software integration not done. Not much
feedback form members on text. Feedback requested. Members requested to strictly respect the
rules and procedures.
Normative changes
– Moving FGS and AR-FGS from JD back to JSVM (not Annex G) [Editors]
– JVT-V032* [J. He] CE4: Disabling SVC chroma deblocking filter (as values of
disable_deblocking_idc)
– JVT-V035* [A. Segall] CE8: CGS SVC-to-AVC bitstream-rewriting (incl removal of IDCT
for base layer of MGS/CGS SNR scalability)
– Remove the use of nal_unit_type value 21, using spare bits in the current use of
nal_unit_type equal to 20 [Editors]
– constrained_intra_pred_flag must be 1 when Intra_base is used [Editors]
– disallow temporal direct for nal_unit_type = 20 or 21 [Editors]
– When nal_unit_type = 1,2,3,4, then disallow temporal direct when used for inter-layer
prediction [Editors]
– Number of base layer macroblocks that need to be decoded in order to form an IntraBL
predictor should be limited. [T. Wiegand, details TBD]
– suffix NAL unit – nal_ref_idc must be the same as the associated non-suffix NAL units [Y.K. Wang]
141
–
–
–
–
–
–
–
–
–
–
semantics of discardable_flag – is an indication of a lack of dependency for the current
access unit and all subsequent access units [Y.-K. Wang]
Bitstream may require discarding of some NAL units with simple_priority_id … in order to
form a conforming subset bitstream [Y.-K. Wang]
Smoothed reference prediction flag to be at slice level rather than in SPS [N. Cammas]
Put a flag in the slice header to “skip” the entire slice (indicating that BLSkip flag is equal to
1 and residual_pred-flag is equal to 1 for all MBs in slice and no further information is sent)
[Editors]
JVT-V068* [J. Luo] SVC hypothetical reference decoder (Details as recorded elsewhere), r4
SEI message for temporal subsets and other aspects
JVT-V088* [A. Eleftheriadis] SVC error resil using frame index in NAL unit header (extra
byte for any D, Q; byte is moved from NAL unit header to slice header and suffix/prefix
NAL unit payload; flag for switching the byte stays in the NAL unit header)
Suffix NAL unit – suggested change removes the ability to provide more than one of these.
Alternative suggestion – for the first NAL unit of the base layer, a NAL unit type 14 is
prefixed to convey the contents of the current suffix NAL unit. For other slices, we use the
suffix NAL units as they are (NAL unit type 20). [Y.-K. Wang]
Prohibit MV refinement when store_base_layer_flag = 1 [Editors]
JVT-V036* [A. Segall] Support for transcoding in scalability info SEI (r1)
SVC profile and levels according to meeting notes (incl removal of profile C) [Editors]
For FGS (integrated into JVT-W070)
– JVT-V095* [M. Karczewicz] CE1: Improved coefficient coding (Tool 1: adopted; Tool 2:
change only for I and P but not B slices)
Issues (JD)
– Feedback provided by JVT members on the documents was very low. Comments received
from Nokia (Ye-Kui) and Microsoft (Gary)
– Clarify constraints for frame_num (inside a “layer”)
– Clarify usage of frame_num for base layers
– Move process 8.2.2 to Clause 7 (slice groups)
– Rewrite resampling G.8.6 (highly redundant)
– Ed. Notes to be solved
– (anything else that’s broken, needs to be clarified)
Issues (JSVM)
– FGS part needs to be reworked and improved
The editors further worked on the JD and JSVM text after providing JVT-V202. Updated
versions of the texts are provided as input document JVT-W070. They contain various changes
including the following:
JSVM:
– Integration of JVT-V095 (was missing in JVT-V202)
– JD:
– Add/extend definitions in G.3
– Corrections, clarifications in G.6
– Update of G.7 (syntax and semantics) + several fixes
– Restructuring of G.8
– Several corrections, clarifications in G.8
– Correction of bugs in G.10 (profiles and levels)
142
The JSVM 8 software was delivered to the group at the end of the Marrakech meeting. The
JSVM software integration process has followed the rules and procedures defined in the JSVM
Software Manual available in the CVS server listed below.
Note that the integration process has more than one month delay with respect to the original
integration schedule. Various integrators encountered difficulties which were reportedly mainly
related to the FGS parts. There were still four software integrations to be done.
The last JSVM integration schedule is summarized in the table below.
Proposal
Company
Start date
Comments
JSVM Tag
Cleaning + Memory leaks fixing +
Improvement of FixedQPEncoder
[JVT-T037] - CE2: Progressive to interlace
inter layer motion prediction
[JVT-V088] - tl0_picture_idx
Thomson
01.02.2007
simple(1 day)
JSVM_8_0_1
Samsung
01.02.2007
Moderate(2 days)
JSVM_8_1
OK
Layered Media
07.02.2007
Simple (2 days)
JSVM_8_2
OK
HHI
12.02.2007
Moderate (3 days )
JSVM_8_3
OK
HHI
15.02.2007
Moderate (3 days )
JSVM_8_4
OK
Microsoft
18.02.2007
Moderate (4 days)
JSVM_8_5
OK
Orange
26.02.2007
Simple (1 day)
JSVM_8_6
OK
Freescale
27.02.2007
Moderate(5 days)
JSVM_8_7
OK
Sharp
02.03.2007
Moderate (7 days)
JSVM_8_8
OK
Nokia
16.03.2007
Moderate (4 days)
JSVM_8_9
OK
Sharp
20.03.2007
Difficult
JSVM_8_X
Started
Thomson
Difficult / parallel
JSVM_8_X
ST Microelectronics
Moderate (7 days)
JSVM_8_X
Qualcomm
Moderate (4 days)
End of JSVM 9 integration
JSVM_8_X
JSVM_9_0
[JVT-V125]- H241 RCDO
[JVT-V126] MGS Key pictures
[JVT-V074][JVT-V090] - Motion comp
interpolation 4-tap and parameterized
[JVT-V058] - Smoothed reference flag +
Interlaced Bug fixes (J. Vieron)
[JVT-V032] - CE4: Disabling SVC chroma
deblocking filter
[JVT-V035]– Bitstream rewriting ([JVTV036])
High-level syntax
CGS/MGS residual prediction in transform
domian
[JVT-V068] - HRD and SEI message
[JVT-V079] - SVC low complexity MB
mode decision
FGS refinements
Status
OK
In order to improve the whole software integration process, the software integration guidelines
and rules have been refined as following:
– The integrated software shall compile without warnings when using the provided VS 6,
VS .NET, and VS 2005 workspaces, as well as linux makefiles.
– Do not use variable declarations inside the header of for-loops (the scope for for-loops is not
correctly supported with all compilers).
– Follow the coding style of the JSVM software. Use 2 (two) spaces for indentation, no tab.
– Re-use code and integrate functionality as possible. Try to avoid redundant code.
– Do not change the meaning of existing input parameters but define new ones if necessary
(and applicable).
– Make sure that new parameters have meaningful default values. Tools should not be
switched on by default (if not decided different by the JVT).
– Do not re-structure the output of the compiled binaries (if not decided different by the JVT).
– Please change the JSVM version number macro (i.e. “_JSVM_VERSION_”) located in the
file “CommonDefs.h” to be inline with your integration tag.
The AhG on SVC text and JSVM software recommended
1) To take the proposed version of the JD and JSVM text in JVT-W070 as basis for further
editing.
143
2) To carefully study the latest version of the JD (JVT-W070) and provide feedback to the
editors.
3) To follow the integration rules and procedure of validation described in the JSVM software
manual (found on the CVS server listed below).
4) All proponents to strictly respect these rules/guidelines. Sticking to these basic principles and
recommendations is mandatory and facilitate the future integration and maintenance works.
5) To continue maintaining the JSVM Software Manual: We remind that each proponent is
responsible for updating the Software Manual by providing description for each newly
introduced parameters and/or tools.
CVS reference:
host address: garcon.ient.rwth-aachen.de
user name: jvtuser password: jvt.Amd.2
authentication: pserver path: /cvs/jvt module name: jsvm_red
4.6.1.1.7 JVT-W007 (Admin) [A. Segall, S. Regunathan] AHG Report: SS resampling
A kick-off message was sent to the JVT reflector on 27 February 2007. The message requested
suggestions on upsamplers, down-samplers and sequences with "different source characteristics"
from interested experts.
Candidate down-samplers were identified for study and circulated on the reflector on 4 April
2007. The AhG decided to study filters from the paper: K. Turkowski “Filters for Common
Resampling Tasks”. (Online at
http://www.worldserver.com/turk/computergraphics/ResamplingFilters.pdf) The AhG
recommended studying the two Gaussian filters and two Lanczos windowed filters described in
the document. Furthermore, the AhG recommended combining the Gaussian/Lanczos filters with
an unsharp mask to better approximate common image enhancement techniques.
Regarding the sample ratios, the AhG recommended interested experts to focus on the case of
dyadic, 1.25 and 1.5 cases. This was to cover common NTSC->HD and PAL->HD (accounting
for the change in aspect ratio), as well as 720p->1080p and 1080i->1080p applications.
The AhG conducted an internal evaluation of the proposed down-samplers. The AhG did not
find evidence that additional upsampling filters are needed in the SVC specification.
Documents JVT-W022 and JVT-W028 relate to filter design. Focus was`on dyadic, 1.25, and
1.5. A number of other documents were listed as relevant.
CE2 has a residual upsampling part.
CE3 is related to subband spatial scalability.
Relevant contributions:
– Resampling: JVT-W022, JVT-W028, JVT-W086
– Spatial scalability: JVT-W097, JVT-W122
– Inter-layer prediction: JVT-W105, JVT-W109, JVT-W117, JVT-W106, JVT-W130, JVTW123
4.6.1.1.8 JVT-W008 (Admin) [H. Schwarz, S. Regunathan, A. Eleftheriadis] AHG
Report: SVC complexity reduction
Relevant contributions were listed and summarized, including: JVT-W027, JVT-W029, JVTW061, JVT-W063, JVT-W068, JVT-W069, and JVT-W072.
144
4.6.1.1.9 JVT-W009 (Admin) [Y.-K. Wang, S. Pateux, P. Amon, T. Schierl] AHG
Report: SVC high-level syntax, err resil
There have been some discussions regarding signaling of full sets of HRD parameters for
rewritten bitstreams. One counter argument was reportedly that signaling full sets of HRD
parameters for rewritten bitstreams is somehow overkill, because SVC already supports signaling
of one full set of HRD parameters for each operation point or scalable layer, and SVC has not yet
supported signaling of HRD parameters for extracted bitstreams according to quality layer
information or priority_id values. JVT-W091 was reported to be related to this topic.
Nokia and University of Science and Technology of China were reported to have started the
following implementation work to the JSVM.
– Coding of multiple slices per picture
– Slice size of fixed number of macroblocks
– Slice size of fixed number of bytes
The implementation for slice size of fixed number of macroblocks was reported to have been
finished, and the other part was reported as ongoing.
Relevant contributions were listed (High-level syntax: JVT-W020, JVT-W046, JVT-W047, JVTW048, JVT-W051, JVT-W052, JVT-W053, JVT-W064, JVT-W091, JVT-W114, and JVTW125; Error resilience: JVT-W049, JVT-W050, JVT-W054, and JVT-W062).
4.6.1.1.10
JVT-W010 (Admin) [Y. Gao, A. Segall, T. Wiegand] AHG Report: SVC
bit depth and chroma format
Relevant contributions and the status of work on CE4 were noted.
The AhG sent a kick-off message to the JVT main reflector (jvt-experts@lists.rwth-aachen.de) on
7 March 2007. There were no other messages on the reflector. The work of the AhG consisted
of generating test conditions and test sequences for CE4. Test conditions were circulated by the
AhG in the kick-off message and utilized for testing within CE4. Sequences were generated
within CE4 by the CE partners. The procedure for generating the test sequences was provided in
an Appendix to the AHG report, and includes representative tone mapping and linear shifting
operations.
Test material discussion – test sequences were generated for tone mapping and linear shifting.
Used for CE4. An appendix to the report describes the creation process for test sequences for bitdepth scalability used in CE4.
The following contributions were noted in the AHG report:
– JVT-W102 and JVT-W113 on bit depth scalability
– JVT-W076 on chroma format scalability – there may be an issue with the notion of SNR
scalability happening in luma while spatial scalability is happening in the chroma. Design is
OK as long as luma spatial scalability is happening whenever chroma spatial scalability is
happening.
4.6.1.1.11
JVT-W011 (Admin) [A. Vetro, P. Pandit] AHG Report: MVC high-level
syntax & buffering
145
There was reportedly some reflector discussion on the subject of SPS and base views since the
last meeting. Based on this discussion, it was reported that some issues might need some further
discussion and possible clarification in the text, including the following.
– Need to further clarify and confirm the differences between IDR pictures, View-IDR
pictures, and anchor pictures? For an anchor picture, pictures later in decoding order but
earlier in output order than the anchor picture may refer to pictures earlier in decoding order
than the anchor picture. For an IDR picture, no picture later in decoding order than the IDR
picture may refer to pictures earlier in decoding order than the anchor picture.
– Need to clarify the marking of pictures as unused for reference. In AVC, for an IDR picture,
all previous pictures in decoding order are marked as "unused for reference". In JD2.0 of
MVC, V-IDR does this for a view.
– Is the SPS allowed to change at a P or B-picture? According to the current spec, the SPS
shall be changed in an IDR access unit only. However, this might need some additional
clarification in the MVC context since an access unit with an IDR picture might contain P or
B-pictures.
– How is the view_id for base view indicated? If the base-view is an independently decodable
view with NAL unit type 20, then NAL unit header includes the view_id. If the base-view is
an AVC-compatible view, then the prefix/suffix NAL unit will carry the view_id information
for that view, which the MVC decoder can decode.
4.6.1.1.12
JVT-W012 (Admin) [H.-S. Koo] AHG Report: MVC motion/disparity
vector coding
Relevant contributions to the meeting were classified into 3 categories:
– new inter prediction process,
– modification of motion vector predictor, and
– modification of spatial direct mode.
Results under common testing conditions: 0.18 dB average / 0.54 dB best case for category 1,
0.0x dB for category 2&3 combined.
The relationship with reference picture list reordering (RPLR) was reportedly being investigated.
4.6.1.1.13
JVT-W013 (Admin) [H. Kimata, A. Smolic, P. Pandit, A. Vetro, C. Ying]
AHG Report: JMVM & JD text editing
The JMVM3 and JD2 were submitted to JVT as JVT-V207 and JVT-V209, respectively. Text for
an SEI message for parallel processing was added. The JD text included high-level syntax and
decoding processes related to reference picture list reordering, text corresponding to the
hypothetical reference decoder for MVC and view coding order information in SPS.
Several other editorial improvements and clarifications had also been made to the JD and JMVM
text. For the JD text, some minor updates to the SPS semantics had been made and the document
included a revised definition of access unit that was reported to be in line with the latest versions
of the various AVC amendments. For the JMVM text, the SEI message on parallel processing
had been updated as well. These revisions should be considered as editor’s input to the meeting
and were included as an attachment to this AHG report.
Further issues that had been raised related to high-level syntax and buffering were reported in
another AhG report: JVT-W011.
The JMVM 3 software was delivered to the group on February 24th, 2007. This release contained
the integration of new syntax element as described in JVT-V054, reference list reordering
commands for inter-view pictures as described in JVT-V043, bug fixes and code clean-ups.
146
Subsequently two bug-fix versions tagged JMVM 3_0_1 and JMVM_3_0_2 were released which
contained significant bug-fixes which addressed the high memory usage and spatial direct mode.
The work on the software completed so far was summarized as follows:
– Add new syntax view_id in SPS to indicate view coding order
– Send reference list information in view coding order
– Reference Picture List Construction for MVC, including new RPLR (JVT-V043)
– Memory reduction for the decoder: remove useless code related to FGS, MCTF and save the
memory to around 1/4
– Effective DPB allocation at the encoder
– Bug fix for spatial direct mode
– Encoder parameter file to read multiple inter-view ref
– Some code cleanup for software improvement.
Some software issues that were reported to still need to be addressed were:
– Disabled co-located condition for inter-view (limitation of s/w)
– An AVC compatible SPS needed to decode AVC compatible view only
– Output order of views is not sequential or parallel. It is on an as ready basis.
– All the macros need to be cleaned up & removed permanently
– Encoder/decoder trace file for each view needed
The manual had been added as part of the JMVM reference software module.
The AhG on JMVM and JD text editing recommended:
– To consider the editor’s input in preparing future versions of the JMVM and JD.
– To discuss the issues in the current version of the software as mentioned above
– To improve the manual created for the JMVM software
– To follow the same software integration guidelines present in JSVM (repeated below)
In order to improve the whole software integration process, the software integration guidelines
and rules were reported to be as follows:
– The integrated software shall compile without warnings when using the provided VC6 and,
VS .NET workspaces, as well as linux makefiles.
– Do not use variable declarations inside the header of for-loops (the scope for for-loops is not
correctly supported with all compilers).
– Follow the coding style of the JMVM software. Use 2 (two) spaces for indentation, no tabs.
– Re-use code and integrate functionality as possible. Try to avoid redundant code.
– Do not change the meaning of existing input parameters but define new ones if necessary
(and applicable).
– Make sure that new parameters have meaningful default values. Tools should not be
switched on by default (if not decided different by the JVT).
– Do not re-structure the output of the compiled binaries (if not decided different by the JVT).
– Please change the JMVM version number macro (i.e. “_JMVM_VERSION_”) located in the
file “CommonDefs.h” to be in line with your integration tag.
CVS reference:
host address: garcon.ient.rwth-aachen.de
user name: jvtuser password: jvt.Amd.2
authentication: pserver path: /cvs/jvt module name: jmvm or jmvm_red
jmvm_red does not check out certain old folders related to SVC.
147
The report included a proposed update of text with clarifications, also software updates & bug
fixes were proposed. A plan to install a bug reporting system for the software was described.
4.6.1.1.14
JVT-W014 (Admin) [H. Kimata, A. Smolic] AHG Report: MVC exper.
framework & test cond
Some discussions on the subjects of “combination of MVC and SVC” and “Multi-view Video
plus Depth” were made. Especially for the first topic, spatial scalability in MVC was discussed.
These discussions were to initiate new directions of MVC.
Discussions led on reflector, several input contributions, no conclusion yet.
The AHG on MVC experimental framework and testing conditions recommended discussing
these new directions of MVC based on relevant input contributions.
4.7 JVT liaison communications
4.7.1.1.1 M14548 WG 11 input [FLO Forum] Liaison statement from FLO Forum to
WG 11
M14548 from FLO Forum to WG 11 was noted – It reports the adoption of ISO/IEC 14496-10 /
ITU-T H.264 (AVC) Extended Profile Level 1.3 for use in MediaFlo systems in terrestrial mobile
multimedia multicast networks. WG 11 (MPEG) is planning to reply to it.
5
Scalable video coding
5.1 CE 1 & related docs: SVC FGS simplification
5.1.1.1.1 JVT-W090 ( Prop 2.2/3.1) [H. Kirchhoffer, H. Schwarz, T. Wiegand] CE1:
Simplified FGS
This contribution describes a modification of the transform coefficient level coding of non-PRslices in SVC. A range of scan positions is specified in the slice header that defines which of the
16 transform coefficient level scan positions of each block (in zig-zag-scan order) is encoded in
this slice. In this way, it is possible to divide the transform coefficient levels of an arbitrary nonPR-slice to multiple additional MGS slices and to achieve fine granular SNR scalability. The
complexity increase depends on the number of additional MGS layers used and is thus
controllable by the encoder.
Idea is enhancing non-PR slices to achieve FGS functionality using MGS. Send a start index and
an end index for coefficient frequencies in a slice. Suggest control of complexity by profile/level
constraints.
Experiments did not alter encoding rules.
No spatial intra modes in enhancement layer – some issues in current software. Interaction with
notion of not using spatial domain for SNR scalability.
Currently available software doesn’t yet implement transform-domain prediction.
Inter-layer prediction needs clarification regarding intra prediction processing – concept is
workable but decoder is complex. Some options:
– Disallow spatially-predicted Intra in enhancement layers (seldom selected anyway –
typically IntraBL is used)
148
– Treat IntraBL neighbor as Inter (not available) for purposes of constrained intra prediction.
JVT decision: 2nd approach is adopted.
How to finalize other issue from last meeting: “Number of base layer macroblocks that need to be
decoded in order to form an IntraBL predictor should be limited. [T. Wiegand, details TBD]” –
some details may need finishing. Addressed in JVT-W070 – intra MBs in base layer that are
required for decoding the enhancement layer shall not exceed the number of IntraBL
macroblocks in the base layer times 1.5. JVT decision: Agreed.
Used QP difference of 6, two MGS layers. Coefficients that are received are added to the ones
previously received.
Some mismatch using current software, but basic concept seems understood and verified.
Without encoder-decoder mismatch problem, there would be no difference in the PSNR of the
high bit rate point – only a difference in bit rate.
Complexity? This is a small change to decoding parsing process – very small impact.
JVT decision: Adopted.
5.1.1.1.2 JVT-W115-QV (Late Info) [A. Segall] CE1: Verif JVT-W090 simplified FGS
This document reports a verification of JVT-W090. The proponents provided Sharp with source
code and simulations results. Sharp inspected the source code and reported that it confirmed that
it matched the proposal. Additionally, Sharp compiled the source code, re-generated the results
reported in JVT-W090, and randomly checked data points between results generated at Sharp and
provided by the proponent. All checks matched, and the results in JVT-W090 were reported to
have been verified.
Verified using provided source code. All checked sequences matched.
5.1.1.1.3 JVT-W111 ( Prop 2.2) [M. Karczewicz, S. Park, H. Chung] CE1: Report on
FGS simplif
This contribution proposes changes to the FGS joint significant and refinement coefficient coding
method described in JVT-V077. The results reportedly indicate that the joint significant and
refinement coefficient coding does not degrade the performance – the average improvement on
all tested CIF sequences is reported to be 0.46% and 4CIF sequences to be 0.7%.
JVT-V077/JVT-W121 with simplified sign coding.
Remark: Suggestion to have proponents of JVT-W111 and JVT-W121 confer and report back.
Further discussion then held on Thursday. Merged proposal presented as JVT-W121r1. Merged
proposal (upload as rev of 121 doc). No penalty at first FGS layer, average penalty goes up to
0.4% for higher FGS layers. JVT decision: Adopted (to FGS part of JSVM).
5.1.1.1.4 JVT-W124-QV (Late Info) [J. Ridge] CE1: Verif JVT-W111 FGS simplif
The results presented in JVT-W111 were verified and found to be correct.
149
5.1.1.1.5 JVT-W121 ( Prop 2.2.1/3.1) [J. Ridge, X. Wang] CE1: FGS refinement pass
simplif
This contribution proposes to include FGS refinement pass coefficients in the run-length codes
previously only associated with significance pass coefficients. While the distinction between
“significance” and “refinement” coefficients would remain, there would no longer be a distinct
“significance pass” and “refinement pass”. Sign bits for non-zero refinement values for a block
would be grouped and transmitted after the end-of-block is reached. It is claimed that this
proposal would simplify the FGS VLC algorithm, both in terms of specification and
implementation, because there would be no need for two different coding algorithms for
significance and refinement passes and because coefficients would be decoded in sequence. An
average coding penalty of 0.4% bit rate is reported to be associated with this proposal for QCIF
and CIF sequences.
Same as prior JVT-V077. See notes in section on JVT-W111.
5.1.1.1.6 JVT-W119 ( Prop 2.0/3.1) [Y. Bao, M. Karczewicz, X. Wang, J. Ridge, Y. Ye,
W. J. Han, S. Y. Kim] CE1: FGS simplif
This contribution reports results of CE1 on FGS simplification to address the concerns on FGS
complexity. This contribution proposes to align FGS layer coding with a H.264/AVC baseline
base layer and make decoding process with Cycle Aligned Fragment mandatory to reduce the
computation complexity and simplify the FGS specification. These two changes along with other
simplifications reportedly make it possible to reduce the FGS text to around 30 pages.
Text editing and other simplification of FGS. Several changes discussed and evaluated. Some
doubt expressed about AR-FGS aspect – results not yet available to confirm the simplification.
Aspects seem generally agreed.
JVT decision: Adopted (into FGS JSVM “Annex A”, which is an ongoing study item; 90 pages
 43 pages, which still includes about 10 or 11 pages of duplicated stuff for context).
5.1.1.1.7 JVT-W120 ( Info) [P. Yin] CE1: Verif JVT-W119 FGS simplif
This contribution reports cross-check result for the proposal by Qualcomm as described in
document JVT-W119 “CE1 report: FGS simplification”. The source code and configuration files
were provided by QualComm. The provided source code was compiled and the encoder and
decoder executable were run with the provided configuration files. All results in terms of R-D
were reportedly the same as those provided by QualComm. The decoder crashed for Crew 4CIF
at one point.
Verifies JVT-W119.
5.2 CE 2 & related docs: SVC ESS improvement
5.2.1.1.1 JVT-W030 ( Prop 2.2.1) [X. Wang, J. Ridge] CE2: Improvement of MB
mode pred in ESS
This proposal is a CE report on JVT-V108 with more results provided. In the current JSVM,
inter-layer prediction on macroblock mode in ESS is based on partition information derived from
base layer. More exactly, only if two blocks in an enhancement layer macroblock share the same
partition from base layer, these two blocks can be merged into one. Such a method is asserted to
150
tend to unnecessarily create smaller macroblock partitions and sub-partitions, which would in
turn reportedly incur more interpolation complexity in motion compensation. JVT-V108
proposes a method in which two blocks may be merged into one as long as they share the same
reference frame index and have similar motion vectors from the base layer. Further results
provided in the report assert that the proposed method can effectively solve the alleged problem
with essentially the same coding efficiency.
Try to combine base layer blocks into larger partitions for mode prediction (when reference index
is the same and MVs are close in value). No change to coding efficiency reported. Significant
reduction in use of small block sizes.
Remark: A “merge” has no effect if the motion vectors are equal and the block sizes are 8x8 or
larger, so no text change is needed for that case.
JVT decision: Adopted (even where it may make no difference).
5.2.1.1.2 JVT-W058 ( Info) [E. Francois] CE2: Cross-check of JVT-W030 on ESS
mode pred improvement
This document reports cross-check results of proposal JVT-W030 entitled ‘CE2 report:
Improvement of macroblock mode prediction in ESS’ from Nokia. As a verification task, textual
specification and corresponding JSVM software implementation were reported to have been
verified and coding and decoding performance check was reported to have been carried out. The
results presented in JVT-W030 were reported to be confirmed and the implementation within the
JSVM software wass confirmed to match with the proposed textual specification.
Text was checked against software, software was available last time, software was tested. No
problem reported.
5.2.1.1.3 JVT-W117 ( Prop 2.2/3.1) [Y. Ye, Y. Bao] CE2: Improved resid upsampling
for ESS
JVT-V115 proposed a change to the residual upsampling process in ESS. In JVT-V115, the
residual upsampling scheme makes the decision about whether to use bilinear interpolation or
nearest neighbor copying based on the relative block alignment between the base layer and
enhancement layer transform blocks. The scheme proposed in JVT-V115 was reported to
(slightly) improve coding performance for commonly used ESS scaling ratios, and to improve
visual quality. The contribution proposes a modified scheme that is mostly based on JVT-V115,
along with another decision making based on the base layer block type (intended to further
reduce blocking artifacts). The proposed scheme was reported to achieve small but consistent
coding performance improvement over the reference JSVM_7_13 for all testing conditions
specified in CE2. Reconstructed video quality was reported examined and reported to also show
visible improvement.
Changes when base layer residual edge (either 4x4 or 8x8) without an 8x8 enhancement layer
edge – to use bilinear rather than nearest-neighbor – except when the edge is intra/inter.
Remark: Predictor (e.g., motion vector) may be different across that edge – predicting the
residual across that edge seems questionable.
Addresses prior question of bilinear all the time negatively.
Basically no difference in PSNR measure quality – perceptual argument.
151
Significant visual improvement (subjectively) reported.
“Cherry picking” of results to report? Perhaps some.
See notes in section on JVT-W105.
5.2.1.1.4 JVT-W106-QV (Late Info) [X. Wang] CE2: Verif Qualcomm JVT-W117
improved resid upsamp for ESS
The purpose of this document is to verify results in JVT-W117 from Qualcomm.
Compiled and compared PSNR results. Did not check for subjective improvement.
Question re: “Submission of final software and results [to CE partners]: next meeting – 2 weeks”
– was this followed?
Proposal changed somewhat since last meeting – due to some artifacts discovered relating to
intra/inter switch boundary.
See notes in section on JVT-W105.
5.2.1.1.5 JVT-W105 ( Prop 2.0) [X. Wang, J. Ridge] Study on residual upsampling
without block boundary check under ESS
This proposal provides study results on the topic raised in JVT-V115. In JVT-V115, a method
was proposed so that under ESS bilinear interpolation is performed across a base layer block
edge if the edge falls within an enhancement layer transform block. By doing so two things may
reportedly be achieved: 1) visual improvement (picture less blocky); 2) slight coding gain (about
1%). The contribution asserts that the for the case of ESS, doing bilinear interpolation without
block boundary check can achieve essentially the same results claimed in JVT-V115. Visual
quality is asserted to not show a visible difference from JVT-V115.
Residual prediction concept fails when predictor is substantially different across a base layer
block edge.
Suggests that an encoder can detect situations where such a failure to create an adequate predictor
may occur (e.g., MV discontinuity). In the reported test, intra macroblocks were conceptually
assigned a zero motion vector value.
Remark: Testing for block boundaries, and making the upsampling process depend on that, seems
to create a decoder burden. On the other hand, sometimes the “all the time” technique will
require extra lower-layer residual blocks.
Remark: What about dyadic case? Response: The proposal considers only ESS. Remark:
Treating dyadic as a special case seems undesirable from a design perspective.
Question: How about intra/inter switch boundary?
Remark: How about the Crew sequence? Response: Haven’t checked – the problem report had
focused on Foreman.
152
Remark: The dyadic case should not be changed – there has been a lot of experience with that,
and it creates more cases where extra residual block reconstruction can be avoided.
Remark: This (particular) encoder design does not avoid all artifacts – some failure cases remain.
Remark: It’s not clear whether the JVT-W117 method will avoid all artifacts either.
Remark: Bad failure cases should not be very difficult for an encoder to detect.
Suggestion: Failure cases are likely to remain, no matter what. Encoders will ideally need some
kind of detection and avoidance. Primarily consider two factors: Decoder implementation
friendliness, and stability and consistency of design.
Neither proposal, as proposed, changes the dyadic case.
Three main options considered:
– Do nothing
– As proposed in JVT-W117
– As proposed in JVT-W105
Opinions expressed were evenly divided between the three – no consensus for change.
No action taken on decoder text.
JVT decision: Adopt non-normative JVT-W106 encoder problem detection trick into JSVM.
5.2.1.1.6 JVT-W109-LV (Late Info) [E. Francois] Verif JVT-W105 on residual
upsampling without block boundary check under ESS
This document reports cross-check results of proposals JVT-W105 and JVT-W123 that both
relate to residual upsampling in ESS. Both contributions propose solutions for reducing visual
artifacts caused by residual upsampling. As a verification task, a coding and decoding
performance check was reportedly carried out. The results presented in JVT-W105 and JVTW123 were reported to have been confirmed.
The software implementing the proposals had been provided. Binaries were reportedly
regenerated from these versions and used for generating the cross-check results, both for the
original version and the modified ones.
The verification reportedly consisted of encoding and decoding, and checking that the provided
figures of JVT-W105 and JVT-W123 fit with the results obtained.
Results data have been verified for the following configurations considered in document JVTW105 and JVT-W123:
– ratio 3/2: verification on bus, mobile, foreman and football sequences.
– ratio 4/3: verification on crew and soccer sequences.
– 3 layers: verification on crew sequence.
For all the performed verification tests, the encoder and decoder were reported to match perfectly.
The decoded results were also reported to perfectly match the results provided in JVT-W105 and
JVT-W123.
See notes in section on JVT-W105.
153
5.2.1.1.7 JVT-W123 ( Prop NN2.2.1) [X. Wang, J. Ridge] Analysis of visual artifacts
in ESS residual pred
This contribution is a non-normative proposal that aims to address the issue of possible visual
artifacts in ESS reported in JVT-V115. Detailed analysis of those areas with artifacts is asserted
to reveal that the artifacts were caused by residual prediction with non-matching residuals. In
this proposal, during encoding process such areas are identified so that a different R-D measure
may be applied to prevent those visual artifacts. Results are asserted to show that with such a
method the visual artifacts can be prevented while coding efficiency is preserved.
See notes on JVT-W105 and JVT-W117.
5.3 CE 3 & related docs: SVC subband coding
5.3.1.1.1 JVT-W097 ( Prop 2.2/3.1) [S.-T. Hsiang] CE3: Intra-frame dyadic spatial
SVC based on subband/wavelet filter banks framework
This contribution reports CE3 results based on the previous contributions U133 and V084 that
attempt to integrate the subband coding framework with the current JSVM for improved dyadic
spatial scalable coding. Further simulation results are provided for Intra-coding under the
CAVLC entropy coding mode. It also reports the results for dyadic spatial scalable coding under
the long delay test condition utilizing the proposed algorithm for coding Intra frames only.
For intra-only coding average bitrate saving around 8% for QCIF-CIF, for uniform subband
quantization in 4CIF 20% as compared to JSVM. For inter coding with long delay (where only I
frame is wavelet coded) small loss as compared to JSVM.
Questionable whether for the intra case (where the lower layer uses a different reference) PSNR
comparison is valid.
Only works for intra, dyadic, progressive. Not clear if it can be combined with bit-depth
scalability.
This will not have a home in the current development. Very questionable where this would go in
any future profile. Useful only for intra-only case, which is already covered in profile B intra.
JVT decision: Adopt to JSVM, but not with the automatic assumption that this will go into a draft
by next meeting. Further evidence requested what it is good for; otherwise it may be removed.
5.3.1.1.2 JVT-W122-QV (Late Info) [J. Ridge] CE3: Verif JVT-W097 wavelet-based
intra dyadic spatial SVC
The results presented in JVT-W097 for the intra-only case were reportedly verified through
compilation of source code provided by the proponent. The wavelet coder results reportedly
appeared to match precisely.
There was reportedly a slight difference in the reference results, which reportedly appears to have
been due to a difference in the JSVM software version used by the proponent and the verifier.
However, this small discrepancy does not reportedly appear to have materially affected the
conclusions.
154
Results for the long-delay case were not fully verified, reportedly due to time constraints.
Visual results at the highest layer reportedly correlate with the PSNR results. At the lower layer,
the wavelet results reportedly are naturally sharper due to the difference in filter.
5.4 CE 4 & related docs: SVC bit-depth scalability
See also the closely-related ad hoc group report JVT-W010.
5.4.1.1.1 JVT-W102 ( Prop 2.2/3.1) [Y. Gao, Y. Wu] CE4: SVC bit-depth scalability
simulation results
This contribution presents simulation results of bit depth scalability with the technique proposed
in JVT-V061. This technical solution to bit depth scalability is asserted to be compliant to current
SVC standard. The contribution indicates that there is no new syntax element needed to support
bit depth scalability. Only a process of inter-layer bit depth prediction using fixed a left shift is
invoked during the decoding process. The software integration and the test conditions in the
performance test are subject to JVT-V304 and the conclusions from the Ad hoc Group of bit
depth and chroma format scalability. Simulations were reportedly performed with eleven video
sequences that covered a variety of bit depth/tone mapping approaches to create the 8-bit and/or
10-bit version from the same source video content. Detailed experimental results were also
provided.
Remark: No actual energy in some of the higher bit depth video.
Remark: Seems like the most obvious way to do bit depth scalability. Related remark: This
requires 10 bit motion comp in the enhancement layer (although still single-loop).
Remark: There was a competing proposal JVT-V078 at last meeting. That proposal used 8 bit
motion comp and had multiple mappings to the enhancement layer, including the one in this
contribution as one of them. It was planned to also be evaluated in the same core experiment, as
reported in JVT-V304.
Remark: Test conditions and test material were available late, so the JVT-V078 proponents did
not have adequate time for preparation of experiment results. JVT rules refer to a need for
availability of necessary material by three weeks prior to the meeting and availability of final
software and results by 2 weeks prior to the meeting. It was remarked that these deadlines were
not fully met, such that some material needed for the experiments was only available at the last
minute before the JVT ordinary contribution deadline.
Question: Is the software part of the contribution? Response: It can be provided.
The contribution appears to show that the proposed method is an effective way of achieving bit
depth scalability with a substantial advantage over simulcast.
We tentatively agree to accept the relative performance reported in this contribution, relative to
single layer, as representative of the capability of the technique (in the absence of further
evidence).
Continue CE – suggestion to crop pictures to 4CIF this time.
155
5.4.1.1.2 JVT-W116 ( Info) [A. Segall] CE4: Verif JVT-W102 (Thomson prop)
This document reports the verification status of JVT-W102, which is titled "CE4: Bit-depth
Scalability Simulation Results". The proponents provided Sharp with source code and
simulations results. As of April 17, 2007, the verification is ongoing.
Experiments finished so far have successfully verified; more ongoing.
5.4.1.1.3 JVT-W113 ( Prop 2.2) [A. Segall, Y. Su] System for bit-depth scalable
coding
A system for the scalable coding of higher bit-depth and/or larger dynamic range video sequences
is reported. The approach is reportedly motivated by applications that do not utilize linear
scaling to generate a lower bit-depth image from the higher bit-depth sequence. Examples
include gamma correction, color correction, dynamic range limiting or other forms of tone
mapping. The proposed design employs a modified inter-layer prediction scheme that consists of
a series of shifts and adds (signaled in the bitstream like intra prediction modes) and addresses
relationships between luma and chroma. The proposed process is spatially varying, and it is
signaled in a manner similar to intra-prediction modes within AVC/SVC.
Suggests a reportedly efficient way of doing inverse tone mapping. Approach seems worth
studying.
Intra-only results were shown. Benefit reported for inter-layer prediction mapping scheme.
Asserts that either JVT-W102 approach or JVT-V078 approach can use this technique. Requests
inclusion in CE.
It may be beneficial to test this concept in either scheme (JVT-W102 or JVT-V078).
5.4.1.1.4 JVT-W076 ( Prop 2.2) [J. Jia, H. K. Kim, H. C. Choi, J. J. Yoo] SVC
chroma format scalability
From an investigation on the current SVC draft for chroma format scalability, it was reported that
current SVC design works for most of the cases when the chroma format scalability is combined
with the spatial, temporal or/and quality scalability – where both luma and chroma components in
the enhancement layer are encoded. However, there is one case where the current standard draft
was reported to not work well in terms of coding efficiency performance. That is when only
chroma format scalability is applied to an enhancement layer. The current draft specification is
reported to have been designed to code all the information regarding the luma and chroma parts
together for an enhancement layer while in a chroma-only scalability case, luma related
information would not be required because that information is already coded in the lower layer.
Question: How much bit rate is saved by customizing for this case? Not reported. Group
suggestion: provide results. Contribution noted.
5.5 SVC high-level syntax
5.5.1.1.1 JVT-W020 ( Prop 2.0) [Z. G. Li, S. Rahardja, S. L. Xie and W. Yao]
Hypothetical reference decoder for video coding
This document provides two different methods for the conformance of a coded scalable bitstream
to hypothetical reference decoder (HRD).
156
When digital video is compressed, the coded bit rate may vary significantly over time. The
bitstream is sometimes transmitted over a reliable channel at a constant bit rate (CBR). While
there is no packet loss in such a scenario, some jitter may occur among the packets. The buffer
size of encoder picture buffer (EPB) associated with an encoding process and that of coded
picture buffer (CPB) associated with the corresponding decoding process are finite. Hence the
encoder must constrain the bit-rate variation such that a hypothetical reference decoder (HRD)
with a predefined buffer size can decode the bitstream without resulting in any overflow or
underflow (in non-low-delay operation). In the classical constant-delay mode, the coded data can
be removed at the computed removal time while the decoding and display times preserve the
output (possibly fixed) frame rate.
This contribution addresses the constant delay mode for both non-scalable video coding and
scalable video coding (SVC). It asserts that the sending rate can be greater than the coding rate
and there may be jitter, and the dynamics of EPB and CPB can be nonlinear because of the
possible saturation, and the EPB and CPB may not be complementary. Iterative algorithms are
designed for the HRD by taking both the jitter and the total size of coded bitstream into
consideration. This is reportedly necessary to minimize the values of buffer size and initial buffer
delay when there is saturation on the dynamics of EPB and CPB. An interpolation algorithm is
also presented such that the coding rate and the sending rate are decoupled as in the prior design.
SVC is composed of a base layer and (possibly several) enhancement layers, and each
enhancement layer has its “base layer”. The conformance of each layer is proposed to be checked
by defining the corresponding constraint for each layer by the proposed method. The base layer
has two transmitted values: the buffer size and the delay between storing a picture in the buffer
and starting the decoding of that picture. Each enhancement layer is proposed to transmit two
values: the difference between the buffer size in the layer and its “base layer" and the difference
between the delay in the layer and its “base layer" (using a coded difference to enable efficient
representation). Two different methods are proposed for the HRD of SVC. In the first method,
the sizes of all frames from the base layer to the current enhancement layer are used to compute
the buffer size and the delay in the layer. In the second method, only the sizes of the current
enhancement layer data are used. It is reportedly observed that the values obtained by the first
method are usually smaller while those by the second one are more scalable.
Constant delay mode in case of channel jitter. EPB and CPB may no longer be symmetric then.
Proposes two algorithms considering initial buffer fullness. For base layer buffer size and delay
are transmitted, for enhancement layers differences in buffer size and delay as compared to
corresponding base layers.
Only focuses on CBR case. Claim that main idea can still be used for VBR. In principle no
problem with the current approach. General opinion that the amount of bits potentially saved by
differential coding is not worthwhile to consider this.
HRD design is critical to the standard.
Focuses on CBR case.
Remark: What about the notion of multiple schedules?
Remark: SVC is designed as a single-loop syntax with data partitioning. Current syntax and
HRD design conceptually can apply – refer to meeting report of last meeting (“submitted as JVTV068r4. Other parts of JVT-V068 adopted (separate HRD parameters for each, and include
temporal level in scalable nesting SEI).”)
157
Remark: Per meeting report of last meeting, current design seems conceptually similar to first
proposed variant. Proposal suggests to code differences rather than totals, saving some bits at
SPS level.
Remark: Differences are relative to what (considering multiple schedules)?
Remark: Amount of bits this would save is not a problem of a magnitude worth fixing.
Contribution noted.
5.5.1.1.2 JVT-W046 ( Prop 2.2.1/3.1) [M. M. Hannuksela, Y.-K. Wang] Support for
SVC header rewriting to AVC
It is asserted that there are two structures for coded video sequences that would allow lightweight
SVC-to-AVC rewriting by removal of certain NAL units and NAL unit header SVC extensions
as well as conversion of SVC VCL NAL unit types to the corresponding AVC ones. In the first
structure, a temporal enhancement is provided as an enhancement layer to a Baseline profile base
layer. In the second structure, more than one AVC stream is encapsulated within an SVC stream.
This contribution first proposes a change to the sequence parameter set SVC extension syntax
and slice header syntax to enable the lightweight SVC-to-AVC rewriting. It is further proposed
that syntax structure for the SVC-to-AVC conversion in the scalability information SEI message
is appended with an indicator of the conversion operation, such that interoperability information
of lightweight rewritten bitstreams can be signaled.
Proposal elements:
1) Flag in SPS extension for “trivial” rewriting ability
2) Alignment of SVC slice syntax with AVC slice syntax (seems OK, but let’s make sure)
3) Scalability info SEI appended with “conversion type” info
4) slice_type values, adding “all conceptually the same kind” indication
JVT decision: Adopted.
Question: Effect on deblocking filter of slice skip or other rewriting tricks?
Answer: If set base_mode_flag equal to 1 and residual_prediction_flag to 1 without sending
coeffs, should inherit the CBP and QP and transform_size_8x8_flag from the base layer for
deblocking purposes Also deblock IntraBL as Intra. Also follow this spirit if we notice similar
issues, conditioning on simple rewriting flag if in would be inappropriate not to. JVT decision:
Agreed.
5.5.1.1.3 JVT-W047 ( Prop 2.2.1/3.1) [M. M. Hannuksela, Y.-K. Wang] Pictures not
for output in SVC
It is asserted in this contribution that there are two sources of needs for indicating whether
decoded pictures are to be output. First, it is assumed to be a desirable feature that the layer with
the highest dependency_id may be coded with a lower temporal resolution than its base layer. In
such a coded stream, certain enhancement layer slices are coded as “skipped” and should not be
output. Second, thinning of a scalable bitstream may result into a decoded sequence that is argued
to be of insufficient quality for output in the presented coding schemes, logo insertion and
discardable data adaptation. It is proposed that an output_flag is included in the SVC NAL unit
header and controls whether the decoded picture is marked as “needed for output” or “not needed
for output” in the decoded picture buffering process. It is additionally proposed that a syntax
158
element layer_output_flag[ i ] is included in the scalability information SEI message to indicate
which layers or operation points are not intended for output and hence should not be output.
JVT decision: Adopted.
5.5.1.1.4 JVT-W048 ( Prop 2.2.1/3.1) [M. M. Hannuksela, Y.-K. Wang, Y. Chen] On
SVC high-level syntax
This contribution proposes 1) a change to the semantics of sub-sequence information SEI
message to align with the latest definition of IDR picture, 2) a couple of constraints to the
semantics of store_base_rep_flag and idr_flag, and 3) some syntax changes regarding presence of
the syntax structure dec_ref_pic_marking_base( ).
1) adopted
2) 2.1 adopted, 2.2 (removing constraint on base layer IDR needing enhancement IDR) adopted.
3) Depends on use_base_representation, further studied during meeting, and adopted.
JVT decision: Adopted.
5.5.1.1.5 JVT-W051 ( Prop 2.2.1/3.1) [Y.-K. Wang, Y. Chen, M. M. Hannuksela] On
SVC scalability information related SEI messages
This contribution first proposes two technical changes to scalability information SEI message
among some editorial changes. The first technical change is inclusion of signaling for maximum
number of buffered decoded frames and maximum number frames reordered for output for each
scalable layer. This signaling enables a decoder to allocate minimum decoded picture buffer size
for decoding a subset of the bitstream, and to start to output and display as soon as possible with
the minimum initial delay. The second technical change is inclusion of signaling of profile, level,
and bitrate information for quality layers. Furthermore, slight changes to some other SEI
messages are proposed to enable using common SEI messages for both SVC and MVC.
JVT decision: Adopt signaling of profile and bit rate indication for quality layers. Do not adopt
the “unification” with MVC (because it is far from clear at current point what MVC will need,
and does not make sense to complicate SVC for that). For further issues, see under JVT-W064
below.
5.5.1.1.6 JVT-W052 ( Prop 2.2.1/3.1) [Y.-K. Wang, M. M. Hannuksela] SVC
feedback based coding
It is assumed in this contribution that an encoder could avoid the use of the base representations
for inter prediction, if it has received feedback from the far-end decoder indicating whether all
the quality layers of the corresponding access unit were correctly decoded. It is asserted, however,
that decoders have no means to conclude whether all the quality layers of a particular access unit
have been received completely and decoded without mismatch. A quality layer integrity check
SEI message is proposed for enabling the presented feedback-driven usage of the base
representation in inter prediction. The message includes a cyclic redundancy check (CRC) code
calculated over the NAL units for which quality_id is greater than 0. A change regarding the
presence of the syntax element store_base_rep_flag is also proposed to enable the feedback based
coding.
Remark: In case of MGS the decoder could potentially know the completeness when scalability
info SEI messages completely received. In case of FGS it would not be possible. Alternatively,
159
for MGS other more simple methods than CRC would be viable (e.g. signaling the maximum
quality layer).
JVT decision: Adopted.
For a given quality ID, should macroblock data be required to be present for the entire picture?
JVT decision: Only for quality ID = 0.
5.5.1.1.7 JVT-W137-B (BoG) [I. Amonou, N. Cammas, S. Kervadec, S. Pateux] Revised
syntax for quality layer SEI message
Reports modifications of quality layer SEI syntax needed for removal of FGS.
JVT decision: Adopt - Editors are also given discretion to “clean out” any further-identified
remnants of removed features.
5.5.1.1.8 JVT-W053 ( Prop 2.0/3.1) [M. M. Hannuksela, Y.-K. Wang, D. Singer, T.
Rathgen] SVC priority_id value setting method indication
SVC file format allows conveyance of multiple sets of alternative priority_id values for one SVC
bitstream. A server may re-label the priority_id values for all the NAL units with one set of
alternative priority_id values before sending the bitstream, thus to allow customized priority
based adaptation. For each set of alternative priority_id values, a field
priority_assignment_method_id is included to identify the method used to calculate the
prioritiy_id values. This contribution proposes to include the indication of the priority calculation
method for the default set of priority_id values contained in the NAL unit headers in the
scalability information SEI message.
In principle useful. JVT decision: Adopt after revision of nt-string (per r2 of document).
5.5.1.1.9 JVT-W064 ( Prop 2.2/3.1) [J. Luo, L. Zhu, P. Yin, C. Gomila] VUI updates
for SVC
This contribution proposes to modify the H.264/MPEG-4 AVC Video Usability Information
(VUI) for the Scalable Video Coding (SVC) standard. The bitstream restriction information in
VUI is independent for each interoperability point (IOP). This contribution aims at modifying the
VUI to transmit bitstream restriction information for multiple IOPs. It is also considered how to
use SEI messages to convey bitstream restriction information for an H.264/MPEG-4 AVC
compatible layer.
Related to JVT-W051. Difference is putting bitstream restriction in VUI vs SEI. After offline
clarification with proponents of JVT-W051.
Seven bitstream restrictions proposed – identical to those already in VUI for entire bitstream.
Proposed to specify them per-layer. Issue of how to handle base-layer temporal subsequences
and SDP syntax.
Put in both places? No. Put them (all seven, with presence indicators as in current VUI) in
scalability info SEI. JVT decision: Agreed.
160
5.5.1.1.10
JVT-W091 ( Prop 2.2/3.1) [L. Cieplinski] HRD parameters for SVC
bitstream rewriting
An earlier contribution proposed the extending the Hypothetical Reference Decoder for SVC to
include parameters to support bitstream rewriting for CGS. This contribution proposes an
alternative way of incorporating the additional parameters, which is claimed to result in less
significant changes to the specification.
Concern is raised about changing the picture timing SEI message. In principle, the HRD
parameters could also be determined when re-writing is done and need not be transmitted
beforehand. Application example / showcase about usefulness of the proposal needs to be
provided.
Question: Is the rewriting process fully specified? Unless we can fully and clearly specify the
rewriting process, how can we know what HRD parameters they will conform to? Don’t know
which pictures the translator will choose to pass onward, which enhancement layers it will
choose to include, etc. The same decoding process outcome may have multiple patterns of
expression on AVC syntax.
Contribution noted. Intriguing, but unable to accept in this form – ideas would need more
maturation.
5.5.1.1.11
JVT-W114 ( Prop 2.2) [A. Segall, J. Zhao] Showcase for transcoding
scalability info SEI
In the Marrakech meeting, JVT-V036r1 was adopted to add AVC bit-rate information to the SVC
Scalability Information SEI message. This contribution provides the required showcase of the
SEI modifications.
Audience is satisfied with showcase.
5.5.1.1.12
JVT-W125 ( Prop 2.2) [G. J. Sullivan] On SVC high-level syntax and
HRD
As SVC is designed as an extension of the AVC (ITU-T Rec. H.264 | ISO/IEC 14496-10)
specification, it is important to consider the relationship between future SVC bitstreams and
existing AVC decoders, and the relationship between different SVC decoders that are operating
in the same system environment. It is also important to establish appropriate buffering and
timing constraints to establish bitstream conformance, particularly including proper specification
of an SVC HRD. This contribution proposes several high-level syntax modifications and an
HRD design to address these issues. As an additional “clean-up” remark, the contribution also
suggests a modification to the definition of arbitrary slice order.
Issue 1 (SPS/PPS/SEI): Other proposals to address this were discussed during last meeting.
Remark: Similar reasoning applies to access unit delimiter NAL units – these could also be
subsumed into an SVC NAL unit type (and assigned a D,T,Q).
Clarified offline with Miska and discussed further. Suggestion: Use prefix NAL unit to assign
D,T,Q for SPS/PPS/SEI/filler (not AUD). Remark: Various implications discussed. No action.
161
Issue 2 (prefix NAL unit): Do not retain suffix NAL units. Type 14 prefix NAL units should
always be used instead. JVT decision: Adopted.
Issue 3 (filler data): JVT decision: Adopted using prefix NAL unit to assign D,T,Q.
Issue 4: Make sure that the NAL header bytes cannot cause start code emulation. JVT decision:
Adopted (exact form of header syntax to be determined).
Issue 5 (HRD): Possibility to add an informative clause about bitstream extraction (similar in
spirit to figure under 3), but it must be guaranteed that the extracted subset is still a conforming
bitstream.
Definition of VCL NAL units – should not have changed what an AVC non-scalable decoder will
do with NAL unit type 20 the HRD. JVT decision: Agreed.
Issue 5 (ASO): JVT decision: Adopted.
Remark: Removal of SVC SEI messages rather than NAL units? No. Remove SEI NAL units
not associated with any VCL NAL unit in the access unit - using prefix or content, which need to
be consistent. JVT decision: Agreed.
Error resilience: 49, 50, 54, 62
5.5.1.1.13
JVT-W049 ( Prop 2.2.1/3.1) [C. He, H. Liu, H. Li, Y.-K. Wang, M. M.
Hannuksela] Redundant pictures in SVC
Redundant picture support is one of the error resilient tools in H.264/AVC for enhancing the
robustness to packet loss. Currently it is open whether to support the redundant picture feature for
SVC enhancement layers. This document provides simulation results comparing different coding
cases with or without coding of redundant pictures. It is proposed that the redundant picture
feature is supported for SVC enhancement layers and included into the Scalable Baseline profile.
Furthermore, an SEI message is proposed to contain redundant picture properties, based on which
a decoder can determine whether the redundant picture can be used for inter-layer prediction
when the corresponding primary picture is lost.
IDR in random access example can be realized by MGS enhancement picture?
Provides tests that seem to demonstrate usefulness of redundant pictures.
JVT decision: Adopt SEI message contingent on adoption of redundant coded pictures in a
profile.
Profile aspect open. Proposed to add to scalable baseline profile enhancement layers.
JVT decision: Adopted.
5.5.1.1.14
JVT-W050 ( Prop 2.2.1/3.1) [Y.-K. Wang, M. M. Hannuksela] On
tl0_pic_idx in SVC
Document JVT-V140 proposed to remove tl0_pic_idx from the NAL unit header and include it in
the RTP payload instead. In Marrakech, JVT decided to put the syntax element tl0_pic_idx at the
start of the slice header and the prefix/suffix NAL units, and planned to adopt the JVT-V140
162
approach in San Jose if IETF AVT would take action to adopt the JVT-V140 approach to the
SVC RTP payload format. In the March 2007 IETF AVT meeting, AVT adopted the signaling of
tl0_pic_idx, among others, in the payload content scalability information (PACSI) NAL unit that
can be present in the beginning of RTP packets. PACSI NAL unit is described in Section 6.10 in
the latest Internet-Draft of SVC RTP payload format, available from http://www.ietf.org/internetdrafts/draft-ietf-avt-rtp-svc-01.txt. It is asserted that the latest Internet-Draft of SVC RTP payload
format effectively satisfies the condition for adoption of JVT-V140 approach to SVC. Therefore,
it is proposed that, in the SVC specification, tl0_pic_idx is not included in the NAL unit header
or slice header and is only signaled in the SEI message as presented in JVT-V140 and copied in
this document with minor editorial changes.
See notes below in section on JVT-W062.
5.5.1.1.15
JVT-W062 ( Prop 2.2/3.1) [A. Eleftheriadis, S. Cipolli, J. Lennox]
Improved error resilience using temporal level 0 picture index
This contribution reviews the status and proposes further improvements to the concept of
tl0_pic_idx. The field was introduced in the NAL unit header extension of SVC in JD8
(Hangzhou) to address the behavior of an SVC decoder (and SVC systems in general) in the
presence of packet errors. It was shown that it is a way to use temporal scalability and multiple
reference pictures to implement “zero-delay ARQ”, something that was not possible with earlier
video coding systems. In JD9 (Marrakech), it was re-cast as an element of the slice layer (but in
exactly the same bitstream location), in an attempt to have a fixed-length NAL unit header in
SVC, with further action dependent on incorporation of the feature in the RTP payload format for
SVC. The field was subsequently adopted into the RTP payload format for SVC in the March
2007 IETF meeting in Prague, together with two additional flags (that signal the first and last,
respectively, NAL unit of a picture). This contribution first re-introduces the proposal for adding
these two associated flags that signal the first and last, respectively, NAL unit of a picture, in
order to address the case where the lowest temporal level picture data is transported over multiple
NALs. It is shown that, coupled with RTP sequence number tracking, this design allows
immediate detection of lost data for the lowest temporal level pictures both when no picture data
is received, as well as when partial data is received. This contribution further describes three
syntax designs for the tl0_pic_idx itself: a fixed-length NAL header, a variable-length NAL
header, and a design in which the tl0_pic_idx field is moved to a new SEI message. It is shown
that the fact that SEI messages can only appear at the beginning of an access unit, renders the SEI
solution ineffective if SVC NAL ordering is strictly followed. It is also shown that a further
limitation is the fact that SEI messages do not carry DTQ information in their NAL headers,
whereas the Scalable Nesting SEI message does not provide the needed T information. Finally,
the contribution identifies a bug in the current JD9 with proposed changes document, in that the
tl0_pic_idx is not shown as a payload to a suffix NAL unit, as adopted in the Marrakech meeting.
Proposal to put tl0_pic_idx in SEI message..
Offline discussion result is documented in section 3.3 of revision JVT-W062r3.
JVT decision: Adopt section 3.3 of JVT-W062r3.
5.5.1.1.16
JVT-W054 ( Info) [I. Radulovic, Y.-K. Wang, S. Wenger, A. Hallapuro,
M. M. Hannuksela] Multiple description coding using AVC redundant
pictures
163
Multiple description coding (MDC) reportedly offers a competitive solution for video
transmission over lossy packet networks, with a graceful degradation of the reproduced quality as
the loss rate increases. This paper describes how redundant pictures, an error resilience tool
included in H.264/AVC, can reportedly be employed in conjunction with MDC, in a standard
compliant manner. It is asserted that comparisons with state-of-the-art techniques show a superior
performance of the scheme, both in terms of an average PSNR, and in the smoothness of the
reconstructed video.
Document for information only.
5.5.1.1.17
JVT-W068 ( Prop 2.2/3.1) [C. Tu, S. Srinivasan, S. Regunathan, G.
Sullivan] CE4: 4-tap MC interp for high-res SVC enh layers
This contribution proposed a 4-tap motion compensation interpolation filter for SVC
enhancement layers. It is proposed to shorten the SVC motion compensation interpolation filters
from 6 taps to 4 taps in order to reduce computational complexity. The 4-tap filter can reportedly
be implemented using 16-bit only arithmetic. Coding performance was demonstrated, which is
reportedly comparable (around 0.03 dB better on average) to the current 6-tap filters, and
reportedly outperforming the H.241 RCDO interpolation method for 4CIF sequences. For CIF
sequences, although on average performance penalty was reportedly around 0.2 dB, it was
comparable to the 6-tap filters on some sequences. It was proposed to adopt this 4-tap motion
compensation interpolation filter for luma for SVC high resolution (for example, standard
definition and higher, or 720p and higher) enhancement layers, and to use it as an optional
interpolation filter for low resolution enhancement layers.
Actual numbers of operations (assuming given distribution of positions) not given. No results
with SNR scalability where the difference between interpolation results of base and enhancement
might be critical. For low resolution (CIF) average loss around 0.1..0.2, maximum loss 0.5 for
Mobile. For higher resolution loss is almost negligible (on average, varying between -0.1 and
+0.1 dB).
Proponent recommends to make this switchable.
Showed some loss for low resolution video; approximately neutral for high resolution. Spatial
scalability and temporal scalability using B and P hierarchies.
Remark: How about SNR scalability? Not tested in contribution.
Remark: Comparative complexity analysis? Complexity of proposed filter is described in
contribution, but not alongside a comparative analysis relative to the current MC interpolation
method.
Remark: Test set seems limited.
Remark: Not much experience with 4 tap, would not be comfortable with complete replacement
of current method.
Remark: Rewriting feature impact?
It only seems reasonable, considering the above, to consider adding as an additional supported
feature rather than as a replacement.
Key pictures would need two motion compensations.
164
Not adopted.
Additional information uploaded in revision: Reported a 40% computation reduction and 23%
memory bandwidth reduction for 8x8 block size.
5.5.1.1.18
JVT-W072 ( Info) [H. Schwarz] Results comparing JSVM, 4-tap, and
RCDO MC interp.
In this contribution, simulation results comparing the coding efficiency of different sub-sample
interpolation filters for the luma component in SVC enhancement layers were reported. The
following interpolation filters had reportedly been tested: H.264/AVC interpolation filter as
currently specified in the JD, RCDO luma interpolation filter as specified in H.241, and the 4-tap
interpolation filter as proposed in JVT-V090.
Similar results as JVT-W068, but only reported for low-resolution case. Identical results to JVTW068 for the overlapping test cases.
SNR scalability – QCIF and CIF.
5.5.1.1.19
JVT-W027 ( Info) [E. Francois, V. Bottreau, J. Vieron] Evaluation of 4
tap motion compensation interp
This document is an information contribution on the evaluation of the 4 tap motion compensation
interpolation proposed for SVC. The obtained results reportedly show that the 4 tap filter gives
equivalent results to the 6 tap one on many sequences but may have some important impact on
some specific sequences (Mobile QCIF/CIF with +12% rate increase, Bus QCIF/CIF with +4%
rate increase), which visually corresponds to less sharp pictures. Consequently the contributors
suggest to keep the 6 tap MC filter at least for SVC Profile B and to possibly consider the use of
the 4 tap MC filter for SVC profile A, rather dedicated to mobile applications and more
concerned by complexity issues.
This (information doc) reports similar results in terms of loss for high resolution (4CIF). Highest
PSNR decrease in case of City. Thomson recommends to keep the 6-tap filter (or make it
switchable) for profile B.
Bitstream rewriting would no longer be supported unless the 6 tap filter was used in the
enhancement layer.
In case of use_base_rep flag, it would also need to be disallowed (otherwise would need 2 MC
operations with the different filters).
Switchability would lead to more complex (in terms of gates) hardware. However, might save
battery lifetime (how much?)
Similar results reported as in JVT-W068 and JVT-W072.
5.5.1.1.20
Discussion of potential rearrangement of NAL unit order
A top-down ordering of SVC NAL units was suggested and discussed. One mentioned issue
relating to it was that an encoder would need to add delay for rearrangement of its bottom-upgenerated NAL units into a top-down order. A decoder that receives things in a non-preferred
165
order within an access unit could, if it wishes, operate by buffering up the access unit to achieve
the processing order that its designer desires to follow. No action.
5.6 SVC applications and profiles
JVT decision: Branch out the software parts relevant for the current standard phase 1 as WD 1 of
reference software. The remaining part of JSVM (with other tools) will be further maintained
after that first step is done.
JVT decision: Editors are given discretion to put in any definition that we forgot to talk about.
5.6.1.1.1 JVT-W075 ( Prop 2.0/3.1) [M. Horowitz, A. Eleftheriadis] Max frame size
for enh layers of SVC profiles <withdrawn>
This contribution presents a mechanism for modifying the constraint imposed by the maximum
frame size (MaxFS) in the H.264 | AVC level specifications to support applications that require a
large range of frame size and frame rate combinations at a particular level (e.g., video
surveillance and video conferencing). For example, for a video surveillance application
compliant with a given level it may be advantageous to decode 720x480 video frames at 15 Hz
and at that same level decode 1920x1080 frames at 2 Hz. For this example, the MaxFS constraint
requires that level 4 (or higher) be specified to decode the larger frame size where level 2.2
would have sufficed for both video streams without the constraint. In the proposal, the MaxFS
column of Table A-1 is replaced with an expression for deriving MaxFS. The values for MaxFS
are derived so that the resource requirements (e.g., MB/s, DPB, etc.) are level-for-level identical
to the existing H.264 | AVC level structure. The context of this proposal is the enhancement
layers of the SVC profiles. That is, it is neither being proposed for the SVC base layer nor for
other existing profiles of H.264 | AVC.
Question: Current content of A.3.1 (similar in A.3.2).
e) PicWidthInMbs * FrameHeightInMbs <= MaxFS, where MaxFS is specified in Table A-1
f) PicWidthInMbs <= Sqrt( MaxFS * 8 )
g) FrameHeightInMbs <= Sqrt( MaxFS * 8 )
For example, Level 3 supports 5 pictures at 4CIF resolution. This proposal would also require
support of one picture with five times that number of macroblocks – a “20CIF” picture, but at a
five times lower maximum frame rate.
H.241, RFC 3984, and 3GPP documents have something related this (although not exactly the
same).
Remark: But how can we do this for enhancement layers while keeping the base layer
constrained by the original spec?
Enhancement layer might have a lower “level” than the base layer?
After further consideration, proposal withdrawn.
5.6.1.1.2 JVT-W093 ( Prop 2.2.1/3.1) [H. Chung, M. Karczewicz, J. Ridge, X. Wang,
W. Han, S. Kim] SVC FGS profile
166
This document provides additional results to compare FGS with MGS in the so-called Profile C
(not an actual currently-planned profile, but a further study topic) for SVC. It is claimed that ARFGS offers the ability to respond to forced bit rate adaptation in a more graceful manner than
MGS in a low-delay environment. Creation of a profile including FGS scalability is proposed.
Remark: Considering timing of work schedule and recent modification of MGS to improve its
granularity properties, further analysis of FGS requirements should be postponed until the next
meeting.
Remark: Current software does not support slice-structured coding. It may be difficult to
maintain the current FGS and AR-FGS functionality in the software while working on proper
support of “phase 1” features. It was suggested to branch the software and allow FGS and ARFGS and other non-“phase 1” features to be removed or to cease to function properly in the
“phase 1” branch. JVT decision: Agreed.
Contribution focuses on small frame variations due to characteristics of next-gen networks –
reason: low-delay applications with some types of (e.g., CBR) characteristics. Particular interest
was expressed by the contributor in AR-FGS. Comparative data shown reporting AR-FGS
advantage in some cases.
Assertion is that our “phase 1” approach cannot be used to achieve nearly-constant frame size
with low delay. Hierarchical P picture approach is asserted to be inappropriate due to variation in
frame sizes.
End-to-end delay analysis? How much can delay be reduced and how much will remain?
Proponent estimates 200 ms end-to-end delay.
Bit rates? Frame rates?
Potential for feedback usage.
Potential for taking enhancement picture into account for key pictures.
Appropriate content? (Is the Bus sequence really relevant?)
Set up an AHG on identification of application requirements for FGS and simplification of FGS
design.
5.6.1.1.3 Profiles definition changes
167
SVC Profiles tools table
AVC base layer (dependency_id equal to 0
and quality_level equal to 0) Profile
Impacting AVC base layer tools
SVC tools
Scalable
Baseline
Scalable
High
Scalable
High Intra
a.k.a.
a.k.a.
a.k.a.
SVC A
Baseline
SVC B
High
SVC B
Intra
High
slice_type
deblocking filter
constrained_intra_pred_flag in base
layer
num_slice_groups > 1
slice_group_map_type
direct_spatial_mv_pred_flag
arbitrary slice order
redundant slices
slices
I, P
Y
1
I, P, B
Y
1
I
n/a
1
N
N
n/a
N
N
I, P, EI, EP
N
n/a
n/a
N
N
I, EI
smoothed ref inter pred
PR slice motion refinement
AR-PR slices
fgs_coding_mode
interlace
CAVLC
CABAC
deblocking filter
deblocking filter (upsampling)
constrained_intra_pred_flag
below the top layer
arbitrary slice order (within slice
group)
num_slice_groups > 1
slice_group_map_type
resolution factors 2, 1.5
ESS (any factor)
ESS aligned crop window
ESS non-aligned crop window
EIDR
IROI
fragmented PR slice
CGS with varying quality levels
(MGS)
weighted prediction
use_base_representation_flag
8x8 transform block size
quant scaling matrices
num temporal levels
num dependency id
max num decoded dependency id
(using inter-layer prediction)
num quality levels
color_bit_depth, color format
N
N
N
N
N
Y
Y*
Y
Y
1
N
n/a
1
N
N
I, P, B, EI,
EP, EB
Y
N
N
N
Y
Y
Y
Y
Y
1
N
N
N
Y
2
Y
N
Y
N
Y
N
N
Y
N
Y
Y
Y
Y
Y
N
N
Y
N
Y
Y
Y
Y
n/a
N
N
Y
Y
Y
Y*
Y
8
8
3
Y
Y
Y
Y
8
8
3
Y
Y
Y
Y
8
8
3
16
4:2:0/8
16
4:2:0/8
16
4:2:0/8
Y
N
N
N
Y
Y
Y
n/a
Y
1
*: activation of the CABAC and 8x8 transform block size tool is subjected to levels definition
(Level 2.1 (2CIF) and above)
Max NAL unit size (NumBytesInNALunit)? No.
Smoothed reference prediction (see JVT-W026, JVT-W118, JVT-W126).
B pictures in scalable baseline enhancement layers? MinLumaBiPredSize? See below.
168
For both SVC A and B, when PicSizeInMbs is greater than 1620, the number of macroblocks in
any coded slice shall not exceed MaxFS / 4, where MaxFS is specified in Table A-1 (or SVC
equivalent). JVT decision: Agreed.
For both SVC A and B, cpbBrVclFactor = 1250 and cpbBrNalFactor = 1500.
JVT decision: Agreed.
Scalable High: Same level limits as High. JVT decision: Agreed.
Scalable Baseline:
– Levels 2.1 and 2.2 SliceRate = 22 (which slices count? the slices for the layers that are
“necessary” for decoding, as can be determined from high-level syntax)
– Allow B pictures (direct_8x8_inference_flag = 1 always, MinLumaBiPredSize = 8x8
always).
– Define MaxSubMbRecSize 576 up to level 3, 1152 level 3.1 and 3.2, 1440 levels 4 to 4.2, no
limit for level 5 and 5.1 (limit for base layer too, and also enhancement layer).
JVT decision: Agreed.
5.7 SVC other normative design proposals
5.7.1 SVC restrictions on interlaced coding
5.7.1.1.1 JVT-W025 ( Prop 2.0/3.1) [E. Francois, V. Bottreau, J. Vieron] Restrictions
on interlaced coding in SVC
This document relates to SVC interlaced video coding. The initial design of interlaced coding in
SVC permits any field / frame picture configuration for the different layers. This proposal aims at
introducing some constraints on these possible set of configurations, in order to ease the
implementation of interlaced coding in SVC.
Three elements:
1) Suggests to force field_pic_flag and bottom_field_flag to be equal across layers (while
allowing frame_mbs_only_flag to be different across layers).
2) Suggests that base_frame_and_bottom_field_coincided_flag and
base_bottom_field_coincided_flag to be identical or to be removed (enforcing alignment of
the top of the frame with the top field of the other layer).
3) Correct a clear error in some position calc equations.
JVT decision: Adopted.
5.7.2 SVC smoothed reference prediction
5.7.2.1.1 JVT-W026 ( Prop 2.0/3.1) [E. Francois, V. Bottreau, J. Vieron] Profile SVC
B: Evaluation of smoothed ref pred
This document relates to the evaluation of smoothed reference prediction (SRP) on a number of
various configurations, addressing both low delay and long delay coding applications. SRP is
currently only considered for profile SVC B, mainly oriented toward broadcast applications (it is
not included in profile SVC A that rather relates to mobile and real-time applications). The
obtained results reportedly show a slight bit rate increase by removing SRP on most sequences
169
(for long delay configurations: average 0.70%, worst case 2.32%; for low delay configurations:
average 1.94%, worst case 5.99%), not noticeable on a visual quality point-of-view. The less
favorable results for SRP are observed for long delay configurations that rather correspond to
profile SVC B applications. Consequently the contributors recommend removing this tool from
profile SVC B.
Reports that there is some PSNR benefit, mostly at high bit rates, but that perceptually the SRP
has some undesirable excess smoothing effect.
Question: Were the frames selected to show perceptual issue “cherry picked”. Response:
Basically, yes – as far as typical behavior with motion video running at full speed, basically see
no difference in quality – not asserting that there is typically any significant difference in quality
that way. But when viewing individual pictures, author asserts that some loss of resolution is
observed.
Dyadic two-layer hierarchical prediction for “low delay” (2) – why? Just didn’t have time to try
other cases.
Proposes not to use SRP in Profile B (the only profile it is currently in).
Author says that the tool requires some complexity to support. Remark: There are comments
about that in another contribution.
5.7.2.1.2 JVT-W118 ( Prop 2.2) [Y. Ye, Y. Bao, W. J. Han, S. Y. Kim] Perf and
complexity of smoothed ref pred
In the current SVC baseline profile, smoothed reference prediction is not supported (assertedly
due to concerns over its complexity and performance). Further experiments have been carried out
within the coding framework of SVC baseline profile. It is asserted that smoothed reference
prediction not only offers notable performance gain that effectively reduces the gap between
multi-layer SVC coding and single-layer AVC coding, but also reduces system complexity, both
in terms of reducing computational complexity and reducing memory bandwidth requirement.
Furthermore, it is asserted that smoothed reference prediction provides much better visual quality
in the reconstructed video. Additional implementation cost is asserted to be low compared to the
benefits it offers. In addition to supporting smoothed reference prediction in the scalable high
profile, it is proposed to also enable smoothed reference prediction in scalable baseline profile.
Test reported: Dyadic spatial scalability, CAVLC (asserted to be a pessimistic scenario for SRP),
three-layer hierarchical P.
Best case was 0.5 dB+ for Harbor, worst is City with basically no difference in fidelity measured.
Average roughly in 0.2 to 0.3 dB range – better for CIF to 4CIF than for QCIF to CIF.
Reported conclusion is that when SRP is available, BLSkip is used more often rather than using
other prediction modes that are more complex than it is, to the extent that the overall complexity
is reduced around 15-20% as a percentage of inter prediction generation for the luma component.
About 7% memory bandwidth reduction also estimated.
Visual example shown where extra detail is enabled when SRP is present. Another visual
example of “noisiness” when not using SRP.
SRP is, basically switched as a submode of BLSkip with residual prediction – syntax at the MB
level when that case is encountered 9 (and can be disabled at the slice level).
170
(Bitstream rewriting flags are also at slice level.)
There were prior contributions at the two previous meetings saying similar things.
It is currently in one profile but not the other. At the moment we don’t have a strong consensus
that the tool should be put everywhere or removed from the standard.
Question: Two issues with IPR statement in contribution:
1) Contribution is a proposal for a technology that was previously reported with a 2.2 IPR
statement, but proposal has a 2.0 IPR statement? Response: Probably the proposal should have
had a 2.2 IPR statement – will revise.
2) Contribution is a two-company proposal, but only one company is listed in the IPR statement?
Response: Second company will be contacted to clarify.
Revision uploaded Tuesday with Qualcomm 2.2 and Samsung 2.2/3.1 statements.
Further results later presented, asserting that if the encoder biases its decision-making to favor the
selection of smoothed reference prediction, complexity is further reduced with no apparent
impact on coding efficiency.
Has the tool been tried with interlace? Has interlaced support been stable in S/W?
What if the decoder impact if SRP is turned on for all macroblocks? Impact complexity does not
seem large.
Remark: Consider complexity of needing to support switching between two inter-prediction
modes – Response: Consider that switching of MC interp process now needs to happen at a finer
granularity to support smaller block sizes.
Overall coding efficiency benefit of SRP is small – concentrated at high bit rates. Visual excess
blur sometimes reported at such high rates.
Visual benefits shown for particular still-frame cases – it did look good in those examples.
Encoder complexity increases – but not by much since applies to testing only one case (BLSkip
with residual prediction).
More benefit seems to be in low delay scenarios.
In terms of implementation effort, optimization effort, testing effort, and quantity of text, SRP
adds a burden. However, in terms of processing cycles and other such measures, it may be
somewhat statistically beneficial.
Remark: SRP helps reduce gap between SVC and single layer.
Remark: Consider multi-layer optimization – e.g., per JVT-W071.
Tested GOP size = 4. Why not others? Just testing effort.
Remark: Did not find gain with larger GOP size. Larger GOP sizes are expected.
Remark: Internal testing by another company has led to a negative opinion of the feature.
171
Remark: Adds a difference relative to Base design.
Remark: This feature is an extra implementation burden for supporting scalability. We are not
designing the base layer here. Design consistency is desired, and implementers of AVC decoders
should not be burdened with a need to implement extra features to support scalabilty.
Available data for assessing usefulness of feature is limited.
SRP is not in Profile A and it seems clear that there is no consensus to add it.
Upon further discussion, SRP should be removed from Profile B. JVT decision: Agreed.
SRP can be considered as a Phase II investigative tool.
5.7.2.1.3 JVT-W126 ( Info) [Z. He] Verif JVT-W118 perf and complexity of
smoothed ref pred
This report is to verify the document JVT-V118 “Performance and complexity of smoothed
reference prediction in SVC profile A” from Qualcomm. Verification was performed based on
the source code and configuration provided by Qualcomm, and the simulation results were
confirmed for all the eight CIF sequences. 4CIF was partially verified (verified for the cases that
were tested).
The reason the source code was from Qualcomm was due to not knowing the exact status of
JSVM work and wanting to control off/on for SRP – the software is available on the ftp site as
part of JVT-W118.
5.7.2.1.4 JVT-W112-L (Late Prop 2.2) [A. Segall] Clarification of base_mode_flag
<withdrawn>
Contribution JVT-W112 was submitted late, but was withdrawn as moot after some discussion,
in consideration of action taken in response to other contributions, as is also noted elsewhere in
this report.
A change is requested for the case that the base_mode_flag is one and adaptive_prediction_flag is
zero. The fix enables the smoothed reference prediction process when it is enabled in the
baselayer.
Current syntax seems to allow “weird” multi-layer combinations of motion vectors and the
associated interpolation process with respect to smoothed reference prediction.
Proposes to infer the smoothed reference flag from the base layer when base_mode_flag = 1 and
adaptive_residual_prediction_flag = 0.
Discussed offline. Proposal withdrawn, considering removal of SRP from Phase I.
5.7.3 SVC deblocking
5.7.3.1.1 JVT-W061 ( Prop 2.2/3.1) [D. Hong, A. Eleftheriadis, O. Shapiro] Modified
deblocking filter process in scalable extension
172
This contribution introduces a modified deblocking filter process in scalable extension (subclause
G.8.14 in JD9). The current process is mainly derived from the AVC deblocking filter process
with modifications proposed by contributions JVT-O067 and JVT-P013. These previous
contributions adjust the original AVC deblocking filter process to change the handling of the
cases where base layer residue or sample values are used to derive current layer samples. The
present contribution further adjusts the deblocking filter process by modifying the qPav
derivation method so that the base layer QP is used for the deblocking when the enhancement
layer blocks have no transform data, the residual of the blocks is predicted from the base layer,
and (in the case of inter blocks) the enhancement layer blocks have similar motion vectors with
the same ref_idx. This adjustment of the qPav derivation method was introduced in JVT-V089
where the arithmetic mean of the base and enhancement layer QPs was used, rather than just the
enhancement layer QP. This contribution further considers the effect of applying the proposed
qPav derivation method for various GOP sizes (2, 8, 32) for both hierarchical P and B structures.
This contribution also tests and compares using several different types of weighted average
combination of the base and enhancement layer QPs instead of taking just the simple arithmetic
mean. Using just the base layer QP (an extreme case of the averaging where the base layer QP is
weighted by 1 and the enhancement layer QP is weighted by 0), the modified qPav derivation
method provides experimental results under the JVT common conditions that range from the
maximum benefit of +0.654 dB PSNR to the maximum penalty of -0.003 dB.
Like JVT-V089, but using base layer QP instead of the average.
A significant PSNR benefit was reportedly shown when enhancement layer has much larger step
size (QP increase by 15) than the base layer.
Remark: How about just turning down the deblocking filter strength? Reply: Can do that, but
increases overall blockiness.
Fixed QP. Remark: Realistic?
Remark: Adding more conditions to deblocking filter. Response: Similar conditions to what the
encoder is already using in the DF process.
Proponent asserts that this is in response to an issue that arose in an actual real-time
implementation with rate control.
Question: How often does this issue arise.
Visual effect shown for a difference of 15 in QP. Data for a smaller QP difference requested.
No verification contributed. Text and software and bitstreams are (or soon will be) available.
Other experts were asked to study the proposed technique during the meeting.
Remark on somewhat related topic: What about RCDO deblocking? Has been put into software
but not studied. Has not yet been shipped in products. Suggestion that some adjustments for
SVC might not be appropriately made as-it-is.
Remark: Inheriting QP from base layer can help “rewriting” – suggest that using the base layer
QP value when MB is not coded makes sense.
Remark: Experimented with it using QP+15 – helped in some areas and did not help in others.
Also found that when the QP difference is large, upsampling the base layer can sometimes look
173
better than adding a very coarsely quantized enhancement layer. Overall impression was
negative.
Idea: Send a “gamma” weight fraction sent at slice level like alpha and beta (in units of one
eighth) to determine the weighting of QP between the base and enhancement layer:
( QP1 * f + QP2 * ( 8 - f) + 4) >> 3. Default behavior is what is in current JD. Don’t send when
doing rewriting (use enhancement layer QP in that case).
Try to find a method, such as the above, to adjust the effective QP that can capture the benefit of
the current and proposed methods.
Remark: Goal of the weighting idea was to capture the benefits of each approach. Suggestion is
for encoder to use the existing adjustment controls of deblocking filter process. This is asserted
to suffice without a need for the further adjustability.
No action taken.
5.7.3.1.2 JVT-W063 ( Prop 2.0/3.1 Layered Media, then 2.2 from Polycom) [D. Hong.
A. Eleftheriadis, O. Shapiro] Deblocking filter for SVC to support multithreading with slice boundary
This contribution proposes to modify the current SVC deblocking filter process to support multithreading, without having to turn off slice boundary deblocking. With the current process, a
picture has to be sliced and deblocking across slice boundaries must be turned off in order to run
deblocking of each slice in parallel. This creates an annoying “blockiness” artifact across slice
boundaries in decoded images.
The desire is multi-threaded deblocking, which is difficult in the current design. Proposes to
change the order of edge processing, so that right-to-left and top-to-bottom edge ordering is used.
Remark: May change access pattern of some hardware designs that are highly-customized to the
current design.
Remark: Can switch the order of the interior edges again, reducing the number of stages further –
from 4 to only 2 stages (horizontally and vertically).
Remark: Any perceptual effects? Proponent has seen no subjective difference – can provide
sequences.
Remark: Effect on “bitstream rewriting” capability? Suggestion: Disable for rewriting-oriented
coding.
Idea from Polycom: New value of deblocking_disable_idc that indicates applying filtering inside
of slice first (without changing edge ordering), followed by filtering across the slice boundaries.
Do not use this value when rewriting is enabled.
Revised contribution uploaded to reflect that (with a 2.2 patent statement from Polycom).
Remark: That’s OK, because it enables paralelizable encoding – focus at the moment is not on
the decoding.
JVT decision: Adopt the idea from Polycom documented in revised (JVT-W063r1) contribution.
174
5.7.3.1.3 JVT-W069 ( Prop 2.2/3.1) [Z. He] Simplified H.264/AVC deblocking filter
for SVC enh layer
This contribution proposes a reported simplification of the existing deblocking filtering for SVC
enhancement layers while reportedly maintaining the same data and control flow as used for base
layer. In reported results for four CIF@30fps common sequences, the proposed simplified
algorithm reportedly shows a reduction on the data access and computation complexity by 60%
in average compared to the original deblocking algorithm, with luma PSNR degradation about
0.03 dB (maximum 0.05 dB). In comparison, the RCDO deblocking reportedly has ~45%
reduction with luma PSNR degradation of 0.1 dB (maximum 0.18 dB). Since the proposed
deblocking reportedly has the same data- and control-flow as the existing H.264/AVC deblocking
filter, the deblocking design can reportedly be shared in SVC base- and enhancement-layers.
Modifications:
1) Only use BS = 0 or 1
2) Only use edge detection for one of the four rows
3) Include an offset into the edge detector (specified in text or by encoder-sent syntax)
Very limited testing (only CIF).
There is not sufficient information available to make such substantial changes to the deblocking
filter.
Contribution noted.
5.7.3.1.4 JVT-W128-QV (Late Info) [Y. Ye] Verif of JVT-W069: Simplified deblocking
for SVC enh layer
This document reports verification results for the proposal by Freescale as reported in document
JVT-W069 “Simplified deblocking for SVC enhancement layer”. The results reported in JVTW069 were reportedly confirmed.
The simulation results reported in JVT-W069 were reportedly verified. Out of the four sequences,
Bus, Mobile and Foreman were reported fully verified; the results and the verification results
reportedly matched exactly for these three sequences. For Football, the reported results and the
verification results showed very small difference of up to 0.05 kbps, and the reason was
reportedly probably due to different platforms being used in simulations (Unix vs. Windows XP).
5.7.4 SVC spatial scalability resampling
Ad hoc group finished its evaluation, and did not find evidence of a need for additional
upsampling filters.
5.7.4.1.1 JVT-W028 ( Info) [E. Francois, V. Bottreau, J. Vieron] Evaluation of
flexible 4-tap upsampling filters
This document is an information contribution on the evaluation of the adaptive upsampling filters
proposed in JVT-V074. The results were asserted to show that on the tested sequences, the
flexible upsampling filters do not provide significant improvements compared to the current nonadaptive solution.
Ran the software that supports alternative filter selection. Found no significant benefit.
175
Remark: The software does not include an encoder method for selecting which filter will be used,
so no benefit would be anticipated from running the test that way.
5.7.4.1.2 JVT-W022 ( Prop 2.2/3.1) [T. Tran, L. Liu, P. Topiwala] Dyadic spatial
down- and up-sampling filters for SVC
This proposal presents updated results for FIR low-pass filters that can be employed as dyadic
down-sampling and up-sampling filters in SVC. The proposed filters reportedly have their roots
from the wavelet and spline interpolation theory which is asserted to have long been established
to have stable interpolation characteristics. All of the proposed filters have integer coefficients;
some are asserted to have very low dynamic range and to be suitable for efficient VLSI
implementation. This proposal also asserts that coding efficiency does not necessarily have to be
sacrificed by employing short low-complexity integer-coefficient filters.
Proposes different downsampling and upsampling filters. Contribution supports both dyadic and
ESS, but focuses on dyadic, since most benefit reportedly found there. Focuses on intra.
Results only provided in contribution for one sequence. Some other results shown that were not
previously presented.
Proposal asserted to be the same as JVT-V030 / JVT-V031.
Performance asserted to be more measurable for low QP and intra-only. No gain for high-delay
case. No significant gain for ESS cases – contribution focuses on dyadic.
Modified downsampler (odd-length mirror-symmetric). Proposed 4-tap phases are in JVT-V031.
Visual and PSNR benefits reported – esp. for base layer upsampled. Also some for enhancement
layers – esp. at high bit rates.
Visual demo shown – base layer upsampled using provided filter when used with proposal’s
downsampling filter used.
Base layer should perhaps not be watched as-is without such “matched” upsampling. Mixed
opinions in favor of JSVM filter expressed in such experiments in Hangzhou.
Proposes to enable encoder selection of the proposed 4-tap upsampling table as an alternative to
the current table.
Remark: Differences likely primarily due to downsampling change. Upsampling actual tap
values very similar.
Remark: Position calculation for luma different than in reference.
Remark: Some phases (1, 5, and 7) were not tested in AHG activity.
Remark: Conceptually-reversed and significantly off phase positions in current design for linear
ramp phase measure – specifically, one person from Microsoft expressed an opinion that phases 5,
6 and 7 from the prior Microsoft proposal or from this proposal looked better than the ones in our
current draft.
Some question of application need for such upsampled base layer use.
176
Some confusion over starting phase offset (lack of adjustability in current design for luma).
Phase alignment of luma is not adjustable in the current design. Adjustability would add
requirement for all positions to be supported in the decoder – even when using fixed upsampling
ratios like 3/2.
5.7.4.1.3 JVT-W086 ( Prop 2.2/3.1) [I. Amonou, N. Cammas, S. Kervadec, S. Pateux]
Some consideration on the up-sampling position calculation
The up-sampling operation currently found in Joint Draft for extended spatial scalability (ESS)
uses a particular method of calculating the position and phase information when up-sampling the
low resolution layer. That method relies on the use of an approximate ratio computation. This
contribution presents some implementation issues related to this approximate ratio. For some
practical up-sampling ratios, non static phase pattern for up-sampling occurs which is not a
desired feature for low complexity implementation. This contribution recommends then to
specify use of an actual division operation in the position calculations for ESS rather than the use
of the approximate ratio. Software and excel file reporting practical scenarios with identification
of phase differences accompanies the contribution for illustration.
Group requested interested parties to confer off-line. Result was reported in JVT-W136.
5.7.4.1.4 JVT-W136-B (BoG) [G. J. Sullivan, S. Pateux] BoG report on JVT-W086
Summary of BoG overview conclusions regarding JVT-W086.
Presented.
JVT decision: Keep method as in current draft. For levels having picture width or height greater
than 2048, specify scaling the resampling ratio up more by the constant amount that will still
keep all calculations within 32 bits.
5.8 SVC non-normative contributions
5.8.1 SVC editorial input
5.8.1.1.1 JVT-W070 ( Text) [H. Schwarz, M. Wien] Editors input for SVC draft
Draft text from the editors showing the current status of SVC text drafting work.
Shows progress in editing work – should be the basis for the future work. JVT decision: Agreed.
5.8.1.1.2 JVT-W099 ( Info) [J. H. Park, Y. H. Kim, B. H. Choi] Clarification of
mb_qp_delta syntax
This contribution reports the clarification of mb_qp_delta in macroblock layer syntax in scalable
extension to eliminate an unnecessary condition check. And also reports that no modification is
needed on JSVM S/W.
177
Remark: Had a 2.2 IPR statement and was marked as a proposal, later revised as an information
document with no attached IPR statement. Appears to be strictly editorial input.
Editors are asked to consider the comment in their drafting work.
5.8.2 SVC tutorial material
5.8.2.1.1 JVT-W132-B (Requested Info) [T. Wiegand] Overview paper and presentation
on SVC
This contribution, submitted at the request of the JVT, provides tutorial information on the SVC
extension design for AVC.
5.8.3 SVC encoder and extractor optimization
5.8.3.1.1 JVT-W071 ( Info) [H. Schwarz, T. Wiegand] Further results for an rd-opt.
multi-loop SVC enc.
The main disadvantage of the JSVM encoder control for multi-layer coding is that the losses
against single-layer coding are unevenly distributed between base and enhancement layer. In
JVT-T080 the basic idea of a joint multi-loop encoder control for spatial and SNR scalable
coding has been described and first simulation results for IPPP have been shown. In this
contribution, further results for hierarchical B pictures and a newer version of the JSVM software
are provided. The simulation results demonstrate that enhancement layer coding efficiency can be
traded-off for base layer coding efficiency.
For the cases of spatial and SNR scalability that were tested in this contribution, it was reported
to be possible to adjust the coding efficiency for base and enhancement in a way that the rate
increase relative to single-layer coding is about 10% for both the base and enhancement layer.
Remark: The last sentence above is approximately the same as saying that the goals of the SVC
project have been fulfilled (in PSNR measure terms).
Basic idea is to jointly optimize the base and enhancement layer coding parameters (by an
adjustable amount controlled by a weighting factor).
Shows how to measure the “usage” of the base layer rate in terms of its effect on the
enhancement layer fidelity.
5.8.3.1.2 JVT-W029 ( Info 2.2.1/3.1) [W.-H. Peng] Low-complexity mode decision
algorithm for combined CGS and temporal scalability
This contribution presents a layer-adaptive mode decision algorithm and a motion search scheme
for scalable video coding (SVC) with combined coarse granular scalability (CGS) and temporal
scalability. To speed up the encoder while minimizing the loss in coding efficiency, the
“computational redundancy” between the coding layers is considered. Depending on the
macroblock (MB) coding modes and the quantization parameters (QP) of the reference/base layer,
a look-up table is recursively used to determine the MB modes to be tested at the enhancement
layers. In addition, to avoid exhaustive motion estimation, the reference frame indices of the base
layer are adaptively reused, and according to the MB partition at the enhancement layer, the
178
initial search point for motion estimation is selected from the motion vector at the base layer or
the motion vector predictor at the enhancement layer. The proposed schemes were tested with
standard sequences in CIF and 4CIF resolutions using 1 base layer, 3 CGS layers, 3 reference
frames, and GOP size of 8 and 16. As compared with the mode decision algorithm in JSVM 8,
the proposed schemes reportedly provide an average of 76% improvement in overall encoding
time with an average increase of bit rate below 1%, and an average Y-PSNR loss below 0.01 dB.
Binary executable offered (not source code, at least not now).
Contribution describes a number of techniques that may be useful in fast encoder design. No
action requested. Further investigation of such techniques, along with source code and
verification, could potentially lead to a good low-complexity mode of JSVM software operation.
5.8.3.1.3 JVT-W043 ( Prop NN) [A. Leontaris, A. M. Tourapis] Rate Control for the
Joint Scalable Video Model (JSVM)
The consideration of rate control algorithms within video encoding systems is very critical for a
variety of applications where transmission may be constrained due to the channel’s bandwidth.
Nevertheless, the authors have observed that all evaluation of the Joint Scalable Video Model
(JSVM) reference software, and consequently of the Scalable Video Coding (SVC) standard, has
been limited in experiments using fixed and pre-determined quantization parameters (QPs).
Furthermore, very few, if any, experiments were performed to evaluate the impact of rate control
to the scalability features of SVC. To this purpose, this contribution introduces the quadratic rate
control scheme that has already been adopted within the H.264/MPEG-4 AVC Joint Model (JM)
reference software in the latest JSVM software. This implementation only affects the SVC base
layer but the scheme could be extended in the future to also support scalability layers as well.
Although it could be arguable whether this algorithm can be considered as state of the art, the
provided experimental results demonstrate that its Rate Distortion performance is equivalent
compared to the use of fixed QPs, while achieving the target bit rate. This suggests that this tool
should be a valuable addition within the JSVM software.
Software available – has been uploaded.
Question: Did they try the quality level assigner? No.
JVT decision: Adopted (integration with lower priority than normative things).
5.9 SVC conformance
5.9.1.1.1 JVT-W138-B (BoG) [V. Bottreau] Toward an SVC conformance specification
Coordinators: Alex Eleftheriadis for Scalable Baseline, Vincent Bottreau for Scalable High and
Scalable High Intra)
Every coding “tool” must have some conformance bitstream(s). Otherwise the tool will be
removed from the specification.
Draft spec in manner similar to AVC conformance spec.
Common SVC features listed.
Profile-dependent SVC features listed.
JVT decision: Plan approved per JVT-W138.
179
5.10 SVC verification testing
5.10.1.1.1
JVT-W110 ( Info) [E. Francois, V. Bottreau, J. Vieron] SVC verif test
plan: Updated results for SVC High Profile intra
This is an information contribution that presents updated results according to the Draft SVC
Verification Test Plan Version 2.2 (MPEG output document N8903) for supporting SVC Profile
High Intra as defined during the last (Marrakech) JVT Meeting for Professional video
manipulation scenarios.
5.10.1.1.2
JVT-W131-B (Late Info) [D. Hong, A. Eleftheriadis] Verification
bitstreams for SVC Profile A
This information contribution provides verification bitstreams for SVC Profile A, particularly for
videoconferencing.
5.10.1.1.3
JVT-W135-B (BoG) [I. Amonou, N. Cammas, S. Kervadec, S. Pateux] On
SVC verif test plan
Summarizes conclusions from break-out. JVT decision: Plan approved. Refinement by the
editors is invited.
6
Multi-view coding
6.1 CE 5 & related docs: MVC illumination compensation
6.1.1.1.1 JVT-W024 ( Prop 2.2/3.1) [W. S. Shim, M. W. Park, G. H. Park, D. Y. Suh,
H. S. Song, Y. H. Moon, J. B. Choi] CE5 results- joint prop for MVC
deblocking
In this contribution, CE5 results of the joint proposal of MVC deblocking for illumination
compensation are reported. The joint MVC deblocking method (combined with JVT-V033 and
JVT-V051) for diminishing or eliminating blocking artifacts caused by illumination
compensation is asserted to be able to improve the subjective picture quality as well as
maintaining the objective picture quality of the MVC video sequences.
Joint proposal from JVT-V033 and JVT-V051. Control bs by IC_flag and IC_offset to avoid
additional blocking artifacts. Additional decision is included at the end of the bs=0 derivation. Bit
saving average about 0.05%, mostly claim for subjective improvement. Most effects visible in
flat areas (like Race). Subjective viewing (performed by Tobias last meeting) did not conclude
for subjective improvement in cases of ballroom and exit (only 3 sequences were tested).
The group checked with the test group chair if there is subjective improvement for at least one
more sequence (except Race) - the result was positive.
Adopt to JMVM.
6.1.1.1.2 JVT-W023 ( Info) [S.-C. Lim, D.-H. Han, Y.-L. Lee] CE5: Verification of
loop filtering in MVC
180
This document presents verification results of “CE5: Loop filter” proposal by Samsung and KHU.
The encoder and decoder executables, bitstreams, source code, and configure files were provided
by Samsung and KHU. And the provided source code was compiled and the decoder executable
was run with the provided bitstreams. All of the decoded results were reported to be matched
exactly with the results provided by Samsung and KHU.
Checked with the same source code, results verified.
6.1.1.1.3 JVT-W031 ( Prop 2.2) [J.-H. Yang] CE5: Illumination comp. info.
derivation for MVC
This contribution proposes the modification of the part “2.3.2 ICA MC for Skip and Direct
modes” in JMVM 3.0. Unlike the P_Skip mode, the B_Skip mode in the current JMVM model
requires the transmission of mb_ic_flag and dpcm_of_dvic. The proposed scheme derives the IC
information (mb_ic_flag and DVIC) from the neighboring blocks for the B_Skip mode. Then, it
is asserted that the IC technology with the proposed B_Skip mode requires simple syntax and
becomes in line with the H.264/AVC design. Also, the simulation results are asserted to show
that the proposed scheme gives better coding efficiency
Revisit of JVT-V063, implementation on the newest JMVM version was done. B_direct mode
case of JVT-V063 not used any more, B_skip mode case is retained. Derivation of IC done
similar to the B_skip mode for motion vectors of AVC. Proposal for syntax modification relative
to JMVM 3: Remove syntax elements from slice header and macroblock prediction syntax. 0.4%
bitrate saving on average.
Performed further study on possible complexity impact in breakout, reported back. Complexity
decreased without penalty in compression.
JVT decision: Adopt to JMVM.
6.1.1.1.4 JVT-W085 ( Info) [Y. S. Ho, K. J. Oh, C. Lee, B. H. Choi, J. H. Park] CE5:
Verification of JVT-W031 illumination comp. info. derivation
In this document, verification results of JVT-W031 are reported. Reports that they received the
source code, configuration files, coded bitstream, experimental results, and documents for
description. LG proposed the derivation scheme for IC information in B_Skip mode. They
verified the implementation, encoding/decoding for the proposed scheme and its results.
Checked with the same source code, results reportedly verified.
6.2 CE 6 & related docs: MVC view interpolation
6.2.1.1.1 JVT-W055 ( Prop 2.2/3.1) [T. Senoh, M. Okui, K. Enami] Experimental
results of camera-rotation-compensated prediction in CE6
Experimental results of view interpolation prediction based on the camera-rotation-compensation
of the reference pictures were reported. For Uli test sequences, very small R-D gain was
reportedly observed. The reasons reportedly seem to include camera location errors, camera gain
errors and many occluded blocks.
(no verification)
181
Only small gain found (<0.1 dB). No further study envisaged currently, but might be combined
with block slant distortion compensation. Contribution noted.
6.2.1.1.2 JVT-W059 ( Prop 2.2/3.1) [S. Yea, A. Vetro] CE6: View synthesis prediction
This contribution reports progress of CE6 on view synthesis prediction for multiview coding. A
method to determine an appropriate depth search range and step size has been explored. It is
asserted that these parameters have a substantial effect on the coding gains. Improved coding
results are shown for one test sequence, however overall gains for other sequences are negligible.
The report suggests that further study is needed to determine the appropriate depth maps for each
test sequence.
(no verification)
Finding depth range and step size by using KLT tracker. Correction vectors used in addition to
depth. Adaptive strategies aiming to reduce the coding cost, also depth range may change
temporally over the sequence. Breakdancer bitrate saving up to 8% (only at low bit rates as high),
not much gain for other sequences currently. Plan further study in particular for improved depth
search and adaptive coding, and alternative representations of depth. Contribution noted.
6.2.1.1.3 JVT-W084 ( Info 2.2/3.1) [Y. S. Ho, K. J. Oh, C. Lee, B. H. Choi, J. H.
Park] Observations of multi-view test sequences
This document introduces information obtained by observing the multi-view test sequences. The
observations are related to vertical and horizontal displacement caused by inaccurate camera
arrangements, illumination changes, synchronization of multi-view sequences, and focusing.
Future multi-view video sequences should solve these problems for efficient multi-view coding
and real applications.
Reported as introduction before JVT-W083. Reports problems: Vertical displacements,
illumination changes, synchronism, camera arrangement (in case of Rena sequence).
Contribution noted (may be difficult to get better test sequences).
6.2.1.1.4 JVT-W083 ( Prop 2.2/3.1) [Y. S. Ho, C. Lee, K. J. Oh, B. H. Choi, J. H.
Park] CE6: View interp pred for MVC
This contribution describes a ‘VIP P-picture’ coding which uses the synthesized image as the
additional reference frame. The proposed view interpolation method can make an intermediate
image by using initial disparity estimation, variable block-based disparity estimation, and pixellevel disparity estimation based on the adjusted search range. In addition, motion vector
prediction scheme is modified and vertical displacement is compensated to maximize the
efficiency of ‘VIP P-picture’ coding.
Try to compensate the problems reported in JVT-W084 (in particular vertical displacement
compensated before disparity estimation). Modified motion vector prediction in cases where
neighboring blocks are mixtures of VIP and V/T frames. For “dense sequences” (Akko&Kayo
and Rena) average gains 0.2 dB overall, 0.66 dB for B-views, for other sequences marginal
(Breakdancers) or no gains. (In general, the rate for B-views is not too high anyway.)
No action taken.
182
6.2.1.1.5 JVT-W103 ( Info) [J.-H. Yang, S.-H. Lee] CE6: Verif GIST MVC
contribution JVT-W083 MVC view interp pred
This document reports verification results of JVT-W083 from GIST. The author received
decoding executables, coded bitstreams for B-views, reconstructed yuv files and experimental
results. They verified the decoding and its results for the proposed scheme.
Checked with the same source code (source or executable? document says executable), results
verified for 3 sequences where gain was observed.
6.2.1.1.6 JVT-W096 ( Prop 2.2/3.1) [S. Naito, A. Koike] CE6: Results on MVC
In this contribution, recent progress for CE6 on view interpolation prediction for multiview video
coding is described. In order to improve the coding efficiency by view interpolation and disparity
compensation, an efficient encoding scheme for depth and disparity vectors is proposed.
Experimental results for anchor frames are provided under the common test conditions. The
proposed scheme is asserted to be effective especially for a sequence with an arc camera
arrangement.
(no verification)
Introduce a coding scheme for disparity vector and depth. Basic idea to allow conversion
between depth and disparity vector, in order to use them mutually for prediction from
neighboring blocks. Depth is derived using camera parameters plus disparity information (on
block basis). Maximum gain reported for Breakdancer (0.1..0.2), almost nothing for other
sequences. In Breakdancers, difference between depth and disparity vector is apparently most
significant.
No action taken.
6.2.1.1.7 JVT-W087 ( Prop 2.2/3.1) [S. Shimizu, H. Kimata] New view synthesis pred
framework using resid pred
This contribution proposes a view synthesis prediction framework for multiview video coding
using residual prediction. In this framework, only one depth map is encoded at every instant in
order to perform view synthesis prediction with fewer bits as a whole. It is asserted that the most
important technique proposed in this contribution is spatial/temporal residual prediction on view
synthesis prediction residual signals. The preliminary experimental result for the sequence “rena”
was reported as -7.39% or 0.34 dB in Bjontegaard measure. Note that this experiment was
conducted on the special prediction structure for low delay.
(new contribution)
Focus on issues: How to reduce bits for depth information, how to deal with inaccuracies in depth
estimation. Depth is encoded duplicated (e.g. as disparity for different pictures), in fact being
redundant due to same physical meaning. Goal to encode only one overall depth map (e.g. on
base view) and derive all other reference information out of it. Residual signal between original
and depth-synthesis prediction is encoded, but may have spatial and temporal correlation
(because the wrong camera parameters do not change over time). Therefore, they can be encoded
like conventional video after depth-based synthesis prediction from the base view. Average gain
for Rena (no other sequences tested) 0.34 dB average. No temporal prediction applied for the
non-base views.
Looks interesting (also some relation with depth-based projection in 23002-3).
183
Current conclusion from CE: Breakout to identify the most promising directions in view
interpolation and start more collaborative effort.
Currently, most gains are reported for Rena Akko&Kayo and Breakdancers. For other sequences,
due to the physical structure the ranges of depth are much to high to be compensated by the
global camera parameters.
Schemes that estimate depth at decoder are not followed any more currently. Would only work
for dense sequences.
6.2.1.1.8 Anthony Vetro presents new CE6 work plan.
Two different paths so far: Block-based depth, pixel based depth (mostly from global camera
parameters) – latter does not work for sequences with highly varying depth
– do not further follow decoder-side depth estmation
– concentrate on approaches video plus depth
– Two approaches: Directly coded residual, predictively coded residual
Issues:
– What is resolution, range and precision of depth maps
– Study global depth, try to minimize rate for depth maps
Currently aiming for improved coding efficiency, but would be interesting to study the
relationship with the video plus depth approaches that were presented (for view synthesis).
Tradeoff: The latter one would require more precise depth maps which might penalize the
compression performance. Needs to be further studied. Continue CE. Discussed combination of
different approaches available so far in joint software framework. Uploaded slides as JVT-W133.
6.2.1.1.9 JVT-W133-B (BoG) [A. Vetro] BoG report on MVC view interpolation pred
Summary of BoG Discussion on View Interpolation Prediction.
6.3 MVC high-level syntax
6.3.1.1.1 JVT-W035 ( Prop 2.2.1/3.1) [Y. Chen, Y.-K. Wang, M. M. Hannuksela] On
MVC JD 2.0
This contribution presents some comments and proposals on the following topics: 1) single-path
adaptations based on priority and temporal level, 2) view_level and indication of suffix NAL unit,
3) IDR picture, IDR access unit, changing and activation of sequence parameter sets, 4) implicit
removal of decoded non-reference pictures that belong to the not-output views, and 5) scalable
nesting SEI message.
Issue 1 (Priority ID): JVT decision: Proposal adopted
Issue 2 (view_level): align with the SVC design (decided not to have suffix NAL unit, see JVTW125). View level cannot directly be compared with temporal level, because there are much
more different configurations. In principle, no semantics is associated. Offline clarification
resulted in a recommendation to remove view_level. JVT decision: Agreed.
Issue 3 (IDR, IDR access, SPS)
a) IDR, can other pictures in same AU be non V-IDR? JVT decision: Proposal adopted.
b) Shall IDR access unit have all pictures IDR or V-IDR (but we may need a name for this
case - editorial)
c) When can SPS change? Only in IDR – JVT decision: Agreed.
184
d) Shall SPS MVC extension be same for all SPSs? Comment: View dependency should be
retained the same, otherwise start with new IDR. JVT decision: Adopted
e) What happens if certain views are stripped off, but SPS is unchanged? Should there be an
identifier for discardable views in SPS? Other solution could be to signal this by SEI
message (as in SVC) – thin about in future. No action.
Issue 4 (implicit removal of decoded pictures) editorial – clarify offline.
Issue 5 (re-use scalability nesting SEI message in backward-compatible manner) JVT decision:
Agreed, but may need re-consideration / extension when the views shall be differently scaled
temporally, spatially.
6.3.1.1.2 JVT-W036 ( Prop 2.2.1/3.1) [Y.-K. Wang, M. M. Hannuksela, Y. Chen]
MVC output related conformance
MVC supports a large range of views, but the number of the views the decoder process can be
constrained to a relatively small value to meet the rendering capabilities, for example. According
to the current MVC draft, it cannot be known from the bitstream which views are to be outputted.
It is claimed in this contribution that the information which views are to be outputted is required
in the picture output and removal processes of the hypothetical reference decoder as well as in the
derivation of the minimum decoded picture buffer requirement. While it is possible for a decoder
to get the information through a systems means that is out of the scope of the MVC specification,
it is asserted that containing the information within the bitstream is helpful in at least two aspects.
First, like AVC or SVC, the decoding process can be independent of external information.
Second, when parts of the bitstream have not been received due to any reason, the receiver knows
how to handle, e.g. to conceal a lost picture or to omit decoding a non-required picture. This
contribution proposes the signaling of the to-be-outputted views within MVC bitstreams.
One possibility to leave this unspecified. However, if there is a mechanism to specify this, it may
even be possible not even to decode these views. Sounds like a very special case, where e.g. the
server or proxy must be aware of the type of display that is available at the receiver end.
Discussed further after offline of more showcase details with Anthony V.
JVT decision: Adopted into SEI. Showcase to be made by next meeting.
Remark: Using SEI for this seems odd, since it governs normative decoder behavior.
6.3.1.1.3 JVT-W037 ( Prop 2.2.1/3.1) [Y. Chen, Y.-K. Wang, M. M. Hannuksela]
View scalable SEI message for MVC
View scalability information SEI message is proposed in this contribution. The SEI message is
used to signal a number of operation points and information of each operation point, including
definition of the operation point, maximum priority_id and temporal level, profile and level
compatibility information, bitrate information, frame rate information, and initial parameter sets
information.
Similar to scalability information SEI message in SVC. JVT decision: Adopt.
6.3.1.1.4 JVT-W038 ( Prop 2.2.1/3.1) [Y. Chen, Y.-K. Wang, M. M. Hannuksela]
Operation point and view dependency changes SEI messages for MVC
View scalability information SEI message is proposed in this contribution. The SEI message is
used to signal a number of operation points and information of each operation point, including
definition of the operation point, maximum priority_id and temporal level, profile and level
185
compatibility information, bitrate information, frame rate information, and initial parameter sets
information.
View dependency and scalability operation point changes cover very specific case – keep this for
further study, no adoption in the current early phase of the project.
6.3.1.1.5 JVT-W039 ( Prop 2.2.1/3.1) [Y.-K. Wang, Y. Chen, M. M. Hannuksela]
Non-required pictures SEI message for MVC
A new SEI message for indication of non-required pictures is proposed in this contribution. With
the proposed SEI message, a communication system using MVC can avoid transmitting,
decoding and buffering of the non-required pictures. A non-required picture refers to such a
picture in a certain view in an access unit that is not used for inter-view prediction while listed as
an inter-view prediction picture in the sequence parameter set. Furthermore, a non-required
picture does not affect the decoding process of the current and future pictures in the current view
and other target output views.
See notes in section on JVT-W056.
6.3.1.1.6 JVT-W056 ( Prop 2.2) [J. B. Choi, W. S. Shim, H. S. Song, Y. H. Moon]
Inter-view prediction reference picture marking
This document proposes to additional nal_ref_idc_view syntax for the marking process of interview prediction reference picture and the modified initialization process for reference picture list
for inter-view prediction picture. When the prediction structure that some pictures are used for
inter-view prediction reference and some pictures are not used for inter-view prediction reference
in same view is implemented, it is asserted that current inter-view prediction reference picture
marking system has some problems. Firstly, current inter-view prediction reference picture
marking system could mark a picture which is used for inter-view prediction reference picture.
Actually the picture is not used for inter-view prediction reference picture. Because current interview prediction reference picture marking system only uses view dependency information from
SPS. Secondly, current initialization process for inter-view prediction reference picture could
insert the picture that is not used for inter-view prediction reference in reference list. Because
Current initialization process considers the view dependency information and PicOrderCnt(). The
proposed nal_ref_idc_view represents whether a picture is used for inter-view reference picture
and the modified initialization process considers the view dependency information,
PicOrderCnt() and Proposed nal_ref_idc_view.
Case 1: Some pictures in same view are not use for interview prediction, case 2: In case of
temporal resolution of some views is different. Similar method was proposed in U103. Necessary
information can be derived from view dependency id.
Consensus in the group that JVT-W039 and JVT-W056 cover something useful. Breakout group
to elaborate on unified solution (considering pro’s and con’s of doing it in NAL or as SEI, also
relevance in terms of complexity saving) and report back.
After consideration – JVT decision: Adopt JVT-W056 (not JVT-W039 at this point).
6.3.1.1.7 JVT-W066 ( Prop 2.2/3.1) [P. Pandit, P. Yin, C. Gomila] Ref pic list
reordering for MVC
186
In the current JD, new Reference Picture List Reordering (RPLR) commands were added to
support reordering of inter-view reference pictures. This document proposes to change the
equations used to derive the view index prediction value in order to allow for
duplicating/repeating the inter-view reference pictures in the list.
Resolves problem that currently exists for the first RPLR command. JVT decision: Adopt.
Remark: There are surely other mechanisms to fix the problem, but the proposed method is
similarly simple as those would be.
6.3.1.1.8 JVT-W067 ( Prop 2.2/3.1) [P. Pandit, P. Yin, C. Gomila] H.264/AVC
extension for MVC using SEI message
This document proposes a new supplemental enhancement information (SEI) message for
signaling of multiview information in a H.264/MPEG-4 AVC compatible bitstream where each
picture contains sub-pictures for each particular view. This SEI message is intended for easy and
convenient display of multiview video streams on 3D monitors which may use such a framework.
Reports a method that would enable packing several views into an AVC compatible bitstream.
(using tiling of views).
There would be other possible approaches to achieve this (e.g. temporal interleaving). Could also
be seen as extension of stereoscopic SEI. Anyway, existing AVC decoders would be unaware of
such a new SEI message, which could only be defined in a new amendment. Set up AHG on
“Study of MVC solutions using existing AVC decoders”, chair P. Pandit.
6.3.1.1.9 JVT-W074 ( Prop 2.2) [H. S. Song, W. S. Shim, Y. H. Moon, J. B. Choi]
Comments on view dependency info
This contribution consists of two sections. The section 1 is about the additional syntax of
sequence parameter set for the flexibility of the inter-view prediction structure. Since the added
new syntax gives information which a picture with the temporal level is predicted with the interview prediction, the proposed scheme is asserted to be efficient for the random access or memory
management, etc. Also, the proposed scheme can reportedly be used under the environment with
the restricted memory size or the required low complexity.
The section 2 is about the modified representation method of view dependency information for
efficient representation view dependency. The modified representation method represents view
dependency by the basic unit of repeat pattern and the number of view of basic unit. It is reported
to be useful and efficient for representing prediction structure which the view dependency is
repeated by uniform pattern.
Comment on first part: Seems not to be very significant in terms of saving memory and
complexity. No real support in the group.
Comment on second part: Amount of bits saved is negligible and not worth the additional
complexity.
No action taken.
6.3.1.1.10
JVT-W080 ( Prop 2.2.1) [K. Ugur, H. Liu, Y.-K. Wang] Showcase for
parallel decoding info SEI message for MVC
187
At the Marrakech meeting, the Parallel Decoding Information SEI message (JVT-V098) was
adopted to JMVM to facilitate parallel encoding/decoding of different views. This contribution
presents a showcase for this SEI message. In addition, some allegedly-minor issues were
identified with the syntax and semantics of the SEI message after the Marrakech meeting. This
contribution also proposes the changes to syntax and semantics to address these issues.
Group is satisfied with showcase. JVT decision: Agree with syntax adjustments as presented.
6.3.1.1.11
JVT-W088 ( Prop 2.2) [S. Lin, S. Gao, Y. Liu, L. Xiong] H.264/AVC
SEI extensions for MVC
This contribution proposes modifications to extend two H264/AVC SEI messages for MVC, one
is spare picture SEI message, the other is decoded reference picture marking repetition SEI
message. Both of these SEI messages were introduced in H.264/AVC to implement error
concealment.
Remark: Can achieve same functionality with JVT-W035.
Response from proponent: That does not enable inter-view spare picture selection. Suggests that
spare picture usage in the view direction would be useful.
Question: Any example pictures identified where this would be useful?
Remark: Marking process only affects the temporal direction. There is no marking that operates
in the view direction.
Aspects of this contribution beyond what can be achieved by JVT-W035 are for further study.
6.4 MVC other normative technical inputs
6.4.1 MVC motion/disparity vector coding
6.4.1.1.1 JVT-W081 ( Prop 2.2) [H. S. Koo, Y. J. Jeon, B. M. Jeon] MVC motion skip
mode
This document proposes a motion skip mode for MVC which is originated from the idea that
there is a similarity in respect of motion between the neighboring two views. In the proposed
method, the motion information is inferred from the corresponding macroblock in the frame with
the same temporal index of the neighboring view. To compensate the inter-view difference
generated by camera geometry, disparity vector is applied to find the corresponding macroblock
in the neighboring view. The maximum gain obtained with the proposed method is up to 0.54 dB.
Uses global disparity vector, for non-anchor pictures this is derived from the anchor pictures.
Introduces the global disparity in the slice header syntax. Introduce motion_skip_flag in MB
layer syntax. 0.54 dB gain for Rena, 0.38 Akko&Kayo, Race 0.25, Flamenco 0.1, negligible for
other sequences. 0.18 dB on average.
Comment: Proposal uses inter-view reference for motion information in view level 1, which
would require to define the picture as reference picture and store it in DPB. Not clear if this is
possible.
188
Clarified buffer management issue offline and reported back. After review of breakout:
Proponent was to produce concrete description text for potential inclusion in JMVM –
participants of breakout also were asked to check this against the JMVM software code.
Break-out group discussion held with results recorded in JVT-W139.
6.4.1.1.2 JVT-W139-B (BoG) [LG, Thomson] Break-out conclusions on JVT-W081
Report of break-out discussion on JVT-W081.
JVT decision: Adopt (into JMVM) as recorded in JVT-W139.
6.4.1.1.3 JVT-W073 ( Info) [K. Sohn, J. Seo] Verification of JVT-W081 LGE MVC
motion skip contrib.
This document reports the cross-check results of JVT-W081 “MVC motion skip mode” by LGE.
The source code, configuration files and coded bitstreams were provided. The verification was
performed by decoding the bitstreams provided by LGE. The simulation results of JVT-W081 are
confirmed.
Check made on basis of compiled source code, results verified.
6.4.1.1.4 JVT-W101 ( Prop 2.2) [H. Yan, J. Huo, Y. Chang, S. Lin, S. Gao, L. Xiong]
MV/DV prediction based on RDV
This document is a response to JVT documents JVT-V071, JVT-V072 and JVT-V073. Several
changes of original techniques have been made, and coding performance of proposed mv/dv
prediction method is investigated.
Only small or no gains. Possibly still bugs in implementation. Contribution noted.
6.4.1.1.5 JVT-W104 ( Prop 2.2) [S.-H. Lee, S.-H. Lee, N.-I. Cho, J.-H. Yang] MVC
disparity vector pred
This contribution is a response to the ad hoc group work on disparity and motion vector coding.
We propose a modified motion vector prediction scheme, which distinguishes neighboring
motion vectors as temporal motion vectors and disparity vectors. Each kind of motion vector is
used exclusively in motion vector prediction phase by reference picture types. Disparity vectors
are derived from temporal matching blocks when they are not available from neighboring blocks.
Proposed algorithm shows 0.0 dB~0.04 dB PSNR gain and 0.2%~1.2% bit reduction with
Bjontegaard measure for all views and all frames. And 0.01 dB~0.074 dB PSNR gain and 0.34 %
~ 2.76% bit reduction for selected views which have an inter-view dependency in non-anchor
frame.
Average gain without RPLR on: 0.042 dB, with RPLR on: 0.006 dB. In some cases worse results
than JMVM 3.0.2. Contribution noted.
6.4.1.1.6 JVT-W107 ( Info) [K. Sohn, J. Seo] Verif JVT-W104 MVC disparity vector
pred
189
This document reports the cross-check results of JVT-W104 “MVC disparity vector prediction”
by SNU/LGE. The binary files, coded bitstreams were provided. The verification was performed
by decoding the bitstreams provided by SNU/LGE. The simulation results of JVT-W104 are
confirmed.
Cross-check based on compiled source code.
6.4.2 MVC weighted prediction
6.4.2.1.1 JVT-W040 ( Prop 2.2.1/3.1) [S. Liu, Y. Chen, Y.-K. Wang, M. M.
Hannuksela] Constraints on temporal direct mode and weighted prediction in
MVC
When an inter-view prediction picture is a co-located picture, it is reportedly not specified how
the temporal direct mode and implicit weighted prediction should be applied. It was studied
whether the temporal direct mode suits the inter-view prediction pictures by using view_id
instead of PicOrderCnt to calculate the scaling factors. Judging from the simulation results, the
modified temporal direct is reported to provide no efficiency gain and sometimes to even bring
coding efficiency loss. It is therefore proposed that when the co-located reference picture belongs
to inter-view reference pictures, temporal direct mode shall not be used. Furthermore, methods on
how to support weighted prediction when there are inter-view references in the reference lists are
discussed.
Disable temporal direct mode in case of inter-view prediction. JVT decision: Adopt. Problem
with scaling in case of implicit weighted prediction. JVT decision: Adopt solution to disable
implicit weighting prediction (which seems to be the best possible fix for the time being).
6.4.2.1.2 JVT-W098 ( Prop 2.2) [J. H. Park, Y. H. Kim, J. W. Kim, B. H. Choi]
Weighted prediction for MVC
This contribution suggests re-use of base view weighting parameters when all view sequences
have quite similar tendency of weighting parameters. It reports that introducing one bit syntax
same_weighted_prediction_flag in SPS in SVC MVC extension gives a way to avoid redundant
process when all views have the same weighting value. Also reported is that introducing
use_base_view_prediction_flag in slice header gives flexibility.
Proposal to re-use the weighting parameters from base view for enhancement view. Coding gain
negligible, but would need change of existing slice header syntax and decoding process.
Contribution noted.
6.4.3 MVC downsampled reference etc.
6.4.3.1.1 JVT-W079 ( Prop 2.2/3.1) [H. Kimata, S. Shimizu, K. Kamikura, Y.
Yashima] Inter-view prediction with downsampled reference pictures
In this document is proposed a new inter-view prediction method in case that each view has a
different spatial resolution in a multi-view sequence. The above case is asserted to be beneficial
to reduce both the total bit rate for multi-view sequence and the complexity of encoding and
decoding multi-view sequences, because the number of samples in some views could be
decreased when spatial resolution of them is decreased. The contribution discusses the efficiency
190
in terms of coding efficiency and complexity of the proposed coding in the document. And in the
proposed method, it is asserted that we could have an optional post processing in which decoded
images of such low resolution pictures could be up-sampled. It also discusses briefly the
effectiveness of such post processing.
Idea to reduce the spatial resolution in some of the views to achieve better compression
performance. This could also include the case where some of the cameras produce less resolution.
Preferably, these are encoded as B views. Requires downsampling in the prediction process for
the B views. Proposal to add the respective syntax in the SPS. Currently, it would only be
possible to encode the downsampled views independently from the higher-resolution views.
Results indicate that significant compression gain can be achieved for the low-resolution views if
prediction from the higher-resolution views is enabled.
Comments:
– In practice (for certain types of displays) also inter-view methods would be used to generate
the up-sampling
– Complexity reduction is very interesting aspect
– Overall rate saving needs still to be investigated (currently, only saving on B views was
reported)
– Would also require subjective evaluation
Further work encouraged (see below under JVT-W092)
6.4.3.1.2 JVT-W092 ( Prop 2.2/3.1) [P. Pandit, P. Yin, C. Gomila] Reduced resolution
update for MVC
This document presents the extension of Reduced Resolution Update (RRU) mode for multi-view
video coding (MVC). RRU is currently supported by H.263 (Annex U). This mode is asserted to
provide the opportunity to increase the coding picture rate while maintaining sufficient subjective
quality. This is done by encoding an image at a reduced resolution, while performing prediction
using a high resolution reference. This reportedly allows the final image to be reconstructed at
full resolution and with good quality, although the bit rate required for encoding the image has
been reduced considerably. It is asserted that the results using JM 10.1 show performance
improvements of about 0.3 db over not using RRU.
Request for more flexibility in having views of different resolutions, similar to JVT-W076.
Support for spatial scalability. Preliminary results with RRU (based on JM, not JSVM), gain of
0.2 … 0.6 dB for the views that were processed (no overall gain reported). Compared to the
previous proposal, it is here necessary to specify a normative upsampling filter in the loop.
Saving in complexity not as high as 076 (only for parsing process it is less, but all MC/DC is
done at full resolution).
Further study of this and JVT-W076, establish AHG, still need to be checked whether there is
relationship with other issues such as view interpolation.
6.4.3.1.3 JVT-W094 ( Info 2.2) [W. J. Tam] Image and depth quality of
asymmetrically coded stereoscopic video for 3D-TV
This contribution provides experimental results of subjective evaluation of stereoscopic images in
which the image quality of the left-eye and the right-eye views are different. Subjective ratings
of binocular image quality are biased towards the input with the higher image quality when the
source of image degradation is Gaussian filtering (blur). When the source of image degradation
is from quantization, such as from block-based coding, the binocular image quality is reported to
191
be approximately the average of the image quality inputted to the two eyes. In contrast to image
quality, depth quality ratings were reportedly only slightly affected by asymmetrical image
degradation arising from either blur or blockiness. The main findings were replicated for a wide
range of asymmetrical quality between the two eyes, using a variety of test sequences, and for
different groups of viewers.
Finding that lowpass-filtered image for one eye gives subjectively same quality perception in
stereo if resolution for the second eye is kept high. Asymmetrical coding is viable method for
bandwidth saving. Cross-switching at scene cuts is uncritical and might help to circumwent
problems of large asymmetries.
Would need more investigation how it works for multi-view; potentially alternating quality (lowhigh-low-high) could be viable. In general, this is more an issue of encoder decisions.
Contribution gives valuable hints, but difficult to quantify in absolute numbers (most probably
sequence dependent)
6.4.4 MVC modes and other coding efficiency topics
6.4.4.1.1 JVT-W078 ( Prop 2.2/3.1) [H. Kimata, S. Shimizu, K. Kamikura, Y.
Yashima] Co-located block condition for inter-view prediction
This is a follow-up of JVT-U134, in which a simplified coding method of direct mode for interview prediction of MVC was proposed to reduce memory usage for disparity information. In this
document, all coding results are presented and complexity reduction is discussed.
Main issue is reducing memory bandwidth for disparity vectors. Usage of co-located block
meaningless for inter-view prediction, shown that there is no loss in compression performance.
Not clear how large the complexity reduction really is, most probably it is minor. Therefore, it
seems to be better to keep it as it is (as in AVC) instead of imposing additional constraints that
are specific for MVC.
6.4.4.1.2 JVT-W082 ( Prop 2.2) [Y. J. Jeon, H. S. Koo, B. M. Jeon] Modified spatial
direct mode in MVC
The application of the colZeroFlag derivation process in spatial direct mode is not proper in some
cases where the motion properties of current block and co-located block are different (i.e. one is
mv, and the other is dv, or vice versa). This contribution introduces a solution to keep this
derivation process meaningful. Before the derivation of this flag, a validation check process is
invoked to check whether the motion properties of two blocks are identical. If the motion
properties of two blocks are identical, the existing colZeroFlag derivation process is invoked. If
not identical, colZeroFlag is set to 0 without any further investigation. By the proposed method,
the colZeroFlag derivation process in H.264/AVC can be carried out in proper way.
Zero gain in compression, no reduction in complexity, keep it as it is in AVC. Contribution noted.
6.4.4.1.3 JVT-W065 ( Prop 2.2/3.1) [P. L. Lai, A. Ortega, P. Pandit, P. Yin, C.
Gomila] Adaptive reference filtering for MVC
This document considers the problem of coding multi-view video that exhibits mismatches in
frames from different views. Such mismatches could be caused by heterogeneous cameras and/or
different shooting positions of the cameras. In particular, it considers focus mismatches across
192
views, i.e., such that different portions of a video frame can undergo different
blurriness/sharpness changes with respect to the corresponding areas in frames from the other
views. It proposes an adaptive filtering approach for inter-view prediction in multi-view video
coding. Preliminary results, on anchor only coding (IPPPP), are asserted to show gains ranging
from 0.06db to 0.8db over the current method. The asserted gain is larger for sequence with
stronger focus mismatches.
Gains depending on number of reference pictures that are used. For one sequence (Flamenco 2)
gain of > 1 dB and almost 20% bitrate savings are reported (for selected views). Decoder
complexity increased by a 5x5 2D filter. Average gain 0.45 dB for case of 1 reference picture,
0.14 for 3 reference pictures, 0.06 for 5 reference pictures. Further study in AHG, in particular
consider complexity at pixel level, possibly combination with subpel interp. Filters (has some
relation with VCEG AHG that exists for studying adaptive MC interpolation filters and also with
previous proposals (Wedi) for adaptive Wiener loop filters).
6.4.5 MVC depth-based methods & displays
6.4.5.1.1 JVT-W095 ( Info 2.2) [W. J. Tam, L. Zhang] Depth map preproc and
minimal content for 3D-TV using depth-based rendering
This contribution provides experimental results of subjective evaluation of stereoscopic images
consisting of an original image and a rendered image from depth-image-based rendering (DIBR).
Experimental results show the beneficial effect of smoothing of depth maps before DIBR on
image quality. Furthermore, results are shown for asymmetrical smoothing in which the extent of
smoothing is larger in the vertical than in the horizontal direction to reduce geometric distortions.
Finally, consistent with the findings that depth maps do not have to contain "full resolution,"
subjective assessment results from a different set of studies indicate that enhanced depth
sensation, compared to reference monoscopic images, can be obtained using "surrogate" depth
maps. That is, depth maps that contain sparse "depth" information located mainly at edges and
object boundaries. The overall findings indicate that depth information for DIBR, just as for
colour information, do not have to be of full spatial resolution for the generation of useful images
for autostereoscopic multiview displays and other stereoscopic displays to produce enhanced
sensation of depth.
In general indication that stronger smoothing of depth maps provides improved subjective quality.
Depth of boundary location may be sufficient. Right view generated by depth-based projection
from the left. Subjective tests performed with minimum of 10 subjects. Shutter-eye glasses used.
6.4.5.1.2 JVT-W100 ( Prop 2.0/3.1) [A. Smolic, K. Mueller, P. Merkle, N. Atzpadin, C.
Fehn, M. Mueller, O. Schreer, R. Tanger, P. Kauff, T. Wiegand, T. Balogh, Z.
Megyesi, A. Barsi] Multi-view video plus depth (MVD) format for advanced
3D video systems
The contribution proposes to initiate a study on how to support multi-view video plus depth
(MVD) data efficiently by a coding standard. It illustrates advanced 3D video and free viewpoint
video systems, and argues that these are not efficiently supported by available and emerging
specifications, such as MPEG-C Part 3 and MVC. The central requirement of such technology is
said to be an input data format that allows rendering a wide range continuum of views at the
decoder. MVD is introduced and illustrated in some detail, being multi-view video with multiple
associated per sample depth maps. It is claimed that MVD fulfills the above requirement and is
193
therefore a suitable candidate for a basic format for advanced future 3DV and FVV systems.
Finally, an initial work plan for the proposed investigation is presented.
Relationship with both MVC and MPEG-C part 3. Required input format that allows rendering of
continuous views. Occlusions can be handled by smoothed depth maps. Single video plus depth
has limitations, artifacts when wide range is required. Possible solution is multiple videos plus
depth, another is layered depth video: One video, one depth map, one background layer for
occluded pixels. Proposal to start new work plan on this. Firstly, this is about new functionality.
The relationship with compression would also be interesting to be investigated (e.g. using depth
maps for generation of prediction references). Report that it was found that compression of depth
maps for multiple views is not simple (if high quality view generation is required).
Look into issues of
– Compression of depth maps
– Relationship with MPEG-C part 3
– Relationship depth-based rendering and view compression (CE 6)
Was further discussed in the context of requirements (FTV).
Discussion (Tue morning): Is it necessary to define normative rendering? Most raise objections
against that. Displays are that specific that it needs to be left to the manufacturer how to perform
the interpolation. It must be specified what the “conformant” output views are (may not be equal
to the views that are actually captured). Definition of data representation that allows to generate a
certain number (in principle up to arbitrary) of views. Boundary between decoding and rendering
may be floating, depending on whether a method for rendering would be worthwhile to be
considered as a compression tool (e.g. producing a better prediction of intermediate views).
Under discussion:
– Format allowing generation of arbitrary (up to continuous) views would be useful,
supporting many types of displays (consensus on this)
– Would require (in addition what is currently investigated) to have information about 3D
scene structure as necessary for rendering (one example would be depth maps)
– Rendering/display/interpolation (see note below) is non-normative, but an example method
shall be given, and would be needed anyway for the development
– Needs to be investigated whether relationship between depth information and picture
information helps to develop a better compression
Note: There is some internal dispute on what “rendering” means. Interpolation may also include
spatial upsampling in cases where some of the views have lower resolution.
Further discussed with Requirements (Wed 14:00), also whether this will be added into current
MVC development or another activity with extended timeline.
How to evaluate effectiveness of depth maps as a view coding feature?
Question raised by JVT-W100: Should we specify normative interpolative rendering? Without it,
how does an encoder know how to optimize its encoding decisions?
Remark: Leave that non-normative.
Remark: Would like at least some decoders to be required only to extract and decode exact (noninterpolated) view(s).
See also related notes in section on JVT-W127.
194
Further study to be held in CE on view interpolation prediction.
6.4.5.1.3 JVT-W060 ( Prop 2.2/3.1) [A. Vetro, S. Yea, W. Matusik, H. Pfister, M.
Zwicker] Anti-aliasing for 3D displays
This contribution describes an anti-aliasing technique for improved rendering of multiview video
on 3D displays. View interpolation techniques are utilized to achieve an oversampling of the
multiview signal in the view dimension. The oversampled signal is then filtered to suppress high
frequency portions of the signal that contribute to aliasing, and finally sub-sampled to match the
display characteristics. This contribution examines ways to minimizes receiver resources in this
framework. Two distinct needs for MVC are highlighted, including the need to code and transmit
depth maps along with the multiview video, as well as the need for spatial scalability. An SEI
message that signals acquisition and scene attributes is also proposed.
Danger of alias: Relationship between spatial resolution and number of views (including
scalablity of both). Spatial resolution of each view affects the spectrum of input signal. One
effect is ghosting artifacts which can be prevented by pre-filtering. Depth maps could be part of
access unit and managed together with the primary picture set (which would not be possible
when using MPEG-C part 3).
Proposal for maximum disparity and camera parameters as SEI messages. The group further
discussed the proposed SEI message issues in the context of HL syntax. 1) For camera
parameters, participants were asked to clarify relationship with previous proposals, precision etc.,
2) for max. disparity showcase is needed.
JVT decision: Adopt camera parameters SEI and max disparity SEI (showcase expected at next
meeting).
Min disparity may also be useful – for further study.
6.4.6 MVC view parallel processing
6.4.6.1.1 JVT-W077 ( Prop 2.2) [P. Yang, X. Xu, G. Zhu, Y. He] View parallel
processing on MVC
A view parallel coding architecture is presented in this proposal. In MVC inter-view references
are used to exploit the dependency of different views. Consequently, the parallel processing
ability is deteriorated. When parallel processing is required, a simple structure would be used.
However, the coding efficiency for this scheme is therefore compromised. The proposed method
in this document restrains the prediction between pictures at the same time slot in different views,
but allows the other kinds of inter-view prediction, thus all of the views can be processed in
parallel. Therefore, the proposed method has a similar parallel mechanism as the simple structure,
while achieving some coding efficiency gain over it. An average PSNR gain (for all encoded
pictures) is reported to be 0.14 dB when encoding the common test conditions and 0.22 dB for all
non-key frames. Also, the proposed method would favorite sequences with large motions. An
overall 0.34 dB gain and 0.47 dB gain for non-key frames are achieved.
Implementation done in JSVM (JMVM only supports view-first). 0.14 dB gain on average as
compared to “simple” structure (which does not allow inter-view prediction for non-key pictures).
Concept may have implications on access unit definitions and buffer management. Encoding of
195
frame t0 would cause initial delay which can never be catched up again. Identify relationship
with JVC proposal for parallel processing made 2 meetings ago.
JMVM implementation would be needed.
No report given Tuesday morning, apparently no offline discussion happened prior to that time.
Cross-check JVT-W108 was still not available at that time.
Remark: Complicates management of reference pictures. Would like to see text on how this
would be solved. We have cross-view and cross-time dependency referencing – this proposes
new diagonal dependency directions.
Remark: Implications on MMCO and ref pic list construction may be major.
Remark: Consider JVT-V132 structure. IPPP cross-view from that document is suggested as a
better reference. Commenter asserts that non-hierarchical structure in view direction will provide
better results than hierarchical structure.
For further study.
6.4.6.1.2 JVT-W108-QV (Late Info) [Q. Chen, Z. Chen] Verif JVT-W077 view parallel
proc on MVC
This document verifies the results of JVT-W077: “View parallel processing on MVC” from
Tsinghua Univ.
Based on the executable files (encoder and decoder) and configure files provided, bit streams
were reportedly generated for verification.
In the first (late) uploaded version, only the sequences race and exit were reported finished. The
finished results were reportedly identical to JVT-W077r1.xls and can be found in JVT-W108.xls.
The bit streams could reportedly be decoded correctly.
Remark: This “verification” does not seem to fulfill the spirit of such efforts – the algorithm was
not investigated, and the contribution refers to just using executable files provided by the
proponent.
6.5 MVC reference software, common conditions, encoder optimization
No contributions noted (other than AHG input).
7
AVC base specification and related topics
7.1.1.1.1 JVT-W041 ( Prop NN) [A. M. Tourapis, K. Suehring, G. J. Sullivan, A.
Leontaris] H.264/MPEG-4 AVC reference software (JM) manual
Revision of the H.264/MPEG-4 AVC Reference Software Manual. JVT decision: Adopt.
Further presented on the last day of the meeting. Participants were encouraged to provide further
input to improve the software and its associated manual and algorithm description.
196
7.1.1.1.2 JVT-W042 ( Prop NN) [A. Leontaris, A. M. Tourapis] Rate Control
reorganization in the JM reference software
Rate control is an important component of a video compression system as it allows generating
compressed bit streams that satisfy bandwidth and buffering constraints. The Joint Model (JM)
reference software includes a basic rate control, which, even though not strictly optimal in a ratedistortion sense, allows researchers to evaluate the standard for practical compression scenarios.
However, it has been determined that several of the coding tools that are currently included in the
JM reference software were not properly supported by the existing rate control algorithm. Other
important coding tools, such as hierarchical B-coded pictures, while indirectly supported, were
being penalized because the rate control algorithms were never updated to properly consider and
take advantage of these tools. On the other hand, the rate control contained severe bugs that were
affecting the performance of the software or resulted in invalid bitstreams.
This contribution describes the reorganization of the original rate control algorithm and which
was contributed in the Joint Model (JM) 12.0 reference software. This contribution resolved
several standing problems that affected the rate control in previous JM versions, but also
introduced several new features and support for new tools such as coding of hierarchical
structures. More specifically, a number of new rate control modes were introduced to address
specific encoding situations, such as intra-only encoding and hierarchical B-coded pictures,
without however modifying the essence and basic operation of the original scheme. Instead, the
software enhancements have improved the readability and expandability of the original rate
control source code, as it was rewritten to adopt an object-oriented structure. The authors note
that the presence of broken coding tools in the JM may cause misinterpretation of the actual
capabilities of the coding tools.
JVT decision: Adopt.
7.1.1.1.3 JVT-W044 ( Info) [A. M. Tourapis, A. Leontaris, K. Suehring] New JM
reference software enhancements
The H.264/MPEG-4 AVC standard has been at times criticized due to its high complexity in
terms of both encoding and decoding. Unfortunately, and even though the standard is
considerably more complex than previous standards such as H.263 and MPEG2, evaluations on
its complexity are sometimes based on the implementation of the AVC Joint Model (JM)
reference software. Unlike commercial implementations however, this software was implemented
without any complexity considerations. Instead, it was designed mainly with flexibility of
implementation in mind since such was required for the proper evolution and development of the
standard. The JM codec, and obviously the standard as well, was developed in a relatively
significant amount of time and required the involvement of engineers from several companies
and institutes with a variety or level of programming knowledge/skills. Although this has helped
in the finalization of the standard, its complexity of both the encoder and the decoder has been
rather poor compared to almost all commercial or publicly available implementations.
To this purpose, the coordinators of the reference software have undertaken a slow and at times
time consuming, effort to reorganize the software, improve its efficiency and coding performance,
and at the same time reduce its complexity. One such effort involved the reorganization of most
motion compensation and estimation processes within the encoder. This document presents
additional enhancements that were introduced to the latest reference software (version 12.2), and
which result in considerable complexity reductions at the decoder. The coordinators are still
undertaking several other optimizations within the software which may be released in future
197
versions. This contribution would discuss the primary enhancements that were introduced in the
JM software.
Some additional needs: Encoder conformance assurance (transform dynamic range, MV area
constraint in Baseline), decoder conformance checks (transform dynamic range).
Current “official” version is 12.2.
Observation: At very high bit rates, CAVLC works better than CABAC. Why?
Question: Coordination with VCEG KTA software? Remark: Karsten has broad discretion to
coordinate the work on our software – and that presumably includes discretion to coordinate with
VCEG efforts.
JVT decision: Adopt.
7.1.1.1.4 JVT-W057 (Late Info) [K. P. Lim] Improved JM text algorithm description
Reference software and descriptions of reference encoding methods and non-normative reference
decoding error concealment methods are useful in aiding users of a video coding standard to
establish and test conformance and interoperability, and to educate users and demonstrate the
capabilities of the standardThis document specifies non-normative reference encoding methods
and methods of concealing errors and losses in decoders for video data conforming to ITU-T
Recommendation H.264 | ISO/IEC International Standard ISO/IEC 14496-10 advanced video
coding.
JVT decision: Adopt.
7.1.1.1.5 JVT-W140-B (BoG) [T. Suzuki] Toward a professional profiles conformance
specification
Report of preliminary work toward a professional profiles conformance specification.
JVT decision: Endorsed.
8
Video annotation (jointly discussed with MPEG requirements 3:30 pm
Wednesday 25 April)
8.1.1.1.1 JVT-W032 ( Info) [Q. Chen, C. Louis, Z. Chen] Requirements of video
annotation in video coding
This document presents some asserted requirements on adding video annotation support into a
video coding standard. Some application scenarios were listed which can reportedly benefit from
this practice, and these were placed into three categories: text annotation, visual characteristics,
and video structure. The contribution recommended that the JVT work out some methods to
support the target applications.
Presentation and discussion were held together for JVT-W032, JVT-W033, and JVT-W034.
Asserts that MP4 file format metadata tracks and MPEG EPG tracks do not fully address current
needs.
198
Suggests supporting carriage of annotation data in both the system level and video bitstream level.
Support within video is asserted to be useful due to the ability to carry the metadata regardless of
the system environment.
Asserts that MPEG-7 has too many things in it – to the degree that people don’t know which ones
to use. Potential approaches to this asserted problem include profiles of MPEG-7 or nonnormative guidance about which MPEG-7 data types to use.
Remark: Putting such data into the video layer means that you need to touch the video layer just
to manipulate the metadata, and that you may need to search through very high bit rate
information to locate metadata of interest.
Remark: How does this connect with compression work?
It was suggested that metadata should be based on MPEG-7 as much as possible, and that
duplications of effort and text and inconsistencies of design should be avoided. Work should be
kept coordinated across the organizational boundaries.
Where to carry? Systems layer or video layer (SEI)? New metadata types? Re-use MPEG-7?
With modified structure?
Arguments for doing it in video: Persistent regardless of type of systems and FF, can be
generated as part of encoding, good to have it as part of raw video stream.
Elaborate pro’s and con’s doing it here or there. It may also be the case that for certain cases one
or the other is better. Even then, the metadata should be compatible (same subset of MPEG-7
etc.).
Explore relation between metadata and coding. Metadata should be MPEG-7.
AHG to be established in MPEG on the topic.
8.1.1.1.2 JVT-W033 ( Prop 2.2/3.1) [Q. Chen, Z. Chen, X. Gu] Video annotation SEI
message
This document proposes to add video annotation SEI messages into AVC bit streams to add
capabilities for video searching, browsing, and other applications. A couple of related issues are
discussed and finally a particular approach is proposed to the JVT.
See notes relating to JVT-W032.
8.1.1.1.3 JVT-W034 ( Prop 2.2/3.1) [C. Louis, O. Lionel, L. Frederic, Z. Chen, Q.
Chen] Fingerprint and video structure for video annotation SEI message
This document proposes to add “video fingerprint” and video structure support in SEI messages
for video annotation. These are combined into a proposed video annotation SEI message. The
applications are reported to be fast video copy detection, fast video browsing, etc.
See notes relating to JVT-W032.
199
9
AVC errata and clarification issues
9.1.1.1.1 JVT-W134-Q (Late Prop 2.2) [S. Narasimhan] Splicing issues and some
suggested changes
This contribution was subject to lateness penalties as recorded elsewhere in this report.
Splicing is currently used in U.S. cable networks for digital ad-insertion based on the MPEG-2
video standard and there are plans to migrate these applications based on AVC in the near future.
In these applications, the splicing equipment (or function) combines two independently encoded
AVC streams and is expected to produce an AVC ‘conformant’ output for receiving equipment.
This contribution outlines issues related to generating an AVC conformant output by such
splicing equipment and suggests some reportedly-minor changes to the AVC standard to
reportedly assist these applications.
Remark: What about SMPTE RP 312M on seamless splicing? Basically, that is not being used,
and is reportedly expected be withdrawn.
ITU-T J.181 is relevant (developed by SCTE and brought to ITU-T SG 9 or 11 with further
involvement by Japan ITU-T members).
Discusses local ad insertion, other types of splicing, and associated difficulties.
Output document JVT-W210 to be produced incorporating issues noted herein and others
identified by the editor of the output document, Gary J. Sullivan.
10 Requirements joint discussions with WG 11
Joint discussions were held with WG 11 requirements and video subgroups at 2pm on Wednesday
25 April. Some issues raised in WG 11 documents were discussed.
Also see notes relating to video annotation and profiles and applications.
10.1.1.1.1
M14452 WG 11 input [T. Murakami, K. Asai, Y. Yamada] Requirement of
full-color video coding for consumer applications
This WG 11 contribution suggested considering consumer-device support for the following.
– 4:4:4 chroma
– More than 8 bit dynamic range
Picture formats from QVGA to HDTV.
Request for 4:4:4 in consumer applications. Camcorders and big displays would reportedly
benefit. Real big formats (8K) expected to happen after 2010. Requirements: frame rate up to 60
Hz, progressive scanning, up to 10 bits, enhanced coding efficiency for 4:4:4. Comment: Could
be achieved with 4:4:4 predictive profile. What would be needed that is different than that? Are
bit rate ranges of the proposal useful? Perform a study about exact requirements and identify
whether the professional profiles available would require any change.
Remark: Below certain bit rates/fidelities, having 4:4:4 and high bit depth hurts rather than helps
(some skepticism about that remark was expressed). Remark: There were old contributions from
Dolby and recent contributions in VCEG that may help clarify these issues.
200
Having a better understanding of those issues is needed, and an understanding of what is needed
that would be different than what the High 4:4:4 Predictive profile provides.
10.1.1.1.2
M14360 [USNB to WG 11] Issues relating to expiring patents
The USNB to WG 11 noted 1) That some number of core patents in media coding have expired
or will be expiring soon; 2) That there also exist un-patented technologies in media coding; 3)
That for many years the combination of CPU power, bandwidth, and compression efficiency was
not sufficient to give acceptable quality in many environments, and improved compression
efficiency was the driving factor in developing new standards, but for at least some environments
this has now changed – indeed, the USNB asserts that there are striking examples where not all
the CPU power available is used, or not all the bandwidth is used; there are also environments
where the strongest compression is not the dominant selection criterion; 4) That it has been
argued that a royalty-free standard would detrimentally affect the uptake of existing MPEG
standards – however, if it is technically possible to develop a standard which does this, the USNB
prefers that it be done in WG 11 where there is expertise in doing it well, and where such a
putative standard could be made a 'family member' with other MPEG standards (with an upgrade
path, for example, or related technical ‘roots’ etc.); 5) That the 'terms of engagement' of a study
on developing a process for royalty-free standards, and the results and follow-on for such work,
should be made more clear before more discussion is held at WG 11.
M14360: “No explicit request. Main purpose to create a discussion. No request to take specific
action.” Question raised: Does WG 11 have the expertise to find out whether a standard is
royalty-free?
No explicit request for action was made in these comments, and it was asserted that there would
need to be a more clear understanding of a process for developing royalty-free standards prior to
proceeding with such work.
10.1.1.1.3
JVT-W127 ( Req) [M. Tanimoto, T. Fujii, H. Kimata, S. Sakazawa]
Requirements for FTV (MPEG M14417)
Proposed requirements for FTV (free viewpoint television) are provided in this document. The
content of this document is the same as in the MPEG document M14417.
FTV is anticipated to be a “ray-based system” as opposed to the “pixel-based system” approach
largely taken in today’s video coding standards. FTV should be able to be viewed on a wide
variety of displays, including both 2D and 3D displays, on a wide variety of platforms (from
mobile phones to fixed large room-based displays). Considers need for view generation,
including depth map determination and interpolation for display.
Note relevance of ISO/IEC 23002-3 and JVT-W100.
Proposed standardization action items for FTV
1) FTV data format
2) Compression
3) Rendering
4) Transmission data format and protocol (ITU-T SG 9 working on this)
There was some discussion of the distinction between “data format” and “compression”.
201
There was some discussion of the scope of standardization – and particularly regarding whether
“rendering” should be standardized, and where the compression decoding process ends and
rendering begins.
FTV must support many types of displays. Function of view generation should be simple. FTV
requires depth search and interpolation. 3 possibilities: Both at sender (ray space), both at
receiver (MVC plus postprocessing), or separated (search at sender, interpolation at receiver)
(MVC plus depth). Supported format is view plus depth, but information about reflection also
might need to be included in FTV data. Standardisation issues: Format, compression, rendering,
transmission. Proposal: Determine FTV format together with rendering; compression format can
be extension of MVC.
Data format would consist of all the information needed to perform a good rendering (image,
depth, illumination). Compression for most compact representation of data with a certain quality.
Question: To which extent should the method of creating additional views be specified? Unclear
– rendering would definitely be needed to be standardized when it is used as part of the (de)compression. Otherwise, how can it be known which renderer is required which is display
specific?
Concern (against normative rendering) that freedom in the design of display is given up (in terms
of quality, complexity, …). However, testing with rendering (maybe different) will be needed.
See also related notes in section on JVT-W100.
Fernando will draft an MPEG requirements document on application requirements of FTV.
Can very well be seen as extension of current MVC. However, exploiting relationships between
image, depth and reflection could lead to better compression than separate handling.
Changes relative to prior MPEG MVC Requirements document have been agreed:
– Spatial scalability as “shall”
– Variation of spatial and temporal resolution across views as “shall”
11 JVT internal operating rules
JVT decision: The following clarifications/adjustments of JVT operating rules have been adopted.
The JVT decided that participants shall to refrain from long (=more than 4 Minutes) presentations
of their proposal, if the results of their coding efficiency experiments have provided less than 2%
bit-rate on average (or equivalently 0.1 dB gain on average).
Presentations should also not use "cherry picking" of results for summary reporting in abstracts
and presentations. Summary reports must be true summaries – not highlights of best results
while ignoring worst results.
Regarding late contributions: Due to our difficulties with a large quantity of late-submitted
contributions at this and other recent meetings, the JVT has agreed that for its next meeting, no
late-uploaded (non-AHG-report, non-liaison, non-verification) contribution will be presented
without having a minimum of 4 JVT participants (working for organizations other than that of the
primary contribution author) recorded by name as supporting the allowance of such a
presentation, in addition to a consensus of the general JVT membership to allow the presentation.
Such support to allow a presentation is to be understood to not necessarily imply support of the
202
adoption of the content of the late contribution, but only as a positive expression that the
document should be allowed to be presented. Additionally, the provider of such a presented late
contribution shall send an email apology to the JVT email reflector. This rule does not apply to
material requested by the JVT at the meeting (e.g., reports of JVT-authorized side activities).
For all contributions that have presentation material that is used to present them to the group (e.g.,
PowerPoint presentations), the presentation material should be provided along with the written
contribution (within the same zip container file). PDF is preferred over PPT for presentations
when the PPT filesize is large and there is no need for the slide deck to be editable by others.
All submissions must be made in JVT-Wxxx.zip format with the word docs, excel sheets and
other information being in the zip container. The document must contain an abstract and be
accompanied with an e-mail notification containing title, authors and abstract (identical to the one
in the doc) which is no longer than 200 words and is written in 3rd person in a manner that does
not express endorsement of the content of the document.
On filenames inside of .zip containers – use a filename so that if you take the files out of the zip
container, you'll still know what contribution they came from. Every file in the .zip container for
document JVT-Wxxx should start with JVT-Wxxx. Example: JVT-Wxxx.doc (main document),
JVT-Wxxx_presentation.pdf, JVT-Wxxx_results1.xls, etc.
When providing additional or revised files, do not include copies of files that were already
included in the prior .zip archive for the same contribution and do not re-use the same filenames
without adding revision numbers (r1, r2, etc.) – this saves us needing to worry about whether the
files we get with the same filenames are the same or different.
Independent verification (necessary for adoption of a proposal) is provided either through
a) independent implementation by 1 or more company different than the proponent based on
the textual description (after adoption, both decoder source code versions must be made
publicly available and one encoder version), or
b) providing source code to all CE participants prior to the meeting (CEs can only be joined
at the meeting, when the CE is created. CEs are created at each meeting and last until the
next meeting.)
Simply running binary executables provided by a proponent is not ordinarily considered
independent verification. Source code should be provided and used, and the verifying party
should invest a proper degree of effort to ensure that the “verification” they perform is a
meaningful and professional study with significant depth rather than just a perfunctory procedural
formality.
For every SEI message and every syntax element that are currently in the SVC/MVC draft, a
showcase has to be provided in order to retain it in the JSVM/JMVM/JD. If such a showcase is
not provided at the next meeting for an SEI message or parts of it, the SEI message or the
respective parts will be removed from the JSVM/JMVM/JD. The source code and executables for
the showcase must be made available.
A first CE description should be available at the last day of the meeting. Changes of the CE
description are only allowed until 3 weeks prior to the next meeting. These changes must be of
evolutionary characteristic relative to the input documents on which the CE is based and must be
agreed by those who contributed the respective input document(s) or be added as an option.
203
Contributions that are proposals of new technology that was not what was described as being
tested in a CE (even if related to the tested technology) should not indicate that they are CE
documents in their title and abstract.
12 List of adoptions
This section of the report lists adoption actions by the JVT at this meeting in condensed form.
All items noted in this section should be redundant with actions noted elsewhere in this report.
Where listed, the person listed in brackets is responsible for provision of text and software
integration.
12.1 SVC normative adoptions into JD
Adoption actions are listed as follows:
– FGS: JVT-W090.
– IntraBL treated as inter for constrained intra pred (see notes on JVT-W090).
– Intra MBs in base layer not exceeding IntraBL by more than 1.5 (see notes on JVT-W090).
– ESS improvement: JVT-W030.
– Interlaced restrictions: JVT-W025.
– Remove SRP (see notes on JVT-W026 and JVT-W118).
– De-blocking JVT-W063r1.
– Header re-writing JVT-W046.
– Inheritance of deblocking control (see notes on JVT-W046).
– Pictures not for output JVT-W047.
– Various items JVT-W048.
– Profile & bit rate indicators (subset of JVT-W051).
– Integrity check JVT-W052.
– MBs required for picture only for QID = 0 (see notes JVT-W052).
– Quality layer SEI syntax JVT-W137.
– Priority ID JVT-W053r2.
– Seven restriction indicators in scalability info SEI (see notes on JVT-W064).
– Various HL syntax issues (see notes on JVT-W125).
– Redundant pictures into profile A & SEI messages (JVT-W049).
– SEI message tl0_pic_idx (sec. 3.3 of JVT-W062r3).
– Profile changes as recorded in profiles section.
– Change to scaling in position calc for large pictures (see notes on JVT-W136).
12.2 SVC normative adoptions into JSVM
Adoption actions are listed as follows:
– FGS modifications JVT-W119.
– FGS modifications JVT-W121r1 (which combines elements of JVT-W111 and JVT-W121).
– Dyadic subband coding method JVT-W097.
12.3 SVC non-normative adoptions
Adoption actions are listed as follows:
– Encoder problem detection trick from JVT-W105.
– Rate control JVT-W043
204
12.4 SVC software adoptions
No particular adoption actions noted.
12.5 MVC normative JD adoptions
Adoption actions are listed as follows:
– Various high-level syntax changes JVT-W035.
– Signal views to be output JVT-W036.
– View scalable SEI JVT-W037.
– nal_ref_idc_view JVT-W056.
– Reference picture list reordering bug fix JVT-W066.
– Parallel decoding SEI syntax modifications (as presented in JVT-W080).
– Camera parameters & max disparity JVT-W060.
– Restriction of temporal direct and weighted prediction (see notes on JVT-W040).
12.6 MVC JMVM adoptions
Adoption actions are listed as follows:
– Deblocking filter control JVT-W024.
– Illumination compensation info derivation JVT-W031.
– MVC Motion skip mode JVT-W081 as recorded in JVT-W139.
12.7 MVC non-normative adoptions
No particular adoption actions noted.
12.8 JM non-normative adoptions
Adoption actions are listed as follows:
– JM manual JVT-W041
– JM rate control JVT-W042
– JM software cleanup JVT-W044
– JM algorithm text description JVT-W057
12.9 Other normative adoptions
No particular adoption actions noted.
12.10 Other non-normative adoptions
No particular adoption actions noted.
13 Software integration plan
Delegated to the software coordinators.
14 SVC conformance work plan
The Hangzhou meeting report recorded the following: “The following companies each announce
to provide at least 10 conformance bitstreams for SVC: HHI, Sharp, Thomson, RWTH (maybe),
Nokia (potentially), Orange, Microsoft, Qualcomm.”
205
These parties were not present on Tuesday morning. It was asserted that a conformance
workplan working draft (WD) needed urgently to be set up by end of week. Progress was later
reported from a break-out group activity as recorded in JVT-W138.
15 SVC verification test plan
Action items noted during the meeting:
– Viewing of available material
– Clarify situation about the completeness of the test material for SVC compression
performance
– demonstrating potential prototype applications of SVC (e.g. showing advantage of scalability
in streaming)
Report of breakout work was presented:
– Review of JVT-W131: Bitrate SNR may be a bit too high; ratio 3:1 Enh:Bas better than 2:1,
eventually include up to HD
– Bandwidth fluctuation scenario may eventually not be too useful without FGS (or would
need implementation of concealment which might be difficult)
– Profile B: Broadcast SD & HD
– Scalability between 1080i and 1080p might also be a convincing scenario
16 List of AHGs established
The following JVT “ad hoc groups” (AHGs) were established to progress work on identified
topics until the next meeting of the JVT.
16.1 JVT project management and errata reporting
Discussion: jvt-experts@lists.rwth-aachen.de
Chair: Gary Sullivan, Jens Rainer Ohm, Ajay Luthra, and Thomas Wiegand
Mandates:
– Collect errata reports on standards under management of JVT
– Coordinate overall interim JVT progress
– Prepare status information for JVT status reporting
16.2 JM Text, reference software, bitstream exchange and conformance
Discussion: jvt-experts@lists.rwth-aachen.de
Chair: Thomas Wiegand, Karsten Sühring, Alexis Tourapis, Teruhiko Suzuki, Keng Pang Lim
Mandates:
– Maintain and update JM algorithm description text
– Maintain and update JM reference software and its usage manual
– Facilitate exchange of test bitstreams to aid interoperability testing
– Collect bitstreams for inclusion in Conformance specifications
– Identify and correct problems in Conformance specifications and associated bitstreams
16.3 AVC professional applications
Discussion: jvt-experts@lists.rwth-aachen.de
Chair: Teruhiko Suzuki
Mandates:
– Finalize software for new professional profiles
– Collect bitstreams for Conformance specification update for new prof profiles
206
16.4 SVC JD and JSVM text, software and conformance
Discussion: jvt-svc@lists.rwth-aachen.de
Chair: Heiko Schwarz, Jérome Vieron, Thomas Wiegand, Mathias Wien, Alex Eleftheriadis,
Vincent Bottreau
Mandates:
– Edit and deliver JD and JSVM text
– Coordinate JSVM software integration
– Coordinate bug-fixing process for the JSVM software
– Maintain JSVM software manual
– Plan, edit, and collect bitstreams for SVC conformance specification
16.5 SVC bit depth and chroma format scalability
Discussion: jvt-svc@lists.rwth-aachen.de
Chair: Yongying Gao, Andrew Segall, Thomas Wiegand
Mandates:
– Identify applications
– Work out suggestions for detailed needs
– Find/create test material
– Study bit-depth reduction techniques, e.g., tone-mapping tools
– Study color space and/or gamma conversion requirements
– Study combined spatial and bit depth scalability
– Define experiments and test conditions
– Investigate software and text modification needs
– Identify complexity issues
16.6 SVC FGS applications and design simplification
Discussion: jvt svc@lists.rwth-aachen.de
Chair: Justin Ridge, Marta Karczewicz
Mandates:
– Identify applications for FGS and their characteristics
– Define experiments and test conditions relating to FGS technology
– Explore simplification of FGS design
16.7 MVC high-level syntax and buffer management
Discussion: jvt-mvc@lists.rwth-aachen.de
Chair: Anthony Vetro, Purvin Pandit
Mandates:
– Discuss high-level syntax for MVC including NAL unit type, NAL unit header extension,
SPS extensions, slice layer and integration with SVC syntax.
– Discuss reference picture management to enable simultaneous picture output of different
views and to facilitate parallel processing.
– Discuss issues related to HRD.
– Propose refined syntax and decoding processes for JMVM.
16.8 MVC JD and JSVM text and software
Discussion: jvt-mvc@lists.rwth-aachen.de
Chair: Hideaki Kimata, Aljoscha Smolic, Purvin Pandit, Anthony Vetro, Chen Ying
Mandates:
207
–
–
–
–
–
Collect comments on draft, perform necessary editing and delivery.
Maintain JMVM and JD document and collect comments on the text.
Coordinate JMVM software integration
Coordinate bug-fixing process for the JMVM software
Maintain JMVM software manual
16.9 MVC experimental framework and testing conditions
Discussion: jvt-mvc@lists.rwth-aachen.de
Chair: Hideaki Kimata, Aljoscha Smolic
Mandates:
– Evaluate application needs in MVC framework
– Discuss testing conditions to evaluate specific application needs
– Consider needs for new tools to be evaluated
16.10 MVC solutions using existing AVC decoders
Discussion: jvt-mvc@lists.rwth-aachen.de
Chair: Purvin Pandit
Mandates:
– Collect comments on methods for enabling AVC decoding of multiview video
(spatial/temporal/others)
– Study the complexity of such methods
– Investigate the applications enabled
16.11 MVC reduced resolution update, downsampled reference and adaptive
reference filtering
Discussion: jvt-mvc@lists.rwth-aachen.de
Chair: Purvin Pandit, Hideaki Kimata
– Investigate approaches for enhancing MVC coding efficiency using spatial downsampling
– Evaluate the complexity of such methods
– Investigate the relationship between downsampling approaches and view interpolation
– Evaluate subjective quality associated with methods
– Study the complexity associated with adaptive reference filtering
– Evaluate performance of adaptive reference filtering under JMVM common conditions
17 Resolutions conveyed to MPEG parent body
The JVT approved the following resolutions for conveyance to its MPEG (WG 11) parent body.
17.1 Resolutions relating to ISO/IEC 14496-4
17.1.1 The JVT and the video subgroup recommend to approve the following documents
No.
Title
14496-4 Conformance testing
8954
Request for ISO/IEC 14496-4:2004/Amd.30
8955
Working Draft 1 of ISO/IEC 14496-4:2004/Amd.30
JVT-W204 Conformance testing for new profiles for professional
applications
8956
Request for ISO/IEC 14496-4:2004/Amd.31
8957
Working Draft 1 of ISO/IEC 14496-4:2004/Amd.31
208
TBP Available
No
No
07/04/27
07/06/29
No
No
07/04/27
07/06/29
JVT-W205 Conformance testing for SVC profiles
17.1.2 The JVT and the video subgroup thank the following companies for their
commitment to provide conformance testing streams for ISO/IEC
14496-4:2004/Amd.30: Mitsubishi Electric Corp., Panasonic, Sejong University,
Thomson.
17.1.3 The JVT and the video subgroup thank the following companies for their
commitment to provide conformance testing streams for ISO/IEC
14496-4:2004/Amd.31: ETRI, FhG-HHI, France Telecom/Orange, Layered Media,
Sharp, Thomson.
17.2 Resolutions relating to ISO/IEC 14496-5
17.2.1 The JVT and the video subgroup recommend to approve the following documents
No.
Title
14496-5 Reference Software
8958
Request for ISO/IEC 14496-5:2001/Amd.18
8959
Working Draft 1 of ISO/IEC 14496-5:2001/Amd.18
JVT-W206 Reference software for new profiles for professional
applications
8960
Request for ISO/IEC 14496-5:2001/Amd.19
8961
Working Draft 1 of ISO/IEC 14496-4:2001/Amd.19
JVT-W211 Reference software for SVC
TBP Available
No
No
07/04/27
07/06/29
No
No
07/04/27
07/06/29
17.3 Resolutions relating to ISO/IEC 14496-10
17.3.1 The JVT and the video subgroup recommend to approve the following documents
No.
Title
14496-10 Advanced Video Coding
8962
Study Text (version 3) of ISO/IEC 14496-10:2005/FPDAM3
JVT-W201 Scalable video coding
8963
Joint scalable video model (JSVM) 10
JVT-W202
8964
JSVM 10 software
JVT-W203
8965
Draft SVC verification test plan version 3.0
JVT-W212
8966
Working Draft 3 of ISO/IEC 14496-10:2005/Amd.4
JVT-W209 Multiview video coding
8967
Joint multiview video model (JMVM) 4
JVT-W207
8968
JMVM 4 software
JVT-W208
TBP Available
No
07/05/31
No
07/05/31
No
07/06/29
No
07/05/18
No
07/05/18
No
07/05/18
No
07/05/31
17.3.2 The JVT and the video subgroup request WG 11 National Bodies to kindly
consider the SVC Study Document N8962 [JVT-W201] and if necessary provide
additional comments by the July 2007 meeting.
17.4 Resolutions relating to future meeting scheduling
17.4.1 The JVT chairmen propose to hold a JVT meeting during June 29 through July 6,
2007 under the auspices of the meeting of ITU-T SG 16 in Geneva, CH. Further
meetings are proposed to be held during October 19-26, 2007 under WG 11
auspices in Shenzhen, CN, and during January 11-18, 2008 under WG 11 auspices
209
in Antalya, TR.
17.5 Resolutions relating to ad hoc group activities
17.5.1 The JVT provides the following list of JVT ad hoc groups appointed to progress
work in the interim period until the next JVT meeting:
Title and Email Reflector
JVT project management and errata reporting
(jvt-experts@lists.rwth-aachen.de)
JM Text, reference software, bitstream exchange and
conformance
(jvt-experts@lists.rwth-aachen.de)
AVC professional applications
(jvt-experts@lists.rwth-aachen.de)
SVC JD and JSVM text, software and conformance
(jvt-svc@lists.rwth-aachen.de)
SVC bit depth and chroma format scalability
(jvt-svc@lists.rwth-aachen.de)
SVC FGS applications and design simplification
(jvt-svc@lists.rwth-aachen.de)
MVC high-level syntax and buffer management
(jvt-mvc@lists.rwth-aachen.de)
MVC JD and JMVM text and software
(jvt-mvc@lists.rwth-aachen.de)
Chairs
Gary Sullivan, Jens Rainer Ohm,
Ajay Luthra, and
Thomas Wiegand
Thomas Wiegand,
Karsten Sühring, Alexis Tourapis,
Teruhiko Suzuki, Keng Pang Lim
Teruhiko Suzuki
Mtg
N
Heiko Schwarz, Jérome Vieron,
Thomas Wiegand, Mathias Wien,
Alex Eleftheriadis,
Vincent Bottreau
Yongying Gao, Andrew Segall,
Thomas Wiegand
Justin Ridge, Marta Karczewicz
N
Anthony Vetro, Purvin Pandit
N
Hideaki Kimata, Aljoscha Smolic,
Purvin Pandit, Anthony Vetro,
Chen Ying
MVC experimental framework and testing conditions Hideaki Kimata, Aljoscha Smolic
(jvt-mvc@lists.rwth-aachen.de)
MVC solutions using existing AVC decoders
Purvin Pandit
(jvt-mvc@lists.rwth-aachen.de)
MVC reduced resolution update, downsampled
Purvin Pandit, Hideaki Kimata
reference and adaptive reference filtering
(jvt-mvc@lists.rwth-aachen.de)
18 Attendance
Persons registered to attend the meeting, as recorded by a sign-in sheet circulated during the
meeting, were the following (185 listed participants):
1)
Alvarez, José Roberto (Mobilygen)
2)
Amon, Peter (Siemens AG)
3)
Bandoh, Yukihiro (NTT)
4)
Bao, Yiliang (Qualcomm)
5)
Baik, Sung Uk (Oniontech)
6)
Bivolarski, Lazar (Brightscale)
7)
Bjøntegaard, Gisle (Tandberg)
8)
Borgwardt, Peter (Motorola)
9)
Bottreau, Vincent (Thomson R&D France)
10)
Bourge, Arnaud (Philips / NXP)
210
N
N
N
N
N
N
N
N
11)
12)
13)
14)
15)
16)
17)
18)
19)
20)
21)
22)
23)
24)
25)
26)
27)
28)
29)
30)
31)
32)
33)
34)
35)
36)
37)
38)
39)
40)
41)
42)
43)
44)
45)
46)
47)
48)
49)
50)
51)
52)
53)
54)
55)
56)
57)
58)
59)
60)
61)
62)
63)
Branguolo, Sebastien (SSM)
Bruls, Fons (Philips)
Cammas, Nathalie (Orange - France Telecom.)
Chen, Lulin (Omneon Video Networks USA)
Chen, Quqing (Thomson)
Chen, Weizhong (Huawei Tech.)
Chen, Ying (Tampere Univ. Tech.)
Cheong, Hye-Yeon (Univ. Southern California)
Chiu, Yi-Jen (Intel)
Choi, Byeongho (KETI)
Choi, Hae-Chul (ETRI)
Choi, Jongbum (Samsung)
Choi, Woongil (Samsung AIT)
Chujoh, Takeshi (Toshiba)
Chung, Hyukjune (Qualcomm)
Cieplinski, Leszek (Mitsubishi Electric)
Civanlar, M. Reha (DoCoMo Labs USA)
Cock, Jan De (Ghent Univ.)
Cornog, Katie (Avid)
Coté, Guy (Mobilygen)
Divorra, Òscar (Thomson)
Eleftheriadis, Alex (Layered Media)
Fröjdh, Per (Ericsson)
Fujii, Toshiaki (Nagoya Univ.)
Gao, Yongying (Thomson)
Gallant, Michael (LSI Logic, Canada)
Goh, Kwong Hueng (Inst. for Infocomm Research)
Guleryuz, Onur (Docomo USA Labs)
Han, Woo-Jin (Samsung)
Hannuksela, Miska (Nokia)
Harmani, Oztan (DoCoMo USA Labs)
Haskell, Barry (Apple)
He, Jones (Freescale)
Hinds, Arianne (IBM)
Ho, Yo-Sung (GIST)
Hong, Danny (Layered Media)
Hsiang, Shih-Ta (Motorola)
Huang, Wei-Hung (MediaTek)
Huang, Yu-Wen (MediaTek)
Huo, Junyan (Xidian Univ.)
Ishtiaq, Faisal (Motorola)
Itoh, Takashi (Fujitsu Labs)
Jeon, Byeong-Moon (LG Electronics)
Jeon, Byeungwoo (SKKU)
Jeon, Yongjoon (LG Electronics)
Jia, Jie (Sejong Univ.)
Jung, Bongsoo (SKKU)
Jung, Joël (France Telecom R&D)
Kang, Jung Won (ETRI)
Kanumuri, Sandeep (NTT DoCoMo USA Labs)
Karczewicz, Marta (Qualcomm)
Kim, Dongkyun (Sejong Univ.)
Kim, Hae Kwang (Sejong Univ.)
211
64)
65)
66)
67)
68)
69)
70)
71)
72)
73)
74)
75)
76)
77)
78)
79)
80)
81)
82)
83)
84)
85)
86)
87)
88)
89)
90)
91)
92)
93)
94)
95)
96)
97)
98)
99)
100)
101)
102)
103)
104)
105)
106)
107)
108)
109)
110)
111)
112)
113)
114)
115)
116)
Kim, Hyun Mun (Samsung AIT)
Kim, Jae Hoon (Univ. Southern California)
Kim, Jinwoong (ETRI)
Kim, Jong Lak (DSP Group)
Kim, So Young (Samsung Electronics)
Kim, Yong-Hwan (KETI)
Kimata, Hideaki (NTT)
Kimoto, Takahiro (NEC)
Koo, Han-Suh (LG Electronics)
Kopansky, Arkady (Sarnoff)
Lainema, Jani (Nokia)
Lee, Sang-Heon (Seoul Natl. Univ.)
Lee, Sang-Houn (DSP Group)
Lee, Yung Ki (Sejong Univ.)
Lee, Yung-Lyul (Sejong Univ.)
Lei, Shawmin (Sharp Labs USA --> MediaTek)
Leontaris, Athanasios (Dolby)
Li, Zhengguo (I2R)
Lim, Chong Soon (Panasonic)
Lim, Sung Chang (Sejong Univ.)
Lin, Sixin (Huawei)
Lu, Ning (Intel)
Luo, Jiancong (Thomson)
Luthra, Ajay (Motorola)
Masashi, Takahashi (Hitachi)
Matsubara, Akio (Ricoh)
McCartley, Sean (Modulus Video)
Meany, James (Boeing)
Müller, Karsten (Fraunhofer HHI)
Naito, Sei (KDDI)
Nakamura, Hiroya (JVC)
Narasimhan, Sam (Motorola)
Ndili, Obianuju (Santa Clara Univ.)
Nilsson, Mike (BT)
Nishi, Takashi (Oki Electric Industry)
Ogunfunmi, Tokunbo (Santa Clara Univ.)
Oh, Kwan-Jung (GIST)
Ohm, Jens-Rainer (RWTH Aachen Univ.)
Onno, Patrice (Canon France)
Pandit, Purvin (Thomson)
Park, Ji Ho (KETI)
Park, Min-woo (Kyung Hee Univ.)
Park, Seanae (Kwangwoon Univ.)
Park, Seung-Wook (LG Electronics)
Pateux, Stephane (Orange - France Telecom)
Peng, Wen Hsiao (Samsung AIT)
Pereira, Fernando (IST)
Prieto, Yolanda (Freescale)
Ransburg, Michael (Klagenfurt Univ.)
Rathgen, Thomas (Ilmenau Univ.)
Regunathan, Shankar (Microsoft)
Reznik, Yuriy (Qualcomm)
Ridge, Justin (Nokia)
212
117)
118)
119)
120)
121)
122)
123)
124)
125)
126)
127)
128)
129)
130)
131)
132)
133)
134)
135)
136)
137)
138)
139)
140)
141)
142)
143)
144)
145)
146)
147)
148)
149)
150)
151)
152)
153)
154)
155)
156)
157)
158)
159)
160)
161)
162)
163)
164)
165)
166)
167)
168)
169)
Rault, Patrick (Quartics)
Rodriguez, Arturo (Scientific Atlanta / Cisco)
Sakazume, Satoru (JVC)
Sampedro, Jesus (Polycom)
Sato, Kazushi (Sony)
Schwarz, Heiko (Fraunhofer HHI)
Schierl, Thomas (Fraunhofer HHI)
Segall, Andrew (Sharp Labs USA)
Sekiguchi, Shun-ichi (Mitsubishi)
Senoh, Takanori (Univ. Tokyo)
Seo, Chang-Won (Sejong Univ.)
Seo, Juheon (Sejong Univ.)
Seo, Jungdong (Yonsei Univ.)
Shi, Xiaojin (Apple)
Shim, Woo-Sung (Samsung Electronics)
Shimizu, Shinya (NTT)
Shiodera, Taichiro (Toshiba)
Sim, Donggyu (Kwangwoon Univ.)
Sjöberg, Rickard (Ericsson)
Smolić, Aljoscha (Fraunhofer HHI)
Su, Yeping (Thomson USA --> Sharp USA)
Suh, Doug Young (KHU)
Suh, Jong-Yeul (LG Electronics)
Sullivan, Gary (Microsoft Corp.)
Sun, Huifang (Mitsubishi)
Suzuki, Teruhiko (Sony)
Takamura, Seishi (NTT)
Tam, James (CRC, Canada)
Tan, Thiow Keng (NTT DoCoMo)
Tanimoto, Masayuki (Nagoya Univ.)
Tanizawa, Akiyuki (Toshiba)
Thoma, Herbert (Fraunhofer IIS)
Tian, Dong (Thomson)
Timmerer, Christian (Klagenfurt Univ.)
Topiwala, Pankaj (FastVDO)
Tourapis, Alexandros (Dolby Labs)
Tung, Yi-Shin (Setabox Tech. Corp.)
Ugur, Kemal (Nokia)
Van de Walle, Rik (Ghent Univ.)
Vetro, Anthony (Mitsubishi Electric)
Vieron, Jerome (Thomson R&D France)
Viscito, Eric (eV Consulting)
Wan, Wade (Broadcom)
Wang, Haohong (Marwell)
Wang, Xianglin (Nokia)
Wang, Yong (Motorola)
Watanabe, Hitoshi (Qpixel)
Wedi, Thomas (Panasonic)
Wiegand, Thomas (Fraunhofer HHI)
Wien, Mathias (RWTH Aachen Univ.)
Wittmann, Steffen (Panasonic)
Wu, Ping (Tandberg Television)
Wus, John (Panasonic)
213
170)
171)
172)
173)
174)
175)
176)
177)
178)
179)
180)
181)
182)
183)
184)
185)
Xiong, Lianhuan (Huawei)
Xu, Xiaozhong (Tsinghua Univ.)
Yagasaki, Yoichi (Sony)
Yamamoto, Tomoyuki (Sharp)
Yamasaki, Takahiro (Oki Electric Industry)
Yang, Haitao (Xidian Univ.)
Yang, Jeong-Hyu (LG Electronics)
Yang, Jungyoup (SKKU)
Yang, Ping (Tsinghua Univ.)
Yao, Wei (I2R)
Ye, Yan (Qualcomm)
Yoo, Jeong-Ju (ETRI)
Yu, Haoping (Thomson)
Yu, Lu (Zhejiang Univ.)
Zhang, Liang (CRC, Canada)
Zheng, Jianhua (Huawei)
214
Annex J – Audio report
Source:
Schuyler Quackenbush, Chair, Audio Subgroup
1
2
Opening of the meeting ......................................................................................................... 216
Administrative matters .......................................................................................................... 216
2.1 Approval of previous meeting report 216
2.2 Approval of agenda and allocation of contributions 216
2.3 Task Groups 216
2.4 Communications from the Chair
216
2.5 Joint meetings 216
2.6 Received National Body Comments and Liaison matters 216
3 Record of AhG meetings ....................................................................................................... 216
4 Audio plenary, joint meeting and task group activities ......................................................... 216
4.1 Review of AHG reports
217
4.2 Received national body comments and liaison matters
217
4.3 Joint Meetings 217
4.3.1 Systems at Audio on MP4FF and Sampling Rate ....................................................... 217
4.4 Task Group discussions
217
4.4.1 MPEG Surround .......................................................................................................... 217
4.4.2 SAOC .......................................................................................................................... 218
4.4.3 MPEG-4 ELD .............................................................................................................. 219
4.4.4 Speech and Audio Exploration .................................................................................... 223
4.4.5 Symbolic Symbol Representation ............................................................................... 223
4.4.6 MPEG-1, MPEG-2 and MPEG-4 audio, conformance, reference software ............... 224
5 Meeting deliverables ............................................................................................................. 225
5.1 Recommendations for final plenary 225
5.2 Establishment of Ad-hoc Groups
225
5.3 Approval of output documents
225
5.4 Responses to Liaison and NB comments
225
5.5 Press statement
225
6 Future activities ..................................................................................................................... 225
6.1 Schedule of future meetings 225
6.2 Agenda for next meeting
225
6.3 All other business
225
6.4 Closing of the meeting
225
Annex A Participants ............................................................................................................... 226
Annex B Audio Contributions and Schedule .......................................................................... 227
Annex C Task Groups ............................................................................................................. 232
Annex D Output Documents ................................................................................................... 233
Annex E Agenda for the 81st MPEG Audio Meeting............................................................. 235
215
1
Opening of the meeting
The MPEG Audio Subgroup meeting was held during the 80th meeting of WG11, April 23-27,
2007, San Jose, CA, USA. The list of participants is given in Annex A.
2
Administrative matters
2.1
Approval of previous meeting report
th
The 79
approved.
2.2
Audio Subgroup meeting report was registered as a contribution, and was
Approval of agenda and allocation of contributions
The agenda and schedule for the meeting was discussed, edited and approved. It shows the
documents contributed to this meeting and presented to the Audio Subgroup, either in the task
groups or in Audio plenary. The Chair brought relevant documents from Requirements, Systems
and MDS to the attention of the group. It was revised in the course of the week to reflect the
progress of the meeting, and the final version is shown in Annex B.
2.3
Task Groups
Task groups were convened for the duration of the MPEG meeting, as shown in Annex C.
Results of task group activities are reported below.
2.4
Communications from the Chair
The Chair summarised the issues raised at the Sunday evening Chair’s meeting, proposed task groups for the week, and proposed agenda items for
discussion in Audio plenary.
2.5
Joint meetings
The joint meetings with Audio over the course of the week are listed here and are reported on
below.
Groups
What
Where
Day Time
Systems,
14529, MP4 file format considerations Audio
Wed 1130Audio
for high sample-rate audio
1200
Requirements, 14411,WD Professional Archival MAF Requirements Tue 1400MDS, Audio
14430, Comments on Prof. Archival
1800
MAF
Other topics in MAF under
consideration.
2.6
Received National Body Comments and Liaison matters
The NB Comments and Liaison documents for the meeting that require a response are as shown
below.
No.
Title
Response by
14313 IEC CDV 61937-3 [SC 29 N 8263]
None required.
Liaison Statement from ETSI TC DECT to ITU-T SG
14331
S. Quackenbush
12 and ETSI TC STQ
14354 Liaison Statement from ITU-T SG 16 [SC 29 N 8324] None required.
3
Record of AhG meetings
There were no AhG meetings prior to the 80th MPEG meeting.
4
Audio plenary, joint meeting and task group activities
216
4.1
Review of AHG reports
There were no requests to review any of the AHG reports.
4.2
Received national body comments and liaison matters
Liaison documents were reviewed and the drafting of the responses was delegated.
4.3
Joint Meetings
4.3.1
Systems at Audio on MP4FF and Sampling Rate
David Singer, Apple, presented m14529, MP4 file format considerations for high sample-rate
audio. After some discussion and further investigation done via email, it appears that all items
with sampling rates greater than 2^16 -1 (65535) are written with the target value modulo 2^16.
This error will be discussed during the AhG period.
4.4
Task Group discussions
4.4.1
MPEG Surround
Kristofer Kjörling, Coding Technologies, presented
Kristofer Kjörling
Jonas Rödén
14453
Jeroen Koppens Proposed draft corrigendum for MPEG Surround
Erik Schuijers
Jeroen Breebaart
This contribution presented errors that make up a proposed corrigendum. These corrections are in
two categories: one is proposed changes to the Enhanced Matrix Mode that result in a change in
the decoded output, and the other is changes that have no impact on the decoded output.
Enhanced Matrix Mode
It is proposed to change the specification so that the parameters derived from the downmix in
EMM are quantized, thus permitting further processing via table look up.
Other
There are a number of editorial corrections and corrections in which the text must be changed to
agree with the implementation software. The Chair noted that some of these changes do affect the
bitstream syntax, but Audio Subgroup experts felt very strongly that there was no risk to fielded
devices. Another correction relates to HRTF processing, which is exposed when non-symmetric
HRTFs are used. The change is technically well-motivated.
It is proposed that these changes be issued as a “Proposed Changes to MPEG Surround,” and
possibly be issued as a DCOR at the next meeting. It was the consensus of the Audio Subgroup to
incorporate all of these changes into the output document.
Heiko Purnhagen, Coding Technologies, presented
Johannes Hilpert
Sascha Disch
14499
Proposed MPEG Surround Level Enhancement
Heiko Purnhagen
Werner Oomen
This contribution proposes the new capability of decoding of a 7.1 channel bitstream that uses a
7-2-7 structure to a 5.1 channel output, and also proposes a new level to explicitly support this
case in the MPEG Surround Profile. The proposal requires some changes in the decoding
specification in addition to new text describing the profile.
The Audio Chair confirmed that this has been implemented in source code and there have been
informal listening tests to check the implementation.
It was the consensus of the Audio Subgroup to incorporate the MPEG-D changed into the
“Proposed Changes to MPEG Surround,” document, and the definition of additional values of
MPEG Audio profile and level into an open amendment (either 8 or 9) to MPEG-4 Audio.
Heiko Purnhagen, Coding Technologies, presented
217
Heiko Purnhagen
Andreas Schneider
Frans de Bont
Proposed Updates for MPEG Surround
14504
Karsten Linzmeier Conformance
Ralph
Sperschneider
This contribution presents a new version of the MPEG Surround conformance document that
contains the following changes and new information:
 Editorial changes that account for the fact that Conformance is an amendment to MPEG
Surround and not a new part.
 Specification of bitsream syntax restrictions
 Specification of decoder conformance procedure
 Definition of sequence. It was noted that these sequence exist. The Chair urged the
authors to make these sequences available on some FTP site whose
fpt/username/password could be publicized in an MPEG document.
4.4.2
SAOC
Hee-Suk Pang, LGE, presented
Hyun-Kook Lee
Hee-Suk Pang
Dong Soo Kim
Report on the SAOC test material provided by
14422
Sung-Yong
LGE
Yoon
Henney Oh
Yang-Won Jung
This contribution described three proposed test items that might be used for test c). Their
characteristics are summarized here:
Item Number of Objects Number of Rendering Matrices
1
10
6
2
9
4
3
13
4
In every case both a mono and stereo downmix are provided.
Oliver Hellmuth, FhG, presented
Oliver Hellmuth
Juergen Herre Proposed SAOC test items provided by Fraunhofer
14441
Thorsten
IIS
Kastner
This contribution proposes items for SAOC tests a) and c) and for the Stream Combination test,
but not for b). Specifcally:
 5. items for each of tests a) and c)
 items for Stream Combination
 downmix matrices
 Rendering matrices
Jeroen Breebaart, Philips, presented
Jeroen Breebaart
14464
Proposed SAOC test items provided by Philips
Werner Oomen
This contribution proposes three items for SAOC binaural test b). They are “inside,” “telco” and
“pop.” Each use the KEMAR HRTFs and can be rendered in a very flexible way using a Matlab
script. This permits setting level and position parameters and these factors can also change
218
dynamically. Two “scenes” for each test item are also provided, consisting of specific downmix
and rendering matrices.
Heiko Purnhagen, Coding Technologies, presented
Jonas
Description of SAOC test items provided by Coding
14488
Engdegård
Technologies
Barbara Resch
This contribution describes four sets of objects which may apply to the listening tests as shown
here:
Item
Test item
Playback configuration and
Rendering cases
nr.
downmix specification
1
Black Coffee
a) R:5.1 / D:Stereo
III. Complex (5 cases)
2
HammerOrgan
a) R:5.1 / D:Stereo
I. Att./Ampl (2 cases)
3
HammerOrgan
a) R:5.1 / D:Stereo
I. Att./Ampl (2 cases)
4
VoiceOverMusic c) R: Stereo / D:Stereo
I. Att./Ampl
5
VoiceOverMusic c) R: Stereo / D:Stereo
I. Att./Ampl
6
Karaoke
c) R: Stereo / D:Stereo
I. Att./Ampl
7
Karaoke
c) R: Stereo / D:Stereo
I. Att./Ampl
Jeongil Seo, ETRI, presented
Seungkwon Beack
Jeongil Seo
14540
Information on SAOC test items by ETRI
Taejin Lee
kyungok kang
This contribution describes 2 candidate test items. Each has a 5.1 channel background scene
object and a monoral vocal object. The items can be applied to tests a) subtest II) and III) an test
b) subtest II) and III).
Schuyler Quackenbush, Audio Research Labs, presented
Schuyler
Spatial Audio Object Coding Evaluation
14315
Quackenbush
Procedures and Criterion
This contribution is an output of the AhG on SAOC Call for Proposals. It has extensive editorial
changes that improve English language usage and general organization and presentation of
information. However it has yellow highlighted “to be discussed” text in several locations. These
were reviewed and will be discussed later in the week.
SAOC Material Selection Task
On Tuesday afternoon, interested experts attended a listening task group at Apple. After a
preliminary selection that day, and later in the week further listening for the binaural items via
headphones, a final selection was made as shown in the table found in the following document:
9099 Final Spatial Audio Object Coding Evaluation Procedures and Criterion
4.4.3
MPEG-4 ELD
Block Switching CE
All of these contributions assessed the performance of the following systems:
FhG AAC-ELD no block switching coded at 32 kb/s
FT AAC-ELD no block switching coded at 32 kb/s
FT AAC-ELD BS with block switching coded at 32 kb/s
The tests were done for two sets of signals, the first set containing transient material and the
second having no transients (such that AAC-ELD BS never triggered a block switch).
Werner Oomen, Philips, presented
Erik Schuijers
14465
Crosscheck FT enhanced LD AAC core experiment
Werner Oomen
219
The listening test showed that the performance of the systems under test were not different at the
95% level of significance.
Markus Schnell, FhG, presented
Markus
Schmidt
Cross-check report on Proposed FT Core Experiment
14515
Ralf Geiger
for AAC-ELD
Markus
Schnell
In both tests the performance of the systems under test were not different at the 95% level of
significance.
Henney Oh, LGE, presented
Henney Oh
Yang-Won
Jung
Hyo Jin Kim Cross-check report on proposed FT Core Experiment
14530
Chang-Heon for AAC-ELD
Lee
Hong-Goo
Kang
The performance of the systems under test were not different at the 95% level of significance.
Pierrick Philippe, France Telecom, presented
Catherine
Colomes
Listening test results on instantaneous block
14519
Pierrick
switching CE for AAC ELD
Philippe
David Virette
This contribution presented the listening test using first the 7 items that invoke block switching. It
reports that for one item, si02, FT AAC-ELD BS (with block switching) had statistically better
performance at the 95% level of significance.
Pierrick Philippe, France Telecom, presented
Pierrick
Updated description for AAC ELD instantaneous
14520
Philippe
block switching CE
David Virette
The contribution provided addition technical details on the operation of AAC-ELD with bock
switching. It explained how the aliasing cancelling is obtained in the context of the AAC-ELD
architecture, that is, both MDCT and QMF filters. The block switching introduces some slight
increase in complexity, but anecdotal evidence suggests that block switching reduces the activity
of TNS.
Additional information was supplied, that being the listening test results pooled across all test
sited doing cross-checks in this CE. For the 7 items for which block switching were active, the
mean performance of FT AAC-ELD BS was higher than that of FT AAC-ELD and FhG AACELD, but not at the 95% level of significance.
A T-test on the difference in score between FT AAC-ELD and FhG AAC-ELD (i.e. FT AACELD BS - FhG AAC-ELD and FT AAC-ELD BS - FT AAC-ELD) over the 7 items showed that
this statistic was greater than zero at the 95% level of significance.
As such, the T-Test revealed statistically significant improvement both on average for the 7 items
under consideration and also for 4 individual items. The proposed technology has statistically
similar performance for the 3 remaining items.
Ralf Geiger, FhG, presented
Ralf Geiger
Utilizing AAC-ELD for delayless mixing in
14516
Markus Schnell frequency domain
220
Jürgen Herre
Kristofer
Kjörling
This contribution discussed the requirements for a Mixing Control Unit (MCU), particularly
focussing on the requirements of low complexity and low delay. It noted that mixing in the
frequency domain significantly reduces the delay through the MCU, and at the same time reduces
the complexity of the mixing operation. When including the SBR filterbank, as in AAC-ELD, it
is required that the SBR parameters be “merged” for the downmix signal, which is possible.
Discussion
Pierrick Philippe noted that having this tool in the standard does not prevent mixing in the frequency domain. In
closed systems, encoders can be forced to use a given set of parameters e.g. sampling frequency, or a specific subset
of tools e.g. to not use block switching.
Bernhard Grill, FhG, noted that using block switching is a “headache” for implementation, both
in terms of source code for the target functionality and also in terms of encoder tuning. Therefore,
he cautioned that incorporating block switching for possibly a limited quality advantage could
have significant impact on coder implementation.
Kristofer Kjörling noted that there is limited evidence of quality improvement, and this is
balanced against concerns on complexity of implementation and use.
Pierrick Philippe noted that it is very difficult to achieve statistically significant improvement for
transients using the MUSHRA test methodology if they only occur for only a few frames in a
waveform, but that the T test reveals such improvements.
Later in the week Pierrick Philippe presented additional information, that being t-test analysis for
each of the four cross-check sites. Statistical improvement with the proposed technology was
revealed on the 4 test sites, no degradation were noticed for any of the items. It is Pierrick
Philippe's strong opinion that this CE brings significant improvement.
After considerable discussion, the Audio Chair called for a show of hands from those having
strong positions on this matter. The tally was as follows:
For: 1 person from 1 company.
Against:
10 persons from 4 companies
A lack of consensus for this CE was due to differing perspectives on the degree of quality
improvement and the characterization of the numerous dimensions of complexity of the proposed
technology (e.g. storage, computation and also implementation and coder tuning) and its
applicability to identified applications.
The Audio Subgroup will discuss the complexity information presented for the Block Switching
CE at the 81st MPEG meeting, consulting MPEG experts from the Implementation Study Group,
and agree upon metrics for balancing complexity against demonstrated quality improvement.
Further Evaluation of Performance for Speech
Per Frojdh, Ericsson, presented
Anisse
Report on the Evaluation of MPEG-4 Enhanced Low
14501
Taleb
Delay AAC on Speech Content
This contribution showed evidence on the performance of AAC-ELD on a new test set that is
more represented of speech applications. The test results showed that AAC-LD at 48 kb/s had
better performance than AAC-ELD at both 38 kb/s and AAC-ELD at 32 kb/s at the 95% level of
significance. Furthermore, AAC-ELD at 38 kb/s was not different from AAC-ELD at 32 kb/s at
the 95% level of significance. This result agrees with the outcome of previous listening test
results from France Telecom.
Ralf Geiger, FhG, presented
Markus Schmidt
14518
Ralf Geiger
Additional information on quality of AAC-ELD
Markus Schnell
221
This contribution showed evidence on the performance of AAC-ELD on both the MPEG-4 test
set and the new test set as used in contribution m14518. The systems under test were:
Codec
Rate (kb/s) Delay (ms)
AAC-LD
32
43
AAC-ELD
32
44
G.722.1-C
32
40
G.722.2 (AMR-WB) 23.85
25
For the speech test set, AAC-ELD was better than AAC-LD at the 95% level of significance. For
the MPEG-4 test set, AAC-ELD was better than AAC-LD at the 95% level of significance.
Discussion
It was noted that, for the speech items, the FhG report test site scored AAC-ELD at 32 kb/s in the
“70” range, while the Ericsson report test site scored AAC-ELD at 32 kb/s in the “80” range.
This might suggest a reason for the differences
Pierrick Philippe, France Telecom, volunteered to cross-check the FhG listening test result. This
effort will be supported by a workplan.
Erisson noted that AAC-ELD is focussed on low delay or conversational applications where
speech signals are most important, but concluded that the evidence of advantage of AAC-ELD
has not been confirmed by cross-check. The Chair noted that the mandate of MPEG-4 AAC-LD
is coding of audio with low delay. AAC-ELD retains low delay while providing greater
compression than AAC-LD for generic audio signals. Bernhard Grill, FhG, noted that in
independent cross-checks using the speech items, the MUSHRA score of AAC-LD at 48 kb/s
was 90 and 83, or a quality of “excellent” and the MUSHRA score of AAC-ELD at 38 kb/s was
81 and 77 or a quality at the lower range of “excellent” or upper range of “good,” and AAC-ELD
at 32 kb/s was 76 and 67 or a quality of “good.”
CE on low-delay SBR filterbank
Ralf Geiger, FhG, presented
Markus Schnell
Jürgen Herre
14517
Ralf Geiger
Proposed Core Experiment on AAC-ELD
Markus Schmidt
Markus Multrus
This contribution proposes to use a new prototype filter for the SBR filterbank that reduces the
analysis/synthesis filterbank delay to 64 samples (1.3 ms) from the current SBR filterbank delay
of 576 samples (12 ms). This permits the entire system one-way delay to be reduced from 42 ms
to 31 ms. It presents listening test results for AAC-ELD with current SBR filterbank and with
lower-delay SBR filterbank, for both high quality and low power operating modes. It was noted
that there is the tendency (but not significant at the 95% level of significance) for the new
filterbank to provide better performance than the original filterbank. This may be due to the
asymmetry of the prototype filter which would cause little to no “pre-echo” effect.
The contribution also presented filterbank frequency selectivity and filterbank computational
complexity.
Kristofer Kjörling, Coding Technologies, presented
Fredrik
Cross check of FhG Core Experiment on LD-SBR
14492
Henn
filterbank for AAC-ELD
This contribution presented the results of a cross-check listening test. The results were very
similar to the FhG listening test.
It was the consensus of the Audio Subgroup to accept this technology into the FPDAM text.
The Audio Chair presented the following two ballot comment documents relating to ISO/IEC
14496-3:2005/PDAM 9 (AAC-ELD).
SC 29
Summary of Response to Proposal of Minor
14286
Secretariat
Enhancement: 14496-3/Amd.9 [SC 29 N 8179]
222
SC 29
Summary of Voting on ISO/IEC 14496Secretariat
3:2005/PDAM 9 [SC 29 N 8180]
Concerning the first contribution, the Chair noted that a single no vote (as in this ballot) is not
sufficient to delay progression of a standard.
Concerning the second contribution, the Chair noted that the Finnish NB and the French NB
ballot comments relate to objectives and performance of AAC-ELD, and will be further discussed.
Markus Schnell, FhG, presented
Markus Schnell
14514
Proposed FPDAM of AAC-ELD
Ralf Geiger
Heiko Purnhagen, Coding Technologies, endorsed an even simpler signalling method than what
is proposed here. The Chair suggested a small break-out discuss this and report back to the group.
It was the consensus of Audio Subgroup to incorporate the technology for the low-delay SBR
filterbank into the FPDAM text.
14288
4.4.4
Speech and Audio Exploration
Schuyler Quackenbush, Audio Research Labs, presented
Schuyler
Proposed Workplan for Speech and Audio
14317
Quackenbush
Exploration
This contribution proposed that a listening test be used to characterize the candidate test items.
There was considerable disagreement as to whether this is the appropriate means to assess the test
set. After some discussion, it was decided that what was most important that the test items
represent significant application areas, for example streaming music, talk radio or IPTV. The
Chair noted that of paramount importance is to expand the current test set. Experts will listen to
all contributed items and pick new or replacement items for the test set, which will be reviewed
by the Audio Subgroup.
Eunmi Oh, Samsung, presented
14455
Eunmi Oh Evaluation of speech and audio coding scheme
This contribution suggested guidelines for listening tests associated with assessing signals that
are mixed signals, e.g. both speech and audio. Specifically, that participants listen to stimuli three
times: once to assess e.g. speech coding artefacts, once to assess music coding artefacts and once
to assess how the two categories of impairment could be combined to form an overall judgement.
It was also noted that items of duration of not more than 15 seconds would be best considering
that listeners should listen to them numerous times.
Additionally, the contribution described three new mixed-signal items that Samsung has
contributed to the set of candidate items.
4.4.5
Symbolic Symbol Representation
Pierfrancesco Bellini, UNFI, presented
Pierfrancesco Bellini
Paolo Nesi
14364
Editors study on ISO/IEC 14496-23/FCD
Maurizio Campanai
Giorgio Zoia
The contribution is candidate text for the FDIS test to be produced at this meeting. All changes
are in response to ballot comments from the UK, Italian and Korean NBs. The Chair noted that
the SMR editors should consider some demonstration or publicity vehicle that might play the role
of a verification test and hence server to demonstrate to MPEG and the larger community of
potential customers the range of functionalities supported by SMR. As a minimum, this could
include a technology demonstration at the closing MPEG plenary at the Lausanne meeting.
The SMR task group members:


integrated the comments received for the korean ornaments definition
integrated the changes proposed in contribution m14364
223

4.4.6
prepared the DoC
MPEG-1, MPEG-2 and MPEG-4 audio, conformance, reference software
Werner Oomen, Philips, presented
Frans de
Bont
Cor to 14496-3:2005 subpart 10, DST (lossless
14536
Werner
oversampled audio)
Oomen
The contribution proposes corrections to the DST specification. This an error that occurs in two
places and which permits the specification to support a greater number of channels. This will
issue as a DCOR from this meeting.
Kelvin Lee, I2R, presented
Kelvin Lee
14414
Te Li
Proposed Corrigenda to 14496-3:2005/AMD 3 (SLS)
Haibin Huang
This contribution corrects an error that appears a number of places in the text relating to the sign
of the residual. It also corrects values in a number of tables.
Mauri Vannen, Nokia, presented
Juha Ojanperä miikka.vilermo@nokia.com
On AAC LTP
14522
Miikka Vilermo
conformance
There was considerable discussion on the issues that made conformance testing of LTP a difficult
problem. The Chair encouraged Nokia experts to maintain their momentum in this effort and to
propose at the next meeting:
 A conformance procedure
 Conformance bitstreams
 Informative text on encoder operation strategies that would produce bitstreams that, when
decoded, always meet conformance criterion.
Noboru Harada, NTT, presented
Noboru Harada
TakehiroMoriya Proposed revision for ISO/IEC14496-3, AMD8:
14410
Yutaka
MP4FF box for original audio file information
Kamamoto
This text will have some additional edits
Ralph Sperschneider, FhG, discussed
14355
Ralph Sperschneider
WD on MPEG-4 Audio Fourth Edition
The Chair urged all experts to review this text. It will be output as WD from this meeting as we
wish to incorporate AMD 9 (BSAC and SBR) into this edition, and the final ballot for AMD 9
has not yet closed.
Tilman Liebchen, LGE, presented
Tilman
Proposed Text of ISO/IEC 14496-4:2004/FDAM 19,
14428
Liebchen
Audio Lossless Coding (ALS) Conformance
Tilman
Updated Status of ALS Conformance
Liebchen
These two contributions are revised text for Conformance FDAM 19 and also an update on the
status of ALS conformance. Currently all bitstreams are defined, available and cross-checked.
Kelvin Lee, I2R, presented
14407
Kelvin Lee
Status of SLS reference software update
14429
224
This contribution reports that the “stand-alone” SLS reference software now supports MP4FF,
and that AAC LC BSAC can be used as a core coder in mono, stereo and multichannel and that
SLS operates in non-core mode in mono, stereo and multichannel.
The Chair suggested that FhG and I2R work together during the next AhG to define an API such
that the stand-alone code could be linked with the MP4VM so as to be part of the unified
framework, and that they report back at the next MPEG meeting as to whether this integration
method is feasible.
5
5.1
Meeting deliverables
Recommendations for final plenary
The Audio recommendations were presented and approved.
5.2
Establishment of Ad-hoc Groups
The following ad-hoc groups were established by the Audio subgroup:
No.
Title
9097
AHG on Audio Standards Maintenance
9098
AHG on SAOC CfP, Speech and Audio and AAC-ELD
5.3
Mtg
No
Yes
Approval of output documents
All output documents, shown in Annex D, were presented in Audio plenary and were approved.
5.4
Responses to Liaison and NB comments
The responses to Liaison and NB comments were prepared and approved.
5.5
Press statement
The Audio part of the press statement was prepared and approved.
6
6.1
Future activities
Schedule of future meetings
Ad Hoc group meetings are indicated in Section 5.2. Unless otherwise indicated, Ad Hoc group
meetings will be held at the location of the next MPEG meeting on the weekend preceding that
meeting.
6.2
Agenda for next meeting
The agenda for the next MPEG meeting is shown in Annex E.
6.3
All other business
There was none.
6.4
Closing of the meeting
The 80th Audio Subgroup meeting was adjourned Friday at 14:00.
225
Annex A Participants
First Name
Pierfrancesco
Jeroen
Kok Seng
Matt
Ralf
Matthias
Noboru
Oliver
Jürgen
Haibin
Yang-Won
Dong Soo
Last Name
Bellini
Breebaart
Chong
Fellers
Geiger
Gruhne
Harada
Hellmuth
Herre
Huang
Jung
Kim
Country
Italy
NL
SG
USA
DE
DE
JP
DE
DE
SG
KR
KR
Kristofer
Kelvin
Te
Tilman
Takehiro
Markus
Sua Hong
Toshiyuki
Takeshi
Eunmi
Henney
Kjörling
Lee
Li
Liebchen
Moriya
Multrus
Neo
Nomura
Norimatsu
Oh
Oh
S
SG
SG
DE
JP
DE
SG
JP
JP
KR
KR
Werner
Hee-Suk
Oomen
Pang
NL
KR
Pierrick
Philippe
FR
Heiko
Schuyler
Susanto
Purnhagen
Quackenbush
Rahardja
SE
USA
SG
Jonas
Juergen
Rödén
Schmidt
SE
DE
Andreas
Markus
Jeongil
Osamu
Ralph
Mauri
Schneider
Schnell
Seo
Shimada
Sperschneider
Vaananen
DE
DE
KR
JP
DE
FIN
Jyri
Do-Hyung
Huopaniemi
Kim
FIN
KR
Affiliation
DSI-UNIFI
Philips
Panasonic
Dolby
Fraunhofer IIS
FhG IIS AEMT
NTT
Fraunhofer IIS
Fraunhofer IIS
I2R
LG Electronics
LG Electronics
Coding
Technologies
I2R
I2R
LG Electronics
NTT
Fraunhofer IIS
Panasonic
NEC
Panasonic
Samsung
LG Electronics
Philips Applied
Technologies
LG Electronics
France Telecom
R&D
Coding
Technologies
ARL
I2R
Coding
Technologies
Thomson
Coding
Technologies
Fraunhofer IIS
ETRI
NEC
Fraunhofer IIS
Nokia Res. Center
Nokia Research
Center
Samsung AIT
Annex B Audio Contributions and Schedule
Monday
0900-1200
MPEG Plenary
1200-1400
Lunch
1400-1800
Audio Plenary
Welcome
Approval of previous meeting report
14316
Schuyler Quackenbush
79th MPEG Audio Report
AhG Reports
14281
R. Sperschneider
AHG on Audio Standards Maintenance
14282
S. Quackenbush
AHG on SAOC CfP and AAC-ELD
Liaison
14313
IEC TC 100 via SC 29
Secretariat
IEC CDV 61937-3 [SC 29 N 8263]
14331
ETSI TC DECT via SC 29
Secretariat
Liaison Statement from ETSI TC DECT to ITUT SG 12 and ETSI TC STQ
14354
ITU-T SG 16 via SC 29
Secretariat
Liaison Statement from ITU-T SG 16 [SC 29 N
8324]
Ballot comments
14286
SC 29 Secretariat
Summary of Response to Proposal of Minor
Enhancement: 14496-3/Amd.9 [SC 29 N 8179]
14287
SC 29 Secretariat
Summary of Voting on ISO/IEC TR 111725:1998/DCOR 1 [SC 29 N 8178]
14288
SC 29 Secretariat
Summary of Voting on ISO/IEC 144963:2005/PDAM 9 [SC 29 N 8180]
14292
ITTF via SC 29 Secretariat
Table of Replies on ISO/IEC 144963:2005/FDAM 1 [SC 29 N 8207]
14319
SC 29 Secretariat
Summary of Voting on ISO/IEC 138187:2006/FPDAM 1 [SC 29 N 8268]
14320
SC 29 Secretariat
Summary of Voting on ISO/IEC 144964:2004/FPDAM 18 [SC 29 N 8269]
14321
SC 29 Secretariat
Summary of Voting on ISO/IEC 144964:2004/FPDAM 19 [SC 29 N 8270]
14327
SC 29 Secretariat
Summary of Voting on ISO/IEC 144964:2004/FPDAM 14 [SC 29 N 8276]
14328
SC 29 Secretariat
Summary of Voting on ISO/IEC FCD 14496-23
[SC 29 N 8277]
14344
SC 29 Secretariat
Summary of Voting on ISO/IEC 23003-1/PDAM
1 [SC 29 N 8307]
14345
SC 29 Secretariat
Summary of Voting on ISO/IEC 23003-1/PDAM
2 [SC 29 N 8308]
14384
SC 29 Secretariat
Summary of Voting on ISO/IEC 144963:2005/PDAM 8
MPEG Surround
14453
Kristofer Kjörling
Jonas Rödén
Jeroen Koppens
Erik Schuijers
Jeroen Breebaart
Proposed draft corrigendum for MPEG Surround
14499
Johannes Hilpert
Sascha Disch
Heiko Purnhagen
Werner Oomen
Proposed MPEG Surround Level Enhancement
14504
Heiko Purnhagen
Andreas Schneider
Frans de Bont
Karsten Linzmeier
Ralph Sperschneider
Proposed Updates for MPEG Surround
Conformance
SAOC
14422
Hyun-Kook Lee
Hee-Suk Pang
Dong Soo Kim
Sung-Yong Yoon
Henney Oh
Yang-Won Jung
Report on the SAOC test material provided by
LGE
14441
Oliver Hellmuth
Juergen Herre
Thorsten Kastner
Proposed SAOC test items provided by
Fraunhofer IIS
14464
Jeroen Breebaart
Werner Oomen
Proposed SAOC test items provided by Philips
14488
Jonas Engdegård
Barbara Resch
Description of SAOC test items provided by
Coding Technologies
14540
Seungkwon Beack
Jeongil Seo
Taejin Lee
kyungok kang
Information on SAOC test items by ETRI
14315
Schuyler Quackenbush
Spatial Audio Object Coding Evaluation
Procedures and Criterion
Tuesday
0900-1300
AAC-ELD
14465
Erik Schuijers
Werner Oomen
Crosscheck FT enhanced LD AAC core
experiment
14515
Markus Schmidt
Ralf Geiger
Markus Schnell
Cross-check report on Proposed FT Core
Experiment for AAC-ELD
228
14530
Henney Oh
Yang-Won Jung
Hyo Jin Kim
Chang-Heon Lee
Hong-Goo Kang
Cross-check report on proposed FT Core
Experiment for AAC-ELD
14519
Catherine Colomes
Pierrick Philippe
David Virette
Listening test results on instantaneous block
switching CE for AAC ELD
14520
Pierrick Philippe
David Virette
Updated description for AAC ELD instantaneous
block switching CE
14516
Ralf Geiger
Markus Schnell
Jürgen Herre
Kristofer Kjörling
Utilizing AAC-ELD for delayless mixing in
frequency domain
14501
Anisse Taleb
Report on the Evaluation of MPEG-4 Enhanced
Low Delay AAC on Speech Content
14518
Markus Schmidt
Ralf Geiger
Markus Schnell
Additional information on quality of AAC-ELD
1300-1400
Lunch
1400-1600
SAOC Material Selection (at Apple)
1400-1800
Joint meeting with Requirements, MDS,
Audio at Requirements
14411,WD Professional Archival MAF
14430, Comments on Prof. Archival MAF
Other topics in MAF under consideration.
1800-1900
Liaison Meeting
Response to 14331, ETSI TC DECT
1900-
Chairs Meeting
Wednesday
0900-1100
MPEG Plenary
1130-1200
Joint with Systems at Audio
14529
David Singer
MP4 file format considerations for high samplerate audio
Discuss Ballot Comments on MP4FF box
1200-1300
Speech and Audio Exploration
14317
Schuyler Quackenbush
Proposed Workplan for Speech and Audio
Exploration
14455
Eunmi Oh
Evaluation of speech and audio coding scheme
1300-1400
Lunch
1400-1500
AAC-ELD
229
14492
Fredrik Henn
Cross check of FhG Core Experiment on LDSBR filterbank for AAC-ELD
14517
Markus Schnell
Jürgen Herre
Ralf Geiger
Markus Schmidt
Markus Multrus
Proposed Core Experiment on AAC-ELD
14514
Markus Schnell
Ralf Geiger
Proposed FPDAM of AAC-ELD
1500-1530
14364
SMR
Pierfrancesco Bellini
Paolo Nesi
Maurizio Campanai
Giorgio Zoia
1530-1730
Editors study on ISO/IEC 14496-23/FCD
MPEG-4
14536
Frans de Bont
Werner Oomen
Cor to 14496-3:2005 subpart 10, DST (lossless
oversampled audio)
14414
Kelvin Lee
Te Li
Haibin Huang
Proposed Corrigenda to 14496-3:2005/AMD 3
(SLS)
14522
Juha Ojanperä
miikka.vilermo@nokia.com
Miikka Vilermo
On AAC LTP conformance
1730-
Social
Thursday
0900-1000
14410
Noboru Harada
TakehiroMoriya
Yutaka Kamamoto
Proposed revision for ISO/IEC14496-3, AMD8:
MP4FF box for original audio file information
14355
Ralph Sperschneider
WD on MPEG-4 Audio Fourth Edition
14428
Tilman Liebchen
Proposed Text of ISO/IEC 14496-4:2004/FDAM
19, Audio Lossless Coding (ALS) Conformance
14429
Tilman Liebchen
Updated Status of ALS Conformance
14407
Kelvin Lee
Status of SLS reference software update
1030-1300
Break-out Task Group Activity
SAOC Evaluation
SAOC Binaural material selection
Speech and Audio material selection
1300-1400
Lunch
1400-
SAOC Evaluation Document
1730-1800
Approve Liaison Responses
1800-
Chairs Meeting
230
Friday
Audio plenary
0900-1300
Recommendations for final plenary
Establishment of new Ad-hoc groups
AhG Mandates
Get document numbers
1000
Approve Responses to NB comments
1030
Approval of output documents
Review of Audio presentation to MPEG plenary
Agenda for next meeting
A.O.B.
Closing of the Audio meeting
1300-1400
Lunch (optional!)
1400-
MPEG Plenary
231
Annex C Task Groups
1. MPEG-D MPS
2. MPEG-D SAOC
3. MPEG-4 AAC-ELD
4. Speech and Audio
5. MPEG-1 reference software
6. MPEG-2 audio
7. MPEG-4 audio, conformance, reference software
Annex D Output Documents
No.
9064
9065
9066
9067
9068
9069
9070
9071
9072
9073
9074
9075
9076
9077
9078
9079
9080
9081
9082
9083
9084
9085
9086
9087
9088
9089
9099
9090
9091
9092
9093
9094
9095
Title
11172-5 Software simulation
DoC on ISO/IEC 11172-5:199x/DCOR 1
ISO/IEC 11172-5:199x/Cor. 1
13818-7 Adavnced Audio Coding
DoC ISO/IEC 13818-7:2006/FPDAM 1
ISO/IEC 13818-7:2006/FDAM 1, Transport of MPEG Surround
data in AAC
14496-3 Audio
ISO/IEC 14496-3:2005/DCOR 5 (DST and MP3on4)
ISO/IEC 14496-3:2005/DCOR 6 (SLS)
DoC on ISO/IEC 14496-3/PDAM 8
ISO/IEC 14496-3/FPDAM 8, MP4FF Box for Original Audio File
Information
DoC on ISO/IEC 14496-3:2005/PDAM 9 Request for Amendment.
DoC on ISO/IEC 14496-3:2005/PDAM 9
ISO/IEC 14496-3:2005/FPDAM 9, AAC-ELD
WD on MPEG-4 Audio Fourth Edition
14496-4 Conformance testing
DoC on ISO/IEC 14496-4:2004/FPDAM 14
ISO/IEC 14496-4:2004/FDAM 14, BSAC Extensions Conformance
DoC ISO/IEC 14496-4:2004/FPDAM 18
ISO/IEC 14496-4:2004/FDAM 18, MPEG-1 and -2 on MPEG-4
Conformance
DoC ISO/IEC 14496-4:2004/FPDAM 19
ISO/IEC 14496-4:2004/FDAM 19, ALS Conformance
Study on ISO/IEC 14496-4:2004/FPDAM 20, SLS Conformance
Status of MPEG-4 Audio Conformance
Status of MPEG-4 SLS Conformance
14496-5 Reference Software
ISO/IEC 14496-5:2001/AMD 10:2007/DCOR 1, BSAC and SLS
Request for Amendment, MPEG-1/2 on MPEG-4 Ref. Software
ISO/IEC 14496-5:2001/AMD 20, MPEG-1/2 on MPEG-4 Ref.
Software
14496-23 Symbolic Music Representation
DoC ISO/IEC FCD 14496-23
ISO/IEC FDIS 14496-23:200x, Symbolic Music Representation
23003-1 MPEG Surround
Final Spatial Audio Object Coding Evaluation Procedures and
Criterion
DoC ISO/IEC 23003-1:2007/PDAM 1
ISO/IEC 23003-1:2007/FPDAM 1, MPEG Surround Conformance
DoC ISO/IEC 23003-1:2007/PDAM 2
ISO/IEC 23003-1:2007/FPDAM 2, MPEG Surround Reference
Software
Defect Report of ISO/IEC 23003-1:2007
Audio and speech coding
Framework for Exploration of Speech and Audio Coding
233
TBP Available
No
No
07-04-27
07-04-27
No
No
07-04-27
07-04-27
No
No
No
No
07-04-27
07-06-08
07-04-27
07-04-27
No
No
No
No
07-04-27
07-04-27
07-06-08
07-06-08
No
No
No
No
07-04-27
07-04-27
07-04-27
07-04-27
No
No
No
No
No
07-04-27
07-04-27
07-04-27
07-04-27
07-04-27
No
No
No
07-04-27
07-04-27
07-04-27
No
No
07-05-11
07-05-11
No
07-04-27
No
No
No
No
07-04-27
07-06-08
07-04-27
07-06-08
No
07-04-27
No
07-04-27
9096 Workplan for Exploration of Speech and Audio Coding
234
No
07-04-27
Annex E Agenda for the 81st MPEG Audio Meeting
Agenda Item
1. Opening of the meeting
2. Administrative matters
2.1. Approval of agenda and allocation of contributions
2.2. Communications from the Chair
2.3. Joint meetings
2.4. Review of task groups and mandates
2.5. Approval of previous meeting report
2.6. Review of AhG reports
2.7. Received national body comments and liaison matters
3. Plenary issues
4. Task group activities
4.1. MPEG Maintenance, including MPEG-1, MPEG-2, MPEG-4, SMR and MPEG
Surround issues
4.2. AAC-ELD
4.3. Spatial Audio Object Coding Call for Proposals Evaluation
4.4. Speech and Audio Exploration
5. Discussion of unallocated contributions
6. Meeting deliverables
6.1. Recommendations for final plenary
6.2. Establishment of new Ad-hoc groups
6.3. Approval of output documents
6.4. Responses to NB comments
6.5. Responses to Liaison statements
6.6. Press statement
7. Future activities
8. Agenda for next meeting
9. A.O.B
10. Closing of the meeting
235
Annex K – 3DG report
Source:
Title:
Authors:
Status:
MPEG 3D Graphics Compression
3D Graphics Marrakech meeting report
Marius Preda (INT)
Draft (To be added to Nxxxx)
3DG meeting report
San Jose, April 23-28, 2007
1
Opening of the Meeting
1.1
Approval of the agenda
1.2
Goals for the week
The goals of this week are:
 Review FAMC results and edit the WD
 Review on-going AFX experiments
 Promote the 3DGC profiles
 Issue FDAM of GFX reference software
 Issue FDAM of GFX conformance
 Issue FPDAM of Geometry and Shadow reference software
 Issue FPDAM of Geometry and Shadow conformance
 Review Liaisons to MPEG 3DG
 Review and promote 3DG related demonstrations
 Investigate future developments of MPEG 3D Graphics3
The output documents related to 3D Graphics Compression are:
No.
9132
9146
9133
9147
Title
14496-4 Conformance testing
Text of ISO/IEC 14496-4:2001/ FDAM16 (MPEG-J
GFX Conformance)
DoC on ISO/IEC 14496-4:2001/ FDAM16 (MPEG-J
GFX Conformance)
Text of ISO/IEC 14496-4:2001/ FPDAM21 (Geometry
and Shadow Conformance)
DoC on ISO/IEC 14496-4:2001/ FPDAM21
(Geometry and Shadow Conformance)
No.
Title
14496-5 Reference Software
Text of ISO/IEC 14496-5:2001/ FDAM11 (MPEG-J
9134 GFX RefSoft)
PB: Mark has to clean up the code
236
TBP
Available Editor
No
07/05/12
Mark Callow
No
07/05/12
Marius Preda
No
07/04/27
Jeong-Hwan Ahn
No
07/05/04
Marius Preda
TBP
Available Editor
N
07/05/12
Mark Callow
DoC on ISO/IEC 14496-5:2001/ FDAM11 (MPEG-J
N
GFX RefSoft)
Text of ISO/IEC 14496-5:2001/ FPDAM13 (Geometry N
and Shadow RefSoft)
9135
PB: Patrick has to send me the software from the
CVS
Doc on ISO/IEC 14496-5:2001/ FPDAM13 (Geometry N
9149
and Shadow RefSoft)
07/05/12
Marius Preda
07/05/04
Patrick Gioia
07/05/04
Marius Preda
No.
TBP
Available Editor
N
07/04/27
N
07/04/27
Marius Preda,
Titus Zaharia
Patrick Gioia
N
07/04/27
Marius Preda
N
Y
07/04/27
07/05/12
Khaled Mammou
Pierre Davy
Title
14496-21 MPEG-J GFX
9140 Text of ISO/IEC 14496-21:2006/COR1
TBP
Available Editor
N
07/04/27
No.
TBP
Available Editor
N
07/04/27
Marius Preda
Y
07/04/27
Marius Preda
9148
9136
9137
9150
9138
9139
Title
14496-16 Animation Framework eXtension (AFX)
WD 2.0 of ISO/IEC 14496-16:2006/AMD2 (Framebased Animated Mesh Compression)
WD 1.0 of ISO/IEC 14496-16:2006/AMD3 (3D
MultiResolution Profile)
Request for ISO/IEC 14496-16:2006/AMD3 (3D
MultiResolution Profile)
3D Graphics Core Experiments Description
3D Graphics Compression FAQ 19.0
No.
Title
14496-25 3D Graphics Compression Model
Request for Subdivision of ISO/IEC 14496: Part 25 9141
3D Graphics Compression Model
9142 WD 1.0 for ISO/IEC 14496-25
1.3
Mark Callow
Standards from 3DG
Std
Pt
Edit.
Project Description
CfP
4
4
2004
4
4
2004
4
5
2001
4
5
2001
4 16
2006
4 16
2006
Amd.16 MPEG-J GFX
conformance
Amd.21 Geometry and
Shadow conformance
Amd.11 MPEG-J GFX
reference software
Amd.13 Geometry and
Shadow reference
software
Amd.1 Geometry and
Shadow
Amd.2 Frame-based
237
WD
CD
FCD
FDIS
PDAM FPDAM FDAM
DCOR
COR
06/04
06/10
07/04
06/07
06/10
07/04
07/10
06/01
06/04
06/07
07/04
06/07
06/10
07/04
07/10
05/04
06/04
06/07
07/01
07/01
07/07
07/10
08/04
Animated Mesh
Compression
4 21
4 25
1.4
2006
200x
Cor.1
3D Graphics
Compression Model
07/01
07/10
08/01
07/04
08/07
Room allocation
3DG : Santa Clara
1.5
Allocation of contributions
N°
D1
Title
Schedule
D1
D1
09:00~11:30
D1
11:30~13:00
Monday
MPEG Plenary
3DG Plenary
14269
Roll call, Agenda, Goals, FAQ,
etc.
Marius Preda
Report of AHG on 3DGC
documents, experiments and
software maintenance
Francisco
Morán
Jeong-Hwan
Ahn
Mark Callow
Web Site
Conformance bitstream for
Geometry & Shadow
D1
14:15~14:30
Jeong-Hwan
Ahn
D1
14:30~14:45
Reference Software
Clarify the status on node
templates and Stream Code
all
D1
14:45~15:00
GFX
Report on Reference Software and
Corrigendum status (latest
Mark Callow
developments, demo)
D1
15:00~15:30
New issues
14545
A scene graph node designed to
define haptic properties
3DG General
all
Conformance
14396
MPEG General
D1
13:00~14:00
D1
14:00~14:15
Lunch Break
Clarify the status on www.mpeg3dgc.org maintenance
Activity
Pierre Davy
Nadia
MagnenatThalmann
238
Conformance
N°
Title
Schedule
Activity
15:30~16:00
D1
16:00~16:30
Coffee Break
New issues
Proposal for Future developments Marius Preda
in MPEG 3D Graphics
D1
17:00~17:50
Requirements
14467
D2
Proposal for 3D Compression
Profile
Patrick Gioia
Olivier
Aubault
Preliminary
Discussion
D2
D2 09:00~9:45 CE2
Tuesday
Core Experiments
Patrick Gioia
Anne Le Bris
14466 Report on CE2: Space Partitioning
Romain
Cavagna
D2
09:45~10:30
Core Experiments
Nikolce
Scalable Compression of Dynamic Stefanoski
14363
3D Meshes
Jörn
Ostermann
14498 FAMC with streaming support
14491 FAMC bitstream description
Khaled
Mamou
Karsten
Müller
Detlev Marpe
Titus Zaharia
Marius Preda
Francoise
Prêteux
Khaled
Mamou
Titus Zaharia
Marius Preda
Françoise
Prêteux
Khaled
Mamou
Marius Preda
Titus Zaharia
Francoise
Prêteux
CE1
CE1
CE1
D2
12:00~14:00
D2
14:00~14:30
Lunch Break
Core Experiments
14491 FAMC bitstream description
CE1
10:30~11:00
Coffee Break
Frame-based Animated Mesh
14493 Compression : integration of the
CABAC arithmetic encoder
CE1
Khaled
239
CE1
CE1
N°
Title
Schedule
Activity
Mamou
Marius Preda
Titus Zaharia
Francoise
Prêteux
D2
15:20~15:30
Miscellaneous
14408
3dod.org goes multimedia:
MyMultimediaWorld.com
Marius Preda
Benoit Le
Bonhomme
Son Tran
Françoise
Preteux
D1
15:00~16:00
New issues
Proposal for Future developments Marius Preda
in MPEG 3D Graphics
D2
16:00~17:00
Liaison
Liaison Statements
D3
Liaison
D3
D3
09:00~12:00
D2
12:00~12:30
Wednesday
MPEG Plenary
Joint meeting with Requirements
Proposal for 3D Compression
14467
Profile
D3
12:30~14:00
D3
14:00~17:00
3DG Plenary
D4
3DG General
Jeong-Hwan
Ahn
all
D4
D4
12:00~14:00
D4
14:00~18:00
Thursday
Lunch Break
3DG documens
GFX Output documents review
Core Experiment discussion
CE 1 Review
CE 2 Review
AMD2 3D Multiresolution Profile
D5
Profile
Patrick Gioia
Olivier
Aubault
Lunch Break
WD 2.0 Editing
Conformance bitstream for
14396
Geometry & Shadow (step 2)
Clarify the status on node
templates and Stream Code
MPEG General
Output documents review
Friday
D5
240
3DG General
N°
Title
Schedule
D4
09:00~12:00
3DG documens
Activity
3DG General
Output documents review
Short Report on the Crosschecking
status
Short report on the FAQ
AMD 3 Profile
AhGs and resolutions
D5
12:00~14:00
D5 14:00~
Lunch Break
MPEG Plenary
1.6
MPEG General
Attendance list
Name
Jeong-Hwan Ahn
Marius Preda
Françoise Prêteux
Khaled Mamou
Patrick Gioia
Country
Korea
France
France
France
France
Euee S. Jang
Sunyoung Lee
Sinwook Lee
Jae Bum Jun
Hyungyu Kim
Dan Cernea
Mark Callow
Karsten Muller
Pierre Davy
Ning Lu
Korea
Korea
Korea
Korea
Korea
Belgium
Japan
Germany
Swiss
US
Jörn Ostermann
Germany
Anne Le Bris
France
Company
Samsung AIT
INT
INT
INT
France Telecom
R&D
Hanyang Univ.
Hanyang Univ.
Hanyang Univ.
Hanyang Univ.
Hanyang Univ.
VUB
HI Corporation
FHG-HHI
Miralab
Intel
Corporation
Institut für
Informationsvera
rbeitung
France Telecom
241
e-mail
jeonghwan.ahn @ samsung . com
marius.preda @ int-evry . fr
Francoise.Preteux @ int-evry . fr
Khaled.Mamou @ int-evry . fr
patrick.gioia @ orange-ftgroup . com
esjang @ hanyang . ac . kr
sunnykr @ ihanyang . ac . kr
nembi79 @ gmail . com
powerory @ hanyang . ac . kr
cprov @ cpsite . net
cdcostin @ etro . vub . ac . be
callow_mark @ hicorp . co . cp
kmuller @ hhi . de
davy @ miralab.unige.ch
Ostermann @ tnt.uni-hannover.de
anne.lebris @ simecom.fr
2
General issues
2.1
General Discussion
2.1.1 Experiments
Last meetings resolution
For each new specification development activity, 5 National Bodies should commit resources to that
activity. Contributions should be made at each meeting from those NBs until that activity is finalized.
3DGC will no longer have Exploration Experiments.
3DGC will only have Core Experiments for any official experiments.
The condition for the CE is to have at least 2 active participants (companies or universities having
support from companies on that experiment) dedicating resources to do the work and making
contributions at each meeting.
If a participant does not make any contribution at a meeting, then that participant will not be
considered as active.
The activity in the CE does not necessarily imply adoption into the standard.
xxx
Clarify the status on www.mpeg3dgc.org maintenance
all
Samsung can maintain the web site up to end of 2007 only. Potential solutions: FT and UPM.
Patrick Gioia will be the maintainer of the new web site (once transferred). FT will investigate on
transferring the web site and on finding open source solutions for data protection.
14396
Conformance bitstream for Geometry & Jeong-Hwan
Shadow
Ahn
A table with the responsible person for cross-checking was created.
Some of the files are not yet provided (Multiresolution FootPrint). It is possible that same files show
functionalities in the two table. Jerome (FT) will check and if not he’ll provide new files. The issue
will be re-discussed during the week.
xxx
Clarify the status on node templates and
Stream Code
all
Stream Code problem was solved by correcting the Geometry and Shadow spec and updating the
RefSoft accordingly.
Jerome (FT) will provide a new version of templates8.txt document.
xxx
Report on Reference Software and
Corrigendum status (latest developments, Mark Callow
demo)
242
Reference Software is in good form but still needs some clean up (this will be done in the editing
period). Demonstration of providing the Java MIDP environment with GFX API implementation
was shown. Demonstration material is provided as RefSoft and Conformance
14545
A scene graph node designed to define
haptic properties
Pierre Davy
Nadia MagnenatThalmann
Proposal of a new node in the scene graph.
Examples of using haptics devices: game, touching virtual objects, medical training, interface for
content production.
The problem that has to be solved is how to compute the force to be directed to the device based on
haptics properties of the virtual 3D graphics object. Software solutions exist: direct communication
with the device, haptic geometry, extract the geometry from 3d models. The contribution brings a
proposition on the parameters to be attached to the graphics object.
In order to take a decision to start the CE evidences have to be provided for a support from industrial
partners. Also the requirement of treating this kind data in MPEG has to be established.
Resolution: Proponents are asked to provide more evidences that such tool is currently required by
the industry.
14408
Marius Preda
Benoit Le
Bonhomme
Son Tran
Françoise
Preteux
3dod.org goes multimedia:
MyMultimediaWorld.com
This contribution presents the latest developments of the web site 3dod.org (now called
MyMultimediaWorld.com) showcasing AFX tools.
xxx
Proposal for Future developments in
MPEG 3D Graphics
Marius Preda
A new architecture on considering 3DGC tool was presented. It is based on three layers structure:
 XML-based representation for scene graph,
 Generic Binarization of XML content
 Specific Compression tools for 3D Graphics Primitives
The group acknowledged the advantages of such approach in promoting the AFX tools to the
industry.
14467
Proposal for 3D Compression Profile
243
Patrick Gioia
Olivier Aubault
Preliminary
Discussion
The contribution presented an improved version of the Multiresolution Profile (profile under
consideration from the last meeting).
Issues : having all the tools in the profile and control it by using the levels or selecting only the tools
are really needed.
The levels should be specified for each tool. This issue will be revised during the week.
14466
Patrick Gioia
Anne Le Bris
Romain Cavagna
Report on CE2: Space Partitioning
This contribution presents the results of the exploratory phase. The goal was to specify a sound
framework for space partitioning that may work for all tools
An initial representation for PVS and Cell and Portals is presented. A more compact form should be
provided.
Next stet of the CE is the competitiveness phase: design an efficient data representation. Participants
are ENST and FT.
14363
Scalable Compression of Dynamic 3D
Meshes
Nikolce
Stefanoski
Jörn Ostermann
CE1
The contribution presents a method for scalable representation of the geometry and animation for all
layers. The compression results with respect to FAMC are presented.
14493
Khaled Mamou
Karsten Müller
Detlev Marpe
Titus Zaharia
Marius Preda
Francoise
Prêteux
Frame-based Animated Mesh
Compression : integration of the
CABAC arithmetic encoder
CE1
The contribution presents the adaptation of the CABAC for FAMC. Introduction of the CABAC as
it is used in video. The new results show an improvement of 15%.
14498
Khaled Mamou
Titus Zaharia
Marius Preda
Françoise
Prêteux
FAMC with streaming support
CE1
The contribution presents the partition of the FAMC stream for enabling animation streaming. The
skinning model may be computed for each segment. For some examples, doing so improves the bitrate.
14491
FAMC bitstream description
Khaled Mamou
244
CE1
Marius Preda
Titus Zaharia
Francoise
Prêteux
This contribution presents the bitstream syntax of the FAMC. It includes the new development for
streaming and CABAC integration.
14467
Proposal for 3D Compression Profile
Patrick Gioia
Olivier Aubault
Final Discussion
This contribution is presented in the joint meeting 3DGC-Requirements.
Presentation of the compression tools to be supported in the profile. Accepted as a new AMD of
ISO/IEC 14496-16.
14396
Conformance bitstream for Geometry &
Shadow (step 2)
Jeong-Hwan
Ahn
All the bitstreams are available in www.mpeg-3dgc.org databank.
xxx
Clarify the status on node templates and
Stream Code
all
NodeTemplatev8.txt is updated and available on CVS.
WD 2.0 Editing
14:00-18:00
Technical review was performed. Pictures have to be updated.
GFX Output documents review
The ISO/IEC 14496-21:2006COR was updated. Add a new method for binding textures. Change the
name of a class.
The ISO/IEC 14496-5: FDAM 11 was updated.
The ISO/IEC 14496-4: FDAM 16 was updated.
3
AFX (14496-16) activities
245
3.1
Core Experiments
3.1.1 CE1. Mesh Animation Compression
Last meeting resolution
Continue CE1 with the next steps (representation, compression of other attributes and considering
static and animated data together).
Issue a working draft with the currently proposed technology.
Issue a request for new amendment document with the title “Frame-based Animated Mesh
Compression”
3.1.1.1 M14493 –Frame-based Animated Mesh Compression : integration of the CABAC
arithmetic encoder
This proposal describes an approach of integrating CABAC into the FAMC technology. As was shown by experimental
results, this proposed enhancement of FAMC results in average bit-rate savings of around 16% when compared to the
current WD. At the same time, by replacing the N-ary adaptive arithmetic coder in the current WD by the fast
multiplication-free M coder, as being an integral part of CABAC, computational complexity was reduced.
3.1.1.2 M14498 – FAMC with streaming support
This proposal describes a data packetization mechanism that enables the FAMC technique with the streaming
functionality. The proposed approach makes it possible to associate multiple skinning models with a single animation
sequence and therefore to optimize the motion model to each data segment. The experimental results, carried out on the
3DGC test data set, established that the streaming can be efficiently performed, with marginal loss in term of
compression efficiency.
3.1.1.3 M14491 – FAMC bitstream description
This proposal describes new bistream description for FAMC including the changes from the
previous two contributions.
3.1.1.4 M14363 – Scalable Compression of Dynamic 3D Meshes (SCD3DM)
This proposal describes a method for predictive compression of time-consistent 3D mesh sequences
supporting and exploiting scalability. The applied method decomposes each frame of a mesh
sequence in layers, which provides a time-consistent multi-resolution representation. Following the
predictive coding paradigm, local temporal and spatial dependencies between layers and frames are
exploited for compression. Prediction is performed vertex-wise from coarse to fine layers exploiting
the motion of already encoded neighboring vertices for prediction of the current vertex location. It is
shown that a predictive exploitation of the proposed layered configuration of vertices can improve
the compression performance in domains relevant for applications.
Discussion on CE1
246
The compression results presented in M14363 (SCD3DM) are generally comparable with the ones
in FAMC. However FAMC performs better for low-bitrates. SCD3DM introduces the animation on
different geometry resolution.
Resolution for CE1
In the next phase of the CE it will be investigated how the skinning model will be combined with the
scalable approach. A switch may be used to choose between DCT, Wavelet compression of the
errors and the scalable approach.
Issue a new version of the working draft with the currently proposed technology (including
streaming and CABAC).
3.1.2 CE2. Space Partitioning
Last meeting resolution
Perform exploratory stage with the proposed work plan. (details can be found in the CE description)
3.1.2.1 M14466 – Report on CE2: Space Partitioning
This contribution presents the results of the exploratory phase. The initial goal of specifying a sound
framework for space partitioning that may work for all tools (PVS, BSP, Cell and Portal) was
achieved. Reference implementation and data test was provided.
Discussion on CE1
Next step of the CE is the competitiveness phase with the goal of design an efficient data
representation. Participants are ENST and FT.
Resolution for CE2
Perform competitiveness stage with the proposed work plan. (details can be found in the CE
description)
3.2
Profiles
3.2.1 M14467 – Proposal for 3D Navigation Profile
After two stages review (internal and joint with Requirements) it was acknowledged that the
proposal is mature to start the publishing stage.
Resolution
Request a new AMD and prepare the first draft.
3.3
3.3.1
Promotions
M14408 – 3dod.org goes multimedia: MyMultimediaWorld.com
247
The goal of this contribution is to present the evolution of the 3dod.org for becoming a multimedia
repository showcasing MPEG-4 technology for representing and delivering the content. It supports
on-line visualization of 3D graphics, video, image and sound content, categories and user
management, content upload and conversion, content adaptation.
4
4.1
GFX (14496-21) activities
Reference Software & Conformance
Last meeting resolution
The proposed restructuring of the reference software is approved.
The video files shall be replaced by the next meeting.
In order to synchronize the reference software with conformance schedule, this document will be
promoted to FDAM at the next (80th) meeting.
Study document of the DoC and the Text will be provided at this meeting.
4.1.1 M14091 – Summary of Voting on ISO/IEC 14496-5:2001/FPDAM 11 (from last
meeting)
The JNB has voted disapprove with two comments.
The first comment is to restructure the reference software to support J2ME. Although it is classified
as technical comment, it is only an implementation issue and thus is considered as editorial fix by
the group. However, this will affect the conformance work which is scheduled to be promoted to
FDAM stage at this meeting. Therefore, the group approves the comment but recommends
synchronizing the work with the conformance schedule.
The second comment is to remove the video files that are used to demonstrate the reference software
because of license issue. However, since another movie file should be provided, the group approves
to replace (not remove) the video files
Resolution
The proposed restructuring of the reference software is approved.
The document was promoted to FDAM.
DoC and the Text were provided at this meeting.
5
3D Graphics Compression Model (14496-25) activities
A new architecture on considering 3DGC tool was presented. It is based on three layers structure:
 XML-based representation for scene graph,
 Generic Binarization of XML content
 Specific Compression tools for 3D Graphics Primitives
248
The group acknowledged the advantages of such approach in promoting the AFX tools to the
industry.
Resolution
Request for subdivision of MPEG-4.
Issue the first version of the WD.
6
Resolutions of 3DG
6.1
6.1.1
Output documents
The 3DG subgroup recommends to approve the following documents
No.
Title
14496-4 Conformance testing
Text of ISO/IEC 14496-4:2001/ FDAM16 (MPEG-J
9132
GFX Conformance)
DoC on ISO/IEC 14496-4:2001/ FDAM16 (MPEG-J
9146
GFX Conformance)
Text of ISO/IEC 14496-4:2001/ FPDAM21 (Geometry
9133 and Shadow Conformance)
9147
No.
9134
9148
9135
9149
No.
9136
TBP
Available Editor
No
07/05/12
No
07/05/12
No
07/04/27
DoC on ISO/IEC 14496-4:2001/ FPDAM21 (Geometry
and Shadow Conformance)
No
07/05/04
Title
14496-5 Reference Software
Text of ISO/IEC 14496-5:2001/ FDAM11 (MPEG-J
GFX RefSoft)
PB: Mark has to clean up the code
DoC on ISO/IEC 14496-5:2001/ FDAM11 (MPEG-J
GFX RefSoft)
Text of ISO/IEC 14496-5:2001/ FPDAM13 (Geometry
and Shadow RefSoft)
PB: Patrick has to send me the software from the CVS
Doc on ISO/IEC 14496-5:2001/ FPDAM13 (Geometry
and Shadow RefSoft)
TBP Available Editor
Title
14496-16 Animation Framework eXtension (AFX)
WD 2.0 of ISO/IEC 14496-16:2006/AMD2 (Framebased Animated Mesh Compression)
249
Mark
Callow
Marius
Preda
JeongHwan
Ahn
Marius
Preda
N
07/05/12
Mark
Callow
N
07/05/12
N
07/05/04
Marius
Preda
Patrick
Gioia
N
07/05/04
Marius
Preda
TBP Available Editor
N
07/04/27
Marius
Preda,
Titus
Zaharia
WD 1.0 of ISO/IEC 14496-16:2006/AMD3 (3D
MultiResolution Profile)
Request for ISO/IEC 14496-16:2006/AMD3 (3D
9150
MultiResolution Profile)
3D Graphics Core Experiments Description
9138
9137
9139
No.
9140
3D Graphics Compression FAQ 19.0
Title
14496-21 MPEG-J GFX
Text of ISO/IEC 14496-21:2006/COR1
07/04/27
N
07/04/27
N
07/04/27
Y
07/05/12
Patrick
Gioia
Marius
Preda
Khaled
Mammou
Pierre
Davy
TBP Available Editor
N
No.
Title
14496-25 3D Graphics Compression Model
Request for Subdivision of ISO/IEC 14496: Part 25 9141
3D Graphics Compression Model
WD 1.0 for ISO/IEC 14496-25
9142
6.2
N
07/04/27
Mark
Callow
TBP Available Editor
N
07/04/27
Y
07/04/27
Marius
Preda
Marius
Preda
Resolutions

The 3DG subgroup recommends appointing Patrick Gioia (France Telecom) as the editor of ISO/IEC 1449616:2006/AMD3 and thanks him for taking the responsibility of this project.

The 3DG subgroup initiates a new activity on applying MPEG 3D Graphics compression tools to third-parties
solutions for scene graph and graphics primitives’ representation and encourages external bodies to participate to
this activity.

The 3DG subgroup recommends appointing Marius Preda (INT), Mark Callow (HI Corporation) and Jeong-Hwan
Ahn (Samsung AIT) as the editors of ISO/IEC 14496-25 and thanks them for taking the responsibility of this project.
6.3
Establishment of 3DG Ad-Hoc Groups
N9143
Mandate:
AHG on 3DG documents, experiments and software maintenance
1. Maintain and edit 3DG documents
2. Coordinate 3DG CE activity
3. Coordinate 3DG related conformance and reference software
Chairmen: Jeong-Hwan Ahn (Samsung AIT),
Ning Lu (Intel Corporation)
Duration: Until 81st Meeting
Sunday before 81st meeting
Meetings
Reflector: mpeg-3dgc AT gti. ssr. upm. Es
Subscribe: http://www.gti.ssr.upm.es/mailman/listinfo/mpeg-3dgc
250
7
Closing of the Meeting
See you in Lausanne.
251
Annex L – Test report
Source: Tobias Oelbaum, Chair
Report of Test meeting for the 80th MPEG meeting in San Jose, USA
8
Opening of the Meeting
Goals for the week
The goals of this week are:
 Refine the draft verification test plan for SVC, especially regarding test sequences and bit
rates for the test
 Provide input to JVT to the discussion on the MVC Deblocking Filter
9
Test Activities
Scalable Video Coding - Verification Tests
The Draft SVC Verification Test Plan has been updated. This especially includes the refinement of
bit rates and test sequences that should be used for the test and a refinement of the single test
scenarios. The test plan currently includes 4 scenarios for profile A and profile B and 2 scenarios for
profile B Intra.
Based on viewing sessions performed previous to the meeting at TUM in Munich bit rates for the
proposed profile B were selected (related JVT-Contribution: JVT-V102). Sequences from JVTW110 (related to profile B Intra) were viewed at the Meeting and at TUM. It was proposed to
increase the bit rate for the test compared to the bit rates used in this contribution.
Using the input from JVT-V102 and JVT-W110 four test sequences were identified that could be
used for this test.
Two late contributions to JVT (JVT-W131 and JVT-W135) were reviewed and related changes were
made in the verification test plan.
Two new sequences that were proposed by Layered Media for the use in the verification test were
viewed. It was proposed to search for more challenging sequences. Layered Media will bring more
sequences from the field of video conferencing to the next meeting.
An AHG for preparing the verification tests has been set up.
Multi-view Video Coding – Deblocking Filter
A short visual evaluation of JVT-W024 was conducted. The question was if the proposed extension
of the deblocking filter for MVC would result in better subjective quality. Results of this evaluation
(the subjective quality could be improved by this extension of the deblocking filter) were reported
back to JVT.
252
10 Test Resolutions
Output Documents

8965 Draft SVC Verification Test Plan Version 3.0
AdHoc Groups
The following AHG was set up:
N8993
AHG on SVC Verification Test
1. To discuss test setups based on applications scenarios of the SVC
Mandate:
profiles
2. To refine the verification test document
3. To prepare the verification test
Tobias Oelbaum (TU München, oelbaum@tum.de)
Chairman:
Mathias Wien (RWTH Aachen, wien@ient.rwth-aachen.de)
Associate Chairs:
Vincent Bottreau
Nathalie Cammas
Alex Eleftheriadis
Justin Ridge
Until 81st Meeting
Duration:
Yes (Sunday before the 81st Meeting)
Meetings
mpeg-svt@lists.rwth-aachen.de
Reflector:
To subscribe or unsubscribe, go to
Subscribe:
http://mailman.rwth-aachen.de/mailman/listinfo/mpeg-svt
253
Annex M – ISG report
Source:
1
ISG Chair, Marco Mattavelli (EPFL)
Overview
The main work items of the Implementation Studies Subgroup in San Jose are:
1. The contributions to the Reconfigurable Video Coding (RVC) activity jointly with the
video group for contribution review, review of results for the on going core experiments,
editing of the RVC WD documents.
2. The review of the final core experiment results aiming at improving the finite precision
DCT/IDCT specification selected at Hengzhou meeting considering possible further
performance improvement and complexity reduction.
3. MPEG-4 Part 9 Reference HW description:
 The editing of the Study of the Third Edition of the TR
 The review of the new HDL module and associated documentation submitted for
integration in Part 9.
Input contributions to ISG group w.r.t. the above items are summarized according to the following
table:
Input Contributions to ISG subgroup
M14276
Robert Turney (Xilinx) Marco
Mattavelli (EPFL)
AHG report on MPEG-4 Part 9
Reference Hardware Description
Phase 1 and 2”
M14434
Julien Dubois
Barthelemy Heyrman
Johel Miteran et al.
Wildcard Platform Vs ML310
2
2.1
Detailed Report
The contribution to the activity on Reconfigurable Video Coding (RVC).
Most of the ISG time in San Jose has been spent in joint meetings with Video for the RVC subgroup
work. The main issues of discussion were the evaluation of the results of the on-going core
254
experiments concerning the evolution and progress of the technology currently described in the WD.
Major results reported are the:
 limitations and bugs of the implementation of the MPEG-4 SP in terms of CAL FUs,
 the implementation of almost all FUs in CAL for AVC baseline
 new results of compression of DDL for a complete decoders
 no results were reported for the implementation of the flexile decoder based on BSDL
bitstream descriptions and transformations to CALML and CAL
 first proposals of methodologies for the conformance testing of RVC FUs
 studies and proposals for the efficient partitioning of FU for B-pictures, multiple reference
frames, intra prediction and for SVC,
 description of the RVC framework tool support and definition of future tool support.
All reviewed contributions are reported in the list below.
Contribution Category
Title
Number
MPEG-C RVC Functional Units naming process proposal
14301
MPEG-B Compression of the RVC DDL Decoder Description with BiM
14340
(results of Core Experiment 1.3 in RVC)
MPEG-C Functional units of inter-prediction under reasonable
14374
system partition for RVC framework
MPEG-C Conformance test tools of RVC functional units
14375
MPEG-C Implementation of B frame support in RVC CAL Model
14416
MPEG-B Core Experiment Result on CDDL
14445
MPEG-B Proposed Text of RVC CE
14446
MPEG-B Study on RVC Framework and Its Requirements
14447
MPEG-C Proposed text of the RVC FUs for MPEG-4 AVC (Results of
14448
CE 2.2)
MPEG-C Implementation of multiple reference frame support in RVC
14454
CAL model
MPEG-C A scheme for implementing MPEG-4 SP codec in the RVC
14457
framework
MPEG-C Evolutions of RVC so as to handle SVC decoding
14463
MPEG-B Extension to support non-MPEG standards (ICT/ZJU)
14473
(Results of CE 1.6)
MPEG-B Exploration experiments of AVS decoder description in RVC
14474
framework
MPEG-C Implementation of MPEG-4 AVC Deblocking Filter in RVC
14480
CAL model
MPEG-B Reconfigurability potential of the MPEG-4 SP decoder
14490
(results of CE 1.1)
MPEG-C Proposal for adding ISO/IEC 23002-2 in RVC tool library
14510
255
14542
14546
2.2
MPEG-B/
Liaison Statement to MPEG on RVC
MPEG-C
MPEG-B/ Description of tools for the RVC framework: editors
MPEG-C simulator software and HDL code generators
Contributions on the specification of a finite precision IDCT
Several contributions have been received concerning cross check of core experiments and validation
of results for finite precision IDCT performance and complexity. The main comments and major
points of each contribution are reported in the table below.
The more relevant results are reported by contribution M14506 in which it is shown how variant of
the current CD algorithm (called Za) can achieve a further reduction of the implementation
complexity for a negligible decrease of drift performances. This represents ~10% complexity
savings compared to previous implementation (saves 2 shifts and 4 negations). Drift test shows
negligible differences between the two.
Another algorithm called L1m9 might be convenient for implementations because it can reuse
blocks for (it can use 8 multiplier 26 additions and 8 shifts, but it does not pass the linearity test).
The decision of the group was to move to Za algorithm and include it in the CD.
Core experiment reports:
Summary of core experiments on fixed point IDCT/DCT
14506
Yuriy Reznik
14485
Zhibo Ni
Lu Yu
14469
Honggang Qi
Wen Gao
Debin Zhao
Siwei Ma
Report of precision results for 3 variants of the CD
algorithm (Z0). Moving to Z0a could save 2 shifts. Drift test
shows negligible differences between the two.
Another algorithm L1m9 might be convenient to reuse
blocks for implementation (it can use 8 multiplier 26
additions and 8 shifts, but it does not pass the linearity test.
IDCT Core Experiment Results
Experiments on variations of CD and other candidate
results. They are done with MPEG-2 and MPEG-4 including
quarter-pel interpolation. Results do not show evidence for
changing current CD algorithm.
Cross-check of IDCT core experiments
Results of 14485 have been cross-checked.
Summary: A variety of variations of the fixed-point IDCT specified in the CD have been
successfully identified, with various trade-offs in regard to dynamic range, operation counts,
256
operation types, etc. Drift analyses were performed for these IDCTs in H.263, MPEG-2, and MPEG4 (with ½- and ¼-pel accurate MC).
Testbed updates:
Updated 23002-1 IDCT precision testbed
14346
Yuriy Reznik
Testbed update
Updated H.263-based IDCT testbed
14347
14348
14379
14380
14403
Yuriy Reznik
Arianne Hinds Row column implementation according to previous
standards.
Updated MPEG-4 IDCT Testbed
Arianne T.
Hinds
Updates including MPEG-4 row-column implementation.
Updated T.83 testbed for IDCTs
Arianne T.
Hinds
Conformance test for JPEG update for row-column
implementation
Updated MPEG-2 IDCT Testbed
Zhibo Ni
Inclusion of row first implementation and
Updated TM5 MPEG-2 Testbed
Arianne T.
Hinds
Addition of H.263 and TM5 with row fist implementation
Summary for all contributions: all testbeds have been updated with all modifications included in
the approved CD including row-column Implementations. Testbeds have also been updated to
include existing fixed point IDCT algorithms from MPEG-2 TM5, H.263 and XVID.
Editing reports:
14310
Yuriy A.
Reznik
Gary Sullivan
Arianne T.
Hinds
14311
Yuriy Reznik
Study Text of ISO/IEC 23002 CD (editors input)
Change of title adding implementation and taking out
“transform”. Definition of the transform is changed. Editing
according to NB comments received at Marrakech meeting.
Study Text of ISO/IEC 23002-1/PDAM1 (editors input)
Amendments of reference SW. Software overview. Mainly
cleanup of previous text without any relevant change.
Summary: A variety of editorial issues were identified with the current CD text and improvements
were proposed to address them.
257
Conformance tests:
Fixed-Point IDCT Conformance Tests
14531
Arianne T.
Hinds
14509
Yuriy Reznik
Report of conformance tests for the CD algorithm. Also
other algorithms pass conformance tests.
Cross-check of IDCT conformance tests
Cross check of the results is confirming the results of
14531.
Summary: A testbed was provided for verification of CE IDCTs using the methodology of MPEG2 video conformance testing. The results were provided and cross-checked.
Drift phenomena analysis and studies:
On the Problem of Quarter Pixel Motion Compensation
14544
Zhibo Ni
Lu Yu
Results showing severe drift results in case of quarter pel
interpolation for MPEG-4 ASP. An analysis of the reasons
of such drift problems is provided. The contribution
presents striking evidence of the need of bit exact match
between encoder and decoder IDCT implementations.
Summary: Analysis of drift propagation with 1/4-pel MC in MPEG-4 P2. This analysis explains
empirically observed phenomena of drift propagation with ¼-pel MC.
Contributions on IDCT design:
14359
Yi-Shin Tung
Chung Hsuan
Kuo
Ming Chung
Hsu
Ja-Ling Wu
Consider Row-Transform-First IDCT in 23002-2 and the
Fixes to 23002-2 CD
The contribution presents the implementation efficiency
reasons for which implementing 2-D IDCT where 1-D row
IDCT are processed first and then columns are processed
after is advantageous. This suggestion has already been
accepted and included in the study text of the FCD.
Summary: Arguments provided in support of implementing 2D IDCTs with 1D row- processing
first, followed by the column- processing.
258
2.3
The progress in the development of the MPEG-4 “Part 9 Reference Hardware
Description”
The ISG activity at the San Jose meeting has mainly been devoted to
 the review of the received contribution (M14434),
 the editorial work for third edition of the technical report,
3
Resolutions
The above activities have led to the following resolutions and output document approval.
4
Resolutions related to MPEG-4
Part 9
Reference Hardware Description
The ISG subgroup recommends to approve the following documents
No.
Title
14496-9 Reference Hardware Description
Status of HDL submissions and commitments for MPEG
Study of ISO/IEC DTR 14496-9
8994
8995
5
TBP Available
No
No
07/04/27
07/04/27
Resolutions related to MPEG-B
Part 4 Codec Configuration Representation
The video subgroup and the ISG recommend to approve the following documents
No.
8979
Title
23001-4 Codec Configuration Representation
WD 4 of ISO/IEC 23001-4
TBP Available
No
07/05/04
MPEG notes that the RVC project is about developing a full collection of individual coding
tools organized in the video tool library and a generic framework that can be used to
make an implementation of any MPEG video coding standard. Further MPEG
recognises the benefit of having the framework be capable of additionally supporting
the implementation of video coding standards from other organizations with which a
collaboration can be established. As part of this project, an identification mechanism
will be developed whereby MPEG video coding tools will be identified by MPEG and
video coding tools from other organizations can be identified via a registration
259
authority.
The video subgroup thanks AVS for their liaison and for providing the specification and
reference software of their standard as needed for the development of the capability of
ISO/IEC 23001-4 to support non-MPEG toolboxes.
MPEG invites organisations who would like to collaborate in the development of the
framework to join MPEG in making the framework support all widely deployed video
codecs.
6
Resolutions related to MPEG-C
Part 2
Fixed point 8x8 DCT/IDCT
The ISG and the video subgroups recommend changing the title of 23002-2 to “Fixed-point
8x8 IDCT and DCT”
The ISG and the video subgroups recommend to approve the following documents
No.
8982
8983
Title
23002-2 Fixed point 8x8 DCT/IDCT
Disposition of Comments on ISO/IEC CD 23002-2
Text of ISO/IEC FCD 23002-2 Fixed-point 8x8 IDCT and DCT
TBP Available
No
No
07/04/27
07/05/04
The video subgroup thanks the National Bodies of Germany and US for their valuable ballot
comments on ISO/IEC CD 23002-2.
Part 4 Video Tool Library
The ISG and the video subgroups recommend to approve the following documents
No.
8984
8985
8986
8987
8988
8989
Title
23002-4 Video Tool Library
WD 4 of ISO/IEC 23002-4
Description of Core Experiments in RVC
RVC Simulation Model (RSM) V4.0
RVC Work Plan
RVC Conformance Testing Working Draft 1.0
Description of Exploration Experiments for Toolbox Extensions
260
TBP Available
No
No
No
No
No
No
07/05/25
07/05/04
07/05/25
07/05/04
07/05/14
07/05/14
Annex N – Liaison report
Source: Kate Grant, Chair
The Liaison group received the following input documents and discussed them at their meeting
on Tuesday April 24th:
No.
Title
Liaison Statements
14285 Liaison Statement from W3C (MMSEM)
Information on current W3C MMSEM work: in particular links to 2 documents: Image
Annotation on the Semantic Web and Multimedia Semantics on the Web: Vocabularies
14297 Liaison Statement from 3GPP
Input on LASeR from 3GPP SA4 group
14300 Liaison Statement from ITU-T FG IPTV
Enclose FG IPTV-R-0021: Report of the 3rd Focus Group on IP Television (IPTV) meeting
14305 Liaison Statement from DVD Forum
Concern regarding backward compatibility problems with N 8859 MPEG-2 Systems DCOR
14313 Liaison Statement from IEC TC100
Text of CDV of Edition 2 of IEC 61937-3 (currently under ballot) for information
14314 Liaison Statement from IEC TC100
Text of CDV of IEC 61966-2-5 (opRGB) (currently under ballot) for information
14331 Liaison Statement from ETSI
Update on issues regarding proposed optional use of MPEG-4 ER AAC-LD for NG-DECT
superwideband conversational applications.
14342 Liaison Statement from CEA
CEA IPTV Roadmap and Phase 2 Report provided for comment before 15th June
14349 Liaison Statement from SMPTE
Concern regarding backward compatibility problems with N 8859 MPEG-2 Systems DCOR
14353 Liaison Statement from ATIS IIF
IPTV Interoperability Specification for the IIF Default Scrambling Algorithm (ATIS0800006) provided for the information and comment
14354 Liaison Statement from ITU-T SG16 (Q10/16)
Selected a reference codec for ITU-T G.722.1 fullband extension standardization that is
publicly available (LAME MP3, http://lame.sourceforge.net).
14362 Liaison Statement from DVB
Request MPEG-7 schemas made available online for automatic retrieval
14413 Liaison Statement from TTA
Information on growth of market in Korea and need for rapid progression of DMB MAF
14533 Liaison Statement from 3D Consortium
Information about consortium and requirement for FTV standardisation
14534 Liaison Statement from TC46/SC9/WG7
Nominating liaison representative and providing background information
261
14535 Liaison Statement from JCP
Information that comments from 79th meeting reflected in current version of JSR-287
14541 Liaison Statement from AVS
Providing AVS specification and reference software to assist collaboration between MPEG
and AVS on RVC and work on identifying general-purpose common elements
14547 Liaison Statement from AES
Project AES-X159, Carriage of PCM with MPEG Surround data over AES3 initiated in SC-02-02
14548 Liaison Statement from FLOForum
Information on use of AVC in MediaFlo and that work on Rich Media is ongoing
The Liaison group prepared the following output documents:
No.
Title
Liaison Statements
8919 Liaison Statement to WG1
Provide information on MPEG-7 Query Format work and CD text for comment
8920 Liaison Statement to IETF
Provide information on new mime type
8921 Liaison Statement to Khronos
Provide information on new work on 3D Graphics Compression Model, and invite input
8922 Liaison Statement to ISO TC184 SC4
Provide information on new work on 3D Graphics Compression Model, and invite input
8923 Liaison Statement to 3GPP
Provide detailed information relating to LASeR
8924 Liaison Statement to W3C
Provide information on Photo Player, an implementation for digital photo libraries
8925 Liaison Statement to ITU-T FG/IPTV concerning M3W
Update on status of M3W standardisation
8926 Liaison Statement to ITU-T FG IPTV
Studying documents in work on identifying IPTV requirements. Provide information on
MAFs and attach FCD of Media Streaming Player
8927 Liaison Statement to SMPTE
Text of revised DCOR on MPEG-2 systems (which addresses their concerns) for comment
8928 Liaison Statement to DVD Forum
Text of revised DCOR on MPEG-2 systems (which addresses their concerns) for comment
8929 Liaison Statement to ETSI
Response to incoming liaison, offering to provide further information if required
8930 Liaison Statement to SMPTE re file format
Provide document on TuC for ISO base media file format for comment
8931 Liaison Statement to DVB
MPEG-7 schemas to be made available on line at ITTF web site
8932 Liaison Statement to JCP
Appreciation that updated JSR-287 specification includes comments from 79th meeting
8933 Liaison Statement to CEA
Information on MPEG specifications which relate to the issues being studied (DRM, QoS
262
etc) and information on MAFs
8934 Liaison Statement to ATIS
Information on MPEG specifications which relate to the issues being studied
8935 Liaison Statement to SMPTE re RVC
Invite experts to participate in development of RVC. Information on AVS collaboration.
8936 Liaison Statement to 3D Consortium
Thank them for information, inform them of start of work on FTV
8937 Liaison Statement to FLOForum
Thank them for information, update them on progress of SVC standardisation
8938 Liaison Statement to TC46/SC9/WG7
Welcome liaison representative, send them MPEG document on URNs ( for comment
8939 Liaison Statement to AVS
Thank them for providing AVS specification and reference software for RVC development
and welcome collaboration on development of RVC framework.
Other Documents
8940 Response to National Bodies
Responses to USNB and Italian NB
8941 List of Organisations with which MPEG entertains liaisons (as of April 2007)
Updated with latest information
263
Download