40
Page 1 of 40 EE5359: MULTIMEDIA PROCESSING A FINAL REPORT ON THE PROJECT COMPARISON AND ANALYSIS OF INTRA PREDICTION EFFICIENCY IN HEVC, H.264, VP9 AND AVS CHINA PART 2 SPRING 2015 BY SWETHAA ALLIYALAMANGALAM JAYARAMAN STUDENT ID: 1001053849 [email protected] UNDER THE GUIDANCE OF DR. K.R.RAO ELECTRICAL ENGINEERING DEPARTMENT THE UNIVERSITY OF TEXAS AT ARLINGTON Submission Date: 5/11/2015

EE5359: MULTIMEDIA PROCESSING - The University … 2015... · ee5359: multimedia processing a final report on the project comparison and analysis of intra ... swethaa alliyalamangalam

  • Upload
    lamthuy

  • View
    238

  • Download
    0

Embed Size (px)

Citation preview

Page 1 of 40

EE5359: MULTIMEDIA PROCESSING

A FINAL REPORT ON THE PROJECT

COMPARISON AND ANALYSIS OF INTRA

PREDICTION EFFICIENCY IN HEVC, H.264, VP9 AND

AVS CHINA PART 2

SPRING 2015

BY

SWETHAA ALLIYALAMANGALAM JAYARAMAN

STUDENT ID: 1001053849

[email protected]

UNDER THE GUIDANCE OF

DR. K.R.RAO

ELECTRICAL ENGINEERING DEPARTMENT

THE UNIVERSITY OF TEXAS AT ARLINGTON

Submission Date: 5/11/2015

Page 2 of 40

TABLE OF CONTENTS

INDEX TITLE PAGE NO.

ACRONYMS 3

1 Abstract 5

2 Introduction 6

3 What is Intra Prediction? 8

4 Overview of Video Coding Standards to be compared 9

a) AVS China PART 2 9

b) VP9 15

c) H.264/AVC 18

d) HEVC 20

5 Performance Comparison Metrics 23

a) MSE and PSNR 23

b) SSIM 23

c) Bjøntegaard-Delta Bit-Rate Measurements 23

d) RD- Plots 23

e) Computational Complexity 23

6 Test Sequences 24

7 Test Platform 26

8 Test Results 30

9 Plots 32

10 Project Progress 36

REFERENCES 37

Page 3 of 40

ACRONYMS

AVC Advanced Video Coding

AVS Audio Video Standard

ADST Asymmetric Discrete Sine Transform

AU Access Unit

BBC British Broadcasting Corporation

BD-BR Bjøntegaard-Delta Bit-Rate

BD-PSNR Bjøntegaard-Delta Peak Signal-to-Noise Ratio

CABAC Context-adaptive binary arithmetic coding

CTU Coding Tree Unit

CU Coding Unit

DBF De-Blocking Filter

DC Direct Current

DCT Discrete Cosine Transform

DFT Discrete Fourier Transform

DST Discrete Sine Transform

EBU European Broadcasting Union

HD High Definition

HDTV High Definition Television

HEVC High Efficiency Video Coding

ISO International Organization for Standardization

ITU-T International Telecommunication Union (Telecommunication Standardization Sector)

JPEG Joint Photographic Experts Group

JVT Joint Video Team

MB Macroblock

MPEG Moving Picture Experts Group

MSE Mean Square Error

NAL Network Adaptation Layer

NGOV Next Generation Open Video

OBMC overlapped block-based motion compensation

PSNR Peak Signal-to-Noise Ratio

PU Prediction Unit

QCIF Quarter Common Intermediate Format

RD Rate Distortion

RDO Rate Distortion Optimization

SAO Sample Adaptive Offset

SDTI Serial Data Transport Interface

SMPTE Society of Motion Picture and Television Engineers

SSIM Structural Similarity Index

TM True Motion

TU Transform Unit

UVLC Universal Variable Length Code

Page 4 of 40

VC Video Coding

VLC Variable Length Coding

Page 5 of 40

1. ABSTRACT:

In the Present Era of Blooming Technology, demand for higher resolution and better quality videos

has escalated exponentially. This has caused a tremendous requirement for proper storage and

transmission of videos across various channels and networks which has in turn led into the

development of better video compression techniques [1]. Video compression is the process of

lessening the amount of data needed for representation of the videos by removing redundant

data [1]. Video decompression is the inverse process of Video compression. Video compression

and decompression are also called as video Encoding (Coding) and video decoding. The device or

software used for both of them is called encoder and decoder respectively [1] [3]. The need for

video coding led to the evolution of Video Coding Standards [1].

The first video coding standard developed was H.120 in 1984 by ITU-T (International

Telecommunication Union-Telecommunications standardization sector) [3]. Over the years,

numerous video coding standards have been developed and some of them got standardized [3].

This project aims at comparing various video coding standards such as HEVC (High Efficiency Video

Coding), H.264/AVC, VP9 and AVS (Audio Video Standard) China part 2 based upon Intra

prediction Efficiency. The comparison is carried out with the help of performance comparison

metrics such as PSNR [5], SSIM [45], MSE [5], and BD – PSNR [4], BD BR [4], Computational

Complexity and RD-plots have been plotted. Intra prediction is one of the key feature which helps

in determination of compression efficiency of the whole codec [11].

The tests will be carried using The HM Test Model 16.3 [8], JM Software 18.6 [9], The WebM

Project’s Encoder [7] and AVS China Reference Software [34] for HEVC, H.264/AVC, VP9 and AVS

China PART 2 respectively.

Page 6 of 40

2. INTRODUCTION:

From Figures 1 and 2, the generalized working of the Codec could be understood. The recent codec’s are

Block-Based. Herein, the input video frame is initially partitioned into blocks of the same size called

macroblocks and within each one of them coding and decoding process works. Further to perform

prediction, a macroblock is sub-partitioned into smaller blocks.

Figure 1: Generalized Video Encoder with intra prediction and other improved features [13]

Page 7 of 40

Figure 2: Generalized Video Decoder with intra prediction and other improved features [13]

Intra prediction works within a current video frame and is based upon the already available encoded

and decoded data for the block being predicted. Inter-prediction is used for motion compensation: a

similar region on previously coded frames close to the current block is used for prediction. The main

focus of the prediction process is to reduce redundancy of data and henceforth, avoid the storage of

excessive information in the encoded bit stream [14].

Page 8 of 40

3. WHAT IS INTRA PREDICTION?

Intra Frame coding is the process in which the spatial redundancies present within an image or video are

exploited by coding the original blocks through transform, quantization and entropy coding, independent

of the surrounding frames [35].

Intra prediction is carried in the current video frame and makes prediction for the current block based

upon the available encoded and decoded data [14]. When using intra frame coding, intra prediction

attempts to predict the current block from the neighboring pixels in the adjacent blocks in a defined set

of directions. [35]. Intra Prediction plays a key role in the determination of compression efficiency of the

whole codec [11]. It was initially proposed in 1952 and then it saw its application in transform domain

such as H.261 and H.263 [12]. Telenor Satellite Services proposed 3 modes [15] for intra prediction,

including DC mode plus vertical mode and horizontal mode in the spatial domain in 1997 [16].

Through this project, Intra Prediction modes among various video coding standards will be studied.

Table 1 shows the Intra Prediction modes among various video coding standards at a glance.

VIDEO CODING STANDARDS BLOCK SIZE NUMBER OF PREDICTION MODES

HEVC 16x16 CTU, 32x32 CTU, 64x64 CTU

35 (0-34)

H.264/AVC 4x4 Spatial, 16x16 Spatial, I-PCM

9 or 4

VP9 64x64, 32x32, 32x16, 16x16, 8x16, 8x8 and 4x4 (rectangular intra prediction possible)

10

AVS PART 2 8x8 block based Intra Prediction 5 (0-4)

Table 1: Intra Prediction among various video coding standards at a glance

Page 9 of 40

4. OVERVIEW OF VIDEO CODING STANDARDS TO BE COMPARED:

a) AVS (Audio Video Standard) China PART 2:

INTRODUCTION:

The AVS Video Coding standard was developed by the China Audio Video Coding Standard (AVS) working

group. It has been successful in gaining popularity from industries as well as research institutes. AVS-video

is an application driven coding standard. The AVS standards consists of several parts such as system, video,

audio, conformance testing and reference software etc. AVS Part 2 focusses on high-definition digital

video broadcasting and high-density storage media. It is also known as AVS1-P2 in AVS [18]. Figures 3 and

4 represent the encoder and decoder of AVS China Part 2.

Figure 3: AVS China PART 2: Encoder [17]

Page 10 of 40

Figure 4: AVS China PART 2: Decoder [17]

LAYERED DATA STRUCTURE:

Figure 5: AVS: Layered Data Structure [19]

AVS is built on the layered video structure where in the video signals are divided into several frames.

Figure 5 represents the layered structure. Firstly, the input video stream is organized into sequences.

Then, the sequences are divided into frames and are termed as pictures. Then, pictures are divided into

rectangular regions called slices. Furthermore, the slices are further divided into square regions called

macro-blocks and finally, the macro blocks are further divided into each of 8x8 pixels. The sequence,

pictures and slices begin with unique start codes that allows the decoder to identify them in the received

bit stream [19].

Page 11 of 40

CODING TOOLS:

Table 2: AVS China Part 2: Major Coding Tools [19]

INTRA- FRAME PREDICTION MODE:

Herein, the spatial prediction technique is implemented and it is based upon 8x8 block structure. 5

luminous intra prediction technique and 4 chrominance intra prediction technique have been

implemented. Here, the reference pixels are the reconstructed pixels of neighboring block without the

de-blocking filter [19].

Figure 6: AVS China Part 2: Neighboring pixels in intra prediction [20]

Figure 7: AVS China Part 2: Five Luminance intra prediction modes [20]

Page 12 of 40

b) VP9:

INTRODUCTION:

Like DIRAC, it is also an open source and free-license video compression standard developed

by Google [32]. Under development, it was known as NGOV (Next Generation Open Video)

and VP-Next. It is successor to VP8. It also aims at reduced bit rate by 50% compared to its

predecessor with the same video quality [32]. Figures 12 and 13 represent the encoder and

decoder of VP9.

Figure 12: VP9: Encoder [33]

Figure 13: VP9: Decoder [33]

Page 13 of 40

CODING TOOLS:

Prediction Block Sizes

Prediction Modes

Transform and Quantization

Entropy Coding

Post Processing: De-Blocking Filter

Table 4: VP9: Major Coding Tools

PREDICTION BLOCK SIZES:

VP9 introduces superblocks of size 64x64. It also facilitates intra prediction for rectangular

blocks. The rectangular blocks can further be divided into square blocks up to the size of 4x4.

Figure 14: Partitioning of a Super Block in VP9 [33]

INTRA PREDICTION MODES:

VP9 supports a set of 10 prediction modes [32] [33] for block sizes 4x4 as in Figure 15 to

32x32. They are:

DC_PRED (DC prediction)

TM_PRED (True-motion prediction)

H_PRED (Horizontal prediction)

V_PRED (Vertical prediction)

6 oblique directional prediction modes:

D27 (angle 27 degrees)

D45 (angle 45 degrees)

Page 14 of 40

D63 (angle 63 degrees)

D117 (angle 117 degrees)

D135 (angle 135 degrees)

D153 (angle 153 degrees)

Note: Angles are measured in anti-clockwise direction against the horizontal axis.

Figure 15: Angular Intra Prediction Modes for VP9 [14]

Page 15 of 40

c) H.264/AVC:

INTRODUCTION:

THE H.264/AVC is developed by ITU-T Video Coding Experts Group (VCEG) and ISO/JEC MPEG Video Group

named Joint Video Team (JVT) [36]. The high coding efficiency of H.264, gives perceptually equivalent

video quality at much less bitrate compared to traditional video coding standards such as MPEG-2 [37],

provides encouragement to TV and internet.

Main Goals:

Enhance compression performance

Provision of a network-friendly video representation addressing conversational (video telephony)

and non-conversational (storage, broadcast, or streaming) applications [38].

Figure 16: Encoder of H.264/AVC Codec [39]

Figure 17: Decoder of H.264/AVC Codec [39]

Page 16 of 40

INTRA PREDICTION MODES:

Each PU is predicted from neighboring image data in the same picture, using DC prediction (an average

value for the PU), planar prediction (fitting a plane surface to the PU) or directional prediction

(extrapolating from neighboring data) [40].

Figure 18: Intra_4x4 Prediction in H.264/AVC [40]

Intra_4x4 has 9 prediction modes:

Mode 0: Vertical Prediction

Mode 1: Horizontal Prediction

Mode 2: DC Prediction

Mode 3: Diagonal Down-Left Prediction

Mode 4: Diagonal Down-Right Prediction

Mode 5: Vertical Right Prediction

Mode 6: Horizontal Down Prediction

Mode 7: Vertical Left Prediction

Mode 8: Horizontal Up Prediction

Intra_16x16 has 4 Prediction Modes:

Mode 0: Vertical Prediction

Mode 1: Horizontal Prediction

Mode 2: DC Prediction

Mode 3: Plane Prediction

Page 17 of 40

d) HEVC(HIGH EFFICIENCY VIDEO CODING):

INTRODUCTION:

High Efficiency Video Coding (HEVC) is the latest Video Coding format [43]. It challenges the state-of-

the-art H.264/AVC [44] Video Coding standard which is in current use in the industry by being able to

reduce the bit rate by 50% [44] and retaining the same video quality. It came into existence in the

early 2012 although Joint Collaborative Team on Video Coding (JCT-VC) was formed in January 2001

to carry out developments on HEVC, and ever since then a huge range of development has been going

on. On 13 April 2013 [44], HEVC standard also called H.265 was approved by ITU-T. Joint Collaborative

Team on Video Coding (JCTVC), is a group of video coding experts from ITU-T Study Group (VCEG) and

ISO/IEC JTC 1/SC 29/WG 11 (MPEG). Figures 19 and 20 represents the encoder and decoder of HEVC.

Figure 19: Encoder for HEVC [41]

Figure 20: Decoder of HEVC [42]

Page 18 of 40

Prediction block sizes and macro-block concept:

The concept of macroblock in HEVC [14] is represented by the Coding Tree Unit (CTU). CTU size can be

16x16, 32x32 or 64x64, while AVC macroblock size is 16x16. Larger CTU size aims to improve the efficiency

of block partitioning on high resolution video sequence. Larger blocks provoke the introduction of quad-

tree partitioning of a CTU into smaller coding units (CUs). A coding unit is a bottom-level quad-tree syntax

element of CTU splitting. The CU contains a prediction unit (PU) and a transform unit (TU).

Figure 21: Prediction Unit Splitting in HEVC [14]

The CU can contain up to four prediction units. CU splitting on PUs can be 2Nx2N, 2NxN, Nx2N, NxN,

2NxnU, 2NxnD, nLx2N and nRx2N as shown in Figure 21 where 2N is a size of a CU being split. In the intra

prediction mode only 2Nx2N PU splitting is allowed. An NxN PU split is also possible for a bottom level CU

that cannot be further split into sub CUs.

Intra Prediction Mode:

There are a total of 35 intra prediction modes in HEVC: planar (mode 0), DC (mode 1) and 33 angular

modes (modes 2-34 in Figure 19). DC intra prediction is the simplest mode in HEVC. All PU pixels are set

equal to the mean value of all available neighboring pixels. Planar intra prediction is the most

computationally expensive. It is a two- dimensional linear interpolation. Angular intra prediction modes

2-34 are linear interpolations of pixel values in the corresponding directions. Vertical intra prediction

(modes 18- 34) is an up down interpolation of neighboring pixel values. Also, intra prediction can be done

at different block sizes, ranging from 4 X 4 to 64 X 64 (whatever size the PU has) [33].

Figure 22: Prediction Modes in HEVC [14]

Page 19 of 40

Figure 23: Luma Intra Prediction Modes in HEVC [14]

Page 20 of 40

5. PERFORMANCE COMPARISON METRICS:

a) Mean Square Error (MSE) AND Peak Signal to Noise Ratio

(PSNR):

MSE and PSNR [5] for an NxM pixel image are defined in equations 1 and 2 where O is the original image

and R is the reconstructed image. M and N are the width and height of an image and ‘L’ is the maximum

pixel value in the NxM pixel image.

b) Structural Similarity Index (SSIM)

The structural similarity (SSIM) [45] index is a method for measuring the similarity between two images.

SSIM emphasizes that the human visual system is highly adapted to extract structural information from

visual scenes. Therefore, structural similarity measurement should provide a good approximation to

perceptual image quality. SSIM is designed to improve on methods like peak signal-to-noise ratio (PSNR)

and mean squared error (MSE), which have proved to be inconsistent with human eye perception. SSIM

considers image degradation as perceived change in structural information. Structural information is the

idea that the pixels have strong inter-dependencies especially when they are spatially close.

Where x and y correspond to two different signals that need to be compared for similarity, i.e. two

different blocks in two separate images.

c) Bjøntegaard-Delta Bit-Rate Measurements:

As rate-distortion (R-D) performance assessment [4], Bjøntegaard-Delta bit-rate (BD-BR) measurement

method is used for calculating average bit-rate differences between R-D curves for the same objective

quality (e.g., for the same PSNRYUV values), where negative BD-BR values indicate actual bit-rate savings.

Page 21 of 40

6. TEST SEQUENCES[31]

1. Name: Claire_qcif.yuv

Resolution: 176x144

Frame Rate: 15fps

2. Name: Bridge-close_cif.yuv

Resolution: 352x288

Frame Rate: 30fps

3. Name: BQMall_832x480_60.yuv

Resolution: 832x480

Frame Rate: 60fps

Page 22 of 40

4. Name: BasketballDrive_1280x720_50.yuv

Resolution: 1280x720

Frame Rate: 50 fps

5. Name: Kimono_1920x1080_24.yuv

Resolution: 1920x1080

Frame Rate: 24 fps

Page 23 of 40

7. TEST SETUP

1. HEVC Implementation: Using HM 16.4 Software[8]:

After downloading the software and installing it, the solution can be built and run using

Microsoft Visual Studio 2010. Herein, the solution is built in the ‘RELEASE’ mode in

Microsoft Visual Studio. This will generate lencod and ldecod and executable files which

can be located in the ‘bin’ directory.

The encoder or decoder can be run by using the command line parameters in the

command prompt.

This sequence is tested for various quantization parameters. The value of quantization

parameter can be changed in the encoder.cfg file.

HM 16.4[8] Configuration Set Up:

Configuration used: Main All Intra Mode Configuration

IntraPeriod : 1 # Period of I-Frame ( -1 = only first)

GOPSize : 1 # GOP Size (number of B slice = GOPSize-1)

QP : 22 # Quantization parameter(0-51) (22, 27, 32 or 37 is used at a time)

Sample command line parameters:

C:\ HEVC\ bin\vc10\Win32\Release>TAppEncoder.exe -c

C:\HEVC\cfg\encoder_intra_main.cfg -wdt 832 -hgt 480 -fr 60 -f 10 -i

C:\HEVC\test_seq\BQ_Mall 832x480_60.yuv

-Description:

-c: config file to be used

-wdt: width of the yuv video

-hgt: height of the yuv video

-fr: frame rate of the sequence

-f: no.of frames to be encoded

-i: the input sequence path

Page 24 of 40

2. H.264 Implementation: JM 18.3 Configuration Set Up[9]:

After downloading the software and installing it, the solution can be built and run using

Microsoft Visual Studio 2010. Herein, the solution is built in the ‘RELEASE’ mode in

Microsoft Visual Studio. This will generate lencod and ldecod and executable files which

can be located in the ‘bin’ directory.

The encoder or decoder can be run by using the command line parameters in the

command prompt.

This sequence is tested for various quantization parameters. The value of quantization

parameter can be changed in the encoder_main.cfg file.

HM 16.4[8] Configuration Set Up:

Profile used: Main Profile

Sample command line parameters:

C:\ h_264\ bin >lencod.exe -f encoder_ main.cfg –p InputFile=

"C:\HEVC\test_seq\bridge-close_cif.yuv" -p FramesToBeEncoded = 10 -p SourceWidth

= 352 -p SourceHeight = 288 -p -p QPISlice = 32 -p FrameRate = 30.0 -p ProfileIDC= 77 –

p LevelIDC =40 –p Intraperiod = 1

Description:

-f: config file to be used

- SourceWidth: width of the yuv video

- SourceHeight: height of the yuv video

-FrameRate: frame rate of the sequence

-FramesToBeEncoded: no.of frames to be encoded

-InputFile: the input sequence path

3. VP9:The WebM Project Software[10]

This software is supported only in linux environment. The solution can be built by using

following commands in Linux:

mkdir libvpx-pub

cd libvpx-pub

git clone http://git.chromium.org/webm/libvpx.git

cd libvpx

git checkout -b master origin/master

cd build

Page 25 of 40

mkdir linuxbuild

cd linuxbuild

../../configure --target=x86_64-linux-gcc --enable-internal-stats --disable-vp8

make -j 12

Set Up Environment: Ubuntu 14.4

Mode Used: All Intra (Achieved by configuring the key frame parameter)

Sample Command Line Parameter:

vpxenc Kimono_1920x1080_24.yuv -o kimono.webm \--codec=vp9 --i420 --

width=1920 --height=1080 --passes=2 -t 0 \--rt --good --cpu-used=0 --end-usage=q \--

auto-alt-ref=1 --fps=24000/1001 --verbose --psnr \--lag-in-frames=25 --kf-max-dist=1 \-

-min-q=32 --max-q=32

If, y4m video sequence is used then the source height, source width and frame rate

need not be mentioned.

Description:

-o: output file

--codec: codec to be used

-i: represents the chroma format

--width: width of the test sequence

--height: height of the test sequence

--passes: No.of passes (1/2)

-t: maximum no.of threads

--end-usage: cbr/vbr/cq/q

--target-bit-rate: Bit-rate desired

--cq-level: Constrained Quality Level(22,27,32,37)

--auto-alt-ref:

--fps: frame rate

--psnr: to display psnr value

--kf-max-dist: for the intra-frame config (here)

4. AVS China:AVS China Reference Software [37]:

After downloading the software and installing it, the solution can be built and run using

Microsoft Visual Studio 2010. Herein, the solution is built in the ‘RELEASE’ mode in

Page 26 of 40

Microsoft Visual Studio. This will generate ldecod and lencod and executable files which

can be located in the ‘bin’ directory.

The encoder or decoder can be run by using the command line parameters in the

command prompt.

This sequence is tested for various quantization parameters. For All-Intra Configuration

the encoder_ai.cfg is used.

Various parameters can be changed by parsing the parameters onto the config file.

Mode Used: All Intra

Sample Command Line Parameter:

lencod.exe -f encoder_ai.cfg -p InputFile = "C:\HEVC\test_seq\bridge-close_cif.yuv" -p

FramesToBeEncoded = 10 -p SourceWidth = 352 -p SourceHeight = 288 -p TraceFile =

"log_bridge.txt" -p OutputFile = "test_bridge.avs" -p QPIFrame = 32 -p FrameRate = 5 -

p ChromaFormat = 1

-f: config file

-p QPIFrame: QP=22,27,32 or 37

-p FrameRate: 5 (30fps)

Page 27 of 40

8. TEST RESULTS

Page 28 of 40

Page 29 of 40

Page 30 of 40

9. PLOTS

Part 1: R D Plots:

1. For the sequence: Claire_qcif.yuv :

2. For the sequence: bridge-close_cif.yuv :

30

32

34

36

38

40

42

44

46

48

0 100 200 300 400 500 600

PSN

R (

dB

)

Bitrate (kbps)

R-D Plot: claire_qcif.yuv HEVC H.264 VP9 AVS China part 2

30

32

34

36

38

40

42

44

46

48

50

0 1000 2000 3000 4000 5000 6000 7000

PSN

R(d

B)

Bitrate (kbps)

R-D Plot: bridge-close_cif.yuv

HEVC H.264 VP9 AVS China Part 2

Page 31 of 40

3. For the sequence: BQMall_832x480_60.yuv:

4. For the sequence: BasketballDrive_1080x70_50.yuv:

30

32

34

36

38

40

42

44

46

48

50

0 10000 20000 30000 40000

PSN

R (

dB

)

Bitrate (kbps)

R-D Plot: BQMall_832x480_60.yuv

HEVC H.264 VP9 AVS China part 2

30

32

34

36

38

40

42

44

46

48

50

0 5000 10000 15000 20000 25000 30000

PSN

R (

dB

)

Bitrate (kbps)

R-D Plot:BasketballDrive_1080x720_50.yuv

HEVC H.264 VP9 AVS China part 2

Page 32 of 40

5. For the sequence: Kimono_1920x1080_24.yuv:

30

32

34

36

38

40

42

44

46

48

50

0 10000 20000 30000 40000

PSN

R (

dB

)

Bitrate (kbps)

R-D Plot: Kimono_1920x1080_24.yuv HEVC H.264 VP9 AVS China part 2

Page 33 of 40

Part 2: Encoding Time Comparison:

1. For the Sequence: Claire_qcif.yuv:

2. For the Sequence: bridge-close_cif.yuv:

0

1

2

3

4

5

6

7

8

22 27 32 37

Enco

din

g Ti

me

(se

c)

Quantization Parameter

ENCODING TIME COMPARISON: CLAIRE_QCIF.YUV

HEVC H.264 VP9 AVS China Part 2

0

10

20

30

40

50

22 27 32 37

Enco

din

g Ti

me

(se

c)

Quantization Parameter

ENCODING TIME COMPARISON: BRIDGE-CLOSE_CIF.YUV

HEVC H.264 VP9 AVS China Part 2

Page 34 of 40

3. For the Sequence: BQMall_832x480_60.yuv:

4. For the Sequence: BasketballDrill_1280x720_50.yuv:

0

50

100

150

200

250

22 27 32 37

Enco

din

g TI

me

(se

c)

Quantization Parameter

ENCODING TIME COMPARISON: BQMALL_832X480_60.YUV

HEVC H.264 VP9 AVS China Part 2

0

50

100

150

200

250

22 27 32 37

Enco

din

g Ti

me

(se

c)

Quantization Parameter

ENCODING TIME COMPARISON: BASKETBALLDRIVE_1280X720_50.

YUV

HEVC H.264 VP9 AVS China Part 2

Page 35 of 40

5. For the Sequence: Kimono_1920x1080_24.yuv:

0

100

200

300

400

500

600

22 27 32 37

Enco

din

g Ti

me

(se

c)

Quantization Parameter

ENCODING TIME COMPARISON: KIMONO_1920X1080_24.YUV

HEVC H.264 VP9 AVS China Part 2

Page 36 of 40

10. PROGRESS:

Implemented the test sequences for HEVC, H.264,VP9 and AVS China Part 2.

Plotted the R-D Plots.

Compared the Encoding Time among the standards.

Will compute and plot BD-PSNR and BD-BR.

Page 37 of 40

REFERENCES [1] I.E. Richardson, “The H.264 Advanced video Compression Standards”, Wiley, 2010.

[2] HM Software manual: http://hevc.hhi.fraunhofer.de/

[3] K.R. Rao, D.N. Kim and J.J. Hwang, “Video Coding Standards: AVS China, H.264/MPEG-4 Part10, HEVC,

VP6, DIRAC and VC-1”, Springer, 2014

[4] BD-BR and BD-PSNR: G. Bjøntegaard, “Calculation of average PSNR differences between RD-curves”,

ITU-T Q.6/SG16 VCEG 13th Meeting, Document VCEG-M33, Austin, USA, Apr. 2001.

[5] PSNR and MSE:

http://homepages.inf.ed.ac.uk/rbf/CVonline/LOCAL_COPIES/VELDHUIZEN/node18.html

[6] V.Sze, M.Budagavi and G. Sullivan, “High Efficiency Video Coding”, Springer 2014.

[7] The WebM Project’s VP9 Encoder: http://www.webmproject.org/vp9/

[8] The HM Test Model 16.3: http://hevc.hhi.fraunhofer.de/HM-doc/

[9] JM Software 18.6: http://iphome.hhi.de/suehring/tml/

[10] “Introduction to the Issue on Video Coding: HEVC and Beyond”, IEEE Journal of Selected Topics in

Signal Processing, Vol.7, pp.931-1151, Dec. 2013.

[11] Access the website http://www.uta.edu/faculty/krrao/dip/Courses/EE5359/index_tem.html: Project

on: “Intra Prediction Efficiency and Performance Comparison of HEVC and VP9”, S. Sukumaran, 2014.

[12] ITU-T Recommendation H.263, “Video coding for low bit-rate communication”, Feb. 1998

[13] J. Ostermann et al, “Video Coding with H.264/AVC: Tools, performance and complexity”, IEEE -Circuits

and Systems Magazine, vol. 4, pp. 7-28, First Quarter 2004.

[14] M.P. Sharabayko et al, "Intra Compression Efficiency in VP9 and HEVC" Applied Mathematical

Sciences, Vol. 7, no. 137, pp. 6803 – 6824, Hikari Ltd, 2013

[15] G. Bjontegaard, “Coding improvement by using 4x4 blocks for motion vectors and transform”, ITU-

T/Study Group 16/Video Coding Experts Group (Question 15), Eibsee, Bavaria, Germany, document Q15-

C-23, Nov. 1997.

[16] Z. Nan et al, “Spatial Prediction Based Intra-Coding [video-coding]” IEEE International Conference on

Multimedia and Expo (ICME), Vol. 1, pp. 97-100, June 2004.

[17] Access the http://www.uta.edu/faculty/krrao/dip/Courses/EE5359/index_tem.html Project on:

“Video compression standard for high definition video: A comparative study of H.264, Dirac Pro and AVS

part 2”, S. Gangavati, 2012.

[18] W. Gao et al, “AVS Video Coding Standard”, Intelligent Studies in Computational Intelligence Vol.

280, pp. 125-166, 2010.

[19] W. Gao et al, “AVS- the Chinese Next Generation Video Coding Standard”,

http://www.avs.org.cn/reference/AVS%20NAB%20Paper%20Final03.pdf

[20] L. Yu et al, “An Overview of AVS-Video: tools, performance and complexity”, Visual

Communications and Image Processing 2005, Proc. of SPIE, vol. 5960, pp. 596021, July 31, 2006.

[21] AVS China: http://www.avs.org.cn/english/index.asp

[22] M.Wien, “High Efficiency Video Coding: Coding Tools and Specification”, Springer, 2015.

[23] A.Grange, Internet draft from Network Working Group “A VP9 Bitstream Overview” August 2013.

[24] J.Bankoski et al, “Towards a Next Generation Open source Video Codec” SPIE Vol. 8666 Page 2, Dec.

2013.

Page 38 of 40

[25] Access website http://forum.doom9.org/ “How HEVC/H.265 works, technical details & diagrams”

[26] M.Shafique and J.Henkel “Low Power Design of the Next-Generation High Efficiency Video Coding”

IEEE Automation Conference (ASP-DAC), 19th Asia and South Pacific, Pages: 274 – 281, Jan. 2014.

[27] I.E. Richardson, “Coding video: A practical guide to HEVC and beyond,” Wiley, 11 May 2015.

[28] T. Davies, “A modified rate-distortion optimization strategy for hybrid wavelet video coding”, IEEE

International Conference on Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings.

2006, Vol.: 2, pp. II, Publication Date: 14-19 May 2006.

[29] Access the website http://www.uta.edu/faculty/krrao/dip/Courses/EE5359/index_tem.html Project

on: “Performance Analysis of Dirac Pro with H.264 Intra frame coding”, P. Kharwandikar, 2010.

[30] K. Onthriar, K. K. Loo and Z. Xue, “Performance comparison of emerging Dirac video codec with

H.264/AVC”, IEEE International Conference on Digital Telecommunications, 2006, ICDT apos; Vol. 06,

pp.22, Issue: 29-31, Aug. 2006.

[31] Video Test Sequences: http://ultravideo.cs.tut.fi/#testsequences &

http://forum.doom9.org/archive/index.php/t-135034.html

32] "VP-Next Overview and Progress Update" (PDF). WebM Project (Google). Retrieved 2012-12-29.

Available on: http://downloads.webmproject.org/ngov2012/pdf/04-ngovproject-update.pdf

[33] Access the website http://www.uta.edu/faculty/krrao/dip/Courses/EE5359/index_tem.html Project

on: “Comparative study of Intra Frame Coding efficiency in HEVC and VP9”, S. Kodpadi, 2014.

[34] AVS China software: Part 2 (Password protected):

ftp://124.207.250.92/public/codec/video_codec/AVS2_P2/RD10.1.zip

[35] J. Golston (Distinguished Member Technical Staff, Texas Instruments), “Comparing Media Codecs for

Video Content”, Embedded Systems Conference – San Francisco 2004.

[36] ISO/IEC JTC1/SC29/WG1 1 (MPEG), "Coding of audio-visual objects - Part 10: Advanced Video Coding,"

International Standard 14496-10, ISO/IEC, 2004.

[37] ISO/IEC JTC I/SC29/WGI I (MPEG), "Information technology - Generic coding of moving pictures and

associated audio information: Video," International Standard 13818-2, ISO/IEC, 2000.

[38] T. Wiegand et al, “Overview of the H.264/AVC Video Coding Standard”, IEEE Transactions on Circuits

and Systems for Video Technology, Vol. 13, No. 7, pp. 560-576, July 2003.

[39] X. Zhou, E. Q. Li, and Y.-K. Chen, “Implementation of H.264 decoder on general purpose processors

with media instructions”, SPIE Conference on Image and Video Communications and Processing, vol. 5022,

pp.224-235, May 2003.

[40] A. K. Khan and H. Jamal ,”The Intra-Prediction in H.264”, 2008. : https://getinfo.de/app/The-Intra-

Prediction-in-H-264/id/BLCP%3ACN069872406

[41] D. Marpe et al, “The H.264/MPEG4 advanced video coding standard and its applications”, IEEE

Communications Magazine, Vol. 44, pp. 134-143, Aug. 2006.

[42] C. Fogg, “Suggested figures for the HEVC specification”, ITU-T / ISO-IEC Document: JCTVC J0292r1,

July 2012.

[43] G.J. Sullivan et al, "Standardized Extensions of High Efficiency Video Coding (HEVC)", IEEE Journal of

Selected Topics in Signal Processing, vol.7, no.6, pp. 1001-1016, Dec. 2013

[44] T. Wiegand et al, “Overview of the H.264/AVC Video Coding Standard”, IEEE Transactions on Circuits

and Systems for Video Technology, Vol. 13, No. 7, pp. 560-576, July 2003.

Page 39 of 40

[45] Z. Wang, et al., “Image quality assessment: From error visibility to structural similarity”, IEEE

Transactions Image Processing, vol. 13, pp. 600–612, Apr. 2004

[46] J.Ohm et al., ” Comparison of the Coding Efficiency of Video Coding Standards- Including High

Efficiency Video Coding (HEVC)”, IEEE Transactions on Circuits and Systems for Video Technology on

Circuits and Systems for Video Technology, Vol.22, pp. 1669 – 1684, No.12, December 2012.

[47] Multimedia Processing: http://nptel.iitk.ac.in/courses/Webcourse-

contents/IIT%20Kharagpur/Multimedia%20Processing/New_index1.html

[48] I.E. Richardson, “Coding Video: A Practical guide to HEVC and beyond”, Wiley, 11 May 2015.

[49] Access the website http://www.uta.edu/faculty/krrao/dip/Courses/EE5359/index_tem.html Project

on: “Overview and comparison of H.264/MPEG-4 part 10 or AVC video coding standard and audio video

standard (AVS) of China as compression standard for video” , K.V.Dhonsale, Spring 2012.

[50] Access the website http://www.uta.edu/faculty/krrao/dip/Courses/EE5359/index_tem.html Project

on: “Study and implementation of Video compression standards (H.264/AVC and Dirac).” , K.V.Dhonsale,

Spring 2012.

[51] Video Coding: HEVC and Beyond- IEEE Journal of Selected Topics In Signal Processing,vol.7, pp. 93-

1151, Dec. 2013

[52] Access the website http://www.uta.edu/faculty/krrao/dip/Courses/EE5359/index_tem.html Project

on: “Audio Video Standard for Mobile (AVS-M)”, S.Devaraju, Spring 2009.

[53] L.Yu et al, “Overview of AVS coding standards”, Signal Processing: Image Communication. Vol. 24, pp.

247 – 262 , April 2009.

[54] L.Fang et al, “Overview of AVS Standard”, Multimedia and Expo, 2004. ICME '04. 2004 IEEE

International Conference, Vol.1, pp. 423- 426, June 2004 .

[55] S.Ma et al, “Overview of IEEE 1857 video coding standard”, access the link: http://ieeexplore.ieee.org/xpl/abstractAuthors.jsp?reload=true&arnumber=6738308 [56] “P1857 - IEEE Draft Standard for Advanced Audio and Video Coding”, pp.1-190, 26 October 2012.

[57] H. Zhang and Z.Ma , “Fast Intra Mode Decision for HEVC”, IEEE Transactions on CSVT, vol.24 , pp.660-668, April 2014.

[58] B. Bross et al, “High Efficiency Video Coding (HEVC) Text Specification Draft 10”, Document JCTVC-L1003, ITU-T/ISO/IEC Joint Collaborative Team on Video Coding (JCTVC), Mar. 2013 available on http://phenix.itsudparis.eu/jct/doc_end_user/current_document.php?id=7243

[59] N. Ahmed et al, “Discrete CosineTransform”, IEEE Transactions on Computers, Vol.C-23, pp.90-93, January1974.

[60] Special issue on emerging research and standards in next generation video coding, IEEE Transactions on Circuits and Systems for Video Technology (CSVT), vol. 23, pp. 2009-2142, Dec. 2013.

[61] D.Grois et al, “Performance Comparison of H.265/MPEG-HEVC, VP9, and H.264/MPEG-AVC Encoders”, available on: http://iphome.hhi.de/marpe/download/Performance_HEVC_VP9_X264_PCS_2013_preprint.pdf

[62] B. Zeng and J. Fu, “Directional discrete cosine transforms - A new framework for image coding”, IEEE Trans. on Circuits and Systems for Video Technology, vol. 18, no. 3, pp. 305-313, Mar. 2008.

Page 40 of 40

[63] M.T. Pourazad et al, “HEVC:The new gold standard for video compression”, IEEE consumer electronics magazine ,vol.1, no.7, pp.36-46, July 2012.

[64] Y.-K. Chen et al, “Implementation of H.264 encoder and decoder on personal computers,” J. Visual Commun. Image Representation, vol. 17, no. 2, pp. 509 – 532, Apr. 2006.

[65] K. Chen et al, “Efficient SIMD optimization of HEVC encoder over X86 processors,” APSIPA ASC 2012 Asia-Pacific , pp. 1-4, Dec. 2012.

[66] H. S. Malvar et al, “Low-complexity transform and quantization in H.264/AVC,” IEEE Trans. Circuits Syst. Video Technol., vol. 13, no. 7, pp. 598 – 603, Jul. 2003.

[67] S. Vasudevan and K.R. Rao, “Combination method of fast HEVC Encoding”, IEEE ECTI-CON-2014, Korat, Thailand, May 2014.

[68] HEVC tutorial by I.E.G. Richardson: http://www.vcodex.com/h265.html

[69] H.264 tutorial by I.E.G. Richardson: https://www.vcodex.com/h264.html

[70] A. Puri et al, “Video coding using the H.264/MPEG-4 AVC compression standard”, Signal Processing: Image Communication, vol. 19, pp. 793-849, Oct. 2004.

[71] M. Khan et al, “An adaptive complexity reduction scheme with fast prediction unit decision for HEVC Intra encoding”, IEEE ICIP, pp. 1578-1582, Sept. 2013.

[72] S. Jeong et al., High efficiency video coding for entertainment quality. ETRI J vol. 33, pp.145–154, 2011.