An Efﬁcient Convolutional Network for Human Pose Estimation · In recent years, human pose...

An Efficient Convolutional Network for Human Pose Estimation

Umer Rafi1

rafi@vision.rwth-aachen.de

Ilya Kostrikov1

ilya.kostrikov@rwth-aachen.de

Juergen Gall2gall@iai.uni-bonn.de

Bastian Leibe1

leibe@vision.rwth-aachen.de

1 Computer Vision Group,RWTH Aachen University,Germany

2 Computer Vision Group,University of Bonn,Germany

In recent years, human pose estimation hasgreatly benefited from deep learning and hugegains in performance have been achieved onpopular benchmarks [1, 3, 4]. The trend to max-imise the accuracy on benchmarks, however, re-sulted in computationally expensive deep net-work architectures that require expensive hard-ware and pre-training on large datasets. In thiswork, we propose an efficient deep network ar-chitecture that can be efficiently trained on mid-range GPUs without the need of any pre-trainingand that is on par with much more complex mod-els on the benchmarks [1, 3, 4].

Our proposed Fully ConvolutionalGoogLeNet (FCGN) network (see Figure 1) isbased on the network architecture from [2]. Wetake the first 17 layers of [2] and add a decon-volution layer to make it fully convolutional. Inaddition, we introduce a skip layer and combinetwo FCGNs with shared weights to obtain amulti-resolution network. Belief maps for eachjoint are then obtained by a deconvolution layerwith large kernel size in combination with asigmoid function for normalisation and spatialdrop out for regularisation.

We compare the performance of the pro-posed architecture against convolutional posemachines [5] on the well-known FLIC, LSP, andMPII benchmarks [1, 3, 4]. Our proposed net-work outperforms most previous approaches andachieves competitive performance to the morecomplex model of [5], while requiring only 3GBof memory and far less training time.

[1] M. Andriluka, L. Pishchulin, P. Gehler, andB. Schiele. 2D Human Pose Estimation: NewBenchMark. In CVPR, 2014.

[2] S. Ioffe and C. Szegedy. Batch normalization: Ac-celerating deep network training by reducing in-ternal covariate shift. 2015.

[3] S. Johnson and M. Everingham. Clustered Pose

Figure 1: (a) Proposed fully convolutionalGoogLeNet (FCGN) (b) The proposed multi-resolution network combines two FCGNs.

Figure 2: Our Qualitative results on FLIC [4],LSP [3] and MPII [1].

and Nonlinear Appearance Models for HumanPose Estimation. In BMVC, 2010.

[4] B. Sapp and B. Taskar. MODEC : Multimodel De-composable Models for Human Pose Estimation.In CVPR, 2013.

[5] S. Wei, V. Ramakrishna, T. Kanade, andY. Sheikh. Convolutional Pose Machines. InCVPR, 2016.

An Efﬁcient Convolutional Network for Human Pose Estimation · In recent years, human pose...

Documents

Pose estimation from one conic correspondence

Pose Estimation in Heavy Clutter Using a Multi-Flash Cameraav21/Documents/pre2011/Pose Estimation... · 2015. 5. 25. · Pose Estimation in Heavy Clutter using a Multi-Flash Camera

Adversarial Semantic Data Augmentation for Human Pose … · 2020. 7. 26. · Keywords: Pose Estimation, Semantic Data Augmentation 1 Introduction Human Pose Estimation (HPE) is the

Pose from Shape: Deep Pose Estimation for Arbitrary 3D

VNect: Real-time 3D Human Pose Estimation with a Single ... · Monocular 2D Pose Estimation Early work mostly on monocular 2D pose estimation Deep learning methods represents the

DeepIM: Deep Iterative Matching for 6D Pose Estimation · RGB based 6D Pose Estimation: Traditionally, pose estimation using RGB im-ages is tackled by matching local features [16,23,4]

Stacked Hourglass Networks for Human Pose Estimation arXiv ... · Stacked Hourglass Networks for Human Pose Estimation 5 hands, a nal pose estimate requires a coherent understanding

Deep Head Pose Estimation from Depth Data for In-car ......3 HEAD POSE ESTIMATION The goal of the system is the estimation of the head pose (i.e., pitch, roll, yaw angles with respect

Crane Pose Estimation Using UWB Real-Time Location Systemusers.encs.concordia.ca/~hammad/papers/Crane Pose...Methodology of Using UWB RTLS for Crane Pose Estimation The present paper

Articulated Pose Estimation by a Graphical Model with ...papers.nips.cc/paper/5291-articulated-pose-estimation-by-a... · Articulated Pose Estimation by a Graphical Model with Image

Real-time Simultaneous Pose and Shape Estimation for ...vis.uky.edu/~gravity/Publications/2014/ArticulatedPoseShape_CVPR14.… · Real-time Simultaneous Pose and Shape Estimation

Why pose estimation?

Head Pose Estimation - Final Report

Skeleton-Based Pose Estimation of Human Figures · 2014-11-07 · Skeleton-Based Pose Estimation of Human Figures ... 5 Skeleton-Based Pose Estimation 28 ... Most algorithms treat

Non-Rigid 2D-3D Pose Estimation and 2D Image Segmentationlccv.ece.gatech.edu/docs/cvpr_Nonrigid_2D-3D_pose...2D-3D pose tracking or pose estimation is concerned with relating the spatial

Pose Machines: Articulated Pose Estimation via Inference Machines

L11: 6D Pose Estimation - GitHub Pages

Real-Time Head Pose Estimation in Low-Resolution Football ... · Real-Time Head Pose Estimation in Low-Resolution Football ... time head pose estimation in low-resolution football

Pose Estimation and Calibration Algorithms for Vision · PDF filePose Estimation and Calibration Algorithms for Vision and Inertial Sensors ... Pose Estimation and Calibration Algorithms

Stacked Hourglass Networks for Human Pose Estimation … · Stacked Hourglass Networks for Human Pose Estimation 5 hands, a nal pose estimate requires a coherent understanding of