24
Learnable pooling with Context Gating for Video Classification Antoine Miech, Ivan Laptev, Josef Sivic 1

Learnable pooling with Context Gating for Video Classification · Learnable pooling with Context Gating for Video Classification Antoine Miech, Ivan Laptev, Josef Sivic 1. 2 Goal:

  • Upload
    dodan

  • View
    226

  • Download
    4

Embed Size (px)

Citation preview

Page 1: Learnable pooling with Context Gating for Video Classification · Learnable pooling with Context Gating for Video Classification Antoine Miech, Ivan Laptev, Josef Sivic 1. 2 Goal:

Learnable pooling with Context Gating for Video Classification

Antoine Miech, Ivan Laptev, Josef Sivic

1

Page 2: Learnable pooling with Context Gating for Video Classification · Learnable pooling with Context Gating for Video Classification Antoine Miech, Ivan Laptev, Josef Sivic 1. 2 Goal:

Goal: Multi-modal features pooling2

POOLING MODULE

CLASSIFICATION

Page 3: Learnable pooling with Context Gating for Video Classification · Learnable pooling with Context Gating for Video Classification Antoine Miech, Ivan Laptev, Josef Sivic 1. 2 Goal:

Recurrent model (e.g LSTM)3

LSTM

LSTM

Page 4: Learnable pooling with Context Gating for Video Classification · Learnable pooling with Context Gating for Video Classification Antoine Miech, Ivan Laptev, Josef Sivic 1. 2 Goal:

Recurrent model (e.g LSTM)4

LSTM

LSTM

LSTM

LSTM

Page 5: Learnable pooling with Context Gating for Video Classification · Learnable pooling with Context Gating for Video Classification Antoine Miech, Ivan Laptev, Josef Sivic 1. 2 Goal:

Recurrent model (e.g LSTM)5

LSTM

LSTM

LSTM

LSTM

LSTM

LSTM

Page 6: Learnable pooling with Context Gating for Video Classification · Learnable pooling with Context Gating for Video Classification Antoine Miech, Ivan Laptev, Josef Sivic 1. 2 Goal:

Recurrent model (e.g LSTM)6

LSTM

LSTM

LSTM

LSTM

LSTM

LSTM

LSTM

LSTM

Page 7: Learnable pooling with Context Gating for Video Classification · Learnable pooling with Context Gating for Video Classification Antoine Miech, Ivan Laptev, Josef Sivic 1. 2 Goal:

Recurrent model (e.g LSTM)7

LSTM

LSTM

LSTM

LSTM

LSTM

LSTM

LSTM

LSTM

Page 8: Learnable pooling with Context Gating for Video Classification · Learnable pooling with Context Gating for Video Classification Antoine Miech, Ivan Laptev, Josef Sivic 1. 2 Goal:

Problem ?

▹ Slow for inference/training▹ This is NOT a sequential problem▹ Needs lots of data for training▹ How about very long videos ?

But surprisingly good results !

8

Page 9: Learnable pooling with Context Gating for Video Classification · Learnable pooling with Context Gating for Video Classification Antoine Miech, Ivan Laptev, Josef Sivic 1. 2 Goal:

Our approach9

Page 10: Learnable pooling with Context Gating for Video Classification · Learnable pooling with Context Gating for Video Classification Antoine Miech, Ivan Laptev, Josef Sivic 1. 2 Goal:

Traditional pooling

▹ Bag-of-visual-words [Sivic and Zisserman, 2003][Csurka et al., 2004]

▹ VLAD [Jégou et al., 2010]

▹ Fisher Vector [Perronnin et al., 2007]

10

Page 11: Learnable pooling with Context Gating for Video Classification · Learnable pooling with Context Gating for Video Classification Antoine Miech, Ivan Laptev, Josef Sivic 1. 2 Goal:

Traditional Pipeline11

Unsupervised learning of a dictionary

Learning a supervised classifier

Encoding And unsupervised

dimension reduction

End-to-end Pipeline

Supervised learning of dictionary+

Supervised dimension reduction+

Learning a supervised classifier

VS

Page 12: Learnable pooling with Context Gating for Video Classification · Learnable pooling with Context Gating for Video Classification Antoine Miech, Ivan Laptev, Josef Sivic 1. 2 Goal:

Learnable pooling 12

Unsupervised End-to-EndBag-of-visual-Words [Sivic and Zisserman, 2003] Soft-DBoW

VLAD [Jégou et al., 2010] NetVLAD [Arandjelović et al. CVPR 2016]

Fisher Vector [Perronnin et al., 2007] NetFV

Page 13: Learnable pooling with Context Gating for Video Classification · Learnable pooling with Context Gating for Video Classification Antoine Miech, Ivan Laptev, Josef Sivic 1. 2 Goal:

Model overview13

Page 14: Learnable pooling with Context Gating for Video Classification · Learnable pooling with Context Gating for Video Classification Antoine Miech, Ivan Laptev, Josef Sivic 1. 2 Goal:

Context Gating14

Page 15: Learnable pooling with Context Gating for Video Classification · Learnable pooling with Context Gating for Video Classification Antoine Miech, Ivan Laptev, Josef Sivic 1. 2 Goal:

Context Gating15

Page 16: Learnable pooling with Context Gating for Video Classification · Learnable pooling with Context Gating for Video Classification Antoine Miech, Ivan Laptev, Josef Sivic 1. 2 Goal:

Context Gating16

Equation:

Page 17: Learnable pooling with Context Gating for Video Classification · Learnable pooling with Context Gating for Video Classification Antoine Miech, Ivan Laptev, Josef Sivic 1. 2 Goal:

Model overview17

Page 18: Learnable pooling with Context Gating for Video Classification · Learnable pooling with Context Gating for Video Classification Antoine Miech, Ivan Laptev, Josef Sivic 1. 2 Goal:

Results18

Page 19: Learnable pooling with Context Gating for Video Classification · Learnable pooling with Context Gating for Video Classification Antoine Miech, Ivan Laptev, Josef Sivic 1. 2 Goal:

Generalization19

Page 20: Learnable pooling with Context Gating for Video Classification · Learnable pooling with Context Gating for Video Classification Antoine Miech, Ivan Laptev, Josef Sivic 1. 2 Goal:

20

Page 21: Learnable pooling with Context Gating for Video Classification · Learnable pooling with Context Gating for Video Classification Antoine Miech, Ivan Laptev, Josef Sivic 1. 2 Goal:

BonusWinning the kaggle competition

21

Page 22: Learnable pooling with Context Gating for Video Classification · Learnable pooling with Context Gating for Video Classification Antoine Miech, Ivan Laptev, Josef Sivic 1. 2 Goal:

Effects of ensembling22

Page 23: Learnable pooling with Context Gating for Video Classification · Learnable pooling with Context Gating for Video Classification Antoine Miech, Ivan Laptev, Josef Sivic 1. 2 Goal:

LOUPE Tensorflow toolbox23

GITHUB LOUPE repo:github.com/antoine77340/LOUPE

GITHUB Kaggle code repo:github.com/antoine77340/Youtube-8M-WILLOW

Page 24: Learnable pooling with Context Gating for Video Classification · Learnable pooling with Context Gating for Video Classification Antoine Miech, Ivan Laptev, Josef Sivic 1. 2 Goal:

Questions ?

24