14
Multimedia Understanding Group, Aristotle University of Thessaloniki, Greece Iason Karakostas Vasileios Papapanagiotou Anastasios Delopoulos Multimedia Understanding Group Information Processing Laboratory Dpt. Electrical and Computer Engineering Aristotle University of Thessaloniki Greece Building parsimonious SVM models for chewing detection and adapting them to the user

Building parsimonious SVM models for chewing detection and ...madima.org/wp-content/uploads/2017/11/karakostas... · Iason Karakostas Vasileios Papapanagiotou Anastasios Delopoulos

  • Upload
    others

  • View
    2

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Building parsimonious SVM models for chewing detection and ...madima.org/wp-content/uploads/2017/11/karakostas... · Iason Karakostas Vasileios Papapanagiotou Anastasios Delopoulos

Multimedia Understanding Group, Aristotle University of Thessaloniki, Greece

Iason KarakostasVasileios PapapanagiotouAnastasios Delopoulos

Multimedia Understanding Group

Information Processing Laboratory

Dpt. Electrical and Computer Engineering

Aristotle University of Thessaloniki

Greece

Building parsimonious SVM models for chewing detection and adapting them to the user

Page 2: Building parsimonious SVM models for chewing detection and ...madima.org/wp-content/uploads/2017/11/karakostas... · Iason Karakostas Vasileios Papapanagiotou Anastasios Delopoulos

Multimedia Understanding Group, Aristotle University of Thessaloniki, Greece

Introduction

• Automatically monitoring eating activity has received significant attention in the research community

• Most of the proposed systems require proprietary/specialized sensors

We propose

• A chewing detection system that captures audio from a commercial bone conduction microphone

• A method to build efficient and effective SVM models

• A method to adapt the SVM model to user requiring minimal user feedback

Building parsimonious SVM models for chewing detection and adapting them to the user

Page 3: Building parsimonious SVM models for chewing detection and ...madima.org/wp-content/uploads/2017/11/karakostas... · Iason Karakostas Vasileios Papapanagiotou Anastasios Delopoulos

Multimedia Understanding Group, Aristotle University of Thessaloniki, Greece

Chewing detection hardware

Building parsimonious SVM models for chewing detection and adapting them to the user

Commercial bone-conduction microphone

Android smart-phone

Page 4: Building parsimonious SVM models for chewing detection and ...madima.org/wp-content/uploads/2017/11/karakostas... · Iason Karakostas Vasileios Papapanagiotou Anastasios Delopoulos

Multimedia Understanding Group, Aristotle University of Thessaloniki, Greece

Audio signal pre-processing and feature extraction

Building parsimonious SVM models for chewing detection and adapting them to the user

• Sampling at 4 kHz• High pass FIR cut-off frequency

at 20 Hz• Hamming filter of 3.72 sec

• Overlapping windows • step = 160 samples, • time domain features windows

(length = 400 samples)• spectral features windows

(length = 800 samples)

A single chew Voice

Page 5: Building parsimonious SVM models for chewing detection and ...madima.org/wp-content/uploads/2017/11/karakostas... · Iason Karakostas Vasileios Papapanagiotou Anastasios Delopoulos

Multimedia Understanding Group, Aristotle University of Thessaloniki, Greece

Extracted features

• 16 features• 7 time domain and 9 spectral features

• Fractal dimension and log of variance are quite discriminative individually

Building parsimonious SVM models for chewing detection and adapting them to the user

1.2 1.4 1.6 1.8 2Fractal Dimension

0

0.2

0.4

0.6

0.8chewingnon-chewing

-16 -12 -8 -4 0Log of Variance

0

0.1

0.2

0.3

0.4chewingnon-chewing

Page 6: Building parsimonious SVM models for chewing detection and ...madima.org/wp-content/uploads/2017/11/karakostas... · Iason Karakostas Vasileios Papapanagiotou Anastasios Delopoulos

Multimedia Understanding Group, Aristotle University of Thessaloniki, Greece

Classification and Active Learning

• Classification of feature vectors using the SVM classifier

• Active Learning• A method of improving a classifier’s effectiveness by enhancing the training

set in “rounds” • Apply model on pool of feature vectors• Select feature vectors and request feedback (correct label)

• Active learning is used for: • Parsimonious Active Learning Training (PALT) • Inter-Active Learning Adaptation (IALA)

Building parsimonious SVM models for chewing detection and adapting them to the user

Page 7: Building parsimonious SVM models for chewing detection and ...madima.org/wp-content/uploads/2017/11/karakostas... · Iason Karakostas Vasileios Papapanagiotou Anastasios Delopoulos

Multimedia Understanding Group, Aristotle University of Thessaloniki, Greece

Parsimonious Active Learning Training (PALT)

Building parsimonious SVM models for chewing detection and adapting them to the user

A method to create SVM models with• much fewer support vectors • without sacrificing much of the model’s discriminative power

Page 8: Building parsimonious SVM models for chewing detection and ...madima.org/wp-content/uploads/2017/11/karakostas... · Iason Karakostas Vasileios Papapanagiotou Anastasios Delopoulos

Multimedia Understanding Group, Aristotle University of Thessaloniki, Greece

Inter-Active Learning Adaptation (IALA)

• Adaptation of a pre-trained SVM model to a single user based on inter-active feedback requests for ambiguous time intervals

• Can be applied both on directly trained models and on PALT models

• Based on time and SVM score thresholds

Building parsimonious SVM models for chewing detection and adapting them to the user

Page 9: Building parsimonious SVM models for chewing detection and ...madima.org/wp-content/uploads/2017/11/karakostas... · Iason Karakostas Vasileios Papapanagiotou Anastasios Delopoulos

Multimedia Understanding Group, Aristotle University of Thessaloniki, Greece

Dataset

• Recordings from 8 subjects using Invisio M3h microphone

• Recording protocol • 7 food types• Non-chewing activities• Both silent and noisy setups

• Ground truth labels assigned based on time-stamps and visual inspection of the captured signals

• Total duration: 90 minutes

• Prior probability: 0.45 for chewing class

Building parsimonious SVM models for chewing detection and adapting them to the user

Page 10: Building parsimonious SVM models for chewing detection and ...madima.org/wp-content/uploads/2017/11/karakostas... · Iason Karakostas Vasileios Papapanagiotou Anastasios Delopoulos

Multimedia Understanding Group, Aristotle University of Thessaloniki, Greece

Experimental evaluation

• Cross-Validation (CV) and Leave-One-Subject-Out (LOSO) experiment setups

• Baseline and PALT performance comparison• Recordings from 8 subjects

• IALA method evaluation using both directly trained and PALT as base models • Recordings from 6 subjects that recorded the protocol twice

Building parsimonious SVM models for chewing detection and adapting them to the user

Page 11: Building parsimonious SVM models for chewing detection and ...madima.org/wp-content/uploads/2017/11/karakostas... · Iason Karakostas Vasileios Papapanagiotou Anastasios Delopoulos

Multimedia Understanding Group, Aristotle University of Thessaloniki, Greece

PALT Evaluation Results

Precision Recall F1 score Accuracy SVs

CV baseline 0.89 0.89 0.89 0.90 33552

CV PALT@100 0.83 0.89 0.86 0.87 232

CV PALT@800 0.85 0.90 0.87 0.88 1633

Building parsimonious SVM models for chewing detection and adapting them to the user

Precision Recall F1 score Accuracy SVs

LOSO baseline 0.84 0.81 0.81 0.83 31152

LOSO PALT@100 0.82 0.79 0.79 0.83 233

LOSO PALT@800 0.81 0.82 0.8 0.83 1632

k-fold Cross-Validation (k=14)initial train set has 40 feature vectors

LOSO evaluation

Page 12: Building parsimonious SVM models for chewing detection and ...madima.org/wp-content/uploads/2017/11/karakostas... · Iason Karakostas Vasileios Papapanagiotou Anastasios Delopoulos

Multimedia Understanding Group, Aristotle University of Thessaloniki, Greece

IALA Evaluation Results

Building parsimonious SVM models for chewing detection and adapting them to the user

Precision Recall F1 score Accuracy SVs

LOSO baseline 0.84 0.82 0.81 0.82 25043

LOSO PALT 0.87 0.66 0.72 0.82 1633

LOSO base + IALA 0.84 0.83 0.82 0.83 25038

LOSO PALT + IALA 0.88 0.80 0.83 0.85 1652

LOSO evaluation on 6 subjects that recorded the protocol twice

Page 13: Building parsimonious SVM models for chewing detection and ...madima.org/wp-content/uploads/2017/11/karakostas... · Iason Karakostas Vasileios Papapanagiotou Anastasios Delopoulos

Multimedia Understanding Group, Aristotle University of Thessaloniki, Greece

Conclusions

• We use active learning techniques for two tasks • Create and deploy classification models with fewer SVs that require reduced

computational resources • Per-user adaptation of the deployed model, requiring minimal user feedback,

and leading to increased accuracy

• User adaptation with IALA has better performance when used with a PALT base model

• Validation on an experimental dataset recorded in lab conditions shows inter-subject accuracy of 0.85 using user-adapted models and parsimonious initial SVM models

• Future work: Evaluation of the proposed system on a larger dataset under free-living conditions

Building parsimonious SVM models for chewing detection and adapting them to the user

Page 14: Building parsimonious SVM models for chewing detection and ...madima.org/wp-content/uploads/2017/11/karakostas... · Iason Karakostas Vasileios Papapanagiotou Anastasios Delopoulos

Multimedia Understanding Group, Aristotle University of Thessaloniki, Greece

Building parsimonious SVM models for chewing detection and adapting them to the user

Thank you