71
Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6, 2015

Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Embed Size (px)

Citation preview

Page 1: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Audio Manipulation Through Gesticulation

Garrett Fosdick, Jair RobinsonJosé Sanchez

Bradley University - Electrical & Computer EngineeringOctober 6, 2015

gfosdick
Done
Page 2: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Overview

- Background- Design Approach- Economic Analysis- Schedule- Division of Labor- Societal and Environmental Impacts

2

Page 3: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Background

3

Page 4: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Problem

• Audio manipulation over a distance

4

• Audio manipulation while multi-tasking

• Interactivity with music

Page 5: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Problem Background• Similar products

- Playstation Eyetoy- Xbox Kinect

• Difference- Audio interactivity- Purely 2D image tracking

5

Microsoft 2014

Sony 2008

Page 6: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Solution

• Kinetis Tower• Visual and audio input• Programming based

- Hand tracking- Dynamic Time Warping- Tie audio effects to motions

6

Freescale 2014

Leopard Imaging

Page 7: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Solution - Audio Manipulation Through Gesticulation

• Audio manipulation over a distance

7

• Audio manipulation while multi-tasking

• Interactivity with music

- Control while several feet away

- Control with movement of a single hand

- Interact through motion- Reduce repetitiveness of songs

Page 8: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Hand Tracking Design

8

Page 9: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Color Matching Through Zeroing

9

Page 10: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Color Matching Through Zeroing

10

Page 11: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Color Matching Through Zeroing

11

Page 12: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Color Matching Through Zeroing

12

Page 13: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Color Thresholding

13

Page 14: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Color Thresholding

14

Page 15: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Color Thresholding

15

Page 16: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Color Thresholding

16

Page 17: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Motion Thresholding

17

Page 18: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Motion Thresholding

18

Page 19: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Motion Thresholding

19

Page 20: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Motion Thresholding

20

Page 21: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Motion Thresholding

21

Page 22: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Color and Motion Thresholding

22

Page 23: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Color and Motion Thresholding

23

Page 24: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Color and Motion Thresholding

24

Page 25: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Color and Motion Thresholding

25

Page 26: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Search Limiting

26

Page 27: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Results

27

Page 28: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

28

Page 29: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Dynamic Time Warping

29

Page 30: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Calculate Difference

- =A B |A-B|

30

Page 31: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Example

359

- =A B |A-B|

359

359

444

222

333

115

137

026

4 2 3359

31

Page 32: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Calculate Cheapest Route To Bottom Right Corner

Difference Cost Search Area

32

Page 33: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Example

|A-B|

115

137

026

Cheapest Cost To Get To That

Square

1

33

Page 34: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Example

|A-B|

115

137

026

Cheapest Cost To Get To That

Square

12

24

34

Page 35: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Example

|A-B|

115

137

026

Cheapest Cost To Get To That

Square

127

249

24

35

Page 36: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Example

|A-B|

115

137

026

Cheapest Cost To Get To That

Square

127

249

2410

36

Page 37: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Example

|A-B|

115

137

026

Cheapest Cost To Get To That

Square

127

249

2410

37

Page 38: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Variations

• Time distortion cost

38

- Non-diagonal movements cost more

• Path killing- Routes over a certain cost are removed

Page 39: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Audio Processing

39

Page 40: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Audio Processing

• Processing time• Analog – digital conversion• 5 audio effects• Finalizing input and output audio

40

Page 41: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Low Pass Filtering• Passing lower Frequencies below cutoff

41Beausievers 2013

Page 42: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

High Pass Filtering• Passing higher frequencies above cutoff

42Beausievers 2013

Page 43: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Distortion• Amplifies audio signal to threshold and clips

43Common Wikimedia 2011

Page 44: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Chorus• Sounds like audio is produced by multiple

sources

44Sound on Sound 2004

Page 45: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Reverb• Sound reflecting in a space

45Practical Musical Production 2012

Page 46: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Development and Testing

46

Page 47: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Development

• Tools- Bradley Computers

- Kinetis Freescale Tower- Leopard Imaging USB Camera

- Kinetis IDE- MATLAB

47

Freescale 2014

Leopard Imaging

Page 48: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Testing• Hand Tracking

- Must have a 80% success rate in the following conditions- Outdoor/Indoor lighting- Different movement speeds

(slow/medium/fast)- At least 3 different hands

- Success is tracking a hand for 30 seconds or more

48

Freescale 2014

Leopard Imaging

Page 49: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Testing

• Dynamic Time Warping- Test against already solved matrixes- Capable of gesture matching

success 90% of the time- Gesture matching will occur in the

same scenarios as the hand tracking

49

Freescale 2014

Leopard Imaging

Page 50: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Testing

• Audio processing- 5 audio effects- Quick processing time- Satisfactory auditory results- No crackling or static- Minimize lag to 100 ms or less

50

Freescale 2014

Leopard Imaging

Page 51: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Economic Analysis

51

Page 52: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Development Costs

52

• Software- Kinetis IDE: $ 0.00- MATLAB: $ 0.00 (Provided By School)

• Hardware- Kinetis Freescale Tower: $294.25- Leopard Imaging USB Camera: $91.94- Total: $386.19

Page 53: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Division of Labor

Decide On Hardware/Order

Parts

Research Video Processes, and Test in

Matlab

Research Audio Processes, and Test in

Matlab

Implement Visual Components on

Hardware

Implement Audio Components on

Hardware

Combine Visual and Audio Components

Test and Improve

Done

Garrett

Jair

Both

Key

53

Page 54: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Schedule

54

Page 55: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Societal and Environmental Impacts

• Is it right to alter an artists music• Liability from damage while gesturing• Liability of harm if used improperly

Disclaimer before use to protect

- Advise users to use caution - Check their surroundings

55

Page 56: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Societal and Environmental Impacts

• For avid music listeners• RoHS compliant

56

Page 57: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Conclusion- Background

- Need for more interactivity with music- Solution provides innovative experience with

personal music- Design Approach

- Dynamic Time Warping and hand recognition for gestures

- Program audio effects tied directly with gestures

57

Page 58: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Conclusion-Feasible project to finish within schedule-Environmentally safe-Socially safe with disclaimer and caution

58

Page 59: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Audio Manipulation Through Gesticulation

Garrett Fosdick, Jair RobinsonJosé Sanchez

Bradley University - Electrical & Computer EngineeringOctober 6, 2015

gfosdick
Done
Page 60: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Extra Slides

60

Page 61: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Division of Labor

61

Page 62: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Test Procedures• Camera Input

- Display Footage On A Monitor - PASS/FAIL

• Hand Tracking- Display Footage At The End Of Each Step – PASS/FAIL- Tracks All Test Hands For A Full 30 Seconds

• Gesture Recognition- Light Up An LED When The Gesture Occurs- Must Be Right 90% Of The Time

62

Page 63: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Test Procedures• Audio Input

- Receiving Without Lag And Distortion- PASS/FAIL

• Audio Output- Audio Is Audible At Normal Hearing Level - PASS/FAIL- No Lag Above 100ms Or Unintended Distortion

• Audio Effects- No Lag from Gesture Trigger Above 100ms- Must Trigger from Correct Gesture 100% of time

63

Page 64: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Preliminary Test Results - Gesture

Matching Partial Random

Success Fail Fail64

Page 65: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Preliminary Test Results - Gesture

65

Page 66: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Block Diagram

66

Page 67: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Glass Block Diagram

67

Page 68: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Gantt Chart

68

Page 69: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Schedule

69

Page 70: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Schedule

70

Page 71: Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Schedule

71