13
Knowledge Acquisition Method Based on Singular Value Decomposition for Human Motion Analysis Yinlai Jiang, Member, IEEE, Isao Hayashi, Member, IEEE, and Shuoyu Wang, Member, IEEE Abstract—The knowledge remembered by the human body and reflected by the dexterity of body motion is called embodied knowledge. In this paper, we propose a new method using singular value decomposition for extracting embodied knowledge from the time-series data of the motion. We compose a matrix from the time-series data and use the left singular vectors of the matrix as the patterns of the motion and the singular values as a scalar, by which each corresponding left singular vector affects the matrix. Two experiments were conducted to validate the method. One is a gesture recognition experiment in which we categorize gesture motions by two kinds of models with indexes of similarity and estimation that use left singular vectors. The proposed method obtained a higher correct categorization ratio than principal component analysis (PCA) and correlation efficiency (CE). The other is an ambulation evaluation experiment in which we distinguished the levels of walking disability. The first singular values derived from the walking acceleration were suggested to be a reliable criterion to evaluate walking disability. Finally we discuss the characteristic and significance of the embodied knowledge extraction using the singular value decomposition proposed in this paper. Index Terms—Singular value decomposition, embodied knowledge, motion analysis, gesture recognition, walking difficulty evaluation Ç 1 INTRODUCTION T HE motions of the human body are being actively studied in the fields of medical care, sports, arts, and so on. The skill ability internalized in the body, which is called embodied knowledge or tacit knowledge, plays an important role in acquiring skills that enable, for exam- ple, athletes to move economically [1], martial artists to make more flexible movements [2], and musicians to per- form with uplifting feeling [3], [4]. Embodied knowledge is a native human endowment that is usually not con- sciously accessible. Embodied knowledge has been studied from the view- point of not only neurophysiology, but also the structure of the body. Miall et al. proposed a forward model to out- put a signal to a controlled object via feedback control using the Smith predictor having no time delay for the control [5], [6]. Kawato proposed a cerebellar computa- tional model, called an internal model, which argued that the inverse model with feedback and feedforward control is useful in modeling motor control [7], [8]. The forward and inverse models are remarkable in constituting an internal model. Instead of arguing about internal model construction, we propose identifying the relationship between sensor input and the output of internal model movement as knowledge. Our approach assumes that skills consist of a hierarchical structure with a mono-func- tional layer to generate the single function result and a meta-functional layer that adapts itself to environmental change, as shown in Fig. 1. We assume that a skill is the knowledge embodied in the human body. The information of motion, such as trajectory, speed, and acceleration, has been measured to extract and ana- lyze embodied knowledge [3], [4], [9], [10], [11], [12], [13], [14], [15]. Time-series data analysis is usually neces- sary to extract features from the measurement data. Vari- ous methods for analyzing physical movement have been proposed. Mitra and Acharya [16] surveyed varied meth- ods for gesture recognition, ranging from mathematical models, such as hidden Markov models (HMMs) and particle filter, to soft computing approaches such as artifi- cial neural network and fuzzy sets. Wilson and Bobick [17] proposed parametric HMMs which outperformed standard HMMs in an experiment to recognize three- dimensional hand gestures. Pylvanainen [18] presented a gesture recognition system based on continuous HMMs to recognize gestures measured by a 3D accelerometer. Lichtenauer et al. [19] recognized sign language using statistical dynamic time warping for time warping and a separate classifier on the warped features. Suk et al. [20] recognized hand gestures in a continuous video stream by a dynamic Bayesian network. Lamar et al. [21] pro- posed a neural network, Temporal-CombNET (T-Comb- NET), and applied it to Japanese-Kana finger spelling recognition. Jerde et al. [22] measured the angles of hand joints for recognizing finger spelling hand shapes, and Y. Jiang is with the Research Institute, Kochi University of Technology, Kami, Kochi 782-8502, Japan. E-mail: [email protected]. I. Hayashi is with the Faculty of Informatics, Kansai University, Takat- suki, Osaka 569-1095, Japan. E-mail: [email protected]. S. Wang is with the School of Systems Engineering, Kochi University of Technology, Kami, Kochi 782-8502, Japan. E-mail: [email protected]. Manuscript received 21 Jan. 2013; revised 20 Jan. 2014; accepted 12 Mar. 2014. Date of publication 9 Apr. 2014; date of current version 29 Oct. 2014. Recommended for acceptance by L. Khan. For information on obtaining reprints of this article, please send e-mail to: [email protected], and reference the Digital Object Identifier below. Digital Object Identifier no. 10.1109/TKDE.2014.2316521 3038 IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL. 26, NO. 12, DECEMBER 2014 1041-4347 ß 2014 IEEE. Translations and content mining are permitted for academic research only. Personal use is also permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.

3038 IEEE TRANSACTIONS ON KNOWLEDGE AND DATA …data ðX i;G;Y ;Zi;GÞ. From this time-series data ti;G ¼ ðX i;G;Y ;Zi;GÞ, n vectors by m data sampling are extracted by overlapping

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: 3038 IEEE TRANSACTIONS ON KNOWLEDGE AND DATA …data ðX i;G;Y ;Zi;GÞ. From this time-series data ti;G ¼ ðX i;G;Y ;Zi;GÞ, n vectors by m data sampling are extracted by overlapping

Knowledge Acquisition Method Based onSingular Value Decomposition for Human

Motion AnalysisYinlai Jiang,Member, IEEE, Isao Hayashi,Member, IEEE, and Shuoyu Wang,Member, IEEE

Abstract—The knowledge remembered by the human body and reflected by the dexterity of body motion is called embodied

knowledge. In this paper, we propose a new method using singular value decomposition for extracting embodied knowledge from the

time-series data of the motion. We compose a matrix from the time-series data and use the left singular vectors of the matrix as the

patterns of the motion and the singular values as a scalar, by which each corresponding left singular vector affects the matrix. Two

experiments were conducted to validate the method. One is a gesture recognition experiment in which we categorize gesture motions

by two kinds of models with indexes of similarity and estimation that use left singular vectors. The proposed method obtained a higher

correct categorization ratio than principal component analysis (PCA) and correlation efficiency (CE). The other is an ambulation

evaluation experiment in which we distinguished the levels of walking disability. The first singular values derived from the walking

acceleration were suggested to be a reliable criterion to evaluate walking disability. Finally we discuss the characteristic and

significance of the embodied knowledge extraction using the singular value decomposition proposed in this paper.

Index Terms—Singular value decomposition, embodied knowledge, motion analysis, gesture recognition, walking difficulty evaluation

Ç

1 INTRODUCTION

THE motions of the human body are being activelystudied in the fields of medical care, sports, arts, and

so on. The skill ability internalized in the body, which iscalled embodied knowledge or tacit knowledge, plays animportant role in acquiring skills that enable, for exam-ple, athletes to move economically [1], martial artists tomake more flexible movements [2], and musicians to per-form with uplifting feeling [3], [4]. Embodied knowledgeis a native human endowment that is usually not con-sciously accessible.

Embodied knowledge has been studied from the view-point of not only neurophysiology, but also the structureof the body. Miall et al. proposed a forward model to out-put a signal to a controlled object via feedback controlusing the Smith predictor having no time delay for thecontrol [5], [6]. Kawato proposed a cerebellar computa-tional model, called an internal model, which argued thatthe inverse model with feedback and feedforward controlis useful in modeling motor control [7], [8]. The forwardand inverse models are remarkable in constituting aninternal model. Instead of arguing about internal model

construction, we propose identifying the relationshipbetween sensor input and the output of internal modelmovement as knowledge. Our approach assumes thatskills consist of a hierarchical structure with a mono-func-tional layer to generate the single function result and ameta-functional layer that adapts itself to environmentalchange, as shown in Fig. 1. We assume that a skill is theknowledge embodied in the human body.

The information of motion, such as trajectory, speed,and acceleration, has been measured to extract and ana-lyze embodied knowledge [3], [4], [9], [10], [11], [12],[13], [14], [15]. Time-series data analysis is usually neces-sary to extract features from the measurement data. Vari-ous methods for analyzing physical movement have beenproposed. Mitra and Acharya [16] surveyed varied meth-ods for gesture recognition, ranging from mathematicalmodels, such as hidden Markov models (HMMs) andparticle filter, to soft computing approaches such as artifi-cial neural network and fuzzy sets. Wilson and Bobick[17] proposed parametric HMMs which outperformedstandard HMMs in an experiment to recognize three-dimensional hand gestures. Pylvanainen [18] presented agesture recognition system based on continuous HMMsto recognize gestures measured by a 3D accelerometer.Lichtenauer et al. [19] recognized sign language usingstatistical dynamic time warping for time warping and aseparate classifier on the warped features. Suk et al. [20]recognized hand gestures in a continuous video streamby a dynamic Bayesian network. Lamar et al. [21] pro-posed a neural network, Temporal-CombNET (T-Comb-NET), and applied it to Japanese-Kana finger spellingrecognition. Jerde et al. [22] measured the angles of handjoints for recognizing finger spelling hand shapes, and

� Y. Jiang is with the Research Institute, Kochi University of Technology,Kami, Kochi 782-8502, Japan. E-mail: [email protected].

� I. Hayashi is with the Faculty of Informatics, Kansai University, Takat-suki, Osaka 569-1095, Japan. E-mail: [email protected].

� S. Wang is with the School of Systems Engineering, Kochi University ofTechnology, Kami, Kochi 782-8502, Japan.E-mail: [email protected].

Manuscript received 21 Jan. 2013; revised 20 Jan. 2014; accepted 12 Mar.2014. Date of publication 9 Apr. 2014; date of current version 29 Oct. 2014.Recommended for acceptance by L. Khan.For information on obtaining reprints of this article, please send e-mail to:[email protected], and reference the Digital Object Identifier below.Digital Object Identifier no. 10.1109/TKDE.2014.2316521

3038 IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL. 26, NO. 12, DECEMBER 2014

1041-4347 � 2014 IEEE. Translations and content mining are permitted for academic research only. Personal use is also permitted, but republication/redistributionrequires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.

Page 2: 3038 IEEE TRANSACTIONS ON KNOWLEDGE AND DATA …data ðX i;G;Y ;Zi;GÞ. From this time-series data ti;G ¼ ðX i;G;Y ;Zi;GÞ, n vectors by m data sampling are extracted by overlapping

reduced the dimensions of the hand by discriminantanalysis by Principal Component Analysis (PCA). Gaitanalysis is also a well studied topic. Daffertshofer et al.[23] suggested the effectiveness of PCA for reducinghigh-dimensional time-series data sets to a small numberof modes in the analysis of gait kinematics. Williamsonand Andrews [24] measured the acceleration of the shankand detected the main phases of a normal gait duringwalking by machine learning. Jakobsen et al. [25]assessed the knee joint range of motion during rehabilita-tion based on correlation coefficients and the smallestreal difference. However, the HMM is not effective whenthe number of states is large or the data is discontinuous.Since a neural network is too sensitive in time-series datalength, the accuracy is not very good. PCA reduces thenumber of explanatory variables and is a model for visu-alization with principal component variables. It is possi-ble, however, to lose significant principal componentvariables when the proportion of variance is low and thenumber of data is inadequate. In other words, the accu-racy of PCA declines when the contribution ratio is lowdue to a shortage of data.

In this paper, we propose a new method to extractembodied knowledge of body motion by using SingularValue Decomposition (SVD) [26], [27]. Recently, SVD hasbeen used in time-series data analyses for data mining [29]and motion analyses to study the coordinative structures inhuman behavior [30]. In our method, the left and right sin-gular vectors and the singular values are decomposed froma Hankel matrix defined from the time-series data, whichare measured with sensors [28]. Since the left singular vec-tor represents the characteristics of the Hankel matrix andthe singular value represents the strength of the corre-sponding left singular vector, SVD is used more generallyas a method for extracting characteristics from observedtime-series data.

We applied the proposed method to a hand gesturerecognition experiment and a walking disability evalua-tion experiment. In the hand gesture recognition experi-ment, we distinguish five kinds of gestures according tothe similarity and the estimation by using the left singularvectors. In the walking disability evaluation experiment,we distinguished the levels of walking disability by athree-dimensional hyperplane constructed by singularvalues. The results of the two experiments suggest thatSVD is effective for extracting embodied knowledge fromthe time-series data. The characteristic and significance of

the embodied knowledge extraction using SVD is dis-cussed based on the results of the two experiments.

2 EMBODIED KNOWLEDGE EXTRACTION USING

SVD

Suppose M is an m-by-n matrix, then a factorization of M isM ¼ USV , where U ¼ ðu1; u2; . . . ; um) contains the left sin-gular vectors ofM, V ¼ ðv1; v2; . . . ; vn) contains the right sin-gular vectors of M, and the matrix S is an m-by-n diagonalmatrix with nonnegative real singular values on the diago-nal. The SVD is an important factorization of a rectangularreal or complex matrix with many applications in signalprocessing and statistics. Applications using SVD includecomputing the pseudo inverse, least squares data fitting,matrix approximation, and determining the rank, range,and null space of a matrix [26], [27].

Suppose that we measure w points ðP1; P2; . . . ; PwÞ ofthe body while a person is performing a motion. On pointPi, the measured data series of motion G is denoted asti;G. The data series of ti;G consists of three-dimensionaldata ðXi;G; Y i;G; Zi;GÞ. From this time-series data ti;G ¼ðXi;G; Y i;G; Zi;GÞ, n vectors by m data sampling are

extracted by overlapping and the matrices Mi;GX ;Mi;G

Y , andMi;G

Z are constructed as a collective of the measurementdata on the X; Y , and Z coordinates of the motion, respec-tively. Fig. 2 shows a design for constructing the matrix

Mi;GX . The matrices Mi;G

X ;Mi;GY , and Mi;G

Z are described asfollows:

Mi;GX ¼ �

Xi;G1 ; Xi;G

2 ; . . . ; Xi;Gn

�T(1)

Mi;GY ¼ �

Y i;G1 ; Y i;G

2 ; . . . ; Y i;Gn

�T(2)

Mi;GZ ¼ �

Zi;G1 ; Zi;G

2 ; . . . ; Zi;Gn

�T; (3)

where Xi;Gp ¼ ðxi;G

p;1 ; xi;Gp;2 ; . . . ; x

i;Gp;mÞ; p ¼ 1; 2; . . . ; n, and x is a

datum on the X coordinate. We define Y i;Gp and Zi;G

p in the

same way. Since the matrix is designed by overlapping the

extracted data from the whole time-series data, the method

using SVD is less constrained by the length of the whole

data than those using a neural network. The difference

between PCA and our method is that PCA generally ana-lyzes the matrix composed of the deviation of each datum

from the empirical mean and does not allow overlap of the

measurement data.

Fig. 1. Conceptual framework of the proposed model.Fig. 2. Design of matrixMi;G

X .

JIANG ET AL.: KNOWLEDGE ACQUISITION METHOD BASED ON SINGULAR VALUE DECOMPOSITION FOR HUMAN MOTION ANALYSIS 3039

Page 3: 3038 IEEE TRANSACTIONS ON KNOWLEDGE AND DATA …data ðX i;G;Y ;Zi;GÞ. From this time-series data ti;G ¼ ðX i;G;Y ;Zi;GÞ, n vectors by m data sampling are extracted by overlapping

Suppose Mi;Gk ; k ¼ fX;Y; Zg is an m-by-n matrix as the

general format of Mi;GX ;Mi;G

Y ;Mi;GZ . The SVD of the matrix

Mi;Gk is

Mi;Gk ¼ Ui;G

k Si;Gk

�V i;Gk

�T; (4)

where Ui;Gk ¼ ðui;G

1;k ; ui;G2;k ; . . . ; u

i;Gm;kÞ is an m-by-m unitary

matrix, fV i;Gk gT denotes the conjugate transpose of V i;G

k ¼ðvi;G1;k ; vi;G2;k ; . . . ; vi;Gn;kÞ which is an n-by-n unitary matrix, and

the matrix Si;Gk is an m-by-n diagonal matrix. The diagonal

entries of Si;Gk are the singular values of Mi;G

k . The matrix

Ui;Gk contains the left singular vectors ofMi;G

k and the matrix

V i;Gk contains the right singular vectors ofMi;G

k .

Now, take Mi;GX as an example of matrix Mi;G

k to discuss

motion analysis. SVD can decompose the matrix Mi;GX into

a product of Ui;GX , Si;G

X , and V i;GX . Intuitively, the left singu-

lar vectors in Ui;GX form a set of patterns of Mi;G

X and the

diagonal values in matrix Si;GX are the singular values,

which can be considered as scalars, by which each corre-

sponding left singular vectors affect the matrix Mi;GX . Sup-

pose that the number of left singular vectors is l, and the

element number of the jth left singular vector is q. Let us

denote the couples of the singular values and the left sin-

gular vector as ððsi;G1;X; u

i;G1;XÞ; ðsi;G

2;X; ui;G2;XÞ; . . . ; ðsi;G

l;X; ui;Gl;XÞÞ for

ui;Gj;X ¼ ðui;G

1j;X; ui;G2j;X; . . . ; u

i;Ghj;X; . . . ; u

i;Gqj;XÞ in the descending

order of the singular values, where ui;Ghj;X is the hth element

of the jth left singular vector ui;Gj;X. The left singular vector

expresses the characteristic of the whole time-series data

better if its corresponding singular value singular is larger.

That is, the greater the singular value is, the more domi-

nant the corresponding pattern is.

3 GESTURE RECOGNITION WITH LEFT SINGULAR

VECTORS

The left singular vectors ðui;G1;X; u

i;G2;X; . . . ; u

i;Gl;XÞ well represent

the characteristics of motions. We conducted a gesture rec-ognition experiment to demonstrate the effectiveness of fea-ture extraction using the left singular vectors.

3.1 Hand Gesture Measurement

Five kinds of hand gestures, Come here (CH), Go away (GA),Go right (GR), Go left (GL), and Calm down (CD), were per-formed by two subjects, SW and ST, who were males in their20s. These gestures are commonly used in daily life. The ges-tures were performed in a 50 cm� 50 cm� 50 cm cubic space,whose zero point and coordinate system are shown in Fig. 3.The motions of the hand gestures were measured withMovetr/3D and GE60/W (Library, Tokyo, Japan).The sub-jects were instructed to finish the gestures within the samespeed period since we focused on the trajectory, not thespeed of motion. Five markers were measured: P1 on the tipof the thumb, P2 on the tip of the middle finger, P3 on the tipof the little finger, P4 on the thumb side of the wrist and P5

on the little finger side of the wrist.The motions of subject SW are shown in Fig. 4 by nine

frames extracted from the experimental video every 1/6 s.

The time-series data of P2 when subject SW performed thefive kinds of gestures are shown in Fig. 5. In Fig. 5, themovement change for GA, CH, and CD is large in the topand bottom direction (onto the z-axis) and in the front andback direction (onto the y-axis), and for GR and GL, themovement change is big in the right and left direction (ontothe x-axis). One gesture was executed nine times by eachsubject to get a sufficient variety of motion. The differencebetween subjects was not the main subject of this studysince the restriction of the cubic space eliminates some indi-vidual differences. Data of the first five executions wereused as the acquisition of patterns of the gesture. Data ofthe last four times were used to distinguish the gesture.

We proposed two kinds of gesture recognition methodsusing the left singular vectors extracted from the time-seriesdata of gesture motion: gesture recognition based on thesimilarity between gesture distances (SGD) and gesture rec-ognition based on the similarity between gesture vectors(SGV). Our aim is to discuss a method to extract embodiedknowledge from the time-series data rather than to developa method to recognize hand gestures.

3.2 Gesture Recognition Based on SGD

Suppose that the observed data series are divided into

two groups: ti;GTRD as the training data series, and ti;GCHD as

the checking data series. Let us denote the left singular

vector ti;GX;TRD for training data related to the X coordinate

values of the point Pi on the hand for the gesture G

as Ui;GX;TRD ¼ ðui;G

1;X;TRD; ui;G2;X;TRD; . . . ; u

i;Gl;X;TRDÞ for ui;G

j;X;TRD ¼ðui;G

1j;X;TRD; ui;G2j;X;TRD; . . . ; u

i;Ghj;X;TRD; . . . ; u

i;Gqj;X;TRDÞ. We define

the left singular vectors tiX;CHD for checking data as

UiX;CHD ¼ ðui

1;X;CHD; ui2;X;CHD; . . . ; u

il;X;CHDÞ, for ui

j;X;CHD ¼ðui

1j;X;CHD; ui2j;X;CHD; . . . ; u

ihj;X;CHD ; . . . ; ui

qj;X;CHDÞ in the

same way.

To recognize the hand gestures, three kinds of similar-ity criteria are defined on the data ti;G ¼ ðXi;G; Y i;G; Zi;GÞas follows:

Fig. 3. Environment for measurement of gestures.

3040 IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL. 26, NO. 12, DECEMBER 2014

Page 4: 3038 IEEE TRANSACTIONS ON KNOWLEDGE AND DATA …data ðX i;G;Y ;Zi;GÞ. From this time-series data ti;G ¼ ðX i;G;Y ;Zi;GÞ, n vectors by m data sampling are extracted by overlapping

S1 : ri�Ui;GTRD; U

iCHD

�¼ 1

3lq

X3k¼1

Xl

j¼1

Xqh¼1

ui;Ghj;k;TRD �

Xqh¼1

uihj;k;CHD

���������� (5)

S2 : ri�Ui;GTRD; U

iCHD

�¼ 1

3lq

X3k¼1

Xl

j¼1

Xqh¼1

��ui;Ghj;k;TRD � ui

hj;k;CHD

�� (6)

S3 : ri�Ui;GTRD; U

iCHD

�¼ 1

3lq

X3k¼1

ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiXl

j¼1

Xqh¼1

�ui;Ghj;k;TRD � ui

hj;k;CHD

�2vuut :(7)

The similarity S1 is defined by the absolute differential ofthe total left singular vector between the training data andthe checking data. The similarity S2 is defined by the totalabsolute differential of the left singular vectors between thetraining data and the checking data at the same order. Thesimilarity S3 is defined by the euclidean distance betweenthe left singular vectors of the training data and the check-ing data on the multidimensional space.

Since there are w measurement points ðP1; P2; . . . ; PwÞ,the estimated gesture G� is identified by the following twokinds of estimations:

E1 : G� ¼

�Gf jmax

f

Xwi¼1

n�Gi

f

�;

for Gif ¼

�Gf jmin

fri�U

i;Gf

TRD; UiCHD

�� (8)

Fig. 5. Time-series data of the gestures(P2).

Fig. 4. Motion of gestures(representative frames are extracted every1/6 second).

JIANG ET AL.: KNOWLEDGE ACQUISITION METHOD BASED ON SINGULAR VALUE DECOMPOSITION FOR HUMAN MOTION ANALYSIS 3041

Page 5: 3038 IEEE TRANSACTIONS ON KNOWLEDGE AND DATA …data ðX i;G;Y ;Zi;GÞ. From this time-series data ti;G ¼ ðX i;G;Y ;Zi;GÞ, n vectors by m data sampling are extracted by overlapping

E2 : G� ¼

�Gf jmin

f

Xwi¼1

ri�U

i;Gf

TRD; UiCHD

�; (9)

where Gf is the fth gesture among the five hand gestures,and nðGi

fÞ is a counting function, which is nðGifÞ ¼ 1 if the

condition Gif is satisfied at the Pi point on the hand.

The estimation E1 is defined by counting the number ofminimal similarity values on all the markers and the mostcounted gesture is output as the recognition result. The esti-mation E2 outputs the gesture with the minimal total simi-larity values as the recognition result.

Table 1 shows the recognition results of the gesture pat-terns of two subjects based on the three kinds of differentsimilarities, Equations (5), (6), and (7). In the calculation,m ¼ 125; n ¼ 5; q ¼ 125; l ¼ 1; w ¼ 5. The recognition resultssuggest that similarities S2 and S3 lead to relatively highercorrect recognition rates. Especially, the pair of the similar-ity S2 and the estimation E1 leads to a significantly highaccuracy of 90:0 percent. The result suggests that the pair ofS2 and E1 is more feasible for gesture recognition. However,in general it is hard to distinguish between the gesture of GoRight and Go Left, and so the results are understandable.

In the experiment, the positions of the five markers onthe right hand were measured. However, it is possible thatnot all the positions of these markers have a high relevanceto the gestures. We compared the accuracy of each markerfor gesture recognition with similarity S2 and estimation E1

by calculating the left singular vectors at each marker.Table 2 shows the large counting number of minimized sim-ilarity values. The results are shown in the form of a=b,where a is the number of counted times of the most countedgesture and b is the name of the gesture. As a result, the firstmarker P1 was selected as the most important markerbecause the accuracy is the highest, 93:85 percent. Since P1

measures the time series at the tip of the thumb and islargely related to movement of the thumb, it is suggestedthat the motion of the thumb is important in gesture recog-nition. Reducing the number of markers considered in therecognition not only reduces the calculation but alsoimproves the recognition accuracy.

3.3 Gesture Recognition Based on SGV

In the second method, gesture recognition is based onSGV. Now, consider the data series in Mi;G

X of Equation(1). If one of the data series Xi;G

p is replaced by anotherdata series Xi

CHD, the singular values and the left singularvectors of Mi

X;CHD are different from those of Mi;GX .

MiX;CHD ¼ �

Xi;G1 ; . . . ; Xi;G

p�1; XiCHD;X

i;Gpþ1; � � � ; Xi;G

n

�T: (10)

The difference between the left singular vectors of Mi;GX

and MiX;CHD is determined by how Xi

CHD is different fromthe other p� 1 data series. The patterns of Mi

X;CHD changemore when Xi;G

p is replaced by a quite dissimilar data seriesthan when it is replaced by a similar one. Therefore, the dif-ference between the left singular vectors of Mi;G

X and XiCHD

can be considered as a measure of the difference betweenXi

CHD and the other data series. IfXiCHD comes from another

kind of hand gesture, the difference can be used as a crite-rion for judging whether Xi

CHD comes from the same kindof hand gesture as the other data series. In our algorithm,the location of Xi

CHD is fixed at the end of the data series.Therefore, only the n th Xi;G

n is replaced by another dataseriesXi

CHD in Equation (10).To recognize a hand gesture, three kinds of similarity cri-

teria are defined on the data ti;G ¼ ðXi;G; Y i;G; Zi;GÞ asfollows:

S4 : ri�Ui;GTRD; U

iCHD

�¼ 1

3lq

X3k¼1

Xl

j¼1

Xqh¼1

ui;Ghj;k;TRD �

Xqh¼1

uihj;k;CHD

���������� (11)

S5 : ri�Ui;GTRD; U

iCHD

�¼ 1

3lq

X3k¼1

Xl

j¼1

Xqh¼1

��ui;Ghj;k;TRD � ui

hj;k;CHD

�� (12)

S6 : ri�Ui;GTRD; U

iCHD

�¼ 1

3lq

X3k¼1

ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiXl

j¼1

Xqh¼1

�ui;Ghj;k;TRD � ui

hj;k;CHD

�2vuut :(13)

The meanings of S4, S5, and S6 are the same as those ofS1, S2, and S3, respectively. Since there are w measurementpoints ðP1; P2; . . . ; PwÞ, the estimated gesture G� is identifiedby the following estimations:

E3 : G� ¼ Gf jmin

f

Xwi¼1

ri�U

i;Gf

TRD; UiCHD

�( ): (14)

TABLE 1Gesture Recognition Accuracy Results (SGD)

TABLE 2Recognition Accuracy Comparison of Markers (SGD)

3042 IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL. 26, NO. 12, DECEMBER 2014

Page 6: 3038 IEEE TRANSACTIONS ON KNOWLEDGE AND DATA …data ðX i;G;Y ;Zi;GÞ. From this time-series data ti;G ¼ ðX i;G;Y ;Zi;GÞ, n vectors by m data sampling are extracted by overlapping

In the proposed method, all of the time series must havethe same number of data to compose matrix Mi;G

X , as shownin Equation (1). However, the lengths of the hand gesturesin real measurements are different according to the kinds ofgestures and the subjects who perform them. Therefore,preprocessing is necessary to make the data series have thesame number of data. In this paper, the number of data wasset to be the average number. If a data series contains moredata than the average number, data are deleted from thedata series at the same interval. If a data series containsfewer data than the average number, data are interpolatedin the data series at the same interval. The interpolated dataare calculated using quadric interpolation.

Table 3 shows the recognition results based on the threekinds of similarity definitions of S4, S5, and S6, in Equations(11), (12), and (13), respectively. The recognition results sug-gest that the similarity definitions of S5 and S6 lead to rela-tively higher correct recognition rates, but the correct rate ofrecognition based on S4 was very low. Therefore, S5 and S6

are more feasible in gesture recognition.As in the comparison of markers in the method by SGD,

we compared the accuracy of markers in this method bySGV with similarity S5 and estimation E3. Table 4 showsthe large counting number of minimized similarity values.The first M1 was selected as the most important marker, asin Table 2. We realize that the movement of the thumb ismost related to gesture recognition by the results. The rec-ognition results of the two methods show that the left singu-lar vectors extracted from the time-series data can be usedas knowledge to distinguish gestures. Especially, the totalabsolute differential of the left singular vectors at the sameorder is significantly effective because the left singular vec-tors express a time-dependent weight for identifying thewhole movement. Regarding the incorrect recognitions, themotion data were quite similar, although the motion wasincorrectly recognized as a gesture different from theintended one. Gestures GR and GL, for example, have oppo-site meanings but their motions are very similar in that thehand waves left and right. Their difference between the twogestures is the hand moves faster from left to right in GRwhile it moves faster from the right to left in GL. Even

human beings sometimes do not distinguish correctlybetween them [31]. On the other hand, the differencesbetween subjects were not considered in the experimentdue to the restrictions of the measurement environment.Further study will discuss individual differences by carry-ing out experiments with more subjects.

We compared our motion analysis method using SVDwith the following traditional methods: a PCA clusteringmethod, a PCA distance method, and a correlation effi-ciency (CE) method. We used the gesture distance methodthat distinguishes gesture movements with similarity S2

and estimation E1, and the gesture vector method that dis-tinguishes gesture movements with similarity S5 and esti-mation E3. For the correlation coefficient method, thecombination of similarity S7 and estimation E4 is adoptedto calculate the recognition accuracy. The following combi-nation of similarity S8 and estimation E5 is adopted for thePCA clustering method, and the combination of similarityS9 and estimation E5 is adopted for the PCA distancemethod.

S7 : ri�Ui;GTRD; U

iCHD

� ¼ 1

3

X3k¼1

R�di;GTRD;k; d

iCHD;k

�(15)

S8 : ri�Ui;GTRD; U

iCHD

�¼ minf

ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiXHc¼1

�fXYZ!c

�E�di;Gf

TRD

��� fXYZ!c

�E�diCHD

���2vuut

(16)

S9 :ri�Ui;GTRD; U

iCHD

�¼ minf

ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiXHc¼1

�fGf!c

�di;Gf

TRD

�� fGf!c

�diCHD

��2vuut (17)

E4 : G� ¼ Gf jmax

f

1

w

Xwi¼1

ri�U

i;Gf

TRD; UiCHD

�( )(18)

TABLE 3Gesture Recognition Accuracy Results (SGV)

TABLE 4Recognition Accuracy Comparison of Markers (SGV)

JIANG ET AL.: KNOWLEDGE ACQUISITION METHOD BASED ON SINGULAR VALUE DECOMPOSITION FOR HUMAN MOTION ANALYSIS 3043

Page 7: 3038 IEEE TRANSACTIONS ON KNOWLEDGE AND DATA …data ðX i;G;Y ;Zi;GÞ. From this time-series data ti;G ¼ ðX i;G;Y ;Zi;GÞ, n vectors by m data sampling are extracted by overlapping

E5 : G� ¼ Gf jmax

f

Xwi¼1

n�ri�U

i;Gf

TRD; UiCHD

��( ); (19)

where Rðx; yÞ is the correlation coefficient between x and y.Eð Þ is an average, fA!Bð Þ is a function of the PCA, whichconverts from the variable A to the principal axis B, and His the number of principal axes, where we setH ¼ 3.

The results are shown in Table 5. The average accuracy ofthe two methods we proposed, which are the method basedon SGD and the method based on SGV, are much higherwith values of 90:0 and 80:0 percent when considering allthe markers, and with average values of 87:12 and 80:67 per-cent when considering each marker separately. None of theother three methods obtained an accuracy higher than80:0 percent, regardless of whether all the markers wereconsidered or each marker was considered separately. Thereason for the low accuracy is that PCA and CE dependstrongly on the length of the data. The difference betweenthe lengths of the TRD for training and the CHD for check-ing leads to the low recognition accuracy. In contrast, sinceour methods using SVD can extract a movement character-istic regardless of the length of the time-series data, they donot depend on the data length because of the overlapping ofsubsets of the time-series data, which contributes to thehigh recognition accuracy.

3.4 Knowledge Representation with Fuzzy If-ThenRules

The results of Sections 3.2 and 3.3 illustrate that the featuresof gesture motion are extracted into the left singular vectors.The features can be used to classify the gestures with a highdegree of accuracy by using a proper calculation. In order tounderstand the features better, an intuitive and understand-able way to represent these features are required. In this sec-tion, the features of the left singular vectors are described asfuzzy sets, and fuzzy if-then rules [32] are used to representthe knowledge.

The frequency distribution of the elements in the left sin-gular vectors is shown by histograms. The histograms of the1st–3rd left singular vectors are shown in Fig. 6 as examples.The singular vectors were calculated fromM1;GA

X , that is, thematrix composed from the x coordinate time-series data atpoint 1 of gesture GA. The elements are between �0.4 and0.4, and are counted in four intervals: (�0.4, 0.2), [0.2, 0), [0,0.2), and [0.2, 0.4).

The features of the histograms can be described byquadrilateral fuzzy sets. A quadrilateral fuzzy set isdenoted as Qða1ðd1Þ; a2ðd2Þ; a3ðd3Þ; a4ðd4ÞÞ, where a1 � a4are the lateral coordinates, and d1 � d4 the vertical

coordinates of the vertices of the quadrilateral. d1 � d4are the grade of membership of a1 � a4. a1 � a4 andd1 � d4 are determined to make the quadrilateral fuzzyset to approximate the shape of the histogram with d1and d4 being 0, the maximum of d2 and d3 being 1, a2and a3 being �0.1 and 0.1 respectively. Fig. 7 shows thequadrilateral fuzzy sets of the corresponding histogramsin Fig. 6. The fuzzy sets for the 1st-3rd left singular vec-tors are Qð�0:3ð0Þ;�0:1ð1Þ; 0:1ð0Þ; 0:3ð0ÞÞ, Qð�0:35ð0Þ;�0:1ð0:83Þ; 0:1ð1Þ; 0:32ð0ÞÞ, and Qð�0:35ð0Þ; �0:1ð0:59Þ;0:1ð1Þ; 0:3ð0ÞÞ, respectively.

After describing the features of the left singular vectorsas fuzzy sets, the fuzzy if-then rule to represent the knowl-edge assumes the following form

ifw

i¼1

3

k¼1

l

j¼1

�Qi;G�

l;k is Qi;Gl;k

�then G� is G; (20)

TABLE 5Recognition Accuracy of Methods

Fig. 6. Element histogram of the 1st–3rd left singular vectors (P1 of GA).

Fig. 7. Membership functions of the 1st-3rd left singular vectors(P1 of GA).

3044 IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL. 26, NO. 12, DECEMBER 2014

Page 8: 3038 IEEE TRANSACTIONS ON KNOWLEDGE AND DATA …data ðX i;G;Y ;Zi;GÞ. From this time-series data ti;G ¼ ðX i;G;Y ;Zi;GÞ, n vectors by m data sampling are extracted by overlapping

where Q is a quadrilateral fuzzy set to represent the jth sin-gular vector on the kth coordinate at the ith marker. Sincethese fuzzy rules represent the knowledge about the fea-tures of the gestures, gesture recognition can be realized byfuzzy reasoning based on these rules.

4 WALKING DISABILITY EVALUATION USING

SINGULAR VALUES

Accelerometers have been widely used to monitor bodymovements, including gait, sit-to-stand transfers, posturalsway, and falls, for its low cost, less restriction and wear-ability [33], [34]. In clinician, they have been used to mea-sure physical activity levels and to identify and classifymovements performed by subjects [24], [35], to analyze gaitpattern in diseases, such as Parkinson disease [36], [37] andstroke [38], [39], and to monitor patient [40]. Since the singu-lar value indicates the strength of the corresponding left sin-gular vector, which represents the characteristics of theHankel matrix, we used the singular values calculated fromthe walking acceleration data to evaluate the levels of walk-ing disability.

4.1 Acceleration Measurement with WearableWireless Accelerometers

In the experiment, the acceleration during walking wasmeasured by three wearable wireless three-axis accelerome-ters (Motion Recorder MVP-RF8, Microstone Nagano,Japan): P1 on the back of the waist (B. Waist), P2 on the mid-point of the right shank (R. M. Shank), and P3 on the mid-point of the left shank (L. M. Shank), as shown in Fig. 8. Thesampling rate of the sensor was 100 Hz. When the subjectstands upright, the sensors’ x-axis is front/back, the y-axisis up/down, and the z-axis is right/left. However, the coor-dination system changes since the orientation of the sensorschanges during walking. In the experiment, we examinedthe acceleration of walking difficulty simulated by restrict-ing the right leg with knee supporters and weight bands

(Fig. 8). The knee supporter bound around the knee jointdecreases the knee joint range of motion and the weightband bound around the ankle joint can simulate the weak-ness in muscle strength. The simulation is very important intesting our method since it does not endanger the safety ofthe disabled during the development phase of the method.Two levels of walking difficulty were simulated by two lev-els of restraint in the experiment. A weak restraint was sim-ulated with one knee supporter and one weight band (1 kg),and a strong restraint with two knee supporters and twoweight bands (2 kg). In total, three statuses (Normal withoutrestraint, Weak, and Strong) were examined.

Six healthy volunteers (YJ, TK, KT, KS, TF, and RT; 5males and 1 female) aged 21–31 yr (mean 26 yr) partici-pated in the experiment. The subjects were instructed towalk for approximately 4 m along a straight line. Theexperiment was carried out in the status order of Normal,Weak, and Strong. For each status, each subject walkedfour times.

The measurement time-series data in the x coordinateof the three sensors when subject YJ walked in the statusof Weak are shown in Fig. 9. The fluctuation of the accel-eration was significant when the right foot or the leftfoot was landing. Fig. 9a shows five strides. The acceler-ation data of the first stride were extracted and areshown in Fig. 9b. All the acceleration at the B. Waist,R. M. Shank, and L. M. Shank significantly fluctuatedwhen the right foot pushed off from the floor or steppedon the floor. The fluctuation in the acceleration at L. M.Shank was more significant than that at R. M. Shank

Fig. 8. Experimental settings for ambulation.

Fig. 9. Example of ambulation.

JIANG ET AL.: KNOWLEDGE ACQUISITION METHOD BASED ON SINGULAR VALUE DECOMPOSITION FOR HUMAN MOTION ANALYSIS 3045

Page 9: 3038 IEEE TRANSACTIONS ON KNOWLEDGE AND DATA …data ðX i;G;Y ;Zi;GÞ. From this time-series data ti;G ¼ ðX i;G;Y ;Zi;GÞ, n vectors by m data sampling are extracted by overlapping

since the right leg was restricted by the knee supporterand the weight band. The fluctuation in the accelerationat B. Waist was the smallest among the three measure-ment points. This showed that the trunk of the body,especially the waist, was kept relatively stable to main-tain the body balance even when the lower limbs weredisabled.

4.2 Relationship between Singular Values andWalking Disability

We focused on the acceleration change around the timewhen the right foot pushed off and stepped on the floor.The acceleration data around the right foot landing wereanalyzed. Matrix M for SVD was designed according tothe local maximum turning points, as shown in Fig. 9. Ittook five strides to walk 4 m in the experiment. The accel-eration data of the middle three strides, which do notinvolve the initiation and termination of the walkingmovement, were extracted for analysis. The data from0.5 s before the maximum turning point to 0.5 s after themaximum turning point were extracted as column vec-tors. For each turning point, nine column vectors (threesensors, each of which has three coordinates) wereextracted. The column vectors from the same accelerationdata series composed matrix M. Therefore, M was amatrix of 100 rows and three columns. In this paper, onlythe first singular value and the first left singular vectorwere considered. Thus parameter l was 1.

The first singular values extracted from the accelerationdata with SVD are listed in Table 6. In spite of the individual

differences in walking, similar changes of the singular val-ues of all six subjects are shown. The singular values of P2

and P3 decreased with the increase of walking difficulty,and especially those of P2 decreased in all three axes, X, Y ,and Z. However, the decrease did not show at P1. The Waistis the center of the body and is always kept balanced duringmovement. Fig. 9 shows that the waist was relatively stableto maintain the body balance. An F-test was performed onthe singular values at P2 to assess the significance of the dif-ference among the three levels. The F-test results in Table 7show that for all three axes, the singular values are signifi-cantly different between the three levels. The first singularvalues of P2, therefore, are suggested to be an effective crite-rion to evaluate walking difficulty. That is, the first singularvalues from the acceleration of the restricted leg decreaseswith the increase of walking difficulty. The larger the firstsingular values at P2 are, the more serious the walkingdisability is.

TABLE 6Singular Values of Ambulation Experiment

TABLE 7Result of F Test on P2

3046 IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL. 26, NO. 12, DECEMBER 2014

Page 10: 3038 IEEE TRANSACTIONS ON KNOWLEDGE AND DATA …data ðX i;G;Y ;Zi;GÞ. From this time-series data ti;G ¼ ðX i;G;Y ;Zi;GÞ, n vectors by m data sampling are extracted by overlapping

4.3 R-Plane for Visualization of Walking Disability

The results in Table 6 suggest that the first singular valuescan be used as the criterion for the evaluation of walkingdisability. In practical applications, walking ability assess-ment is important to track recovery and design a properrehabilitation program according to the condition of thepatient during the process of walking rehabilitation. Clini-cal measurements of walking ability are mainly based oninspection by the physical therapist, sometimes with thehelp of muscle strength measurement, X-ray examination,ground reaction force, 3D motion analysis, accelerationanalysis, gait and stride analysis, electromyography(EMG), and so on [39]. Objective measurements and cor-rect interpretation of the measurement results can have asubstantial impact on the recovery. A convenient assess-ment method is desired for those who rehabilitate andexercise at home, because they can know their status any-time by themselves.

To provide a more understandable presentation of the

data for the physical therapists and the patients to evalu-

ate the recovery of walking ability by using the singularvalues calculated from the walking acceleration, we pro-

pose a visualization tool, called a rehabilitation plane (R-

Plane), to assist in the evaluation of walking disability.

The first singular values extracted from the acceleration

data at P2 are plotted in 3D space in Fig. 10. The singular

values of four times of walking are plotted in a different

shape and color according to the status. The average sin-

gular values of the three statuses, ðsNx ; s

Ny ; s

Nz Þ for

Normal, ðsWx ; sW

y ; sWz Þ for Weak, and ðsS

x ; sSy ; s

Sz Þ for

Strong, are connected by black lines. Fig. 10 shows that

the first singular values are clustered according to the sta-

tus. The line connecting the average values can be consid-

ered as the line of severity of walking difficulty. A plane

can be defined by the average singular values of the three

statuses. Given a patient’s singular values of walking, a

perpendicular line is drawn from the point defined by thesingular values to the lines connecting the average singu-

lar values; then, the patient’s walking disability can be

evaluated intuitively by the position of the intersection

point. This plane is called the R-Plane, which can be used

for walking disability evaluation in rehabilitation to

assess the recovery of patients.The conceptual diagram of the R-Plane is shown in

Fig. 11. Suppose the normal vector of the R-Plane is~n ¼ ðnx; ny; nzÞ, then the R-Plane can be described by Equa-tion (21).

nx

�x� sN

x

�þ ny

�y� sN

y

�þ nz

�z� sN

z

� ¼ 0: (21)

Suppose the coordinate of patient A in 3D space isðsX; sY ; sZÞ, whose projection to the R-Plane is ðsR

i ; sRj Þ. The

projection of ðsRi ; s

Rj Þ to the line connecting the average sin-

gular values is ðs�i ; s

�j Þ ¼ ðs�

X; s�Y ; s

�ZÞ. Then the walking dis-

ability and recovery of patient A can be assessed by theposition of ðs�

i ; s�j Þ and the distances of ðsR

i ; sRj Þ to the aver-

age singular values.As an example, we consider subject TK as patient A and

draw the R-Plane based on the average singular values ofthe other five subjects. The equation of the plane is

� 0:810xþ 0:532yþ 0:246zþ 9:284 ¼ 0: (22)

The R-Plane is shown in Fig. 12. Subject TK’s singularvalues in the statuses of Normal, Weak, Strong areðNL;WC;SCÞ, whose projection to the lines in the R-Planeare ðNL�;WC�; SC�Þ. Although status Normal is not veryaccurate, statuses Weak and Strong are correctly located inthe figure. The results show the effectiveness of the R-Planein intuitively presenting the walking disability and provid-ing visual information of the patient’s recovery.

5 DISCUSSION

A newmethod for embodied knowledge extraction from thetime-series data of motion by using singular value decom-position is proposed and evaluated by a hand gesture recog-nition experiment and a walking disability assessmentexperiment. We design a matrix by overlapping the subsetsof the time-series data and use the left singular vectors ofthe matrix as the patterns of the motion and the singularvalue as a scalar, by which the corresponding left singularvector affects the matrix.

Fig. 10. Singular values of the three statuses. Fig. 11. Conceptual diagram of R-Plane.

JIANG ET AL.: KNOWLEDGE ACQUISITION METHOD BASED ON SINGULAR VALUE DECOMPOSITION FOR HUMAN MOTION ANALYSIS 3047

Page 11: 3038 IEEE TRANSACTIONS ON KNOWLEDGE AND DATA …data ðX i;G;Y ;Zi;GÞ. From this time-series data ti;G ¼ ðX i;G;Y ;Zi;GÞ, n vectors by m data sampling are extracted by overlapping

Since the method is less affected by the length of thetime-series data, it leads to higher accuracy of hand gesturerecognition than do conventional methods using PCA orCE. Both the recognition based on SGD and the recognitionbased on SGV obtained the best accuracy with the samesimilarity calculation, S2 and S5. The similarities S2 and S5

are defined by the total absolute differential of the left sin-gular vectors between the training data and the checkingdata at the same order. It is suggested that the order of thetime-series data is an important characteristic of the time-series data and the left singular vector is able to extract thatcharacteristic. As for the estimation methods, E1, which isdefined by counting the number of minimal similarity val-ues on all the markers, obtained better accuracy. Motionanalysis usually deals with data from multiple sensors andthe sensors are related to each other; for example, P4 and P5

in the gesture motion measurement, which are on thethumb side of the wrist and on the little finger side of thewrist, respectively, are highly correlated. In this case, a deci-sion by the majority is more effective than averaging orsumming.

The recognition results on each marker show that on P1,the recognition accuracy is the highest with the value of93:85 percent. Fig. 13 is a plot of the results in Tables 2 and 3for the position of the five markers. On average, the markerson the thumb side of the arm obtained higher recognitionaccuracy. This may be because when a hand gesture is per-formed, the thumb side of the arm attracts more attention,since it is in the view of the performer, and thus the motionis more consistent.

The first singular values calculated from the walkingacceleration were suggested to be effective to evaluatethe levels of walking disability. The singular value isconsidered as a scalar, by which the corresponding leftsingular vector affects the original matrix. Walking withdifferent disabilities has a similar pattern or characteris-tic. However, the strength of the most dominant charac-teristic, which is the first singular vector, is significantlydifferent. The singular values can be utilized as a crite-rion for the evaluation of recovery in walking rehabilita-tion. An R-Plane, defined by the singular values fromthe walking acceleration, is proposed as a visualizationtool to assist the evaluation of the walking disability.

The physical therapist and the patient can intuitivelyevaluate the recovery progress with the R-Plane.

We propose to extract embodied knowledge by identify-ing the relationship between the sensor input and the out-put of the internal model. The mechanism underlying theskillful movement which a human being performs uncon-sciously is still not sufficiently elucidated from the view-points of neurophysiology or the structure of the body. Themotion analysis method using SVD we propose is effectiveto extract embodied knowledge from motion by obtainingthe characteristics and their strength. Its effectiveness wasvalidated by the gesture recognition experiment, whichmeasured the position of the measurement points, and thewalking disability experiment, which measured the acceler-ation of the measurement points. Our aim was to discussthe method to extract embodied knowledge from the time-series data rather than to develop methods to recognizehand gestures and to evaluate walking disability, and fairlygood results were obtained with the proposed method.

6 CONCLUSION

A newmethod for embodied knowledge extraction from thetime-series data of motion is proposed by using SVD. Weapplied the method to a gesture recognition experiment anda walking disability evaluation experiment. The effective-ness of extracting the motion pattern with the left singularvectors in gesture recognition was validated by comparisonwith conventional methods PCA and CE. In addition, theeffectiveness of evaluating the strength of the dominantmotion pattern affecting the motion with the singular valueswas shown by the significantly different first singular val-ues among the three levels of walking difficulty, normal,weak, and strong. Motion analysis using SVD is suggestedas a useful method to extract embodied knowledge frommotion measurement data.

ACKNOWLEDGMENTS

This work was supported by the Strategic Project to Supportthe Formation of Research Bases at Private Universities:

Fig. 12. Example of R-Plane.

Fig. 13. Recognition result of each marker.

3048 IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL. 26, NO. 12, DECEMBER 2014

Page 12: 3038 IEEE TRANSACTIONS ON KNOWLEDGE AND DATA …data ðX i;G;Y ;Zi;GÞ. From this time-series data ti;G ¼ ðX i;G;Y ;Zi;GÞ, n vectors by m data sampling are extracted by overlapping

Matching Fund Subsidy fromMEXT (Ministry of Education,Culture, Sports, Science and Technology), 2008-2012, andJSPS KAKENHI Grant Numbers 23240088, 22300197, and23700316.

REFERENCES

[1] L. E. Holt and S. L. Beilock, “Expertise and its embodiment: Exam-ining the impact of sensorimotor skill expertise on the representa-tion of action-related text,” Psychonomic Bulletin Rev., vol. 13, no. 4,pp. 694–701, Aug. 2006.

[2] D. C. Spencer, “Habit(us), body techniques and body callusing:An ethnography of mixed martial arts,” Body Soc., vol. 15, no. 4,pp. 119–143, Dec. 2009.

[3] S. Furuya and H. Kinoshita, “Organization of the upper limbmovement for piano key-depression differs between expert pia-nists and novice players,” Exp. Brain Res., vol. 185, no. 4, pp. 581–593, Mar. 2008.

[4] S. Furuya, R. Osu, and H. Kinoshita, “Effective utilization of grav-ity during arm downswing in keystroke by expert pianists,” Neu-rosci., vol. 164, no. 2, pp. 822–831, Dec. 2009.

[5] R. C. Miall, D. J. Weir, D. M. Wolpert, and J. F. Stein, “Is the cere-bellum a Smith predictor?” J. Motor Behavior, vol. 25, pp. 203–216,Sep. 1993.

[6] R. C. Miall and J. K. Jackson, “Adaptation to visual feedbackdelays in manual tracking: Evidence against the Smith Predictormodel of human visually guided action,” Exp. Brain Res., vol. 172,no. 1, pp. 77–84, Jun. 2006.

[7] M. Kawato, “Internal models for motor control and trajectoryplanning,” Current Opinion Neurobiol., vol. 9, no. 6, pp. 718–727,Dec. 1999.

[8] H. Imamizu andM. Kawato, “Cerebellar internal models: Implica-tions for the dexterous use of tools,” Cerebellum, vol. 11, no. 2,pp. 325–335, Jun. 2012.

[9] J. K. Aggarwal and Q. Cai, “Human motion analysis: A review,”Comput. Vis. Image Understanding, vol. 73, no. 3, pp. 428–440, Mar.1999.

[10] J. K. Aggarwal and M. S. Ryoo, “Human activity analysis: Areview,” ACM Comput. Surveys, vol. 43, no. 3, Article 16, p. 43,2011.

[11] R. Balasubramaniam and M. T. Turvey, “Coordination modes inthe multisegmental dynamics of Hula hooping,” Biol. Cybern.,vol. 90, pp. 176–190, Apr. 2004.

[12] M. Hirashima, K. Kudo, K. Watarai, and T. Ohtsuki, “Control of3D limb dynamics in unconstrained overarm throws of differentspeeds performed by skilled baseball players,” J. Neurophysiol.,vol. 97, no. 1, pp. 680–691, Jan. 2007.

[13] Y. Nakayama, K. Kudo, and T. Ohtsuki, “Variability and fluctua-tion in running gait cycle of trained runners and non-runners,”Gait Posture, vol. 31, no. 3, pp. 331–335, Mar. 2010.

[14] K. Furukawa, S. Igarashi, K. Ueno, T. Ozaki, S. Morita, N. Tama-gawa, T. Okuyama, and I. Kobayashi., “(2002.) Modeling humanskill in Bayesian network,” Linkoping Electronic Articles in Com-puter and Information Science, vol. 7, Linkoping Unive. Electron.Press, [Online]. Available: http://www.ida.liu.se/ext/epa/cis/2002/012/tcover.html

[15] K. Furukawa, T. Masuda, and I. Kobayashi, “Abductive reasoningas an integrating framework in skill acquisition,” J. Adv. Comput.Intell. Intell. Informat., vol. 15, no. 8, pp. 954–961, Oct. 2011.

[16] S. Mitra and T. Acharya, “Gesture recognition: A survey,” IEEETrans. Syst., Man, Cybern., Part C, vol. 37, no. 3, pp. 311–324, May2007.

[17] A. D. Wilson and A. F. Bobick, “Parametric hidden Markov mod-els for gesture recognition,” IEEE Trans. Pattern Anal. Mach. Intell.,vol. 21, no. 9, pp. 884–900, Sep. 1999.

[18] T. Pylvanainen, “Accelerometer based gesture recognition usingcontinuous HMMs,” in Proc. 2nd Iberian Conf. Pattern Recognit.Image Anal., Jun. 2005, pp. 639–646.

[19] J. F. Lichtenauer, E. A. Hendriks, and M. J. T. Reinders, “Sign lan-guage recognition by combining statistical DTW and independentclassification,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 30,no. 11, pp. 2040–2046, Nov. 2008.

[20] H. I. Suk, B. K. Sin, and S. W. Lee, “Hand gesture recognitionbased on dynamic Bayesian network framework,” Pattern Recog-nit., vol. 43, no. 9, pp. 3059–3072, Sep. 2010.

[21] M. V. Lamar, M. S. Bhuiyan, and A. Iwata, “Hand gesture recogni-tion using T-CombNET: A new neural network model,” IEICETrans. Inf. Syst., vol. E83-D, no. 11, pp. 1986–1995, Nov. 2000.

[22] T. E. Jerde, J. F. Soechting, andM. Flanders, “Biological constraintssimplify the recognition of hand shapes,” IEEE Trans. Biomed.Eng., vol. 50, no. 2, pp. 265–269, Feb. 2003.

[23] A. Daffertshofer, C. J. Lamoth, O. G. Meijer, and P. J. Beek, “PCAin studying coordination and variability: A tutorial,” Clinical Bio-mechanics, vol. 19, no. 4, pp. 415–428, May 2004.

[24] R. Williamson and B. J. Andrews, “Gait event detection for FESusing accelerometers and supervised machine learning,” IEEETrans. Rehabil. Eng., vol. 8, no. 3, pp. 312–319, Sep. 2000.

[25] T. L. Jakobsen, M. Christensen, S. S. Christensen, M. Olsen,and T. Bandholm, “Reliability of knee joint range of motionand circumference measurements after total knee arthroplasty:Does tester experience matter?” Physiotherapy Res. Int., vol. 15,no. 3, pp. 126–134, Sep. 2010.

[26] M. E. Wall, A. Rechtsteiner, and L. M. Rocha, “Singular valuedecomposition and principal component analysis,” in A PracticalApproach to Microarray Data Analysis, D. P. Berrar, W. Dubitzky,and M. Granzow, eds., Norwell, MA, USA: Kluwer, pp. 91–109.

[27] D. B. Skillicorn, “Understanding Complex Datasets: Data Miningwith Matrix Decompositions,” Boca Raton, FL, USA, CRC Press,May, 2007.

[28] Y. Jiang, I. Hayashi, M. Hara, and S. Wang, “Three-dimensionalmotion analysis for gesture recognition using singular valuedecomposition,” in Proc. IEEE Int. Conf. Inf. Autom., Jun. 2010,vol. 15, no. 8, pp. 805–810.

[29] T. Ide and K. Inoue, “Knowledge discovery from heterogeneousdynamic systems using change-point correlations,” in Proc. SIAMInt. Conf. Data Mining, New Orleans, USA, Jul. 2005, pp. 571–576.

[30] H. Nakanishi, S. Kanata, H. Hattori, T. Sawaragi, and Y. Horigu-chi, “Extraction of coordinative structures of motions by segmen-tation using singular spectrum transformation,” J. Adv. Comput.Intell. Intell. Inf., vol. 15, no. 8, pp. 1019–1029, Oct. 2011.

[31] A. Kendon, Gesture: Visible Action as Utterance. Cambridge, U.K.:Cambridge Univ. Press, 2004.

[32] L. A. Zadeh, “Toward a theory of fuzzy information granulationand its centrality in human reasoning and fuzzy logic,” Fuzzy SetsSyst., vol. 90, no. 2, pp. 111–127, Sep. 1997.

[33] K. M. Culhane, M. O’Connor, D. Lyons, and G. M. Lyons,“Accelerometers in rehabilitation medicine for older adults,” AgeAging, vol. 34, no. 6, pp. 556–560, Nov. 2005.

[34] J. J. Kavanagh and H. B. Menz, “Accelerometry: A technique forquantifying movement patterns during walking,” Gait Posture,vol. 28, no. 1, pp. 1–15, Jul. 2008.

[35] H. Lau and K. Tong, “The reliability of using accelerometer andgyroscope for gait event identification on persons with droppedfoot,” Gait Posture, vol. 27, no. 2, pp. 248–257, Feb. 2008.

[36] A. Salarian, H. Russmann, F. J. Vingerhoets, C. Dehollain, Y.Blanc, P. R. Burkhard, and K. Aminian, “Gait assessment inParkinson’s disease: Toward an ambulatory system for long-termmonitoring,” IEEE Trans. Biomed. Eng., vol. 51, no. 8, pp. 1434–1443, Aug. 2004.

[37] M. Yoneyama, Y. Kurihara, K. Watanabe, and H. Mitoma,“Accelerometry-based gait analysis and its application toparkinson’s disease assessment—part 2 : A new measure forquantifying walking behavior,” IEEE Trans. Neural Syst. Rehabil.Eng., vol. 21, no. 6, pp. 999–1005, Nov. 2013.

[38] M. Iosa, G. Morone, A. Fusco, L. Pratesi, M. Bragoni, P. Coiro, M.Multari, V. Venturiero, D. De Angelis, and S. Paolucci, Effects ofwalking endurance reduction on gait stability in patients withstroke, Stroke Res. Treatment, article ID 810415, p. 6, 2012.

[39] N. M. Salbach, S.J.T. Guilcher, and S. B. Jaglal, “Physicaltherapists’ perceptions and use of standardized assessments ofwalking ability post-stroke,” J. Rehabil, Med., vol. 43, no. 6,pp. 543–549, May 2011.

[40] W. H. Wu, A. A. Bui, M. A. Batalin, D. Liu, and W. J. Kaiser,“Incremental diagnosis method for intelligent wearable sensorsystems,” IEEE Trans. Inf. Technol. Biomed., vol. 11, no. 5, pp. 553–562, Sep. 2007.

JIANG ET AL.: KNOWLEDGE ACQUISITION METHOD BASED ON SINGULAR VALUE DECOMPOSITION FOR HUMAN MOTION ANALYSIS 3049

Page 13: 3038 IEEE TRANSACTIONS ON KNOWLEDGE AND DATA …data ðX i;G;Y ;Zi;GÞ. From this time-series data ti;G ¼ ðX i;G;Y ;Zi;GÞ, n vectors by m data sampling are extracted by overlapping

Yinlai Jiang received the PhD degree in engi-neering from Kochi University of Technology,Japan, in 2008. He was a research associate inthe Kochi University of Technology from 2008 to2012. He is currently an assistant professor inthe Research Institute, Kochi University of Tech-nology. His current research interests includevisual cognition, medical and biological engineer-ing, and computational intelligence. He is a mem-ber of the IEEE.

Isao Hayashi received the PhD degree in engi-neering from Hannan University in 1991. Heworked as a researcher in Sharp Corporationfrom 1981 to 1983, and Panasonic Corporationfrom 1987 to 1993. He was with Hannan Uni-versity first as an assistant professor from1993 to 1994, then an associate professorfrom 1994 to 1997, and finally as a professorfrom 1997 to 2004. He has been a professor inthe Faculty of Informatics, Kansai University,since 2004. His current research interests

include visual cognition, embodied knowledge, and motion analysis.He is a member of the IEEE.

Shuoyu Wang received the PhD degree in elec-trical engineering from Hokkaido University,Japan, in 1993. He was an assistant professor inthe Department of Electronic Information Engi-neering, Yamagata University, from 1993 to 1997,and an associate professor in the Department ofIntelligent Mechanical Systems Engineering,Kochi University of Technology, from 1997 to2002. Since January 2002, he has been a profes-sor in the Department of Intelligent MechanicalSystems Engineering, Kochi University of Tech-

nology. His current research interests include walking rehabilitationrobots, control, and fuzzy reasoning. He is amember of the IEEE.

" For more information on this or any other computing topic,please visit our Digital Library at www.computer.org/publications/dlib.

3050 IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL. 26, NO. 12, DECEMBER 2014