13
Research Article Vibration-Based Adaptive Novelty Detection Method for Monitoring Faults in a Kinematic Chain Jesus Adolfo Cariño-Corrales, 1 Juan Jose Saucedo-Dorantes, 2 Daniel Zurita-Millán, 1 Miguel Delgado-Prieto, 1 Juan Antonio Ortega-Redondo, 1 Roque Alfredo Osornio-Rios, 2 and Rene de Jesus Romero-Troncoso 3 1 MCIA Research Center, Department of Electronic Engineering, Technical University of Catalonia (UPC), Rbla. San Nebridi 22, Gaia Research Building, Terrassa, 08222 Barcelona, Spain 2 CA Mecatronica, Facultad de Ingenieria, Campus San Juan del Rio, Universidad Autonoma de Queretaro, Rio Moctezuma 249, Col. San Cayetano, 76807 San Juan del Rio, QRO, Mexico 3 CA Telematica, DICIS, Universidad de Guanajuato, Carr. Salamanca-Valle km 3.5 + 1.8, Palo Blanco, 36885 Salamanca, GTO, Mexico Correspondence should be addressed to Rene de Jesus Romero-Troncoso; [email protected] Received 26 April 2016; Accepted 30 August 2016 Academic Editor: Mariano Art´ es Copyright © 2016 Jesus Adolfo Cari˜ no-Corrales et al. is is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. is paper presents an adaptive novelty detection methodology applied to a kinematic chain for the monitoring of faults. e proposed approach has the premise that only information of the healthy operation of the machine is initially available and fault scenarios will eventually develop. is approach aims to cover some of the challenges presented when condition monitoring is applied under a continuous learning framework. e structure of the method is divided into two recursive stages: first, an offline stage for initialization and retraining of the feature reduction and novelty detection modules and, second, an online monitoring stage to continuously assess the condition of the machine. Contrary to classical static feature reduction approaches, the proposed method reformulates the features by employing first a Laplacian Score ranking and then the Fisher Score ranking for retraining. e proposed methodology is validated experimentally by monitoring the vibration measurements of a kinematic chain driven by an induction motor. Two faults are induced in the motor to validate the method performance to detect anomalies and adapt the feature reduction and novelty detection modules to the new information. e obtained results show the advantages of employing an adaptive approach for novelty detection and feature reduction making the proposed method suitable for industrial machinery diagnosis applications. 1. Introduction Condition-based maintenance has become accepted by the industrial manufacturing sector as a key factor to achieve the currently required high-competitiveness levels. In the field of electromechanical systems monitoring, an increased number of approaches of health monitoring schemes have been proposed during the last decades [1–3]. ese method- ologies have been demonstrated to be a reliable option as fault detection techniques when data of the possible faults is available. Nevertheless, in order to overcome the practical require- ments of industrial implementation, some considerations must be taken into account. ere is a necessity to develop methodologies capable of monitoring complicated industrial machinery, that is, diagnosis approaches capable of consider- ing complete electromechanical kinematic chains including electric motors, gearboxes, and additional mechanical ele- ments. It is important to mention that information regarding the possible faults that could occur on complicated machines is not usually available due to the high cost and obtaining this information requires a great amount of time. Hindawi Publishing Corporation Shock and Vibration Volume 2016, Article ID 2417856, 12 pages http://dx.doi.org/10.1155/2016/2417856

Research Article Vibration-Based Adaptive Novelty ...downloads.hindawi.com/journals/sv/2016/2417856.pdf · applied under a continuous learning framework. e structure of the method

  • Upload
    others

  • View
    0

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Research Article Vibration-Based Adaptive Novelty ...downloads.hindawi.com/journals/sv/2016/2417856.pdf · applied under a continuous learning framework. e structure of the method

Research ArticleVibration-Based Adaptive Novelty Detection Method forMonitoring Faults in a Kinematic Chain

Jesus Adolfo Carintildeo-Corrales1 Juan Jose Saucedo-Dorantes2

Daniel Zurita-Millaacuten1 Miguel Delgado-Prieto1 Juan Antonio Ortega-Redondo1

Roque Alfredo Osornio-Rios2 and Rene de Jesus Romero-Troncoso3

1MCIA Research Center Department of Electronic Engineering Technical University of Catalonia (UPC) Rbla San Nebridi 22Gaia Research Building Terrassa 08222 Barcelona Spain2CA Mecatronica Facultad de Ingenieria Campus San Juan del Rio Universidad Autonoma de Queretaro Rio Moctezuma 249Col San Cayetano 76807 San Juan del Rio QRO Mexico3CATelematica DICIS Universidad deGuanajuato Carr Salamanca-Valle km 35 + 18 Palo Blanco 36885 Salamanca GTOMexico

Correspondence should be addressed to Rene de Jesus Romero-Troncoso troncosohspdigitalorg

Received 26 April 2016 Accepted 30 August 2016

Academic Editor Mariano Artes

Copyright copy 2016 Jesus Adolfo Carino-Corrales et al This is an open access article distributed under the Creative CommonsAttribution License which permits unrestricted use distribution and reproduction in any medium provided the original work isproperly cited

This paper presents an adaptive novelty detection methodology applied to a kinematic chain for the monitoring of faults Theproposed approach has the premise that only information of the healthy operation of the machine is initially available and faultscenarios will eventually develop This approach aims to cover some of the challenges presented when condition monitoring isapplied under a continuous learning framework The structure of the method is divided into two recursive stages first an offlinestage for initialization and retraining of the feature reduction and novelty detection modules and second an online monitoringstage to continuously assess the condition of the machine Contrary to classical static feature reduction approaches the proposedmethod reformulates the features by employing first a Laplacian Score ranking and then the Fisher Score ranking for retrainingThe proposed methodology is validated experimentally by monitoring the vibration measurements of a kinematic chain driven byan induction motor Two faults are induced in the motor to validate the method performance to detect anomalies and adapt thefeature reduction and novelty detection modules to the new information The obtained results show the advantages of employingan adaptive approach for novelty detection and feature reduction making the proposed method suitable for industrial machinerydiagnosis applications

1 Introduction

Condition-based maintenance has become accepted by theindustrial manufacturing sector as a key factor to achievethe currently required high-competitiveness levels In thefield of electromechanical systems monitoring an increasednumber of approaches of health monitoring schemes havebeen proposed during the last decades [1ndash3] These method-ologies have been demonstrated to be a reliable option asfault detection techniques when data of the possible faults isavailable

Nevertheless in order to overcome the practical require-ments of industrial implementation some considerationsmust be taken into account There is a necessity to developmethodologies capable of monitoring complicated industrialmachinery that is diagnosis approaches capable of consider-ing complete electromechanical kinematic chains includingelectric motors gearboxes and additional mechanical ele-ments It is important to mention that information regardingthe possible faults that could occur on complicated machinesis not usually available due to the high cost and obtaining thisinformation requires a great amount of time

Hindawi Publishing CorporationShock and VibrationVolume 2016 Article ID 2417856 12 pageshttpdxdoiorg10115520162417856

2 Shock and Vibration

Unexpected events in the form of faulty scenarios notinitially considered or deviations over the normal operationtake place during the useful life of the machinery These situ-ations must be identified in order to avoid misclassificationsand the consequent incorrect maintenance action To copewith these challenging scenarios the condition-based main-tenance methods must be able to work under the assumptionthat only information of the healthy condition of themachineis available [4] Another relevant characteristic is the capacityto incorporate new knowledge Since new faults could appearduring monitoring the condition-based maintenance meth-ods must also be able to adapt and include new scenarios totheir initial knowledge [5 6]

These challenging scenarios are not always taken intoconsideration in conditionmonitoringmethodologies there-fore the application of classical methods on an industrialenvironment is limited or nonviable

In the field of electromechanical system monitoring agreat deal of health monitoring schemes based on vibrationshave been proposed during the last years [7ndash10] Mainlythese proposals can be divided into two categories regardingthe available information of the monitored machine and theclassification objective

The first category multifault detection consists in thedevelopment of a monitoring system capable of identifyingmultiple faults previously known [1] which implies that theinformation available to train themonitoring system includesmeasurements of the healthy state of the machine but alsomeasurements of the machine working under different faultyconditions Over the last years instead of looking for highlyspecific features for a component fault a trend towardscalculation of more general statistical features and the fusionof information to enhance the performance of the diagnosissystem has been carried out [11] Yet this approach is mostlycarried out at the laboratory scale with successful resultswhere controlled faults are provoked on the machine

However the information of the possible faults that couldappear on a machine is not always available consequentlya different approach must be considered if only informationof healthy condition of the machine is available The secondcategory corresponds to novelty detection also known asanomaly detection and it represents a solution to cope withthe lack of information available of the monitored machineThe objective of novelty detection is to detect novel eventsthat differ in some manner from the information of whichthemodelwas trainedNovelty detection framework has beensuccessfully applied to network intrusion medical diagnosisimage segmentation and handwritten digit recognition faultdetection and condition monitoring [4 6 12ndash14] Specialeffort has been made to improve condition monitoringnovelty detection applications based on vibrations Ma et alproposed a novelty detection approach to rotating machin-ery by means of monitoring the thresholds obtained froman extreme value distribution [15] McBain and Timusk pro-posed a feature extractionmethodology for novelty detectionapplied to a rotating mechanical system [16]

Most of the methodologies for novelty detection arelimited to a static analysis and the incorporation of the novelinformation to the novelty detection system is not usually

considered An approach to include adaptability to thenovelty framework based on vibrations is proposed byWanget al [17] The proposed monitoring scheme includes testingdata on the boundary of the novelty model based on SupportVector Data Description and retrains the model with thisinformation to gain robustness Nevertheless this approachdoes not take into consideration the possibility to includenovel scenarios during the monitored process

Other approaches to develop an adapting conditionmon-itoring schemewere presented by Filev et al where a practicalframework for autonomous monitoring of industrial equip-ment based on novelty detection is analyzed [18] and byCostaet al where a two-stage algorithm for real-time fault detectionand identification is presented [19] Both approaches providethe opportunity to incorporate novel detected faults tothe monitoring system nevertheless in both methods theincorporation is limited to updating the known data basebut an adaptation of the numerical features analyzed is notconsidered

The performance of a novelty detection system is highlydependent on the numerical features consideredWhen thereis no previous information of the possible faults that canoccur the application of a suitable numerical features analysisstrategy represents a critical challenge [4] Considering acontinuous monitoring framework where the initial infor-mation available is the healthy operating condition andlater on different faults are identified progressively when themachine condition deteriorates all the approaches previouslydiscussed do not modify the initial set of features when newinformation of faults is incorporated This static approachhas the advantage of providing a most adequate situation foronline adaptation nevertheless analyzing the informationof the faults detected during the monitoring process couldimprove the identification of amost adequate set of features todiscriminate the possible upcoming or already detected faultscenarios

To improve the performance of the novelty detection taska dynamic approach for reselection of the features includinginformation of new faults detected is proposed on this workTo the extent of the authorsrsquo knowledge the study regardingthe dynamic change of the feature space by considering newoperating scenarios is novel in the applications to electrome-chanical systems for fault diagnosis

Thereby the contribution of this study is to provide anadaptive methodology for novelty detection where the infor-mation of identified faults during the monitoring process isexploited to improve the anomaly detection task The meth-odology is divided into two recursive stages first an offlinestage for initialization and retraining of the feature reductionand novelty detectionmodules second an onlinemonitoringstage to continuously assess the condition of the machineThe novelty model employed is a one-class Support Vectormachine (OC-SVM) [20] which is a domain based modelsuccessfully employed on vibration signals monitoring [4]Contrary to classical static feature reduction approaches theproposed method reformulates the features by employingfirst a Laplacian Score ranking and then the Fisher Scoreranking for retraining

Shock and Vibration 3

The proposed methodology is tested using real datafrom a laboratory based kinematic chain where faults areinduced and the vibration signals from two accelerometersare monitored to assess the system condition To highlightthe benefits of the methodology proposed a comparison isperformed of classical feature reduction approaches to theadaptive feature reduction scheme proposed

This paper is organized as follows In Section 2 the-oretical aspects of the proposed method are describedSection 3 describes the method presented The experimentalsetup employed for assessment and the results obtained arepresented anddiscussed in Section 4Andfinally conclusionsand future work are summarized in Section 5

2 Theoretical Background

21 One-Class Support Vector Machine As mentioned theobjective of novelty detection is to detect abnormal eventsthat differ in some manner from the information of whichthemodel was trainedTheone-class SupportVectormachine(OC-SVM) which is a domain based model successfullyemployed in condition monitoring applications encloses thetraining data by means of a boundary or threshold whichrepresents the limit of normality The OC-SVM presents thepossibility to include a kernel formulation providing it withenough flexibility to adapt to the distribution of the trainingdata To assess the condition of the machine under test newmeasurements are analyzed regarding their position to theboundary created in the training phase The test measure-ments underlying inside the boundary are considered ldquonor-malrdquo or ldquoknownrdquo while the measurements underlying outsidethe boundary are considered ldquooutlierrdquo or ldquonovelrdquo Comparedto other novelty detection techniques the OC-SVM is robustto outliers during the training phase but has the disadvantageof presenting several configuration parameters which aredescribed later in this section

The OC-SVM was proposed by Scholkopf et al for esti-mating the support of a high-dimensional distribution [20]The OC-SVM classification objective is to separate one classof target samples from all other class samples In this typeof problem one class is characterized properly called targetclass while for the other class usually no measurements areavailable Considering 119883 = [1199091 119909119873]119879 isin 119877119873times119872 whichdenotes the normal data set and 119909119894 119894 = 1 119873 whichdenotes training samples (available measurements) charac-terized by119872 numerical features then in order to obtain theboundary an optimization model is considered as follows

min w22 + 1119873V

119873sum119894=1

120585119894 minus 120588Subject to w sdot Φ (119909) ge 120588 minus 120585119894 120585119894 ge 0

(1)

where V is a regularization parameter and 120585119894 is the slackvariable for the point119909119894The constantsw and 120588 are the normalvector and offset of the hyperplane respectively Thus thedecision boundary can be formulated as

119891 (119909) = w sdot Φ (119909) minus 120588 (2)

where 119909 isin 119877119872 andΦ is a higher dimensional projection vec-tor For the classification problem of two categories the datasets are not always linearly separable in the original spacethen Φ projects the original data sets into a higher dimen-sional space the so-called feature space where the data setscan be linearly separable However Φ is inexplicit in thepractical application and only the dot product from Φ(119909119894) sdotΦ(119909119895) is necessary to be known 119870 represents the kernelfunction Φ(119909119894) sdot Φ(119909119895) The most commonly used kernelfunction is the Gaussian

119870(119909119894 119909119895) = exp(minus10038171003817100381710038171003817119909119894 minus 11990911989510038171003817100381710038171003817221205902 ) (3)

In order to solve optimization problem (1) Lagrangemul-tipliers 119886119894 ge 0 and 120573119894 ge 0 (119894 = 1 119873) are introduced andthe Lagrange equation is formed as

119871 (w 120585 120588 a120573) = w22 + 1119873V

119873sum119894=1

120585119894 minus 120588minus 119873sum119894=1

119886119894 (w sdot Φ (119909119894) minus 120588 + 120585119894)minus 119873sum119894=1

120573119894120585119894(4)

The partial derivatives of the Lagrangian equation withrespect to w 120585 and 120588 are set to zero Then w and 119886119894 can beformulated as

w = 119873sum119894=1

119886119894Φ(119909119894) 119886119894 = 1119873V

minus 120573119894119873sum119894=1

119886119894 = 1(5)

Substitute (5) into Lagrangian equation (4) and its dualform is presented as

min a119879119867a

subject to 0 le 119886119894 le 1119873V119873sum119894=1

119886119894 = 1(6)

where a = [119886119894 119886119873]119879 and 119867 is the kernel matrix and thefactor of119867 that is119867119894119895 which can be expressed as

119867119894119895 = 119870 (119909119894 119909119895) = Φ (119909119894) sdot Φ (119909119895) (7)

Solve optimization problem (6) to get a and then 120588 can begiven as

120588 = 1119899119904119899119904sum119894=1

119873sum119895=1

119886119895119870(119909119894 119909119895) a (8)

where 119899119904 is the number of Support Vectors

4 Shock and Vibration

Tax and Duin proposed another form of OC-SVMwhichis called Support Vector Data Description [21]The basic ideaof SVDD is to construct aminimum-volume hypersphere in ahigh-dimensional feature space to enclose as much as normaldata points Both of these two forms of SVM have equivalentsolution if the diagonal entries of kernel matrix 119867 are equalto a constant

22 Dimensionality Reduction Techniques Working withhigh-dimensional data sets could complicate the learningpart of novelty detection methods not only because of possi-ble presence of noise and redundancy in the data but for otherreasons as well [22] The empty space phenomenon statesthat to cover the whole space a number of samples that growexponentially with dimensionality are needed The curse ofdimensionality implies that in order to learn successfully anumber of training examples that also grow exponentiallywith the dimensionality are needed The ldquoconcentration ofmeasurerdquo phenomenon seems to render distance measuresnot relevant to whatever concept is to be learnt as thedimension of the data increased For these reasons amongothers there is a necessity to apply dimensionality reductiontechniques in novelty detection applications Dimensionalityreduction strategies differ in the question of whether thelearning process is supervised or unsupervised The differ-ence between both learning processes is the availability oflabels to distinguish the different classes

Principal Component Analysis (PCA) is one of the mostcommonly used techniques for unsupervised dimensionalityreduction [23] It aims to find the linear projections that bestcapture the variability of the data

Another well-known dimensionality reduction techniqueis the Laplacian Score (LS) where the merit of each feature ismeasured according to its locality preservation power [24]A nearest neighbor based graph is constructed from thetraining set and analyzed to rank each feature individuallyaccording to a weighting approach selected for the graphedges If the labels of the classes of the analyzed data set aregiven the method changes to a supervised approach and thegraph is created based on these labels instead of the nearestneighbor approach Two variations for weighting the edgescan be applied Heat Kernel and Simpleminded To rank eachfeature its Laplacian Score is computed which is a measureof which extent the analyzed feature preserves the structurepresent in the graph divided by the variance of the feature Fora feature to be selected it must have a low LS which implieshigh variance and locality

Linear discriminant analysis (LDA) is one of the mostwell-known supervised techniques for linear dimensionalityreduction inmulticlass problems [25] LDA searches attemptsto maximize the linear separation between data pointsbelonging to different classes In contrast to most of theother dimensionality reduction techniques LDA as a featureextraction technique finds a linear mapping that maximizesthe linear class separation in the low-dimensional representa-tion of the data The criteria that are used to formulate linearclass separation in LDA are the within-class scatter and thebetween-class scatter

Another variant based on the Fisher coefficient is a featuredimensionality reduction approach called Fisher Score (FS)where the objective is to find a subset of features whichmaximize the Fisher coefficient [26] The main differencebetween LDA and Fisher Score is that LDA is a featureextraction approach and Fisher Score is a feature selectionapproach then instead of extracting new features fromthe subspace obtained FS ranks the features and selectthe reduced subspace of features selected which maximizesthe dimensionality reduction criterion in this case the Fishercriterion

The aforementioned linear feature reduction techniqueshave a different combination of objective (variance preserva-tion topology preservation and discrimination) andmethodof employment (unsupervised or supervised and extractionor selection) and have been widely used in the literature withsuccessful results [16 22] yet they have been applied under astatic framework where the set of features is reduced initiallyand no possible adaptation during the monitoring phase isconsidered

3 Methodology

Given that the data initially available is usually related to thehealthy condition of the machine under analysis condition-based monitoring schemes must be designed to overcometwo main challenges

(i) The identification of significant features to deal withthe characterization of the known conditions of the machineunder the consideration that the occurrence of additionalunknown faults must be detected

(ii) The adaptation of the condition-based monitoringscheme to update the considered data base of the machineonce unknown fault scenarios have been detected

Such requirements are addressed in this work by meansof the proposed adaptive novelty detection method shownin Figure 1 This methodology represents an important stepto the introduction of adaptive novelty detection schemes tothe development of electromechanical system diagnosis pro-cedures

The proposed method is composed of two stages anoffline stage and an online monitoring stageThemain objec-tives of the offline stage are first the analysis of the infor-mation available of the monitored machine to find a reducedset of numerical features to characterize the known machineconditions and second the design of the novelty model bymeans of the selection of the configuration parameters andtraining

Once a reduced set of characteristic features is obtainedand the novelty model is designed the online stage is carriedout During the online stage new measurements are contin-uously compared with the normality threshold 119879119899 definedduring the novelty model training in the offline stageThus ifa novel scenario is detected the supervision of an expert useris proposed in order to confirm and label the new conditionof the machine consequently the monitoring system isretrained to include the characteristics of the novel scenarioDetailed information of each stage and the retraining processis described in Sections 31ndash33

Shock and Vibration 5

Database Feature calculation

Feature reduction

Novelty model design

New measurement

Calculation of reducedfeatures

NoveltyUser

supervisionKnown

operation False alarm

Offl

ine

Onl

ine

RetrainingIncluding

information of new fault

No

Yes

Noveltymodel

Figure 1 Proposedmethodology for the novelty detection approachThemonitoringmethod is composed of an offline stage for initializationand retraining and an online stage for continuous monitoring

31 Offline Stage During the initialization it is assumed thatonly information of the machine operating under healthycondition is available in the database The first step is thecalculation of numerical features from the vibration mea-surements obtained during the machine operation Since theinformation of the possible faults of the monitored machineis not available yet during this initialization a generic set ofstatistical time-based numerical features is proposed to beextracted from each available vibration axis measurementThe proposed set of potential statistical time-frequency fea-tures is shown as follows

Max Value

119894 = max (119909) (9)

Root Mean Square (RMS)

RMS = radic 1119899 sdot119899sum119896=1

(119909119896)2 (10)

Crest Factor (CF)

CF = 119909119896RMS

(11)

Kurtosis

119896 = 119864 [(119909119896 minus 119909)4]1205904 (12)

Shape Factor

SF = RMS1119899 sdot sum119899119896=1 10038161003816100381610038161199091198961003816100381610038161003816 (13)

These features have been successfully employed for faultdetection in the last years [1 27]

The resulting number of numerical features is propor-tional to the number of available vibration axes collectedduring the acquisition However in order to allow the com-pression and visualization of the data a feature reductionmodule is implemented During such offline stage initializa-tion an unsupervised feature reduction approach must be

used a Laplacian Score ranking is proposed in this work asa good trade-off between simplicity and performance [24]to rank the features according to the topology preservationcapabilities The two or three first ranked features in terms ofLaplacian Score are selected

Next the novelty model is designedThere is a significantnumber of noveltymodels proposed in the literature [4] witheach one demonstrated to be a capable option under certaincircumstances An increasing amount of works implies thatdomain based novelty detection models present promisingresults [19 28] In this work a standard OC-SVMwith Gaus-sian kernel is used The design of the novelty model includesthe selection of the parameters for configuration and trainingemploying the known scenarios stored at the database Thenthe initialization of the offline stage finalizes with the designof the OC-SVM

32 Online Stage This stage continuously monitors thecondition of the machine to detect if an anomaly is presentTo accomplish this each of the new measurements of themachine acquired certain amount of predefined time Eachmeasurement is segmented and a set of features is calculatedfrom each segment The set of numerical features calculatedin this stage are reduced in the offline stage by means ofLaplacian Score

Thus each new measurement characterized by thenumerical features is analyzed by the novelty model In caseof no novelty detection it is assumed that the machine isworking under known conditions However if the analyzedmeasurement is detected as a novelty an alarm is triggeredin order to consider the user assessment Then if theoccurrence of a new scenario is confirmed the correspondingmeasurements are stored at the database and a retrainingprocedure is performed

33 Retraining Once the retraining is trigged the featurereduction and the novelty model design modules are modi-fied at the offline stage A diagram of the retraining procedureis presented in Figure 2

It must be noticed that two important contributiveaspects are proposed in this retraining approach first thereconsideration of the feature reductionmodule and secondthe incorporation of fault scenarios to the novelty model As

6 Shock and Vibration

New Fault Detected

Continuos Monitoring

Add Information to DDBB

No

Yes

Perform New Feature

Reduction

Novelty model redesign

A supervised method will be

used

Measurementsof the new fault

Data of all stored scenarios

included

Retr

aini

ng

New faultdetected

Continuousmonitoring

Add information to DDBB

Perform new feature

reduction

Novelty modelredesign

Figure 2 Proposed retraining approach First measurements char-acterizing the fault are stored and then the feature reductionmoduleand the novelty design module are modified to incorporate the newscenario encountered

it has been explained the feature reduction module duringits initialization is supported by an unsupervised LaplacianScore ranking due to the lack of additional scenarios yet oncethe information of a fault is available the feature reductionmodule could improve its performance by employing super-vised methods

Then the Laplacian approach is replaced by a FisherScore ranking approach in this work where the features aresorted according to the Fisher coefficient calculated fromeach feature It is important to mention that Laplacian Scorecan be configured towork also under a supervised frameworkand could obtain similar results than employing a Fisherapproach similarities between the two approaches are dis-cussed in [24] Nevertheless Fisher Score is used in thismethodology to search for a possible discriminative scenarioto increase the detection of novel scenarios A comparison ofresults obtained from both features reduction approaches ispresented in the results of this work

The consideration of a faulty scenario in the noveltymodel may contradict the principle of anomaly detectionwhere the objective is to detect healthy behaviors from therest Nevertheless the aim of an adaptive condition mon-itoring system should be to learnt from all the identifiedconditions to subsequently detect them if they are presentedagain by a fault detection module Indeed some workspresent a parallel structure for fault detection and noveltydetection modules [18 19] where the novelty module learnsthe known faults along with the healthy operation because itsobjective is to detect only new scenarios while the objectiveof the fault detection is to identify the condition of the

Gearbox

Load

VFD

DAS

Inductionmotor

AccelerometerEncoder

PC

z

yx

Figure 3 Test bench used for experimentation

machine including known faults This work is based on suchparallel structure approach where the fault scenariosmust betaken into consideration to be included in the novelty modeland then they could be identified by a complementary faultdetection module with a high confidence level

4 Experimental Results

41 Description of the Experimental Platform The test benchused for testing the kinematic chain with different faults andthe acquisition system used to capture the vibration signalsare shown in Figure 3 The test bench consists in a 1492-W-three-phase induction motor (WEG 00236ET3E145T-W22)with its rotational speed controlled through a variable fre-quency drive-VFD (WEG CFW08) the operating speed isfixed to 60Hz for all experiments It also consists of a 4 1 ratiogearbox (BALDORGCF4X01AA) and a DC generator (BAL-DOR CDP3604) which is used as the mechanical load com-prising around 20of the nominal loadThe vibration signalsare acquired using a triaxial accelerometer (LIS3L02AS4)mounted on a board with the signal conditioning andantialias filtering Two 12-bit 4-channel serial output sam-pling analog-to-digital converters (ADS7841) are used in thedata acquisition system (DAS) board Vibration signal fromthe perpendicular plane (119910-119911) to the motor shaft is acquiredusing a triaxial accelerometer LIS3L02AS4 mounted on aboard with the signal conditioning and antialiasing filteringSampling frequency is set to 3 kHz for vibration acquisitionThe data retrieved by the DAS is stored in a regular computer(PC)

Three scenarios are considered to verify the performanceof the proposed method the first one 119867 is the kinematicchain working under healthy condition and the other two 1198651and 1198652 represent the kinematic chain working under faultyconditions For 1198651 the motor is working with a half brokenbar and for 1198652 the motor is working with a fully brokenbar The detail of the failures is shown in Figure 4 The halfbroken bar failure is artificially produced by drilling a 6mmhole with a depth of 3mm that corresponds mostly to the

Shock and Vibration 7

(a) (b)

Figure 4 Detail of the faults produced in the test bench (a) corresponds to the 12 broken rotor bar and (b) to one broken rotor bar

22 of the section of the rotor bar and the full broken baris produced by a through-hole with a diameter of 6mm anda depth of 14mm which corresponds to the complete sectionof the rotor bar

The information stored from the kinematic chain consistsof an acquisition of 60 seconds of themachine working underthe three scenarios mentioned each acquisition is segmentedin 30 parts of 2 seconds and a set of features is calculatedfrom the 30 segmented measurements Since two axes aretaken into consideration a total of ten features are calculatedfrom each segmented acquisition of the machine workingunder the different scenarios mentioned The first step ofthe methodology is the offline stage where a reduced set offeatures is obtained and the novelty model is designed

42 Parameter Selection Regarding the Laplacian Score con-figuration a simple approach is followed for parameter tun-ing that is a value of 119896 = 3 is used for constructing the adja-cency graph and a ldquosimple mindedrdquo weighting approach isfollowed Since the proposed approach is compared to differ-ent feature reduction modules selecting generic parameterssettings is useful for the purpose of evaluation but ignores thatthere may be dependencies between the feature reductionmodel and the novelty model Regarding the design of thenovelty model the kernel used is the Gaussian and the valueof the configuration parameter is tuned tominimize the errorin the validation In all experiments 80 of the samplesare used for training and 20 for validation To train andadjust the parameter of the novelty model a fivefold cross-validation is used

43 Results and Discussions In order to highlight the contri-butions andmotivation of this work the outline of the resultsis presented as follows first a test is performed by a classicalapproach then the proposedmethodology is applied and theresults are compared The classical static approach impliesconserving the reduced features set obtained at the offline ini-tialization stage meanwhile the proposed dynamic approachimplies a possible reformulation of the reduced features setduring the retraining stage

Healthy

0

005

01

015

02

minus2 minus1minus3 1 2 30LS feature 1 RMS Y

minus3

minus2

minus1

0

1

2

3LS

feat

ure 2

kur

tosis

Y

Figure 5 Initial novelty model representation Limit of the noveltythreshold sdot sdot sdot and measurements used to train the model lowast

During the initialization stage the reduced set of featuresselected bymeans of the Laplacian Score ranking is composedof the RMS of the 119910-axis and the Kurtosis of the 119910-axisThe OC-SVM model is trained employing healthy conditiondata The resulting OC-SVM during training is shown inFigure 5 The marks lowast in the figure represent measurementsof the machine used to train the model in this case andin the subsequence figures only 1-fold of the fivefold cross-validation is displayedThe dotted line represents the noveltythreshold value all data inside the boundaries of the dottedline is considered normal The contour plot represents thenovelty score evaluation over different regions of the featuremap

Once the offline initialization stage is finished the onlinestage follows that is new measurements are obtained toassess the condition of the monitored machine To giverobustness to the novelty detection and avoid false alarms abatch consisting of 30 measurements is evaluated if 75 of

8 Shock and Vibration

0

005

01

015

02

minus4

minus3

minus2

minus1

0

1

2

3

4

5

60 2 4 8 10minus2minus4LS feature 1 RMS Y

KnownF1

LS fe

atur

e 2 k

urto

sis Y

Figure 6 Evaluation of the fault scenario 1198651 The novelty model istrained employing data from healthy operation condition

minus2 0 2 4 6 8 10minus4LS feature 1 RMS Y

minus4

minus3

minus2

minus1

0

1

2

3

4

5

minus001

0

001

002

003

004

005

006

007

008

Known

LS fe

atur

e 2 k

urto

sis Y

Figure 7 Contour plot of the novelty model after including 1198651

the analyzed measurements are evaluated as novelty then thealarm is triggered Next the 1198651 scenario is presented to testthe performance of the model The plot of the scenario andthe novelty threshold obtained during training is presentedin Figure 6

As can be appreciated the new scenario lays outside thenovelty threshold so it is successfully detected as noveltyOnce a novel scenario is detected and identified as a fault bythe user it is incorporated to the database to consider it as partof the known scenarios and the novelty model is retrainedwithout changing the features to include this informationFigure 7 shows the feature space after the novelty modelis trained using healthy and 1198651 data as part of the knownscenarios

Once the model is retrained the third scenario 1198652 isintroduced The visual representation of the test is presentedin Figure 8 this scenario is not detected as novel because only50 of the measurements are labeled as novel by the model

minus001

0

001

002

003

004

005

006

007

008

minus2 0 2 4 6 8 10minus4LS feature 1 RMS Y

minus4

minus3

minus2

minus1

0

1

2

3

4

5

KnownF2

LS fe

atur

e 2 k

urto

sis Y

Figure 8 Evaluation of the fault scenario 1198652 The novelty model istrained employing data from healthy and 1198651 scenarios

A similar result is obtained when the novelty model istrained using healthy and 1198652 data and is tested with data of1198651 A summary of the results of novelty detectionmaintainingthe same features obtained during initialization is shown inTable 1

As it can be seen in Table 1 using the reduced set obtainedduring initialization is easy to detect the novel scenarioswhen there is only information of the healthy condition butwhen a novel scenario is included in the database and thenoveltymodel is retrained the reduced set of features initiallyobtained does not necessarily provide a good representationto detect a new scenario during test

To improve these results the methodology presented inthis work proposes to evaluate again the feature reductionmodule each time a retraining is applied Following theoutline presented for results and parting from the first retrainwhere the scenario 1198651 is included in the database the featurereduction module is applied again but this time includinginformation from healthy condition and 1198651 scenario Sincetwo scenarios are taken into consideration and the labels areknown a supervised approach can be applied in this case aFisher Score ranking for feature reduction is employed Thenovelty model using the initial set of features and the noveltymodel after retraining with the new reduced set of featuresare shown in Figure 9

The new reduced set of features is composed of the RMSof the 119910-axis and the Kurtosis of the 119911-axis as can be appre-ciated the new set of features present a more discriminativedistribution of the three scenarios considered At the laststep (Figure 9(d)) when 1198652 scenario is included the FisherScore still ranked the same features of the last retraining(Figure 9(b)) as the highest yet it would be possible thata different set of features would be obtained The resultsobtained demonstrate the advantages of including the FisherScore reduction module to the retraining procedure the newdistribution of the scenarios avoided an overlapping of the

Shock and Vibration 9

minus2 0 2 4 6 8 10minus4LS feature 1 RMS Y

minus4

minus3

minus2

minus1

0

1

2

3

4

5

0

005

01

015

02

KnownF1

LS fe

atur

e 2 k

urto

sis Y

(a)

minus5

minus4

minus3

minus2

minus1

0

1

2

3

minus2 0 2 4 6 8 10minus4FS feature 1 RMS Y

minus001

0

001

002

003

004

005

006

007

008

Known

FS fe

atur

e 2 k

urto

sis Z

(b)

minus001

0

001

002

003

004

005

006

007

008

minus2 0 2 4 6 8 10minus4FS feature 1 RMS Y

FS fe

atur

e 2 k

urto

sis Z

minus5

minus4

minus3

minus2

minus1

0

1

2

3

KnownF2

(c)

minus2 0 2 4 6 8 10minus4FS feature 1 RMS Y

minus5

minus4

minus3

minus2

minus1

0

1

2

3

minus0005

0

0005

001

0015

002

0025

003

0035

Known

FS fe

atur

e 2 k

urto

sis Z

(d)

Figure 9 Process of evaluation and retraining employing the methodology proposed (a) Evaluation of the fault scenario 1198651 (b) Retrainingof the novelty model and reformulation of the reduced set of features including 1198651 (c) Evaluation of the fault scenario 1198652 (d) Retraining ofthe novelty model and reformulation of the reduced set of features including 1198652Table 1 Performance of the novelty detection using only healthy class data to reduce the number of features where DR stands fordimensionality reduction Different scenarios are included according to the information available to train and test the novelty model

DRPerformance using OC-SVM ()

Known Test Known Test Known Test119867 1198651 + 1198652 119867 + 1198651 1198652 119867 + 1198652 1198651LS 954 (plusmn11) 576 (plusmn67) 435 (plusmn64)

new scenario under test therefore the noveltymodel success-fully detected as novelty all measurements corresponding tothe 1198652 scenario

It is worth mentioning that if initially the scenario 1198652is used for training and the scenario 1198651 is used for testingthe new set of features obtained could be different from theaforementionedThe results achieved fromboth scenarios are

shown in Table 2 which also includes a comparison of theresults obtained employing PCA LDA and Laplacian Scoredimensionality reduction techniques in the retraining stepinstead of the Fisher Score proposed

Regarding the classical feature extraction techniquesPCA and LDA the test scenario is not identified as novelin both cases The Fisher Score and the Laplacian Score

10 Shock and Vibration

Table 2 Performance of the novelty detection employing a reduc-tion of features during retraining Different scenarios are includedaccording to the information available to train and test the noveltymodel

DRPerformance using OC-SVM ()

Known Test Known Test119867 + 1198651 1198652 119867 + 1198652 1198651PCA 590 (plusmn52) 642 (plusmn51)LDA 474 (plusmn61) 512 (plusmn61)LS 843 (plusmn24) 100 (plusmn00)Fisher S 100 (plusmn00) 100 (plusmn00)successfully identified the new scenarios as novel since thepercentage obtained in both cases is higher than the 75 pre-defined threshold in the methodology to activate the alarmBoth techniques achieved high scores but still the FisherScore provided a more appropriate selection of features

As mentioned in Section 2 LDA is a feature extractiontechnique based on the Fisher discriminant coefficient sosimilar results between Fisher Score ranking and LDA areexpected however employing Fisher Score ranking scoreachieved a better result this can be caused because the testconsists of novel scenarios and LDA finds the directions onthe feature space specialized for the two supervised scenariosemployed during training meanwhile Fisher Score rankingprovides a more general approach by selecting features

To test the robustness of the Laplacian Score and FisherScore approaches a comparative test is performed where theset of features is increased from 10 to 15 and by varyingthe number of reduced features selected The five featuresincluded in the original set are obtained from the119909-axis of theaccelerometer monitored where the features calculated arepresented in the aforementioned statistical time-frequencyfeatures these features were discarded initially because theyare not part of the perpendicular plane of the motor anddo not contribute significantly to the monitoring in factincluding these features could affect the performance of thefeature reduction and novelty detection modules The resultsare presented in Table 3

The features obtained from the Fisher Score still present abetter distribution to detect the new scenariosThe LaplacianScore performance is affected when irrelevant features areincluded in the feature set but increases when more dimen-sions are taken into consideration Since the objective of theFisher Score is not topology preservation contrary to LS itis capable of discarding all the irrelevant features that wereincluded and the performance is not affected

5 Conclusions

This work proposes an adaptive novelty detection methodol-ogy based on vibration analysis for the condition monitoringand diagnosis of the components of a kinematic chain Themethodology is based on the acquisition of vibration signalsthat are generated in the kinematic chain along with anadequate signal processing to extract features to characterizethe components and an adaptive novelty detection model to

Table 3 Performance of the novelty detection increasing the num-ber of initial features from 10 to 15 and varying the number of thereduced set of features

DRPerformance varying the number of features

reduced ()2 3 4

LS 590 (plusmn41) 643 (plusmn33) 886 (plusmn25)Fisher S 100 (plusmn00) 100 (plusmn00) 100 (plusmn00)detect anomalies The method is composed of two sequentialstages an offline stage to initialize and retrain the modulesand an online stage to continuously assess the conditionof the machine During initialization the model is trainedemploying only information from the machine workingunder healthy condition and two additional faulty scenariosare introduced to test the performance of the method underunknown operations

The adaptive novelty detection approach successfullydetected both novel scenarios and the model incorporatedthe information to avoid generating alarms if the same faultis detected A comparison between the proposed methodand classical dimensionality reduction approaches highlightsthe limitations of maintaining a static set of features duringmonitoring instead of reformulating the feature reductionmodule once new information is available In this particularstudy employing features reduced by Laplacian Score andFisher Score obtained similar results nevertheless it doesnot imply that a similar outcome will be present during theanalysis of other faults Fisher Score is encouraged to beemployed in this methodology rather than maintaining theLaplacian Score approach due to the similarity between themethod objective and the objective function of Fisher Scorein both the ideal case is to find the features to maximizethe distance between scenarios while maintaining compactclusters A specific comparison of performances between LSand Fisher Score was also included in which the Fisher Scoreobtained better results when irrelevant features are includedin the original set of features and when the dimensionalityof the reduced set is increased this highlights the advantagesand robustness of the feature selection approach by FisherScore compared to the LS

From an industrial perspective two aspects should bementioned First dealing with an identical operating sce-nario (ie healthy) the variation of the working conditions(speed and torque patterns) will affect the resulting valueof the estimated numerical features and consequently itsrepresentation in the feature space used for novelty modeldesign Thus the introduction of the working condition asan additional degree of freedom in the novelty detectionscheme implies the increase of the dimensional space thatthe novelty detectionmodel is managingThis fact implies anincrease of the risk of data overlapping (different scenariosunder different operating conditions showing similar featurevalues) and also an increase of complexity for boundariesdefinition during the novelty modelling (data spread) Thusthe proposed method is not limited to stationary conditionsbut to the repetition of the pattern of working condition used

Shock and Vibration 11

to train the model Indeed in practical industrial applicationwhere cyclostationary processes take place the proposedmethod should be executed during the same period (the sameworking conditions) in order to reduce data variability andincrease novelty performance In case of multiple workingconditions consideration parallel novelty model structurescould be generated in order to activate each one dependingon the corresponding working condition

Second the integration of additional physicalmagnitudesas stator currents temperatures and others represents atrendy approach in electromechanical fault diagnosis inorder to improve fault diagnosis resolution and perfor-mance The corresponding data fusion scheme implies ahigh-dimensional feature space that is multiple numericalfeatures estimated from multiple physical magnitudes con-sidered In this point it must be noted that the presentedmethodology proposes the reevaluation of the feature reduc-tionmodule each time a retraining is appliedThus the largerthe available pool of features is the higher the potentiality ofthe proposedmethod to increase the performance during thedata characterization is

Also the proposed method can be extended andimproved for further development this improvement couldinclude a diagnosis method to not only detect anomalies inthe kinematic chain but also identify the fault causing theabnormal behavior In this sense future work will include aspecific comparison of broken bar fault detection capabilitiesbetween the proposedmethodology and classical approaches

Competing Interests

The authors declare that they have no competing interests

Acknowledgments

This research was partially supported by CONACyT schol-arships Grants 278033 and 313604 and also by the SpanishMinistry of Education Culture and Sport under GrantFPU1300589 The authors would also like to thank a partialfinancial support by the Generalitat de Catalunya (GRCMCIA Grant no SGR 2014-101)

References

[1] H Henao G-A Capolino M Fernandez-Cabanas et alldquoTrends in fault diagnosis for electrical machines a review ofdiagnostic techniquesrdquo IEEE Industrial Electronics Magazinevol 8 no 2 pp 31ndash42 2014

[2] N Bayar S Darmoul S Hajri-Gabouj and H Pierreval ldquoFaultdetection diagnosis and recovery using Artificial Immune Sys-tems a reviewrdquoEngineeringApplications of Artificial Intelligencevol 46 pp 43ndash57 2015

[3] SWang JWan D Zhang D Li and C Zhang ldquoTowards smartfactory for Industry 40 a self-organized multi-agent systemwith big data based feedback and coordinationrdquo ComputerNetworks vol 101 pp 158ndash168 2016

[4] M A F Pimentel D A Clifton L Clifton and L TarassenkoldquoA review of novelty detectionrdquo Signal Processing vol 99 pp215ndash249 2014

[5] R Langone O Mauricio Agudelo B De Moor and J AK Suykens ldquoIncremental kernel spectral clustering for online

learning of non-stationary datardquo Neurocomputing vol 139 pp246ndash260 2014

[6] M-H Chen P-C Chang and J-L Wu ldquoA population-basedincremental learning approach with artificial immune systemfor network intrusion detectionrdquo Engineering Applications ofArtificial Intelligence vol 51 pp 171ndash181 2016

[7] S Khomfoi and L M Tolbert ldquoFault diagnosis and reconfigu-ration for multilevel inverter drive using AI-based techniquesrdquoIEEE Transactions on Industrial Electronics vol 54 no 6 pp2954ndash2968 2007

[8] A Bellini F Filippetti C Tassoni and G-A CapolinoldquoAdvances in diagnostic techniques for induction machinesrdquoIEEE Transactions on Industrial Electronics vol 55 no 12 pp4109ndash4126 2008

[9] V T Tran and B-S Yang ldquoAn intelligent condition-basedmaintenance platform for rotating machineryrdquo Expert Systemswith Applications vol 39 no 3 pp 2977ndash2988 2012

[10] P A Delgado-Arredondo A Garcia-Perez D Morinigo-Soteloet al ldquoComparative study of time-frequency decompositiontechniques for fault detection in induction motors using vibra-tion analysis during startup transientrdquo Shock and Vibration vol2015 Article ID 708034 14 pages 2015

[11] M D Prieto G Cirrincione A G Espinosa J A Ortegaand H Henao ldquoBearing fault detection by a novel condition-monitoring scheme based on statistical-time features and neu-ral networksrdquo IEEE Transactions on Industrial Electronics vol60 no 8 pp 3398ndash3407 2013

[12] P Perner ldquoConcepts for novelty detection and handling basedon a case-based reasoning process schemerdquo Engineering Appli-cations of Artificial Intelligence vol 22 no 1 pp 86ndash91 2009

[13] C Englund and A Verikas ldquoA hybrid approach to outlier detec-tion in the offset lithographic printing processrdquo EngineeringApplications of Artificial Intelligence vol 18 no 6 pp 759ndash7682005

[14] H Garces and D Sbarbaro ldquoOutliers detection in environmen-tal monitoring databasesrdquo Engineering Applications of ArtificialIntelligence vol 24 no 2 pp 341ndash349 2011

[15] S Ma S M Li and Y P Xiong ldquoUncertainty reduced noveltydetection approach applied to rotatingmachinery for conditionmonitoringrdquo Shock and Vibration vol 2015 Article ID 73721310 pages 2015

[16] J McBain and M Timusk ldquoFeature extraction for noveltydetection as applied to fault detection in machineryrdquo PatternRecognition Letters vol 32 no 7 pp 1054ndash1061 2011

[17] S Wang J Yub E Lapirac and J Lee ldquoA modified supportvector data description based novelty detection approach formachinery componentsrdquo Applied Soft Computing vol 13 no 2pp 1193ndash1205 2013

[18] D P Filev R B Chinnam F Tseng and P Baruah ldquoAn indus-trial strength novelty detection framework for autonomousequipment monitoring and diagnosticsrdquo IEEE Transactions onIndustrial Informatics vol 6 no 4 pp 767ndash779 2010

[19] B S J Costa P P Angelov and L A Guedes ldquoFully unsu-pervised fault detection and identification based on recursivedensity estimation and self-evolving cloud-based classifierrdquoNeurocomputing vol 150 pp 289ndash303 2015

[20] B Scholkopf R C Williamson A J Smola J Shawe-Taylorand J C Platt ldquoSupport vector method for novelty detectionrdquoin Neural Information Processing Systems vol 12 pp 582ndash5881999

12 Shock and Vibration

[21] D M J Tax and R P W Duin ldquoSupport vector domaindescriptionrdquo Pattern Recognition Letters vol 20 no 11ndash13 pp1191ndash1199 1999

[22] S D Villalba and P Cunningham ldquoAn evaluation of dimensionreduction techniques for one-class classificationrdquo ArtificialIntelligence Review vol 27 no 4 pp 273ndash294 2007

[23] I T Jolliffe Principal Component Analysis Springer New YorkNY USA 2nd edition 1986

[24] X He D Cai and P Niyogi ldquoLaplacian score for featureselectionrdquo inAdvances in Neural Information Processing Systems18 pp 507ndash514 MIT Press 2005

[25] A J Izenman ldquoLinear discriminant analysisrdquo in Modern Mul-tivariate Statistical Techniques A J Izenman Ed pp 237ndash280Springer New York NY USA 2008

[26] Q Gu Z Li and J Han ldquoGeneralized fisher score for featureselectionrdquo httpsarxivorgabs12023725

[27] Z Gao C Cecati and S X Ding ldquoA survey of fault diagnosisand fault-tolerant techniquesmdashpart II fault diagnosis withknowledge-based and hybridactive approachesrdquo IEEE Trans-actions on Industrial Electronics vol 62 no 6 pp 3768ndash37742015

[28] A E Lazzaretti D M J Tax H Vieira Neto and V HFerreira ldquoNovelty detection and multi-class classification inpower distribution voltage waveformsrdquo Expert Systems withApplications vol 45 pp 322ndash330 2016

International Journal of

AerospaceEngineeringHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

RoboticsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Active and Passive Electronic Components

Control Scienceand Engineering

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of

RotatingMachinery

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporation httpwwwhindawicom

Journal ofEngineeringVolume 2014

Submit your manuscripts athttpwwwhindawicom

VLSI Design

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Shock and Vibration

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Civil EngineeringAdvances in

Acoustics and VibrationAdvances in

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Electrical and Computer Engineering

Journal of

Advances inOptoElectronics

Hindawi Publishing Corporation httpwwwhindawicom

Volume 2014

The Scientific World JournalHindawi Publishing Corporation httpwwwhindawicom Volume 2014

SensorsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Modelling amp Simulation in EngineeringHindawi Publishing Corporation httpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Chemical EngineeringInternational Journal of Antennas and

Propagation

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Navigation and Observation

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

DistributedSensor Networks

International Journal of

Page 2: Research Article Vibration-Based Adaptive Novelty ...downloads.hindawi.com/journals/sv/2016/2417856.pdf · applied under a continuous learning framework. e structure of the method

2 Shock and Vibration

Unexpected events in the form of faulty scenarios notinitially considered or deviations over the normal operationtake place during the useful life of the machinery These situ-ations must be identified in order to avoid misclassificationsand the consequent incorrect maintenance action To copewith these challenging scenarios the condition-based main-tenance methods must be able to work under the assumptionthat only information of the healthy condition of themachineis available [4] Another relevant characteristic is the capacityto incorporate new knowledge Since new faults could appearduring monitoring the condition-based maintenance meth-ods must also be able to adapt and include new scenarios totheir initial knowledge [5 6]

These challenging scenarios are not always taken intoconsideration in conditionmonitoringmethodologies there-fore the application of classical methods on an industrialenvironment is limited or nonviable

In the field of electromechanical system monitoring agreat deal of health monitoring schemes based on vibrationshave been proposed during the last years [7ndash10] Mainlythese proposals can be divided into two categories regardingthe available information of the monitored machine and theclassification objective

The first category multifault detection consists in thedevelopment of a monitoring system capable of identifyingmultiple faults previously known [1] which implies that theinformation available to train themonitoring system includesmeasurements of the healthy state of the machine but alsomeasurements of the machine working under different faultyconditions Over the last years instead of looking for highlyspecific features for a component fault a trend towardscalculation of more general statistical features and the fusionof information to enhance the performance of the diagnosissystem has been carried out [11] Yet this approach is mostlycarried out at the laboratory scale with successful resultswhere controlled faults are provoked on the machine

However the information of the possible faults that couldappear on a machine is not always available consequentlya different approach must be considered if only informationof healthy condition of the machine is available The secondcategory corresponds to novelty detection also known asanomaly detection and it represents a solution to cope withthe lack of information available of the monitored machineThe objective of novelty detection is to detect novel eventsthat differ in some manner from the information of whichthemodelwas trainedNovelty detection framework has beensuccessfully applied to network intrusion medical diagnosisimage segmentation and handwritten digit recognition faultdetection and condition monitoring [4 6 12ndash14] Specialeffort has been made to improve condition monitoringnovelty detection applications based on vibrations Ma et alproposed a novelty detection approach to rotating machin-ery by means of monitoring the thresholds obtained froman extreme value distribution [15] McBain and Timusk pro-posed a feature extractionmethodology for novelty detectionapplied to a rotating mechanical system [16]

Most of the methodologies for novelty detection arelimited to a static analysis and the incorporation of the novelinformation to the novelty detection system is not usually

considered An approach to include adaptability to thenovelty framework based on vibrations is proposed byWanget al [17] The proposed monitoring scheme includes testingdata on the boundary of the novelty model based on SupportVector Data Description and retrains the model with thisinformation to gain robustness Nevertheless this approachdoes not take into consideration the possibility to includenovel scenarios during the monitored process

Other approaches to develop an adapting conditionmon-itoring schemewere presented by Filev et al where a practicalframework for autonomous monitoring of industrial equip-ment based on novelty detection is analyzed [18] and byCostaet al where a two-stage algorithm for real-time fault detectionand identification is presented [19] Both approaches providethe opportunity to incorporate novel detected faults tothe monitoring system nevertheless in both methods theincorporation is limited to updating the known data basebut an adaptation of the numerical features analyzed is notconsidered

The performance of a novelty detection system is highlydependent on the numerical features consideredWhen thereis no previous information of the possible faults that canoccur the application of a suitable numerical features analysisstrategy represents a critical challenge [4] Considering acontinuous monitoring framework where the initial infor-mation available is the healthy operating condition andlater on different faults are identified progressively when themachine condition deteriorates all the approaches previouslydiscussed do not modify the initial set of features when newinformation of faults is incorporated This static approachhas the advantage of providing a most adequate situation foronline adaptation nevertheless analyzing the informationof the faults detected during the monitoring process couldimprove the identification of amost adequate set of features todiscriminate the possible upcoming or already detected faultscenarios

To improve the performance of the novelty detection taska dynamic approach for reselection of the features includinginformation of new faults detected is proposed on this workTo the extent of the authorsrsquo knowledge the study regardingthe dynamic change of the feature space by considering newoperating scenarios is novel in the applications to electrome-chanical systems for fault diagnosis

Thereby the contribution of this study is to provide anadaptive methodology for novelty detection where the infor-mation of identified faults during the monitoring process isexploited to improve the anomaly detection task The meth-odology is divided into two recursive stages first an offlinestage for initialization and retraining of the feature reductionand novelty detectionmodules second an onlinemonitoringstage to continuously assess the condition of the machineThe novelty model employed is a one-class Support Vectormachine (OC-SVM) [20] which is a domain based modelsuccessfully employed on vibration signals monitoring [4]Contrary to classical static feature reduction approaches theproposed method reformulates the features by employingfirst a Laplacian Score ranking and then the Fisher Scoreranking for retraining

Shock and Vibration 3

The proposed methodology is tested using real datafrom a laboratory based kinematic chain where faults areinduced and the vibration signals from two accelerometersare monitored to assess the system condition To highlightthe benefits of the methodology proposed a comparison isperformed of classical feature reduction approaches to theadaptive feature reduction scheme proposed

This paper is organized as follows In Section 2 the-oretical aspects of the proposed method are describedSection 3 describes the method presented The experimentalsetup employed for assessment and the results obtained arepresented anddiscussed in Section 4Andfinally conclusionsand future work are summarized in Section 5

2 Theoretical Background

21 One-Class Support Vector Machine As mentioned theobjective of novelty detection is to detect abnormal eventsthat differ in some manner from the information of whichthemodel was trainedTheone-class SupportVectormachine(OC-SVM) which is a domain based model successfullyemployed in condition monitoring applications encloses thetraining data by means of a boundary or threshold whichrepresents the limit of normality The OC-SVM presents thepossibility to include a kernel formulation providing it withenough flexibility to adapt to the distribution of the trainingdata To assess the condition of the machine under test newmeasurements are analyzed regarding their position to theboundary created in the training phase The test measure-ments underlying inside the boundary are considered ldquonor-malrdquo or ldquoknownrdquo while the measurements underlying outsidethe boundary are considered ldquooutlierrdquo or ldquonovelrdquo Comparedto other novelty detection techniques the OC-SVM is robustto outliers during the training phase but has the disadvantageof presenting several configuration parameters which aredescribed later in this section

The OC-SVM was proposed by Scholkopf et al for esti-mating the support of a high-dimensional distribution [20]The OC-SVM classification objective is to separate one classof target samples from all other class samples In this typeof problem one class is characterized properly called targetclass while for the other class usually no measurements areavailable Considering 119883 = [1199091 119909119873]119879 isin 119877119873times119872 whichdenotes the normal data set and 119909119894 119894 = 1 119873 whichdenotes training samples (available measurements) charac-terized by119872 numerical features then in order to obtain theboundary an optimization model is considered as follows

min w22 + 1119873V

119873sum119894=1

120585119894 minus 120588Subject to w sdot Φ (119909) ge 120588 minus 120585119894 120585119894 ge 0

(1)

where V is a regularization parameter and 120585119894 is the slackvariable for the point119909119894The constantsw and 120588 are the normalvector and offset of the hyperplane respectively Thus thedecision boundary can be formulated as

119891 (119909) = w sdot Φ (119909) minus 120588 (2)

where 119909 isin 119877119872 andΦ is a higher dimensional projection vec-tor For the classification problem of two categories the datasets are not always linearly separable in the original spacethen Φ projects the original data sets into a higher dimen-sional space the so-called feature space where the data setscan be linearly separable However Φ is inexplicit in thepractical application and only the dot product from Φ(119909119894) sdotΦ(119909119895) is necessary to be known 119870 represents the kernelfunction Φ(119909119894) sdot Φ(119909119895) The most commonly used kernelfunction is the Gaussian

119870(119909119894 119909119895) = exp(minus10038171003817100381710038171003817119909119894 minus 11990911989510038171003817100381710038171003817221205902 ) (3)

In order to solve optimization problem (1) Lagrangemul-tipliers 119886119894 ge 0 and 120573119894 ge 0 (119894 = 1 119873) are introduced andthe Lagrange equation is formed as

119871 (w 120585 120588 a120573) = w22 + 1119873V

119873sum119894=1

120585119894 minus 120588minus 119873sum119894=1

119886119894 (w sdot Φ (119909119894) minus 120588 + 120585119894)minus 119873sum119894=1

120573119894120585119894(4)

The partial derivatives of the Lagrangian equation withrespect to w 120585 and 120588 are set to zero Then w and 119886119894 can beformulated as

w = 119873sum119894=1

119886119894Φ(119909119894) 119886119894 = 1119873V

minus 120573119894119873sum119894=1

119886119894 = 1(5)

Substitute (5) into Lagrangian equation (4) and its dualform is presented as

min a119879119867a

subject to 0 le 119886119894 le 1119873V119873sum119894=1

119886119894 = 1(6)

where a = [119886119894 119886119873]119879 and 119867 is the kernel matrix and thefactor of119867 that is119867119894119895 which can be expressed as

119867119894119895 = 119870 (119909119894 119909119895) = Φ (119909119894) sdot Φ (119909119895) (7)

Solve optimization problem (6) to get a and then 120588 can begiven as

120588 = 1119899119904119899119904sum119894=1

119873sum119895=1

119886119895119870(119909119894 119909119895) a (8)

where 119899119904 is the number of Support Vectors

4 Shock and Vibration

Tax and Duin proposed another form of OC-SVMwhichis called Support Vector Data Description [21]The basic ideaof SVDD is to construct aminimum-volume hypersphere in ahigh-dimensional feature space to enclose as much as normaldata points Both of these two forms of SVM have equivalentsolution if the diagonal entries of kernel matrix 119867 are equalto a constant

22 Dimensionality Reduction Techniques Working withhigh-dimensional data sets could complicate the learningpart of novelty detection methods not only because of possi-ble presence of noise and redundancy in the data but for otherreasons as well [22] The empty space phenomenon statesthat to cover the whole space a number of samples that growexponentially with dimensionality are needed The curse ofdimensionality implies that in order to learn successfully anumber of training examples that also grow exponentiallywith the dimensionality are needed The ldquoconcentration ofmeasurerdquo phenomenon seems to render distance measuresnot relevant to whatever concept is to be learnt as thedimension of the data increased For these reasons amongothers there is a necessity to apply dimensionality reductiontechniques in novelty detection applications Dimensionalityreduction strategies differ in the question of whether thelearning process is supervised or unsupervised The differ-ence between both learning processes is the availability oflabels to distinguish the different classes

Principal Component Analysis (PCA) is one of the mostcommonly used techniques for unsupervised dimensionalityreduction [23] It aims to find the linear projections that bestcapture the variability of the data

Another well-known dimensionality reduction techniqueis the Laplacian Score (LS) where the merit of each feature ismeasured according to its locality preservation power [24]A nearest neighbor based graph is constructed from thetraining set and analyzed to rank each feature individuallyaccording to a weighting approach selected for the graphedges If the labels of the classes of the analyzed data set aregiven the method changes to a supervised approach and thegraph is created based on these labels instead of the nearestneighbor approach Two variations for weighting the edgescan be applied Heat Kernel and Simpleminded To rank eachfeature its Laplacian Score is computed which is a measureof which extent the analyzed feature preserves the structurepresent in the graph divided by the variance of the feature Fora feature to be selected it must have a low LS which implieshigh variance and locality

Linear discriminant analysis (LDA) is one of the mostwell-known supervised techniques for linear dimensionalityreduction inmulticlass problems [25] LDA searches attemptsto maximize the linear separation between data pointsbelonging to different classes In contrast to most of theother dimensionality reduction techniques LDA as a featureextraction technique finds a linear mapping that maximizesthe linear class separation in the low-dimensional representa-tion of the data The criteria that are used to formulate linearclass separation in LDA are the within-class scatter and thebetween-class scatter

Another variant based on the Fisher coefficient is a featuredimensionality reduction approach called Fisher Score (FS)where the objective is to find a subset of features whichmaximize the Fisher coefficient [26] The main differencebetween LDA and Fisher Score is that LDA is a featureextraction approach and Fisher Score is a feature selectionapproach then instead of extracting new features fromthe subspace obtained FS ranks the features and selectthe reduced subspace of features selected which maximizesthe dimensionality reduction criterion in this case the Fishercriterion

The aforementioned linear feature reduction techniqueshave a different combination of objective (variance preserva-tion topology preservation and discrimination) andmethodof employment (unsupervised or supervised and extractionor selection) and have been widely used in the literature withsuccessful results [16 22] yet they have been applied under astatic framework where the set of features is reduced initiallyand no possible adaptation during the monitoring phase isconsidered

3 Methodology

Given that the data initially available is usually related to thehealthy condition of the machine under analysis condition-based monitoring schemes must be designed to overcometwo main challenges

(i) The identification of significant features to deal withthe characterization of the known conditions of the machineunder the consideration that the occurrence of additionalunknown faults must be detected

(ii) The adaptation of the condition-based monitoringscheme to update the considered data base of the machineonce unknown fault scenarios have been detected

Such requirements are addressed in this work by meansof the proposed adaptive novelty detection method shownin Figure 1 This methodology represents an important stepto the introduction of adaptive novelty detection schemes tothe development of electromechanical system diagnosis pro-cedures

The proposed method is composed of two stages anoffline stage and an online monitoring stageThemain objec-tives of the offline stage are first the analysis of the infor-mation available of the monitored machine to find a reducedset of numerical features to characterize the known machineconditions and second the design of the novelty model bymeans of the selection of the configuration parameters andtraining

Once a reduced set of characteristic features is obtainedand the novelty model is designed the online stage is carriedout During the online stage new measurements are contin-uously compared with the normality threshold 119879119899 definedduring the novelty model training in the offline stageThus ifa novel scenario is detected the supervision of an expert useris proposed in order to confirm and label the new conditionof the machine consequently the monitoring system isretrained to include the characteristics of the novel scenarioDetailed information of each stage and the retraining processis described in Sections 31ndash33

Shock and Vibration 5

Database Feature calculation

Feature reduction

Novelty model design

New measurement

Calculation of reducedfeatures

NoveltyUser

supervisionKnown

operation False alarm

Offl

ine

Onl

ine

RetrainingIncluding

information of new fault

No

Yes

Noveltymodel

Figure 1 Proposedmethodology for the novelty detection approachThemonitoringmethod is composed of an offline stage for initializationand retraining and an online stage for continuous monitoring

31 Offline Stage During the initialization it is assumed thatonly information of the machine operating under healthycondition is available in the database The first step is thecalculation of numerical features from the vibration mea-surements obtained during the machine operation Since theinformation of the possible faults of the monitored machineis not available yet during this initialization a generic set ofstatistical time-based numerical features is proposed to beextracted from each available vibration axis measurementThe proposed set of potential statistical time-frequency fea-tures is shown as follows

Max Value

119894 = max (119909) (9)

Root Mean Square (RMS)

RMS = radic 1119899 sdot119899sum119896=1

(119909119896)2 (10)

Crest Factor (CF)

CF = 119909119896RMS

(11)

Kurtosis

119896 = 119864 [(119909119896 minus 119909)4]1205904 (12)

Shape Factor

SF = RMS1119899 sdot sum119899119896=1 10038161003816100381610038161199091198961003816100381610038161003816 (13)

These features have been successfully employed for faultdetection in the last years [1 27]

The resulting number of numerical features is propor-tional to the number of available vibration axes collectedduring the acquisition However in order to allow the com-pression and visualization of the data a feature reductionmodule is implemented During such offline stage initializa-tion an unsupervised feature reduction approach must be

used a Laplacian Score ranking is proposed in this work asa good trade-off between simplicity and performance [24]to rank the features according to the topology preservationcapabilities The two or three first ranked features in terms ofLaplacian Score are selected

Next the novelty model is designedThere is a significantnumber of noveltymodels proposed in the literature [4] witheach one demonstrated to be a capable option under certaincircumstances An increasing amount of works implies thatdomain based novelty detection models present promisingresults [19 28] In this work a standard OC-SVMwith Gaus-sian kernel is used The design of the novelty model includesthe selection of the parameters for configuration and trainingemploying the known scenarios stored at the database Thenthe initialization of the offline stage finalizes with the designof the OC-SVM

32 Online Stage This stage continuously monitors thecondition of the machine to detect if an anomaly is presentTo accomplish this each of the new measurements of themachine acquired certain amount of predefined time Eachmeasurement is segmented and a set of features is calculatedfrom each segment The set of numerical features calculatedin this stage are reduced in the offline stage by means ofLaplacian Score

Thus each new measurement characterized by thenumerical features is analyzed by the novelty model In caseof no novelty detection it is assumed that the machine isworking under known conditions However if the analyzedmeasurement is detected as a novelty an alarm is triggeredin order to consider the user assessment Then if theoccurrence of a new scenario is confirmed the correspondingmeasurements are stored at the database and a retrainingprocedure is performed

33 Retraining Once the retraining is trigged the featurereduction and the novelty model design modules are modi-fied at the offline stage A diagram of the retraining procedureis presented in Figure 2

It must be noticed that two important contributiveaspects are proposed in this retraining approach first thereconsideration of the feature reductionmodule and secondthe incorporation of fault scenarios to the novelty model As

6 Shock and Vibration

New Fault Detected

Continuos Monitoring

Add Information to DDBB

No

Yes

Perform New Feature

Reduction

Novelty model redesign

A supervised method will be

used

Measurementsof the new fault

Data of all stored scenarios

included

Retr

aini

ng

New faultdetected

Continuousmonitoring

Add information to DDBB

Perform new feature

reduction

Novelty modelredesign

Figure 2 Proposed retraining approach First measurements char-acterizing the fault are stored and then the feature reductionmoduleand the novelty design module are modified to incorporate the newscenario encountered

it has been explained the feature reduction module duringits initialization is supported by an unsupervised LaplacianScore ranking due to the lack of additional scenarios yet oncethe information of a fault is available the feature reductionmodule could improve its performance by employing super-vised methods

Then the Laplacian approach is replaced by a FisherScore ranking approach in this work where the features aresorted according to the Fisher coefficient calculated fromeach feature It is important to mention that Laplacian Scorecan be configured towork also under a supervised frameworkand could obtain similar results than employing a Fisherapproach similarities between the two approaches are dis-cussed in [24] Nevertheless Fisher Score is used in thismethodology to search for a possible discriminative scenarioto increase the detection of novel scenarios A comparison ofresults obtained from both features reduction approaches ispresented in the results of this work

The consideration of a faulty scenario in the noveltymodel may contradict the principle of anomaly detectionwhere the objective is to detect healthy behaviors from therest Nevertheless the aim of an adaptive condition mon-itoring system should be to learnt from all the identifiedconditions to subsequently detect them if they are presentedagain by a fault detection module Indeed some workspresent a parallel structure for fault detection and noveltydetection modules [18 19] where the novelty module learnsthe known faults along with the healthy operation because itsobjective is to detect only new scenarios while the objectiveof the fault detection is to identify the condition of the

Gearbox

Load

VFD

DAS

Inductionmotor

AccelerometerEncoder

PC

z

yx

Figure 3 Test bench used for experimentation

machine including known faults This work is based on suchparallel structure approach where the fault scenariosmust betaken into consideration to be included in the novelty modeland then they could be identified by a complementary faultdetection module with a high confidence level

4 Experimental Results

41 Description of the Experimental Platform The test benchused for testing the kinematic chain with different faults andthe acquisition system used to capture the vibration signalsare shown in Figure 3 The test bench consists in a 1492-W-three-phase induction motor (WEG 00236ET3E145T-W22)with its rotational speed controlled through a variable fre-quency drive-VFD (WEG CFW08) the operating speed isfixed to 60Hz for all experiments It also consists of a 4 1 ratiogearbox (BALDORGCF4X01AA) and a DC generator (BAL-DOR CDP3604) which is used as the mechanical load com-prising around 20of the nominal loadThe vibration signalsare acquired using a triaxial accelerometer (LIS3L02AS4)mounted on a board with the signal conditioning andantialias filtering Two 12-bit 4-channel serial output sam-pling analog-to-digital converters (ADS7841) are used in thedata acquisition system (DAS) board Vibration signal fromthe perpendicular plane (119910-119911) to the motor shaft is acquiredusing a triaxial accelerometer LIS3L02AS4 mounted on aboard with the signal conditioning and antialiasing filteringSampling frequency is set to 3 kHz for vibration acquisitionThe data retrieved by the DAS is stored in a regular computer(PC)

Three scenarios are considered to verify the performanceof the proposed method the first one 119867 is the kinematicchain working under healthy condition and the other two 1198651and 1198652 represent the kinematic chain working under faultyconditions For 1198651 the motor is working with a half brokenbar and for 1198652 the motor is working with a fully brokenbar The detail of the failures is shown in Figure 4 The halfbroken bar failure is artificially produced by drilling a 6mmhole with a depth of 3mm that corresponds mostly to the

Shock and Vibration 7

(a) (b)

Figure 4 Detail of the faults produced in the test bench (a) corresponds to the 12 broken rotor bar and (b) to one broken rotor bar

22 of the section of the rotor bar and the full broken baris produced by a through-hole with a diameter of 6mm anda depth of 14mm which corresponds to the complete sectionof the rotor bar

The information stored from the kinematic chain consistsof an acquisition of 60 seconds of themachine working underthe three scenarios mentioned each acquisition is segmentedin 30 parts of 2 seconds and a set of features is calculatedfrom the 30 segmented measurements Since two axes aretaken into consideration a total of ten features are calculatedfrom each segmented acquisition of the machine workingunder the different scenarios mentioned The first step ofthe methodology is the offline stage where a reduced set offeatures is obtained and the novelty model is designed

42 Parameter Selection Regarding the Laplacian Score con-figuration a simple approach is followed for parameter tun-ing that is a value of 119896 = 3 is used for constructing the adja-cency graph and a ldquosimple mindedrdquo weighting approach isfollowed Since the proposed approach is compared to differ-ent feature reduction modules selecting generic parameterssettings is useful for the purpose of evaluation but ignores thatthere may be dependencies between the feature reductionmodel and the novelty model Regarding the design of thenovelty model the kernel used is the Gaussian and the valueof the configuration parameter is tuned tominimize the errorin the validation In all experiments 80 of the samplesare used for training and 20 for validation To train andadjust the parameter of the novelty model a fivefold cross-validation is used

43 Results and Discussions In order to highlight the contri-butions andmotivation of this work the outline of the resultsis presented as follows first a test is performed by a classicalapproach then the proposedmethodology is applied and theresults are compared The classical static approach impliesconserving the reduced features set obtained at the offline ini-tialization stage meanwhile the proposed dynamic approachimplies a possible reformulation of the reduced features setduring the retraining stage

Healthy

0

005

01

015

02

minus2 minus1minus3 1 2 30LS feature 1 RMS Y

minus3

minus2

minus1

0

1

2

3LS

feat

ure 2

kur

tosis

Y

Figure 5 Initial novelty model representation Limit of the noveltythreshold sdot sdot sdot and measurements used to train the model lowast

During the initialization stage the reduced set of featuresselected bymeans of the Laplacian Score ranking is composedof the RMS of the 119910-axis and the Kurtosis of the 119910-axisThe OC-SVM model is trained employing healthy conditiondata The resulting OC-SVM during training is shown inFigure 5 The marks lowast in the figure represent measurementsof the machine used to train the model in this case andin the subsequence figures only 1-fold of the fivefold cross-validation is displayedThe dotted line represents the noveltythreshold value all data inside the boundaries of the dottedline is considered normal The contour plot represents thenovelty score evaluation over different regions of the featuremap

Once the offline initialization stage is finished the onlinestage follows that is new measurements are obtained toassess the condition of the monitored machine To giverobustness to the novelty detection and avoid false alarms abatch consisting of 30 measurements is evaluated if 75 of

8 Shock and Vibration

0

005

01

015

02

minus4

minus3

minus2

minus1

0

1

2

3

4

5

60 2 4 8 10minus2minus4LS feature 1 RMS Y

KnownF1

LS fe

atur

e 2 k

urto

sis Y

Figure 6 Evaluation of the fault scenario 1198651 The novelty model istrained employing data from healthy operation condition

minus2 0 2 4 6 8 10minus4LS feature 1 RMS Y

minus4

minus3

minus2

minus1

0

1

2

3

4

5

minus001

0

001

002

003

004

005

006

007

008

Known

LS fe

atur

e 2 k

urto

sis Y

Figure 7 Contour plot of the novelty model after including 1198651

the analyzed measurements are evaluated as novelty then thealarm is triggered Next the 1198651 scenario is presented to testthe performance of the model The plot of the scenario andthe novelty threshold obtained during training is presentedin Figure 6

As can be appreciated the new scenario lays outside thenovelty threshold so it is successfully detected as noveltyOnce a novel scenario is detected and identified as a fault bythe user it is incorporated to the database to consider it as partof the known scenarios and the novelty model is retrainedwithout changing the features to include this informationFigure 7 shows the feature space after the novelty modelis trained using healthy and 1198651 data as part of the knownscenarios

Once the model is retrained the third scenario 1198652 isintroduced The visual representation of the test is presentedin Figure 8 this scenario is not detected as novel because only50 of the measurements are labeled as novel by the model

minus001

0

001

002

003

004

005

006

007

008

minus2 0 2 4 6 8 10minus4LS feature 1 RMS Y

minus4

minus3

minus2

minus1

0

1

2

3

4

5

KnownF2

LS fe

atur

e 2 k

urto

sis Y

Figure 8 Evaluation of the fault scenario 1198652 The novelty model istrained employing data from healthy and 1198651 scenarios

A similar result is obtained when the novelty model istrained using healthy and 1198652 data and is tested with data of1198651 A summary of the results of novelty detectionmaintainingthe same features obtained during initialization is shown inTable 1

As it can be seen in Table 1 using the reduced set obtainedduring initialization is easy to detect the novel scenarioswhen there is only information of the healthy condition butwhen a novel scenario is included in the database and thenoveltymodel is retrained the reduced set of features initiallyobtained does not necessarily provide a good representationto detect a new scenario during test

To improve these results the methodology presented inthis work proposes to evaluate again the feature reductionmodule each time a retraining is applied Following theoutline presented for results and parting from the first retrainwhere the scenario 1198651 is included in the database the featurereduction module is applied again but this time includinginformation from healthy condition and 1198651 scenario Sincetwo scenarios are taken into consideration and the labels areknown a supervised approach can be applied in this case aFisher Score ranking for feature reduction is employed Thenovelty model using the initial set of features and the noveltymodel after retraining with the new reduced set of featuresare shown in Figure 9

The new reduced set of features is composed of the RMSof the 119910-axis and the Kurtosis of the 119911-axis as can be appre-ciated the new set of features present a more discriminativedistribution of the three scenarios considered At the laststep (Figure 9(d)) when 1198652 scenario is included the FisherScore still ranked the same features of the last retraining(Figure 9(b)) as the highest yet it would be possible thata different set of features would be obtained The resultsobtained demonstrate the advantages of including the FisherScore reduction module to the retraining procedure the newdistribution of the scenarios avoided an overlapping of the

Shock and Vibration 9

minus2 0 2 4 6 8 10minus4LS feature 1 RMS Y

minus4

minus3

minus2

minus1

0

1

2

3

4

5

0

005

01

015

02

KnownF1

LS fe

atur

e 2 k

urto

sis Y

(a)

minus5

minus4

minus3

minus2

minus1

0

1

2

3

minus2 0 2 4 6 8 10minus4FS feature 1 RMS Y

minus001

0

001

002

003

004

005

006

007

008

Known

FS fe

atur

e 2 k

urto

sis Z

(b)

minus001

0

001

002

003

004

005

006

007

008

minus2 0 2 4 6 8 10minus4FS feature 1 RMS Y

FS fe

atur

e 2 k

urto

sis Z

minus5

minus4

minus3

minus2

minus1

0

1

2

3

KnownF2

(c)

minus2 0 2 4 6 8 10minus4FS feature 1 RMS Y

minus5

minus4

minus3

minus2

minus1

0

1

2

3

minus0005

0

0005

001

0015

002

0025

003

0035

Known

FS fe

atur

e 2 k

urto

sis Z

(d)

Figure 9 Process of evaluation and retraining employing the methodology proposed (a) Evaluation of the fault scenario 1198651 (b) Retrainingof the novelty model and reformulation of the reduced set of features including 1198651 (c) Evaluation of the fault scenario 1198652 (d) Retraining ofthe novelty model and reformulation of the reduced set of features including 1198652Table 1 Performance of the novelty detection using only healthy class data to reduce the number of features where DR stands fordimensionality reduction Different scenarios are included according to the information available to train and test the novelty model

DRPerformance using OC-SVM ()

Known Test Known Test Known Test119867 1198651 + 1198652 119867 + 1198651 1198652 119867 + 1198652 1198651LS 954 (plusmn11) 576 (plusmn67) 435 (plusmn64)

new scenario under test therefore the noveltymodel success-fully detected as novelty all measurements corresponding tothe 1198652 scenario

It is worth mentioning that if initially the scenario 1198652is used for training and the scenario 1198651 is used for testingthe new set of features obtained could be different from theaforementionedThe results achieved fromboth scenarios are

shown in Table 2 which also includes a comparison of theresults obtained employing PCA LDA and Laplacian Scoredimensionality reduction techniques in the retraining stepinstead of the Fisher Score proposed

Regarding the classical feature extraction techniquesPCA and LDA the test scenario is not identified as novelin both cases The Fisher Score and the Laplacian Score

10 Shock and Vibration

Table 2 Performance of the novelty detection employing a reduc-tion of features during retraining Different scenarios are includedaccording to the information available to train and test the noveltymodel

DRPerformance using OC-SVM ()

Known Test Known Test119867 + 1198651 1198652 119867 + 1198652 1198651PCA 590 (plusmn52) 642 (plusmn51)LDA 474 (plusmn61) 512 (plusmn61)LS 843 (plusmn24) 100 (plusmn00)Fisher S 100 (plusmn00) 100 (plusmn00)successfully identified the new scenarios as novel since thepercentage obtained in both cases is higher than the 75 pre-defined threshold in the methodology to activate the alarmBoth techniques achieved high scores but still the FisherScore provided a more appropriate selection of features

As mentioned in Section 2 LDA is a feature extractiontechnique based on the Fisher discriminant coefficient sosimilar results between Fisher Score ranking and LDA areexpected however employing Fisher Score ranking scoreachieved a better result this can be caused because the testconsists of novel scenarios and LDA finds the directions onthe feature space specialized for the two supervised scenariosemployed during training meanwhile Fisher Score rankingprovides a more general approach by selecting features

To test the robustness of the Laplacian Score and FisherScore approaches a comparative test is performed where theset of features is increased from 10 to 15 and by varyingthe number of reduced features selected The five featuresincluded in the original set are obtained from the119909-axis of theaccelerometer monitored where the features calculated arepresented in the aforementioned statistical time-frequencyfeatures these features were discarded initially because theyare not part of the perpendicular plane of the motor anddo not contribute significantly to the monitoring in factincluding these features could affect the performance of thefeature reduction and novelty detection modules The resultsare presented in Table 3

The features obtained from the Fisher Score still present abetter distribution to detect the new scenariosThe LaplacianScore performance is affected when irrelevant features areincluded in the feature set but increases when more dimen-sions are taken into consideration Since the objective of theFisher Score is not topology preservation contrary to LS itis capable of discarding all the irrelevant features that wereincluded and the performance is not affected

5 Conclusions

This work proposes an adaptive novelty detection methodol-ogy based on vibration analysis for the condition monitoringand diagnosis of the components of a kinematic chain Themethodology is based on the acquisition of vibration signalsthat are generated in the kinematic chain along with anadequate signal processing to extract features to characterizethe components and an adaptive novelty detection model to

Table 3 Performance of the novelty detection increasing the num-ber of initial features from 10 to 15 and varying the number of thereduced set of features

DRPerformance varying the number of features

reduced ()2 3 4

LS 590 (plusmn41) 643 (plusmn33) 886 (plusmn25)Fisher S 100 (plusmn00) 100 (plusmn00) 100 (plusmn00)detect anomalies The method is composed of two sequentialstages an offline stage to initialize and retrain the modulesand an online stage to continuously assess the conditionof the machine During initialization the model is trainedemploying only information from the machine workingunder healthy condition and two additional faulty scenariosare introduced to test the performance of the method underunknown operations

The adaptive novelty detection approach successfullydetected both novel scenarios and the model incorporatedthe information to avoid generating alarms if the same faultis detected A comparison between the proposed methodand classical dimensionality reduction approaches highlightsthe limitations of maintaining a static set of features duringmonitoring instead of reformulating the feature reductionmodule once new information is available In this particularstudy employing features reduced by Laplacian Score andFisher Score obtained similar results nevertheless it doesnot imply that a similar outcome will be present during theanalysis of other faults Fisher Score is encouraged to beemployed in this methodology rather than maintaining theLaplacian Score approach due to the similarity between themethod objective and the objective function of Fisher Scorein both the ideal case is to find the features to maximizethe distance between scenarios while maintaining compactclusters A specific comparison of performances between LSand Fisher Score was also included in which the Fisher Scoreobtained better results when irrelevant features are includedin the original set of features and when the dimensionalityof the reduced set is increased this highlights the advantagesand robustness of the feature selection approach by FisherScore compared to the LS

From an industrial perspective two aspects should bementioned First dealing with an identical operating sce-nario (ie healthy) the variation of the working conditions(speed and torque patterns) will affect the resulting valueof the estimated numerical features and consequently itsrepresentation in the feature space used for novelty modeldesign Thus the introduction of the working condition asan additional degree of freedom in the novelty detectionscheme implies the increase of the dimensional space thatthe novelty detectionmodel is managingThis fact implies anincrease of the risk of data overlapping (different scenariosunder different operating conditions showing similar featurevalues) and also an increase of complexity for boundariesdefinition during the novelty modelling (data spread) Thusthe proposed method is not limited to stationary conditionsbut to the repetition of the pattern of working condition used

Shock and Vibration 11

to train the model Indeed in practical industrial applicationwhere cyclostationary processes take place the proposedmethod should be executed during the same period (the sameworking conditions) in order to reduce data variability andincrease novelty performance In case of multiple workingconditions consideration parallel novelty model structurescould be generated in order to activate each one dependingon the corresponding working condition

Second the integration of additional physicalmagnitudesas stator currents temperatures and others represents atrendy approach in electromechanical fault diagnosis inorder to improve fault diagnosis resolution and perfor-mance The corresponding data fusion scheme implies ahigh-dimensional feature space that is multiple numericalfeatures estimated from multiple physical magnitudes con-sidered In this point it must be noted that the presentedmethodology proposes the reevaluation of the feature reduc-tionmodule each time a retraining is appliedThus the largerthe available pool of features is the higher the potentiality ofthe proposedmethod to increase the performance during thedata characterization is

Also the proposed method can be extended andimproved for further development this improvement couldinclude a diagnosis method to not only detect anomalies inthe kinematic chain but also identify the fault causing theabnormal behavior In this sense future work will include aspecific comparison of broken bar fault detection capabilitiesbetween the proposedmethodology and classical approaches

Competing Interests

The authors declare that they have no competing interests

Acknowledgments

This research was partially supported by CONACyT schol-arships Grants 278033 and 313604 and also by the SpanishMinistry of Education Culture and Sport under GrantFPU1300589 The authors would also like to thank a partialfinancial support by the Generalitat de Catalunya (GRCMCIA Grant no SGR 2014-101)

References

[1] H Henao G-A Capolino M Fernandez-Cabanas et alldquoTrends in fault diagnosis for electrical machines a review ofdiagnostic techniquesrdquo IEEE Industrial Electronics Magazinevol 8 no 2 pp 31ndash42 2014

[2] N Bayar S Darmoul S Hajri-Gabouj and H Pierreval ldquoFaultdetection diagnosis and recovery using Artificial Immune Sys-tems a reviewrdquoEngineeringApplications of Artificial Intelligencevol 46 pp 43ndash57 2015

[3] SWang JWan D Zhang D Li and C Zhang ldquoTowards smartfactory for Industry 40 a self-organized multi-agent systemwith big data based feedback and coordinationrdquo ComputerNetworks vol 101 pp 158ndash168 2016

[4] M A F Pimentel D A Clifton L Clifton and L TarassenkoldquoA review of novelty detectionrdquo Signal Processing vol 99 pp215ndash249 2014

[5] R Langone O Mauricio Agudelo B De Moor and J AK Suykens ldquoIncremental kernel spectral clustering for online

learning of non-stationary datardquo Neurocomputing vol 139 pp246ndash260 2014

[6] M-H Chen P-C Chang and J-L Wu ldquoA population-basedincremental learning approach with artificial immune systemfor network intrusion detectionrdquo Engineering Applications ofArtificial Intelligence vol 51 pp 171ndash181 2016

[7] S Khomfoi and L M Tolbert ldquoFault diagnosis and reconfigu-ration for multilevel inverter drive using AI-based techniquesrdquoIEEE Transactions on Industrial Electronics vol 54 no 6 pp2954ndash2968 2007

[8] A Bellini F Filippetti C Tassoni and G-A CapolinoldquoAdvances in diagnostic techniques for induction machinesrdquoIEEE Transactions on Industrial Electronics vol 55 no 12 pp4109ndash4126 2008

[9] V T Tran and B-S Yang ldquoAn intelligent condition-basedmaintenance platform for rotating machineryrdquo Expert Systemswith Applications vol 39 no 3 pp 2977ndash2988 2012

[10] P A Delgado-Arredondo A Garcia-Perez D Morinigo-Soteloet al ldquoComparative study of time-frequency decompositiontechniques for fault detection in induction motors using vibra-tion analysis during startup transientrdquo Shock and Vibration vol2015 Article ID 708034 14 pages 2015

[11] M D Prieto G Cirrincione A G Espinosa J A Ortegaand H Henao ldquoBearing fault detection by a novel condition-monitoring scheme based on statistical-time features and neu-ral networksrdquo IEEE Transactions on Industrial Electronics vol60 no 8 pp 3398ndash3407 2013

[12] P Perner ldquoConcepts for novelty detection and handling basedon a case-based reasoning process schemerdquo Engineering Appli-cations of Artificial Intelligence vol 22 no 1 pp 86ndash91 2009

[13] C Englund and A Verikas ldquoA hybrid approach to outlier detec-tion in the offset lithographic printing processrdquo EngineeringApplications of Artificial Intelligence vol 18 no 6 pp 759ndash7682005

[14] H Garces and D Sbarbaro ldquoOutliers detection in environmen-tal monitoring databasesrdquo Engineering Applications of ArtificialIntelligence vol 24 no 2 pp 341ndash349 2011

[15] S Ma S M Li and Y P Xiong ldquoUncertainty reduced noveltydetection approach applied to rotatingmachinery for conditionmonitoringrdquo Shock and Vibration vol 2015 Article ID 73721310 pages 2015

[16] J McBain and M Timusk ldquoFeature extraction for noveltydetection as applied to fault detection in machineryrdquo PatternRecognition Letters vol 32 no 7 pp 1054ndash1061 2011

[17] S Wang J Yub E Lapirac and J Lee ldquoA modified supportvector data description based novelty detection approach formachinery componentsrdquo Applied Soft Computing vol 13 no 2pp 1193ndash1205 2013

[18] D P Filev R B Chinnam F Tseng and P Baruah ldquoAn indus-trial strength novelty detection framework for autonomousequipment monitoring and diagnosticsrdquo IEEE Transactions onIndustrial Informatics vol 6 no 4 pp 767ndash779 2010

[19] B S J Costa P P Angelov and L A Guedes ldquoFully unsu-pervised fault detection and identification based on recursivedensity estimation and self-evolving cloud-based classifierrdquoNeurocomputing vol 150 pp 289ndash303 2015

[20] B Scholkopf R C Williamson A J Smola J Shawe-Taylorand J C Platt ldquoSupport vector method for novelty detectionrdquoin Neural Information Processing Systems vol 12 pp 582ndash5881999

12 Shock and Vibration

[21] D M J Tax and R P W Duin ldquoSupport vector domaindescriptionrdquo Pattern Recognition Letters vol 20 no 11ndash13 pp1191ndash1199 1999

[22] S D Villalba and P Cunningham ldquoAn evaluation of dimensionreduction techniques for one-class classificationrdquo ArtificialIntelligence Review vol 27 no 4 pp 273ndash294 2007

[23] I T Jolliffe Principal Component Analysis Springer New YorkNY USA 2nd edition 1986

[24] X He D Cai and P Niyogi ldquoLaplacian score for featureselectionrdquo inAdvances in Neural Information Processing Systems18 pp 507ndash514 MIT Press 2005

[25] A J Izenman ldquoLinear discriminant analysisrdquo in Modern Mul-tivariate Statistical Techniques A J Izenman Ed pp 237ndash280Springer New York NY USA 2008

[26] Q Gu Z Li and J Han ldquoGeneralized fisher score for featureselectionrdquo httpsarxivorgabs12023725

[27] Z Gao C Cecati and S X Ding ldquoA survey of fault diagnosisand fault-tolerant techniquesmdashpart II fault diagnosis withknowledge-based and hybridactive approachesrdquo IEEE Trans-actions on Industrial Electronics vol 62 no 6 pp 3768ndash37742015

[28] A E Lazzaretti D M J Tax H Vieira Neto and V HFerreira ldquoNovelty detection and multi-class classification inpower distribution voltage waveformsrdquo Expert Systems withApplications vol 45 pp 322ndash330 2016

International Journal of

AerospaceEngineeringHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

RoboticsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Active and Passive Electronic Components

Control Scienceand Engineering

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of

RotatingMachinery

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporation httpwwwhindawicom

Journal ofEngineeringVolume 2014

Submit your manuscripts athttpwwwhindawicom

VLSI Design

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Shock and Vibration

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Civil EngineeringAdvances in

Acoustics and VibrationAdvances in

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Electrical and Computer Engineering

Journal of

Advances inOptoElectronics

Hindawi Publishing Corporation httpwwwhindawicom

Volume 2014

The Scientific World JournalHindawi Publishing Corporation httpwwwhindawicom Volume 2014

SensorsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Modelling amp Simulation in EngineeringHindawi Publishing Corporation httpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Chemical EngineeringInternational Journal of Antennas and

Propagation

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Navigation and Observation

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

DistributedSensor Networks

International Journal of

Page 3: Research Article Vibration-Based Adaptive Novelty ...downloads.hindawi.com/journals/sv/2016/2417856.pdf · applied under a continuous learning framework. e structure of the method

Shock and Vibration 3

The proposed methodology is tested using real datafrom a laboratory based kinematic chain where faults areinduced and the vibration signals from two accelerometersare monitored to assess the system condition To highlightthe benefits of the methodology proposed a comparison isperformed of classical feature reduction approaches to theadaptive feature reduction scheme proposed

This paper is organized as follows In Section 2 the-oretical aspects of the proposed method are describedSection 3 describes the method presented The experimentalsetup employed for assessment and the results obtained arepresented anddiscussed in Section 4Andfinally conclusionsand future work are summarized in Section 5

2 Theoretical Background

21 One-Class Support Vector Machine As mentioned theobjective of novelty detection is to detect abnormal eventsthat differ in some manner from the information of whichthemodel was trainedTheone-class SupportVectormachine(OC-SVM) which is a domain based model successfullyemployed in condition monitoring applications encloses thetraining data by means of a boundary or threshold whichrepresents the limit of normality The OC-SVM presents thepossibility to include a kernel formulation providing it withenough flexibility to adapt to the distribution of the trainingdata To assess the condition of the machine under test newmeasurements are analyzed regarding their position to theboundary created in the training phase The test measure-ments underlying inside the boundary are considered ldquonor-malrdquo or ldquoknownrdquo while the measurements underlying outsidethe boundary are considered ldquooutlierrdquo or ldquonovelrdquo Comparedto other novelty detection techniques the OC-SVM is robustto outliers during the training phase but has the disadvantageof presenting several configuration parameters which aredescribed later in this section

The OC-SVM was proposed by Scholkopf et al for esti-mating the support of a high-dimensional distribution [20]The OC-SVM classification objective is to separate one classof target samples from all other class samples In this typeof problem one class is characterized properly called targetclass while for the other class usually no measurements areavailable Considering 119883 = [1199091 119909119873]119879 isin 119877119873times119872 whichdenotes the normal data set and 119909119894 119894 = 1 119873 whichdenotes training samples (available measurements) charac-terized by119872 numerical features then in order to obtain theboundary an optimization model is considered as follows

min w22 + 1119873V

119873sum119894=1

120585119894 minus 120588Subject to w sdot Φ (119909) ge 120588 minus 120585119894 120585119894 ge 0

(1)

where V is a regularization parameter and 120585119894 is the slackvariable for the point119909119894The constantsw and 120588 are the normalvector and offset of the hyperplane respectively Thus thedecision boundary can be formulated as

119891 (119909) = w sdot Φ (119909) minus 120588 (2)

where 119909 isin 119877119872 andΦ is a higher dimensional projection vec-tor For the classification problem of two categories the datasets are not always linearly separable in the original spacethen Φ projects the original data sets into a higher dimen-sional space the so-called feature space where the data setscan be linearly separable However Φ is inexplicit in thepractical application and only the dot product from Φ(119909119894) sdotΦ(119909119895) is necessary to be known 119870 represents the kernelfunction Φ(119909119894) sdot Φ(119909119895) The most commonly used kernelfunction is the Gaussian

119870(119909119894 119909119895) = exp(minus10038171003817100381710038171003817119909119894 minus 11990911989510038171003817100381710038171003817221205902 ) (3)

In order to solve optimization problem (1) Lagrangemul-tipliers 119886119894 ge 0 and 120573119894 ge 0 (119894 = 1 119873) are introduced andthe Lagrange equation is formed as

119871 (w 120585 120588 a120573) = w22 + 1119873V

119873sum119894=1

120585119894 minus 120588minus 119873sum119894=1

119886119894 (w sdot Φ (119909119894) minus 120588 + 120585119894)minus 119873sum119894=1

120573119894120585119894(4)

The partial derivatives of the Lagrangian equation withrespect to w 120585 and 120588 are set to zero Then w and 119886119894 can beformulated as

w = 119873sum119894=1

119886119894Φ(119909119894) 119886119894 = 1119873V

minus 120573119894119873sum119894=1

119886119894 = 1(5)

Substitute (5) into Lagrangian equation (4) and its dualform is presented as

min a119879119867a

subject to 0 le 119886119894 le 1119873V119873sum119894=1

119886119894 = 1(6)

where a = [119886119894 119886119873]119879 and 119867 is the kernel matrix and thefactor of119867 that is119867119894119895 which can be expressed as

119867119894119895 = 119870 (119909119894 119909119895) = Φ (119909119894) sdot Φ (119909119895) (7)

Solve optimization problem (6) to get a and then 120588 can begiven as

120588 = 1119899119904119899119904sum119894=1

119873sum119895=1

119886119895119870(119909119894 119909119895) a (8)

where 119899119904 is the number of Support Vectors

4 Shock and Vibration

Tax and Duin proposed another form of OC-SVMwhichis called Support Vector Data Description [21]The basic ideaof SVDD is to construct aminimum-volume hypersphere in ahigh-dimensional feature space to enclose as much as normaldata points Both of these two forms of SVM have equivalentsolution if the diagonal entries of kernel matrix 119867 are equalto a constant

22 Dimensionality Reduction Techniques Working withhigh-dimensional data sets could complicate the learningpart of novelty detection methods not only because of possi-ble presence of noise and redundancy in the data but for otherreasons as well [22] The empty space phenomenon statesthat to cover the whole space a number of samples that growexponentially with dimensionality are needed The curse ofdimensionality implies that in order to learn successfully anumber of training examples that also grow exponentiallywith the dimensionality are needed The ldquoconcentration ofmeasurerdquo phenomenon seems to render distance measuresnot relevant to whatever concept is to be learnt as thedimension of the data increased For these reasons amongothers there is a necessity to apply dimensionality reductiontechniques in novelty detection applications Dimensionalityreduction strategies differ in the question of whether thelearning process is supervised or unsupervised The differ-ence between both learning processes is the availability oflabels to distinguish the different classes

Principal Component Analysis (PCA) is one of the mostcommonly used techniques for unsupervised dimensionalityreduction [23] It aims to find the linear projections that bestcapture the variability of the data

Another well-known dimensionality reduction techniqueis the Laplacian Score (LS) where the merit of each feature ismeasured according to its locality preservation power [24]A nearest neighbor based graph is constructed from thetraining set and analyzed to rank each feature individuallyaccording to a weighting approach selected for the graphedges If the labels of the classes of the analyzed data set aregiven the method changes to a supervised approach and thegraph is created based on these labels instead of the nearestneighbor approach Two variations for weighting the edgescan be applied Heat Kernel and Simpleminded To rank eachfeature its Laplacian Score is computed which is a measureof which extent the analyzed feature preserves the structurepresent in the graph divided by the variance of the feature Fora feature to be selected it must have a low LS which implieshigh variance and locality

Linear discriminant analysis (LDA) is one of the mostwell-known supervised techniques for linear dimensionalityreduction inmulticlass problems [25] LDA searches attemptsto maximize the linear separation between data pointsbelonging to different classes In contrast to most of theother dimensionality reduction techniques LDA as a featureextraction technique finds a linear mapping that maximizesthe linear class separation in the low-dimensional representa-tion of the data The criteria that are used to formulate linearclass separation in LDA are the within-class scatter and thebetween-class scatter

Another variant based on the Fisher coefficient is a featuredimensionality reduction approach called Fisher Score (FS)where the objective is to find a subset of features whichmaximize the Fisher coefficient [26] The main differencebetween LDA and Fisher Score is that LDA is a featureextraction approach and Fisher Score is a feature selectionapproach then instead of extracting new features fromthe subspace obtained FS ranks the features and selectthe reduced subspace of features selected which maximizesthe dimensionality reduction criterion in this case the Fishercriterion

The aforementioned linear feature reduction techniqueshave a different combination of objective (variance preserva-tion topology preservation and discrimination) andmethodof employment (unsupervised or supervised and extractionor selection) and have been widely used in the literature withsuccessful results [16 22] yet they have been applied under astatic framework where the set of features is reduced initiallyand no possible adaptation during the monitoring phase isconsidered

3 Methodology

Given that the data initially available is usually related to thehealthy condition of the machine under analysis condition-based monitoring schemes must be designed to overcometwo main challenges

(i) The identification of significant features to deal withthe characterization of the known conditions of the machineunder the consideration that the occurrence of additionalunknown faults must be detected

(ii) The adaptation of the condition-based monitoringscheme to update the considered data base of the machineonce unknown fault scenarios have been detected

Such requirements are addressed in this work by meansof the proposed adaptive novelty detection method shownin Figure 1 This methodology represents an important stepto the introduction of adaptive novelty detection schemes tothe development of electromechanical system diagnosis pro-cedures

The proposed method is composed of two stages anoffline stage and an online monitoring stageThemain objec-tives of the offline stage are first the analysis of the infor-mation available of the monitored machine to find a reducedset of numerical features to characterize the known machineconditions and second the design of the novelty model bymeans of the selection of the configuration parameters andtraining

Once a reduced set of characteristic features is obtainedand the novelty model is designed the online stage is carriedout During the online stage new measurements are contin-uously compared with the normality threshold 119879119899 definedduring the novelty model training in the offline stageThus ifa novel scenario is detected the supervision of an expert useris proposed in order to confirm and label the new conditionof the machine consequently the monitoring system isretrained to include the characteristics of the novel scenarioDetailed information of each stage and the retraining processis described in Sections 31ndash33

Shock and Vibration 5

Database Feature calculation

Feature reduction

Novelty model design

New measurement

Calculation of reducedfeatures

NoveltyUser

supervisionKnown

operation False alarm

Offl

ine

Onl

ine

RetrainingIncluding

information of new fault

No

Yes

Noveltymodel

Figure 1 Proposedmethodology for the novelty detection approachThemonitoringmethod is composed of an offline stage for initializationand retraining and an online stage for continuous monitoring

31 Offline Stage During the initialization it is assumed thatonly information of the machine operating under healthycondition is available in the database The first step is thecalculation of numerical features from the vibration mea-surements obtained during the machine operation Since theinformation of the possible faults of the monitored machineis not available yet during this initialization a generic set ofstatistical time-based numerical features is proposed to beextracted from each available vibration axis measurementThe proposed set of potential statistical time-frequency fea-tures is shown as follows

Max Value

119894 = max (119909) (9)

Root Mean Square (RMS)

RMS = radic 1119899 sdot119899sum119896=1

(119909119896)2 (10)

Crest Factor (CF)

CF = 119909119896RMS

(11)

Kurtosis

119896 = 119864 [(119909119896 minus 119909)4]1205904 (12)

Shape Factor

SF = RMS1119899 sdot sum119899119896=1 10038161003816100381610038161199091198961003816100381610038161003816 (13)

These features have been successfully employed for faultdetection in the last years [1 27]

The resulting number of numerical features is propor-tional to the number of available vibration axes collectedduring the acquisition However in order to allow the com-pression and visualization of the data a feature reductionmodule is implemented During such offline stage initializa-tion an unsupervised feature reduction approach must be

used a Laplacian Score ranking is proposed in this work asa good trade-off between simplicity and performance [24]to rank the features according to the topology preservationcapabilities The two or three first ranked features in terms ofLaplacian Score are selected

Next the novelty model is designedThere is a significantnumber of noveltymodels proposed in the literature [4] witheach one demonstrated to be a capable option under certaincircumstances An increasing amount of works implies thatdomain based novelty detection models present promisingresults [19 28] In this work a standard OC-SVMwith Gaus-sian kernel is used The design of the novelty model includesthe selection of the parameters for configuration and trainingemploying the known scenarios stored at the database Thenthe initialization of the offline stage finalizes with the designof the OC-SVM

32 Online Stage This stage continuously monitors thecondition of the machine to detect if an anomaly is presentTo accomplish this each of the new measurements of themachine acquired certain amount of predefined time Eachmeasurement is segmented and a set of features is calculatedfrom each segment The set of numerical features calculatedin this stage are reduced in the offline stage by means ofLaplacian Score

Thus each new measurement characterized by thenumerical features is analyzed by the novelty model In caseof no novelty detection it is assumed that the machine isworking under known conditions However if the analyzedmeasurement is detected as a novelty an alarm is triggeredin order to consider the user assessment Then if theoccurrence of a new scenario is confirmed the correspondingmeasurements are stored at the database and a retrainingprocedure is performed

33 Retraining Once the retraining is trigged the featurereduction and the novelty model design modules are modi-fied at the offline stage A diagram of the retraining procedureis presented in Figure 2

It must be noticed that two important contributiveaspects are proposed in this retraining approach first thereconsideration of the feature reductionmodule and secondthe incorporation of fault scenarios to the novelty model As

6 Shock and Vibration

New Fault Detected

Continuos Monitoring

Add Information to DDBB

No

Yes

Perform New Feature

Reduction

Novelty model redesign

A supervised method will be

used

Measurementsof the new fault

Data of all stored scenarios

included

Retr

aini

ng

New faultdetected

Continuousmonitoring

Add information to DDBB

Perform new feature

reduction

Novelty modelredesign

Figure 2 Proposed retraining approach First measurements char-acterizing the fault are stored and then the feature reductionmoduleand the novelty design module are modified to incorporate the newscenario encountered

it has been explained the feature reduction module duringits initialization is supported by an unsupervised LaplacianScore ranking due to the lack of additional scenarios yet oncethe information of a fault is available the feature reductionmodule could improve its performance by employing super-vised methods

Then the Laplacian approach is replaced by a FisherScore ranking approach in this work where the features aresorted according to the Fisher coefficient calculated fromeach feature It is important to mention that Laplacian Scorecan be configured towork also under a supervised frameworkand could obtain similar results than employing a Fisherapproach similarities between the two approaches are dis-cussed in [24] Nevertheless Fisher Score is used in thismethodology to search for a possible discriminative scenarioto increase the detection of novel scenarios A comparison ofresults obtained from both features reduction approaches ispresented in the results of this work

The consideration of a faulty scenario in the noveltymodel may contradict the principle of anomaly detectionwhere the objective is to detect healthy behaviors from therest Nevertheless the aim of an adaptive condition mon-itoring system should be to learnt from all the identifiedconditions to subsequently detect them if they are presentedagain by a fault detection module Indeed some workspresent a parallel structure for fault detection and noveltydetection modules [18 19] where the novelty module learnsthe known faults along with the healthy operation because itsobjective is to detect only new scenarios while the objectiveof the fault detection is to identify the condition of the

Gearbox

Load

VFD

DAS

Inductionmotor

AccelerometerEncoder

PC

z

yx

Figure 3 Test bench used for experimentation

machine including known faults This work is based on suchparallel structure approach where the fault scenariosmust betaken into consideration to be included in the novelty modeland then they could be identified by a complementary faultdetection module with a high confidence level

4 Experimental Results

41 Description of the Experimental Platform The test benchused for testing the kinematic chain with different faults andthe acquisition system used to capture the vibration signalsare shown in Figure 3 The test bench consists in a 1492-W-three-phase induction motor (WEG 00236ET3E145T-W22)with its rotational speed controlled through a variable fre-quency drive-VFD (WEG CFW08) the operating speed isfixed to 60Hz for all experiments It also consists of a 4 1 ratiogearbox (BALDORGCF4X01AA) and a DC generator (BAL-DOR CDP3604) which is used as the mechanical load com-prising around 20of the nominal loadThe vibration signalsare acquired using a triaxial accelerometer (LIS3L02AS4)mounted on a board with the signal conditioning andantialias filtering Two 12-bit 4-channel serial output sam-pling analog-to-digital converters (ADS7841) are used in thedata acquisition system (DAS) board Vibration signal fromthe perpendicular plane (119910-119911) to the motor shaft is acquiredusing a triaxial accelerometer LIS3L02AS4 mounted on aboard with the signal conditioning and antialiasing filteringSampling frequency is set to 3 kHz for vibration acquisitionThe data retrieved by the DAS is stored in a regular computer(PC)

Three scenarios are considered to verify the performanceof the proposed method the first one 119867 is the kinematicchain working under healthy condition and the other two 1198651and 1198652 represent the kinematic chain working under faultyconditions For 1198651 the motor is working with a half brokenbar and for 1198652 the motor is working with a fully brokenbar The detail of the failures is shown in Figure 4 The halfbroken bar failure is artificially produced by drilling a 6mmhole with a depth of 3mm that corresponds mostly to the

Shock and Vibration 7

(a) (b)

Figure 4 Detail of the faults produced in the test bench (a) corresponds to the 12 broken rotor bar and (b) to one broken rotor bar

22 of the section of the rotor bar and the full broken baris produced by a through-hole with a diameter of 6mm anda depth of 14mm which corresponds to the complete sectionof the rotor bar

The information stored from the kinematic chain consistsof an acquisition of 60 seconds of themachine working underthe three scenarios mentioned each acquisition is segmentedin 30 parts of 2 seconds and a set of features is calculatedfrom the 30 segmented measurements Since two axes aretaken into consideration a total of ten features are calculatedfrom each segmented acquisition of the machine workingunder the different scenarios mentioned The first step ofthe methodology is the offline stage where a reduced set offeatures is obtained and the novelty model is designed

42 Parameter Selection Regarding the Laplacian Score con-figuration a simple approach is followed for parameter tun-ing that is a value of 119896 = 3 is used for constructing the adja-cency graph and a ldquosimple mindedrdquo weighting approach isfollowed Since the proposed approach is compared to differ-ent feature reduction modules selecting generic parameterssettings is useful for the purpose of evaluation but ignores thatthere may be dependencies between the feature reductionmodel and the novelty model Regarding the design of thenovelty model the kernel used is the Gaussian and the valueof the configuration parameter is tuned tominimize the errorin the validation In all experiments 80 of the samplesare used for training and 20 for validation To train andadjust the parameter of the novelty model a fivefold cross-validation is used

43 Results and Discussions In order to highlight the contri-butions andmotivation of this work the outline of the resultsis presented as follows first a test is performed by a classicalapproach then the proposedmethodology is applied and theresults are compared The classical static approach impliesconserving the reduced features set obtained at the offline ini-tialization stage meanwhile the proposed dynamic approachimplies a possible reformulation of the reduced features setduring the retraining stage

Healthy

0

005

01

015

02

minus2 minus1minus3 1 2 30LS feature 1 RMS Y

minus3

minus2

minus1

0

1

2

3LS

feat

ure 2

kur

tosis

Y

Figure 5 Initial novelty model representation Limit of the noveltythreshold sdot sdot sdot and measurements used to train the model lowast

During the initialization stage the reduced set of featuresselected bymeans of the Laplacian Score ranking is composedof the RMS of the 119910-axis and the Kurtosis of the 119910-axisThe OC-SVM model is trained employing healthy conditiondata The resulting OC-SVM during training is shown inFigure 5 The marks lowast in the figure represent measurementsof the machine used to train the model in this case andin the subsequence figures only 1-fold of the fivefold cross-validation is displayedThe dotted line represents the noveltythreshold value all data inside the boundaries of the dottedline is considered normal The contour plot represents thenovelty score evaluation over different regions of the featuremap

Once the offline initialization stage is finished the onlinestage follows that is new measurements are obtained toassess the condition of the monitored machine To giverobustness to the novelty detection and avoid false alarms abatch consisting of 30 measurements is evaluated if 75 of

8 Shock and Vibration

0

005

01

015

02

minus4

minus3

minus2

minus1

0

1

2

3

4

5

60 2 4 8 10minus2minus4LS feature 1 RMS Y

KnownF1

LS fe

atur

e 2 k

urto

sis Y

Figure 6 Evaluation of the fault scenario 1198651 The novelty model istrained employing data from healthy operation condition

minus2 0 2 4 6 8 10minus4LS feature 1 RMS Y

minus4

minus3

minus2

minus1

0

1

2

3

4

5

minus001

0

001

002

003

004

005

006

007

008

Known

LS fe

atur

e 2 k

urto

sis Y

Figure 7 Contour plot of the novelty model after including 1198651

the analyzed measurements are evaluated as novelty then thealarm is triggered Next the 1198651 scenario is presented to testthe performance of the model The plot of the scenario andthe novelty threshold obtained during training is presentedin Figure 6

As can be appreciated the new scenario lays outside thenovelty threshold so it is successfully detected as noveltyOnce a novel scenario is detected and identified as a fault bythe user it is incorporated to the database to consider it as partof the known scenarios and the novelty model is retrainedwithout changing the features to include this informationFigure 7 shows the feature space after the novelty modelis trained using healthy and 1198651 data as part of the knownscenarios

Once the model is retrained the third scenario 1198652 isintroduced The visual representation of the test is presentedin Figure 8 this scenario is not detected as novel because only50 of the measurements are labeled as novel by the model

minus001

0

001

002

003

004

005

006

007

008

minus2 0 2 4 6 8 10minus4LS feature 1 RMS Y

minus4

minus3

minus2

minus1

0

1

2

3

4

5

KnownF2

LS fe

atur

e 2 k

urto

sis Y

Figure 8 Evaluation of the fault scenario 1198652 The novelty model istrained employing data from healthy and 1198651 scenarios

A similar result is obtained when the novelty model istrained using healthy and 1198652 data and is tested with data of1198651 A summary of the results of novelty detectionmaintainingthe same features obtained during initialization is shown inTable 1

As it can be seen in Table 1 using the reduced set obtainedduring initialization is easy to detect the novel scenarioswhen there is only information of the healthy condition butwhen a novel scenario is included in the database and thenoveltymodel is retrained the reduced set of features initiallyobtained does not necessarily provide a good representationto detect a new scenario during test

To improve these results the methodology presented inthis work proposes to evaluate again the feature reductionmodule each time a retraining is applied Following theoutline presented for results and parting from the first retrainwhere the scenario 1198651 is included in the database the featurereduction module is applied again but this time includinginformation from healthy condition and 1198651 scenario Sincetwo scenarios are taken into consideration and the labels areknown a supervised approach can be applied in this case aFisher Score ranking for feature reduction is employed Thenovelty model using the initial set of features and the noveltymodel after retraining with the new reduced set of featuresare shown in Figure 9

The new reduced set of features is composed of the RMSof the 119910-axis and the Kurtosis of the 119911-axis as can be appre-ciated the new set of features present a more discriminativedistribution of the three scenarios considered At the laststep (Figure 9(d)) when 1198652 scenario is included the FisherScore still ranked the same features of the last retraining(Figure 9(b)) as the highest yet it would be possible thata different set of features would be obtained The resultsobtained demonstrate the advantages of including the FisherScore reduction module to the retraining procedure the newdistribution of the scenarios avoided an overlapping of the

Shock and Vibration 9

minus2 0 2 4 6 8 10minus4LS feature 1 RMS Y

minus4

minus3

minus2

minus1

0

1

2

3

4

5

0

005

01

015

02

KnownF1

LS fe

atur

e 2 k

urto

sis Y

(a)

minus5

minus4

minus3

minus2

minus1

0

1

2

3

minus2 0 2 4 6 8 10minus4FS feature 1 RMS Y

minus001

0

001

002

003

004

005

006

007

008

Known

FS fe

atur

e 2 k

urto

sis Z

(b)

minus001

0

001

002

003

004

005

006

007

008

minus2 0 2 4 6 8 10minus4FS feature 1 RMS Y

FS fe

atur

e 2 k

urto

sis Z

minus5

minus4

minus3

minus2

minus1

0

1

2

3

KnownF2

(c)

minus2 0 2 4 6 8 10minus4FS feature 1 RMS Y

minus5

minus4

minus3

minus2

minus1

0

1

2

3

minus0005

0

0005

001

0015

002

0025

003

0035

Known

FS fe

atur

e 2 k

urto

sis Z

(d)

Figure 9 Process of evaluation and retraining employing the methodology proposed (a) Evaluation of the fault scenario 1198651 (b) Retrainingof the novelty model and reformulation of the reduced set of features including 1198651 (c) Evaluation of the fault scenario 1198652 (d) Retraining ofthe novelty model and reformulation of the reduced set of features including 1198652Table 1 Performance of the novelty detection using only healthy class data to reduce the number of features where DR stands fordimensionality reduction Different scenarios are included according to the information available to train and test the novelty model

DRPerformance using OC-SVM ()

Known Test Known Test Known Test119867 1198651 + 1198652 119867 + 1198651 1198652 119867 + 1198652 1198651LS 954 (plusmn11) 576 (plusmn67) 435 (plusmn64)

new scenario under test therefore the noveltymodel success-fully detected as novelty all measurements corresponding tothe 1198652 scenario

It is worth mentioning that if initially the scenario 1198652is used for training and the scenario 1198651 is used for testingthe new set of features obtained could be different from theaforementionedThe results achieved fromboth scenarios are

shown in Table 2 which also includes a comparison of theresults obtained employing PCA LDA and Laplacian Scoredimensionality reduction techniques in the retraining stepinstead of the Fisher Score proposed

Regarding the classical feature extraction techniquesPCA and LDA the test scenario is not identified as novelin both cases The Fisher Score and the Laplacian Score

10 Shock and Vibration

Table 2 Performance of the novelty detection employing a reduc-tion of features during retraining Different scenarios are includedaccording to the information available to train and test the noveltymodel

DRPerformance using OC-SVM ()

Known Test Known Test119867 + 1198651 1198652 119867 + 1198652 1198651PCA 590 (plusmn52) 642 (plusmn51)LDA 474 (plusmn61) 512 (plusmn61)LS 843 (plusmn24) 100 (plusmn00)Fisher S 100 (plusmn00) 100 (plusmn00)successfully identified the new scenarios as novel since thepercentage obtained in both cases is higher than the 75 pre-defined threshold in the methodology to activate the alarmBoth techniques achieved high scores but still the FisherScore provided a more appropriate selection of features

As mentioned in Section 2 LDA is a feature extractiontechnique based on the Fisher discriminant coefficient sosimilar results between Fisher Score ranking and LDA areexpected however employing Fisher Score ranking scoreachieved a better result this can be caused because the testconsists of novel scenarios and LDA finds the directions onthe feature space specialized for the two supervised scenariosemployed during training meanwhile Fisher Score rankingprovides a more general approach by selecting features

To test the robustness of the Laplacian Score and FisherScore approaches a comparative test is performed where theset of features is increased from 10 to 15 and by varyingthe number of reduced features selected The five featuresincluded in the original set are obtained from the119909-axis of theaccelerometer monitored where the features calculated arepresented in the aforementioned statistical time-frequencyfeatures these features were discarded initially because theyare not part of the perpendicular plane of the motor anddo not contribute significantly to the monitoring in factincluding these features could affect the performance of thefeature reduction and novelty detection modules The resultsare presented in Table 3

The features obtained from the Fisher Score still present abetter distribution to detect the new scenariosThe LaplacianScore performance is affected when irrelevant features areincluded in the feature set but increases when more dimen-sions are taken into consideration Since the objective of theFisher Score is not topology preservation contrary to LS itis capable of discarding all the irrelevant features that wereincluded and the performance is not affected

5 Conclusions

This work proposes an adaptive novelty detection methodol-ogy based on vibration analysis for the condition monitoringand diagnosis of the components of a kinematic chain Themethodology is based on the acquisition of vibration signalsthat are generated in the kinematic chain along with anadequate signal processing to extract features to characterizethe components and an adaptive novelty detection model to

Table 3 Performance of the novelty detection increasing the num-ber of initial features from 10 to 15 and varying the number of thereduced set of features

DRPerformance varying the number of features

reduced ()2 3 4

LS 590 (plusmn41) 643 (plusmn33) 886 (plusmn25)Fisher S 100 (plusmn00) 100 (plusmn00) 100 (plusmn00)detect anomalies The method is composed of two sequentialstages an offline stage to initialize and retrain the modulesand an online stage to continuously assess the conditionof the machine During initialization the model is trainedemploying only information from the machine workingunder healthy condition and two additional faulty scenariosare introduced to test the performance of the method underunknown operations

The adaptive novelty detection approach successfullydetected both novel scenarios and the model incorporatedthe information to avoid generating alarms if the same faultis detected A comparison between the proposed methodand classical dimensionality reduction approaches highlightsthe limitations of maintaining a static set of features duringmonitoring instead of reformulating the feature reductionmodule once new information is available In this particularstudy employing features reduced by Laplacian Score andFisher Score obtained similar results nevertheless it doesnot imply that a similar outcome will be present during theanalysis of other faults Fisher Score is encouraged to beemployed in this methodology rather than maintaining theLaplacian Score approach due to the similarity between themethod objective and the objective function of Fisher Scorein both the ideal case is to find the features to maximizethe distance between scenarios while maintaining compactclusters A specific comparison of performances between LSand Fisher Score was also included in which the Fisher Scoreobtained better results when irrelevant features are includedin the original set of features and when the dimensionalityof the reduced set is increased this highlights the advantagesand robustness of the feature selection approach by FisherScore compared to the LS

From an industrial perspective two aspects should bementioned First dealing with an identical operating sce-nario (ie healthy) the variation of the working conditions(speed and torque patterns) will affect the resulting valueof the estimated numerical features and consequently itsrepresentation in the feature space used for novelty modeldesign Thus the introduction of the working condition asan additional degree of freedom in the novelty detectionscheme implies the increase of the dimensional space thatthe novelty detectionmodel is managingThis fact implies anincrease of the risk of data overlapping (different scenariosunder different operating conditions showing similar featurevalues) and also an increase of complexity for boundariesdefinition during the novelty modelling (data spread) Thusthe proposed method is not limited to stationary conditionsbut to the repetition of the pattern of working condition used

Shock and Vibration 11

to train the model Indeed in practical industrial applicationwhere cyclostationary processes take place the proposedmethod should be executed during the same period (the sameworking conditions) in order to reduce data variability andincrease novelty performance In case of multiple workingconditions consideration parallel novelty model structurescould be generated in order to activate each one dependingon the corresponding working condition

Second the integration of additional physicalmagnitudesas stator currents temperatures and others represents atrendy approach in electromechanical fault diagnosis inorder to improve fault diagnosis resolution and perfor-mance The corresponding data fusion scheme implies ahigh-dimensional feature space that is multiple numericalfeatures estimated from multiple physical magnitudes con-sidered In this point it must be noted that the presentedmethodology proposes the reevaluation of the feature reduc-tionmodule each time a retraining is appliedThus the largerthe available pool of features is the higher the potentiality ofthe proposedmethod to increase the performance during thedata characterization is

Also the proposed method can be extended andimproved for further development this improvement couldinclude a diagnosis method to not only detect anomalies inthe kinematic chain but also identify the fault causing theabnormal behavior In this sense future work will include aspecific comparison of broken bar fault detection capabilitiesbetween the proposedmethodology and classical approaches

Competing Interests

The authors declare that they have no competing interests

Acknowledgments

This research was partially supported by CONACyT schol-arships Grants 278033 and 313604 and also by the SpanishMinistry of Education Culture and Sport under GrantFPU1300589 The authors would also like to thank a partialfinancial support by the Generalitat de Catalunya (GRCMCIA Grant no SGR 2014-101)

References

[1] H Henao G-A Capolino M Fernandez-Cabanas et alldquoTrends in fault diagnosis for electrical machines a review ofdiagnostic techniquesrdquo IEEE Industrial Electronics Magazinevol 8 no 2 pp 31ndash42 2014

[2] N Bayar S Darmoul S Hajri-Gabouj and H Pierreval ldquoFaultdetection diagnosis and recovery using Artificial Immune Sys-tems a reviewrdquoEngineeringApplications of Artificial Intelligencevol 46 pp 43ndash57 2015

[3] SWang JWan D Zhang D Li and C Zhang ldquoTowards smartfactory for Industry 40 a self-organized multi-agent systemwith big data based feedback and coordinationrdquo ComputerNetworks vol 101 pp 158ndash168 2016

[4] M A F Pimentel D A Clifton L Clifton and L TarassenkoldquoA review of novelty detectionrdquo Signal Processing vol 99 pp215ndash249 2014

[5] R Langone O Mauricio Agudelo B De Moor and J AK Suykens ldquoIncremental kernel spectral clustering for online

learning of non-stationary datardquo Neurocomputing vol 139 pp246ndash260 2014

[6] M-H Chen P-C Chang and J-L Wu ldquoA population-basedincremental learning approach with artificial immune systemfor network intrusion detectionrdquo Engineering Applications ofArtificial Intelligence vol 51 pp 171ndash181 2016

[7] S Khomfoi and L M Tolbert ldquoFault diagnosis and reconfigu-ration for multilevel inverter drive using AI-based techniquesrdquoIEEE Transactions on Industrial Electronics vol 54 no 6 pp2954ndash2968 2007

[8] A Bellini F Filippetti C Tassoni and G-A CapolinoldquoAdvances in diagnostic techniques for induction machinesrdquoIEEE Transactions on Industrial Electronics vol 55 no 12 pp4109ndash4126 2008

[9] V T Tran and B-S Yang ldquoAn intelligent condition-basedmaintenance platform for rotating machineryrdquo Expert Systemswith Applications vol 39 no 3 pp 2977ndash2988 2012

[10] P A Delgado-Arredondo A Garcia-Perez D Morinigo-Soteloet al ldquoComparative study of time-frequency decompositiontechniques for fault detection in induction motors using vibra-tion analysis during startup transientrdquo Shock and Vibration vol2015 Article ID 708034 14 pages 2015

[11] M D Prieto G Cirrincione A G Espinosa J A Ortegaand H Henao ldquoBearing fault detection by a novel condition-monitoring scheme based on statistical-time features and neu-ral networksrdquo IEEE Transactions on Industrial Electronics vol60 no 8 pp 3398ndash3407 2013

[12] P Perner ldquoConcepts for novelty detection and handling basedon a case-based reasoning process schemerdquo Engineering Appli-cations of Artificial Intelligence vol 22 no 1 pp 86ndash91 2009

[13] C Englund and A Verikas ldquoA hybrid approach to outlier detec-tion in the offset lithographic printing processrdquo EngineeringApplications of Artificial Intelligence vol 18 no 6 pp 759ndash7682005

[14] H Garces and D Sbarbaro ldquoOutliers detection in environmen-tal monitoring databasesrdquo Engineering Applications of ArtificialIntelligence vol 24 no 2 pp 341ndash349 2011

[15] S Ma S M Li and Y P Xiong ldquoUncertainty reduced noveltydetection approach applied to rotatingmachinery for conditionmonitoringrdquo Shock and Vibration vol 2015 Article ID 73721310 pages 2015

[16] J McBain and M Timusk ldquoFeature extraction for noveltydetection as applied to fault detection in machineryrdquo PatternRecognition Letters vol 32 no 7 pp 1054ndash1061 2011

[17] S Wang J Yub E Lapirac and J Lee ldquoA modified supportvector data description based novelty detection approach formachinery componentsrdquo Applied Soft Computing vol 13 no 2pp 1193ndash1205 2013

[18] D P Filev R B Chinnam F Tseng and P Baruah ldquoAn indus-trial strength novelty detection framework for autonomousequipment monitoring and diagnosticsrdquo IEEE Transactions onIndustrial Informatics vol 6 no 4 pp 767ndash779 2010

[19] B S J Costa P P Angelov and L A Guedes ldquoFully unsu-pervised fault detection and identification based on recursivedensity estimation and self-evolving cloud-based classifierrdquoNeurocomputing vol 150 pp 289ndash303 2015

[20] B Scholkopf R C Williamson A J Smola J Shawe-Taylorand J C Platt ldquoSupport vector method for novelty detectionrdquoin Neural Information Processing Systems vol 12 pp 582ndash5881999

12 Shock and Vibration

[21] D M J Tax and R P W Duin ldquoSupport vector domaindescriptionrdquo Pattern Recognition Letters vol 20 no 11ndash13 pp1191ndash1199 1999

[22] S D Villalba and P Cunningham ldquoAn evaluation of dimensionreduction techniques for one-class classificationrdquo ArtificialIntelligence Review vol 27 no 4 pp 273ndash294 2007

[23] I T Jolliffe Principal Component Analysis Springer New YorkNY USA 2nd edition 1986

[24] X He D Cai and P Niyogi ldquoLaplacian score for featureselectionrdquo inAdvances in Neural Information Processing Systems18 pp 507ndash514 MIT Press 2005

[25] A J Izenman ldquoLinear discriminant analysisrdquo in Modern Mul-tivariate Statistical Techniques A J Izenman Ed pp 237ndash280Springer New York NY USA 2008

[26] Q Gu Z Li and J Han ldquoGeneralized fisher score for featureselectionrdquo httpsarxivorgabs12023725

[27] Z Gao C Cecati and S X Ding ldquoA survey of fault diagnosisand fault-tolerant techniquesmdashpart II fault diagnosis withknowledge-based and hybridactive approachesrdquo IEEE Trans-actions on Industrial Electronics vol 62 no 6 pp 3768ndash37742015

[28] A E Lazzaretti D M J Tax H Vieira Neto and V HFerreira ldquoNovelty detection and multi-class classification inpower distribution voltage waveformsrdquo Expert Systems withApplications vol 45 pp 322ndash330 2016

International Journal of

AerospaceEngineeringHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

RoboticsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Active and Passive Electronic Components

Control Scienceand Engineering

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of

RotatingMachinery

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporation httpwwwhindawicom

Journal ofEngineeringVolume 2014

Submit your manuscripts athttpwwwhindawicom

VLSI Design

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Shock and Vibration

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Civil EngineeringAdvances in

Acoustics and VibrationAdvances in

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Electrical and Computer Engineering

Journal of

Advances inOptoElectronics

Hindawi Publishing Corporation httpwwwhindawicom

Volume 2014

The Scientific World JournalHindawi Publishing Corporation httpwwwhindawicom Volume 2014

SensorsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Modelling amp Simulation in EngineeringHindawi Publishing Corporation httpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Chemical EngineeringInternational Journal of Antennas and

Propagation

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Navigation and Observation

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

DistributedSensor Networks

International Journal of

Page 4: Research Article Vibration-Based Adaptive Novelty ...downloads.hindawi.com/journals/sv/2016/2417856.pdf · applied under a continuous learning framework. e structure of the method

4 Shock and Vibration

Tax and Duin proposed another form of OC-SVMwhichis called Support Vector Data Description [21]The basic ideaof SVDD is to construct aminimum-volume hypersphere in ahigh-dimensional feature space to enclose as much as normaldata points Both of these two forms of SVM have equivalentsolution if the diagonal entries of kernel matrix 119867 are equalto a constant

22 Dimensionality Reduction Techniques Working withhigh-dimensional data sets could complicate the learningpart of novelty detection methods not only because of possi-ble presence of noise and redundancy in the data but for otherreasons as well [22] The empty space phenomenon statesthat to cover the whole space a number of samples that growexponentially with dimensionality are needed The curse ofdimensionality implies that in order to learn successfully anumber of training examples that also grow exponentiallywith the dimensionality are needed The ldquoconcentration ofmeasurerdquo phenomenon seems to render distance measuresnot relevant to whatever concept is to be learnt as thedimension of the data increased For these reasons amongothers there is a necessity to apply dimensionality reductiontechniques in novelty detection applications Dimensionalityreduction strategies differ in the question of whether thelearning process is supervised or unsupervised The differ-ence between both learning processes is the availability oflabels to distinguish the different classes

Principal Component Analysis (PCA) is one of the mostcommonly used techniques for unsupervised dimensionalityreduction [23] It aims to find the linear projections that bestcapture the variability of the data

Another well-known dimensionality reduction techniqueis the Laplacian Score (LS) where the merit of each feature ismeasured according to its locality preservation power [24]A nearest neighbor based graph is constructed from thetraining set and analyzed to rank each feature individuallyaccording to a weighting approach selected for the graphedges If the labels of the classes of the analyzed data set aregiven the method changes to a supervised approach and thegraph is created based on these labels instead of the nearestneighbor approach Two variations for weighting the edgescan be applied Heat Kernel and Simpleminded To rank eachfeature its Laplacian Score is computed which is a measureof which extent the analyzed feature preserves the structurepresent in the graph divided by the variance of the feature Fora feature to be selected it must have a low LS which implieshigh variance and locality

Linear discriminant analysis (LDA) is one of the mostwell-known supervised techniques for linear dimensionalityreduction inmulticlass problems [25] LDA searches attemptsto maximize the linear separation between data pointsbelonging to different classes In contrast to most of theother dimensionality reduction techniques LDA as a featureextraction technique finds a linear mapping that maximizesthe linear class separation in the low-dimensional representa-tion of the data The criteria that are used to formulate linearclass separation in LDA are the within-class scatter and thebetween-class scatter

Another variant based on the Fisher coefficient is a featuredimensionality reduction approach called Fisher Score (FS)where the objective is to find a subset of features whichmaximize the Fisher coefficient [26] The main differencebetween LDA and Fisher Score is that LDA is a featureextraction approach and Fisher Score is a feature selectionapproach then instead of extracting new features fromthe subspace obtained FS ranks the features and selectthe reduced subspace of features selected which maximizesthe dimensionality reduction criterion in this case the Fishercriterion

The aforementioned linear feature reduction techniqueshave a different combination of objective (variance preserva-tion topology preservation and discrimination) andmethodof employment (unsupervised or supervised and extractionor selection) and have been widely used in the literature withsuccessful results [16 22] yet they have been applied under astatic framework where the set of features is reduced initiallyand no possible adaptation during the monitoring phase isconsidered

3 Methodology

Given that the data initially available is usually related to thehealthy condition of the machine under analysis condition-based monitoring schemes must be designed to overcometwo main challenges

(i) The identification of significant features to deal withthe characterization of the known conditions of the machineunder the consideration that the occurrence of additionalunknown faults must be detected

(ii) The adaptation of the condition-based monitoringscheme to update the considered data base of the machineonce unknown fault scenarios have been detected

Such requirements are addressed in this work by meansof the proposed adaptive novelty detection method shownin Figure 1 This methodology represents an important stepto the introduction of adaptive novelty detection schemes tothe development of electromechanical system diagnosis pro-cedures

The proposed method is composed of two stages anoffline stage and an online monitoring stageThemain objec-tives of the offline stage are first the analysis of the infor-mation available of the monitored machine to find a reducedset of numerical features to characterize the known machineconditions and second the design of the novelty model bymeans of the selection of the configuration parameters andtraining

Once a reduced set of characteristic features is obtainedand the novelty model is designed the online stage is carriedout During the online stage new measurements are contin-uously compared with the normality threshold 119879119899 definedduring the novelty model training in the offline stageThus ifa novel scenario is detected the supervision of an expert useris proposed in order to confirm and label the new conditionof the machine consequently the monitoring system isretrained to include the characteristics of the novel scenarioDetailed information of each stage and the retraining processis described in Sections 31ndash33

Shock and Vibration 5

Database Feature calculation

Feature reduction

Novelty model design

New measurement

Calculation of reducedfeatures

NoveltyUser

supervisionKnown

operation False alarm

Offl

ine

Onl

ine

RetrainingIncluding

information of new fault

No

Yes

Noveltymodel

Figure 1 Proposedmethodology for the novelty detection approachThemonitoringmethod is composed of an offline stage for initializationand retraining and an online stage for continuous monitoring

31 Offline Stage During the initialization it is assumed thatonly information of the machine operating under healthycondition is available in the database The first step is thecalculation of numerical features from the vibration mea-surements obtained during the machine operation Since theinformation of the possible faults of the monitored machineis not available yet during this initialization a generic set ofstatistical time-based numerical features is proposed to beextracted from each available vibration axis measurementThe proposed set of potential statistical time-frequency fea-tures is shown as follows

Max Value

119894 = max (119909) (9)

Root Mean Square (RMS)

RMS = radic 1119899 sdot119899sum119896=1

(119909119896)2 (10)

Crest Factor (CF)

CF = 119909119896RMS

(11)

Kurtosis

119896 = 119864 [(119909119896 minus 119909)4]1205904 (12)

Shape Factor

SF = RMS1119899 sdot sum119899119896=1 10038161003816100381610038161199091198961003816100381610038161003816 (13)

These features have been successfully employed for faultdetection in the last years [1 27]

The resulting number of numerical features is propor-tional to the number of available vibration axes collectedduring the acquisition However in order to allow the com-pression and visualization of the data a feature reductionmodule is implemented During such offline stage initializa-tion an unsupervised feature reduction approach must be

used a Laplacian Score ranking is proposed in this work asa good trade-off between simplicity and performance [24]to rank the features according to the topology preservationcapabilities The two or three first ranked features in terms ofLaplacian Score are selected

Next the novelty model is designedThere is a significantnumber of noveltymodels proposed in the literature [4] witheach one demonstrated to be a capable option under certaincircumstances An increasing amount of works implies thatdomain based novelty detection models present promisingresults [19 28] In this work a standard OC-SVMwith Gaus-sian kernel is used The design of the novelty model includesthe selection of the parameters for configuration and trainingemploying the known scenarios stored at the database Thenthe initialization of the offline stage finalizes with the designof the OC-SVM

32 Online Stage This stage continuously monitors thecondition of the machine to detect if an anomaly is presentTo accomplish this each of the new measurements of themachine acquired certain amount of predefined time Eachmeasurement is segmented and a set of features is calculatedfrom each segment The set of numerical features calculatedin this stage are reduced in the offline stage by means ofLaplacian Score

Thus each new measurement characterized by thenumerical features is analyzed by the novelty model In caseof no novelty detection it is assumed that the machine isworking under known conditions However if the analyzedmeasurement is detected as a novelty an alarm is triggeredin order to consider the user assessment Then if theoccurrence of a new scenario is confirmed the correspondingmeasurements are stored at the database and a retrainingprocedure is performed

33 Retraining Once the retraining is trigged the featurereduction and the novelty model design modules are modi-fied at the offline stage A diagram of the retraining procedureis presented in Figure 2

It must be noticed that two important contributiveaspects are proposed in this retraining approach first thereconsideration of the feature reductionmodule and secondthe incorporation of fault scenarios to the novelty model As

6 Shock and Vibration

New Fault Detected

Continuos Monitoring

Add Information to DDBB

No

Yes

Perform New Feature

Reduction

Novelty model redesign

A supervised method will be

used

Measurementsof the new fault

Data of all stored scenarios

included

Retr

aini

ng

New faultdetected

Continuousmonitoring

Add information to DDBB

Perform new feature

reduction

Novelty modelredesign

Figure 2 Proposed retraining approach First measurements char-acterizing the fault are stored and then the feature reductionmoduleand the novelty design module are modified to incorporate the newscenario encountered

it has been explained the feature reduction module duringits initialization is supported by an unsupervised LaplacianScore ranking due to the lack of additional scenarios yet oncethe information of a fault is available the feature reductionmodule could improve its performance by employing super-vised methods

Then the Laplacian approach is replaced by a FisherScore ranking approach in this work where the features aresorted according to the Fisher coefficient calculated fromeach feature It is important to mention that Laplacian Scorecan be configured towork also under a supervised frameworkand could obtain similar results than employing a Fisherapproach similarities between the two approaches are dis-cussed in [24] Nevertheless Fisher Score is used in thismethodology to search for a possible discriminative scenarioto increase the detection of novel scenarios A comparison ofresults obtained from both features reduction approaches ispresented in the results of this work

The consideration of a faulty scenario in the noveltymodel may contradict the principle of anomaly detectionwhere the objective is to detect healthy behaviors from therest Nevertheless the aim of an adaptive condition mon-itoring system should be to learnt from all the identifiedconditions to subsequently detect them if they are presentedagain by a fault detection module Indeed some workspresent a parallel structure for fault detection and noveltydetection modules [18 19] where the novelty module learnsthe known faults along with the healthy operation because itsobjective is to detect only new scenarios while the objectiveof the fault detection is to identify the condition of the

Gearbox

Load

VFD

DAS

Inductionmotor

AccelerometerEncoder

PC

z

yx

Figure 3 Test bench used for experimentation

machine including known faults This work is based on suchparallel structure approach where the fault scenariosmust betaken into consideration to be included in the novelty modeland then they could be identified by a complementary faultdetection module with a high confidence level

4 Experimental Results

41 Description of the Experimental Platform The test benchused for testing the kinematic chain with different faults andthe acquisition system used to capture the vibration signalsare shown in Figure 3 The test bench consists in a 1492-W-three-phase induction motor (WEG 00236ET3E145T-W22)with its rotational speed controlled through a variable fre-quency drive-VFD (WEG CFW08) the operating speed isfixed to 60Hz for all experiments It also consists of a 4 1 ratiogearbox (BALDORGCF4X01AA) and a DC generator (BAL-DOR CDP3604) which is used as the mechanical load com-prising around 20of the nominal loadThe vibration signalsare acquired using a triaxial accelerometer (LIS3L02AS4)mounted on a board with the signal conditioning andantialias filtering Two 12-bit 4-channel serial output sam-pling analog-to-digital converters (ADS7841) are used in thedata acquisition system (DAS) board Vibration signal fromthe perpendicular plane (119910-119911) to the motor shaft is acquiredusing a triaxial accelerometer LIS3L02AS4 mounted on aboard with the signal conditioning and antialiasing filteringSampling frequency is set to 3 kHz for vibration acquisitionThe data retrieved by the DAS is stored in a regular computer(PC)

Three scenarios are considered to verify the performanceof the proposed method the first one 119867 is the kinematicchain working under healthy condition and the other two 1198651and 1198652 represent the kinematic chain working under faultyconditions For 1198651 the motor is working with a half brokenbar and for 1198652 the motor is working with a fully brokenbar The detail of the failures is shown in Figure 4 The halfbroken bar failure is artificially produced by drilling a 6mmhole with a depth of 3mm that corresponds mostly to the

Shock and Vibration 7

(a) (b)

Figure 4 Detail of the faults produced in the test bench (a) corresponds to the 12 broken rotor bar and (b) to one broken rotor bar

22 of the section of the rotor bar and the full broken baris produced by a through-hole with a diameter of 6mm anda depth of 14mm which corresponds to the complete sectionof the rotor bar

The information stored from the kinematic chain consistsof an acquisition of 60 seconds of themachine working underthe three scenarios mentioned each acquisition is segmentedin 30 parts of 2 seconds and a set of features is calculatedfrom the 30 segmented measurements Since two axes aretaken into consideration a total of ten features are calculatedfrom each segmented acquisition of the machine workingunder the different scenarios mentioned The first step ofthe methodology is the offline stage where a reduced set offeatures is obtained and the novelty model is designed

42 Parameter Selection Regarding the Laplacian Score con-figuration a simple approach is followed for parameter tun-ing that is a value of 119896 = 3 is used for constructing the adja-cency graph and a ldquosimple mindedrdquo weighting approach isfollowed Since the proposed approach is compared to differ-ent feature reduction modules selecting generic parameterssettings is useful for the purpose of evaluation but ignores thatthere may be dependencies between the feature reductionmodel and the novelty model Regarding the design of thenovelty model the kernel used is the Gaussian and the valueof the configuration parameter is tuned tominimize the errorin the validation In all experiments 80 of the samplesare used for training and 20 for validation To train andadjust the parameter of the novelty model a fivefold cross-validation is used

43 Results and Discussions In order to highlight the contri-butions andmotivation of this work the outline of the resultsis presented as follows first a test is performed by a classicalapproach then the proposedmethodology is applied and theresults are compared The classical static approach impliesconserving the reduced features set obtained at the offline ini-tialization stage meanwhile the proposed dynamic approachimplies a possible reformulation of the reduced features setduring the retraining stage

Healthy

0

005

01

015

02

minus2 minus1minus3 1 2 30LS feature 1 RMS Y

minus3

minus2

minus1

0

1

2

3LS

feat

ure 2

kur

tosis

Y

Figure 5 Initial novelty model representation Limit of the noveltythreshold sdot sdot sdot and measurements used to train the model lowast

During the initialization stage the reduced set of featuresselected bymeans of the Laplacian Score ranking is composedof the RMS of the 119910-axis and the Kurtosis of the 119910-axisThe OC-SVM model is trained employing healthy conditiondata The resulting OC-SVM during training is shown inFigure 5 The marks lowast in the figure represent measurementsof the machine used to train the model in this case andin the subsequence figures only 1-fold of the fivefold cross-validation is displayedThe dotted line represents the noveltythreshold value all data inside the boundaries of the dottedline is considered normal The contour plot represents thenovelty score evaluation over different regions of the featuremap

Once the offline initialization stage is finished the onlinestage follows that is new measurements are obtained toassess the condition of the monitored machine To giverobustness to the novelty detection and avoid false alarms abatch consisting of 30 measurements is evaluated if 75 of

8 Shock and Vibration

0

005

01

015

02

minus4

minus3

minus2

minus1

0

1

2

3

4

5

60 2 4 8 10minus2minus4LS feature 1 RMS Y

KnownF1

LS fe

atur

e 2 k

urto

sis Y

Figure 6 Evaluation of the fault scenario 1198651 The novelty model istrained employing data from healthy operation condition

minus2 0 2 4 6 8 10minus4LS feature 1 RMS Y

minus4

minus3

minus2

minus1

0

1

2

3

4

5

minus001

0

001

002

003

004

005

006

007

008

Known

LS fe

atur

e 2 k

urto

sis Y

Figure 7 Contour plot of the novelty model after including 1198651

the analyzed measurements are evaluated as novelty then thealarm is triggered Next the 1198651 scenario is presented to testthe performance of the model The plot of the scenario andthe novelty threshold obtained during training is presentedin Figure 6

As can be appreciated the new scenario lays outside thenovelty threshold so it is successfully detected as noveltyOnce a novel scenario is detected and identified as a fault bythe user it is incorporated to the database to consider it as partof the known scenarios and the novelty model is retrainedwithout changing the features to include this informationFigure 7 shows the feature space after the novelty modelis trained using healthy and 1198651 data as part of the knownscenarios

Once the model is retrained the third scenario 1198652 isintroduced The visual representation of the test is presentedin Figure 8 this scenario is not detected as novel because only50 of the measurements are labeled as novel by the model

minus001

0

001

002

003

004

005

006

007

008

minus2 0 2 4 6 8 10minus4LS feature 1 RMS Y

minus4

minus3

minus2

minus1

0

1

2

3

4

5

KnownF2

LS fe

atur

e 2 k

urto

sis Y

Figure 8 Evaluation of the fault scenario 1198652 The novelty model istrained employing data from healthy and 1198651 scenarios

A similar result is obtained when the novelty model istrained using healthy and 1198652 data and is tested with data of1198651 A summary of the results of novelty detectionmaintainingthe same features obtained during initialization is shown inTable 1

As it can be seen in Table 1 using the reduced set obtainedduring initialization is easy to detect the novel scenarioswhen there is only information of the healthy condition butwhen a novel scenario is included in the database and thenoveltymodel is retrained the reduced set of features initiallyobtained does not necessarily provide a good representationto detect a new scenario during test

To improve these results the methodology presented inthis work proposes to evaluate again the feature reductionmodule each time a retraining is applied Following theoutline presented for results and parting from the first retrainwhere the scenario 1198651 is included in the database the featurereduction module is applied again but this time includinginformation from healthy condition and 1198651 scenario Sincetwo scenarios are taken into consideration and the labels areknown a supervised approach can be applied in this case aFisher Score ranking for feature reduction is employed Thenovelty model using the initial set of features and the noveltymodel after retraining with the new reduced set of featuresare shown in Figure 9

The new reduced set of features is composed of the RMSof the 119910-axis and the Kurtosis of the 119911-axis as can be appre-ciated the new set of features present a more discriminativedistribution of the three scenarios considered At the laststep (Figure 9(d)) when 1198652 scenario is included the FisherScore still ranked the same features of the last retraining(Figure 9(b)) as the highest yet it would be possible thata different set of features would be obtained The resultsobtained demonstrate the advantages of including the FisherScore reduction module to the retraining procedure the newdistribution of the scenarios avoided an overlapping of the

Shock and Vibration 9

minus2 0 2 4 6 8 10minus4LS feature 1 RMS Y

minus4

minus3

minus2

minus1

0

1

2

3

4

5

0

005

01

015

02

KnownF1

LS fe

atur

e 2 k

urto

sis Y

(a)

minus5

minus4

minus3

minus2

minus1

0

1

2

3

minus2 0 2 4 6 8 10minus4FS feature 1 RMS Y

minus001

0

001

002

003

004

005

006

007

008

Known

FS fe

atur

e 2 k

urto

sis Z

(b)

minus001

0

001

002

003

004

005

006

007

008

minus2 0 2 4 6 8 10minus4FS feature 1 RMS Y

FS fe

atur

e 2 k

urto

sis Z

minus5

minus4

minus3

minus2

minus1

0

1

2

3

KnownF2

(c)

minus2 0 2 4 6 8 10minus4FS feature 1 RMS Y

minus5

minus4

minus3

minus2

minus1

0

1

2

3

minus0005

0

0005

001

0015

002

0025

003

0035

Known

FS fe

atur

e 2 k

urto

sis Z

(d)

Figure 9 Process of evaluation and retraining employing the methodology proposed (a) Evaluation of the fault scenario 1198651 (b) Retrainingof the novelty model and reformulation of the reduced set of features including 1198651 (c) Evaluation of the fault scenario 1198652 (d) Retraining ofthe novelty model and reformulation of the reduced set of features including 1198652Table 1 Performance of the novelty detection using only healthy class data to reduce the number of features where DR stands fordimensionality reduction Different scenarios are included according to the information available to train and test the novelty model

DRPerformance using OC-SVM ()

Known Test Known Test Known Test119867 1198651 + 1198652 119867 + 1198651 1198652 119867 + 1198652 1198651LS 954 (plusmn11) 576 (plusmn67) 435 (plusmn64)

new scenario under test therefore the noveltymodel success-fully detected as novelty all measurements corresponding tothe 1198652 scenario

It is worth mentioning that if initially the scenario 1198652is used for training and the scenario 1198651 is used for testingthe new set of features obtained could be different from theaforementionedThe results achieved fromboth scenarios are

shown in Table 2 which also includes a comparison of theresults obtained employing PCA LDA and Laplacian Scoredimensionality reduction techniques in the retraining stepinstead of the Fisher Score proposed

Regarding the classical feature extraction techniquesPCA and LDA the test scenario is not identified as novelin both cases The Fisher Score and the Laplacian Score

10 Shock and Vibration

Table 2 Performance of the novelty detection employing a reduc-tion of features during retraining Different scenarios are includedaccording to the information available to train and test the noveltymodel

DRPerformance using OC-SVM ()

Known Test Known Test119867 + 1198651 1198652 119867 + 1198652 1198651PCA 590 (plusmn52) 642 (plusmn51)LDA 474 (plusmn61) 512 (plusmn61)LS 843 (plusmn24) 100 (plusmn00)Fisher S 100 (plusmn00) 100 (plusmn00)successfully identified the new scenarios as novel since thepercentage obtained in both cases is higher than the 75 pre-defined threshold in the methodology to activate the alarmBoth techniques achieved high scores but still the FisherScore provided a more appropriate selection of features

As mentioned in Section 2 LDA is a feature extractiontechnique based on the Fisher discriminant coefficient sosimilar results between Fisher Score ranking and LDA areexpected however employing Fisher Score ranking scoreachieved a better result this can be caused because the testconsists of novel scenarios and LDA finds the directions onthe feature space specialized for the two supervised scenariosemployed during training meanwhile Fisher Score rankingprovides a more general approach by selecting features

To test the robustness of the Laplacian Score and FisherScore approaches a comparative test is performed where theset of features is increased from 10 to 15 and by varyingthe number of reduced features selected The five featuresincluded in the original set are obtained from the119909-axis of theaccelerometer monitored where the features calculated arepresented in the aforementioned statistical time-frequencyfeatures these features were discarded initially because theyare not part of the perpendicular plane of the motor anddo not contribute significantly to the monitoring in factincluding these features could affect the performance of thefeature reduction and novelty detection modules The resultsare presented in Table 3

The features obtained from the Fisher Score still present abetter distribution to detect the new scenariosThe LaplacianScore performance is affected when irrelevant features areincluded in the feature set but increases when more dimen-sions are taken into consideration Since the objective of theFisher Score is not topology preservation contrary to LS itis capable of discarding all the irrelevant features that wereincluded and the performance is not affected

5 Conclusions

This work proposes an adaptive novelty detection methodol-ogy based on vibration analysis for the condition monitoringand diagnosis of the components of a kinematic chain Themethodology is based on the acquisition of vibration signalsthat are generated in the kinematic chain along with anadequate signal processing to extract features to characterizethe components and an adaptive novelty detection model to

Table 3 Performance of the novelty detection increasing the num-ber of initial features from 10 to 15 and varying the number of thereduced set of features

DRPerformance varying the number of features

reduced ()2 3 4

LS 590 (plusmn41) 643 (plusmn33) 886 (plusmn25)Fisher S 100 (plusmn00) 100 (plusmn00) 100 (plusmn00)detect anomalies The method is composed of two sequentialstages an offline stage to initialize and retrain the modulesand an online stage to continuously assess the conditionof the machine During initialization the model is trainedemploying only information from the machine workingunder healthy condition and two additional faulty scenariosare introduced to test the performance of the method underunknown operations

The adaptive novelty detection approach successfullydetected both novel scenarios and the model incorporatedthe information to avoid generating alarms if the same faultis detected A comparison between the proposed methodand classical dimensionality reduction approaches highlightsthe limitations of maintaining a static set of features duringmonitoring instead of reformulating the feature reductionmodule once new information is available In this particularstudy employing features reduced by Laplacian Score andFisher Score obtained similar results nevertheless it doesnot imply that a similar outcome will be present during theanalysis of other faults Fisher Score is encouraged to beemployed in this methodology rather than maintaining theLaplacian Score approach due to the similarity between themethod objective and the objective function of Fisher Scorein both the ideal case is to find the features to maximizethe distance between scenarios while maintaining compactclusters A specific comparison of performances between LSand Fisher Score was also included in which the Fisher Scoreobtained better results when irrelevant features are includedin the original set of features and when the dimensionalityof the reduced set is increased this highlights the advantagesand robustness of the feature selection approach by FisherScore compared to the LS

From an industrial perspective two aspects should bementioned First dealing with an identical operating sce-nario (ie healthy) the variation of the working conditions(speed and torque patterns) will affect the resulting valueof the estimated numerical features and consequently itsrepresentation in the feature space used for novelty modeldesign Thus the introduction of the working condition asan additional degree of freedom in the novelty detectionscheme implies the increase of the dimensional space thatthe novelty detectionmodel is managingThis fact implies anincrease of the risk of data overlapping (different scenariosunder different operating conditions showing similar featurevalues) and also an increase of complexity for boundariesdefinition during the novelty modelling (data spread) Thusthe proposed method is not limited to stationary conditionsbut to the repetition of the pattern of working condition used

Shock and Vibration 11

to train the model Indeed in practical industrial applicationwhere cyclostationary processes take place the proposedmethod should be executed during the same period (the sameworking conditions) in order to reduce data variability andincrease novelty performance In case of multiple workingconditions consideration parallel novelty model structurescould be generated in order to activate each one dependingon the corresponding working condition

Second the integration of additional physicalmagnitudesas stator currents temperatures and others represents atrendy approach in electromechanical fault diagnosis inorder to improve fault diagnosis resolution and perfor-mance The corresponding data fusion scheme implies ahigh-dimensional feature space that is multiple numericalfeatures estimated from multiple physical magnitudes con-sidered In this point it must be noted that the presentedmethodology proposes the reevaluation of the feature reduc-tionmodule each time a retraining is appliedThus the largerthe available pool of features is the higher the potentiality ofthe proposedmethod to increase the performance during thedata characterization is

Also the proposed method can be extended andimproved for further development this improvement couldinclude a diagnosis method to not only detect anomalies inthe kinematic chain but also identify the fault causing theabnormal behavior In this sense future work will include aspecific comparison of broken bar fault detection capabilitiesbetween the proposedmethodology and classical approaches

Competing Interests

The authors declare that they have no competing interests

Acknowledgments

This research was partially supported by CONACyT schol-arships Grants 278033 and 313604 and also by the SpanishMinistry of Education Culture and Sport under GrantFPU1300589 The authors would also like to thank a partialfinancial support by the Generalitat de Catalunya (GRCMCIA Grant no SGR 2014-101)

References

[1] H Henao G-A Capolino M Fernandez-Cabanas et alldquoTrends in fault diagnosis for electrical machines a review ofdiagnostic techniquesrdquo IEEE Industrial Electronics Magazinevol 8 no 2 pp 31ndash42 2014

[2] N Bayar S Darmoul S Hajri-Gabouj and H Pierreval ldquoFaultdetection diagnosis and recovery using Artificial Immune Sys-tems a reviewrdquoEngineeringApplications of Artificial Intelligencevol 46 pp 43ndash57 2015

[3] SWang JWan D Zhang D Li and C Zhang ldquoTowards smartfactory for Industry 40 a self-organized multi-agent systemwith big data based feedback and coordinationrdquo ComputerNetworks vol 101 pp 158ndash168 2016

[4] M A F Pimentel D A Clifton L Clifton and L TarassenkoldquoA review of novelty detectionrdquo Signal Processing vol 99 pp215ndash249 2014

[5] R Langone O Mauricio Agudelo B De Moor and J AK Suykens ldquoIncremental kernel spectral clustering for online

learning of non-stationary datardquo Neurocomputing vol 139 pp246ndash260 2014

[6] M-H Chen P-C Chang and J-L Wu ldquoA population-basedincremental learning approach with artificial immune systemfor network intrusion detectionrdquo Engineering Applications ofArtificial Intelligence vol 51 pp 171ndash181 2016

[7] S Khomfoi and L M Tolbert ldquoFault diagnosis and reconfigu-ration for multilevel inverter drive using AI-based techniquesrdquoIEEE Transactions on Industrial Electronics vol 54 no 6 pp2954ndash2968 2007

[8] A Bellini F Filippetti C Tassoni and G-A CapolinoldquoAdvances in diagnostic techniques for induction machinesrdquoIEEE Transactions on Industrial Electronics vol 55 no 12 pp4109ndash4126 2008

[9] V T Tran and B-S Yang ldquoAn intelligent condition-basedmaintenance platform for rotating machineryrdquo Expert Systemswith Applications vol 39 no 3 pp 2977ndash2988 2012

[10] P A Delgado-Arredondo A Garcia-Perez D Morinigo-Soteloet al ldquoComparative study of time-frequency decompositiontechniques for fault detection in induction motors using vibra-tion analysis during startup transientrdquo Shock and Vibration vol2015 Article ID 708034 14 pages 2015

[11] M D Prieto G Cirrincione A G Espinosa J A Ortegaand H Henao ldquoBearing fault detection by a novel condition-monitoring scheme based on statistical-time features and neu-ral networksrdquo IEEE Transactions on Industrial Electronics vol60 no 8 pp 3398ndash3407 2013

[12] P Perner ldquoConcepts for novelty detection and handling basedon a case-based reasoning process schemerdquo Engineering Appli-cations of Artificial Intelligence vol 22 no 1 pp 86ndash91 2009

[13] C Englund and A Verikas ldquoA hybrid approach to outlier detec-tion in the offset lithographic printing processrdquo EngineeringApplications of Artificial Intelligence vol 18 no 6 pp 759ndash7682005

[14] H Garces and D Sbarbaro ldquoOutliers detection in environmen-tal monitoring databasesrdquo Engineering Applications of ArtificialIntelligence vol 24 no 2 pp 341ndash349 2011

[15] S Ma S M Li and Y P Xiong ldquoUncertainty reduced noveltydetection approach applied to rotatingmachinery for conditionmonitoringrdquo Shock and Vibration vol 2015 Article ID 73721310 pages 2015

[16] J McBain and M Timusk ldquoFeature extraction for noveltydetection as applied to fault detection in machineryrdquo PatternRecognition Letters vol 32 no 7 pp 1054ndash1061 2011

[17] S Wang J Yub E Lapirac and J Lee ldquoA modified supportvector data description based novelty detection approach formachinery componentsrdquo Applied Soft Computing vol 13 no 2pp 1193ndash1205 2013

[18] D P Filev R B Chinnam F Tseng and P Baruah ldquoAn indus-trial strength novelty detection framework for autonomousequipment monitoring and diagnosticsrdquo IEEE Transactions onIndustrial Informatics vol 6 no 4 pp 767ndash779 2010

[19] B S J Costa P P Angelov and L A Guedes ldquoFully unsu-pervised fault detection and identification based on recursivedensity estimation and self-evolving cloud-based classifierrdquoNeurocomputing vol 150 pp 289ndash303 2015

[20] B Scholkopf R C Williamson A J Smola J Shawe-Taylorand J C Platt ldquoSupport vector method for novelty detectionrdquoin Neural Information Processing Systems vol 12 pp 582ndash5881999

12 Shock and Vibration

[21] D M J Tax and R P W Duin ldquoSupport vector domaindescriptionrdquo Pattern Recognition Letters vol 20 no 11ndash13 pp1191ndash1199 1999

[22] S D Villalba and P Cunningham ldquoAn evaluation of dimensionreduction techniques for one-class classificationrdquo ArtificialIntelligence Review vol 27 no 4 pp 273ndash294 2007

[23] I T Jolliffe Principal Component Analysis Springer New YorkNY USA 2nd edition 1986

[24] X He D Cai and P Niyogi ldquoLaplacian score for featureselectionrdquo inAdvances in Neural Information Processing Systems18 pp 507ndash514 MIT Press 2005

[25] A J Izenman ldquoLinear discriminant analysisrdquo in Modern Mul-tivariate Statistical Techniques A J Izenman Ed pp 237ndash280Springer New York NY USA 2008

[26] Q Gu Z Li and J Han ldquoGeneralized fisher score for featureselectionrdquo httpsarxivorgabs12023725

[27] Z Gao C Cecati and S X Ding ldquoA survey of fault diagnosisand fault-tolerant techniquesmdashpart II fault diagnosis withknowledge-based and hybridactive approachesrdquo IEEE Trans-actions on Industrial Electronics vol 62 no 6 pp 3768ndash37742015

[28] A E Lazzaretti D M J Tax H Vieira Neto and V HFerreira ldquoNovelty detection and multi-class classification inpower distribution voltage waveformsrdquo Expert Systems withApplications vol 45 pp 322ndash330 2016

International Journal of

AerospaceEngineeringHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

RoboticsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Active and Passive Electronic Components

Control Scienceand Engineering

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of

RotatingMachinery

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporation httpwwwhindawicom

Journal ofEngineeringVolume 2014

Submit your manuscripts athttpwwwhindawicom

VLSI Design

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Shock and Vibration

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Civil EngineeringAdvances in

Acoustics and VibrationAdvances in

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Electrical and Computer Engineering

Journal of

Advances inOptoElectronics

Hindawi Publishing Corporation httpwwwhindawicom

Volume 2014

The Scientific World JournalHindawi Publishing Corporation httpwwwhindawicom Volume 2014

SensorsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Modelling amp Simulation in EngineeringHindawi Publishing Corporation httpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Chemical EngineeringInternational Journal of Antennas and

Propagation

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Navigation and Observation

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

DistributedSensor Networks

International Journal of

Page 5: Research Article Vibration-Based Adaptive Novelty ...downloads.hindawi.com/journals/sv/2016/2417856.pdf · applied under a continuous learning framework. e structure of the method

Shock and Vibration 5

Database Feature calculation

Feature reduction

Novelty model design

New measurement

Calculation of reducedfeatures

NoveltyUser

supervisionKnown

operation False alarm

Offl

ine

Onl

ine

RetrainingIncluding

information of new fault

No

Yes

Noveltymodel

Figure 1 Proposedmethodology for the novelty detection approachThemonitoringmethod is composed of an offline stage for initializationand retraining and an online stage for continuous monitoring

31 Offline Stage During the initialization it is assumed thatonly information of the machine operating under healthycondition is available in the database The first step is thecalculation of numerical features from the vibration mea-surements obtained during the machine operation Since theinformation of the possible faults of the monitored machineis not available yet during this initialization a generic set ofstatistical time-based numerical features is proposed to beextracted from each available vibration axis measurementThe proposed set of potential statistical time-frequency fea-tures is shown as follows

Max Value

119894 = max (119909) (9)

Root Mean Square (RMS)

RMS = radic 1119899 sdot119899sum119896=1

(119909119896)2 (10)

Crest Factor (CF)

CF = 119909119896RMS

(11)

Kurtosis

119896 = 119864 [(119909119896 minus 119909)4]1205904 (12)

Shape Factor

SF = RMS1119899 sdot sum119899119896=1 10038161003816100381610038161199091198961003816100381610038161003816 (13)

These features have been successfully employed for faultdetection in the last years [1 27]

The resulting number of numerical features is propor-tional to the number of available vibration axes collectedduring the acquisition However in order to allow the com-pression and visualization of the data a feature reductionmodule is implemented During such offline stage initializa-tion an unsupervised feature reduction approach must be

used a Laplacian Score ranking is proposed in this work asa good trade-off between simplicity and performance [24]to rank the features according to the topology preservationcapabilities The two or three first ranked features in terms ofLaplacian Score are selected

Next the novelty model is designedThere is a significantnumber of noveltymodels proposed in the literature [4] witheach one demonstrated to be a capable option under certaincircumstances An increasing amount of works implies thatdomain based novelty detection models present promisingresults [19 28] In this work a standard OC-SVMwith Gaus-sian kernel is used The design of the novelty model includesthe selection of the parameters for configuration and trainingemploying the known scenarios stored at the database Thenthe initialization of the offline stage finalizes with the designof the OC-SVM

32 Online Stage This stage continuously monitors thecondition of the machine to detect if an anomaly is presentTo accomplish this each of the new measurements of themachine acquired certain amount of predefined time Eachmeasurement is segmented and a set of features is calculatedfrom each segment The set of numerical features calculatedin this stage are reduced in the offline stage by means ofLaplacian Score

Thus each new measurement characterized by thenumerical features is analyzed by the novelty model In caseof no novelty detection it is assumed that the machine isworking under known conditions However if the analyzedmeasurement is detected as a novelty an alarm is triggeredin order to consider the user assessment Then if theoccurrence of a new scenario is confirmed the correspondingmeasurements are stored at the database and a retrainingprocedure is performed

33 Retraining Once the retraining is trigged the featurereduction and the novelty model design modules are modi-fied at the offline stage A diagram of the retraining procedureis presented in Figure 2

It must be noticed that two important contributiveaspects are proposed in this retraining approach first thereconsideration of the feature reductionmodule and secondthe incorporation of fault scenarios to the novelty model As

6 Shock and Vibration

New Fault Detected

Continuos Monitoring

Add Information to DDBB

No

Yes

Perform New Feature

Reduction

Novelty model redesign

A supervised method will be

used

Measurementsof the new fault

Data of all stored scenarios

included

Retr

aini

ng

New faultdetected

Continuousmonitoring

Add information to DDBB

Perform new feature

reduction

Novelty modelredesign

Figure 2 Proposed retraining approach First measurements char-acterizing the fault are stored and then the feature reductionmoduleand the novelty design module are modified to incorporate the newscenario encountered

it has been explained the feature reduction module duringits initialization is supported by an unsupervised LaplacianScore ranking due to the lack of additional scenarios yet oncethe information of a fault is available the feature reductionmodule could improve its performance by employing super-vised methods

Then the Laplacian approach is replaced by a FisherScore ranking approach in this work where the features aresorted according to the Fisher coefficient calculated fromeach feature It is important to mention that Laplacian Scorecan be configured towork also under a supervised frameworkand could obtain similar results than employing a Fisherapproach similarities between the two approaches are dis-cussed in [24] Nevertheless Fisher Score is used in thismethodology to search for a possible discriminative scenarioto increase the detection of novel scenarios A comparison ofresults obtained from both features reduction approaches ispresented in the results of this work

The consideration of a faulty scenario in the noveltymodel may contradict the principle of anomaly detectionwhere the objective is to detect healthy behaviors from therest Nevertheless the aim of an adaptive condition mon-itoring system should be to learnt from all the identifiedconditions to subsequently detect them if they are presentedagain by a fault detection module Indeed some workspresent a parallel structure for fault detection and noveltydetection modules [18 19] where the novelty module learnsthe known faults along with the healthy operation because itsobjective is to detect only new scenarios while the objectiveof the fault detection is to identify the condition of the

Gearbox

Load

VFD

DAS

Inductionmotor

AccelerometerEncoder

PC

z

yx

Figure 3 Test bench used for experimentation

machine including known faults This work is based on suchparallel structure approach where the fault scenariosmust betaken into consideration to be included in the novelty modeland then they could be identified by a complementary faultdetection module with a high confidence level

4 Experimental Results

41 Description of the Experimental Platform The test benchused for testing the kinematic chain with different faults andthe acquisition system used to capture the vibration signalsare shown in Figure 3 The test bench consists in a 1492-W-three-phase induction motor (WEG 00236ET3E145T-W22)with its rotational speed controlled through a variable fre-quency drive-VFD (WEG CFW08) the operating speed isfixed to 60Hz for all experiments It also consists of a 4 1 ratiogearbox (BALDORGCF4X01AA) and a DC generator (BAL-DOR CDP3604) which is used as the mechanical load com-prising around 20of the nominal loadThe vibration signalsare acquired using a triaxial accelerometer (LIS3L02AS4)mounted on a board with the signal conditioning andantialias filtering Two 12-bit 4-channel serial output sam-pling analog-to-digital converters (ADS7841) are used in thedata acquisition system (DAS) board Vibration signal fromthe perpendicular plane (119910-119911) to the motor shaft is acquiredusing a triaxial accelerometer LIS3L02AS4 mounted on aboard with the signal conditioning and antialiasing filteringSampling frequency is set to 3 kHz for vibration acquisitionThe data retrieved by the DAS is stored in a regular computer(PC)

Three scenarios are considered to verify the performanceof the proposed method the first one 119867 is the kinematicchain working under healthy condition and the other two 1198651and 1198652 represent the kinematic chain working under faultyconditions For 1198651 the motor is working with a half brokenbar and for 1198652 the motor is working with a fully brokenbar The detail of the failures is shown in Figure 4 The halfbroken bar failure is artificially produced by drilling a 6mmhole with a depth of 3mm that corresponds mostly to the

Shock and Vibration 7

(a) (b)

Figure 4 Detail of the faults produced in the test bench (a) corresponds to the 12 broken rotor bar and (b) to one broken rotor bar

22 of the section of the rotor bar and the full broken baris produced by a through-hole with a diameter of 6mm anda depth of 14mm which corresponds to the complete sectionof the rotor bar

The information stored from the kinematic chain consistsof an acquisition of 60 seconds of themachine working underthe three scenarios mentioned each acquisition is segmentedin 30 parts of 2 seconds and a set of features is calculatedfrom the 30 segmented measurements Since two axes aretaken into consideration a total of ten features are calculatedfrom each segmented acquisition of the machine workingunder the different scenarios mentioned The first step ofthe methodology is the offline stage where a reduced set offeatures is obtained and the novelty model is designed

42 Parameter Selection Regarding the Laplacian Score con-figuration a simple approach is followed for parameter tun-ing that is a value of 119896 = 3 is used for constructing the adja-cency graph and a ldquosimple mindedrdquo weighting approach isfollowed Since the proposed approach is compared to differ-ent feature reduction modules selecting generic parameterssettings is useful for the purpose of evaluation but ignores thatthere may be dependencies between the feature reductionmodel and the novelty model Regarding the design of thenovelty model the kernel used is the Gaussian and the valueof the configuration parameter is tuned tominimize the errorin the validation In all experiments 80 of the samplesare used for training and 20 for validation To train andadjust the parameter of the novelty model a fivefold cross-validation is used

43 Results and Discussions In order to highlight the contri-butions andmotivation of this work the outline of the resultsis presented as follows first a test is performed by a classicalapproach then the proposedmethodology is applied and theresults are compared The classical static approach impliesconserving the reduced features set obtained at the offline ini-tialization stage meanwhile the proposed dynamic approachimplies a possible reformulation of the reduced features setduring the retraining stage

Healthy

0

005

01

015

02

minus2 minus1minus3 1 2 30LS feature 1 RMS Y

minus3

minus2

minus1

0

1

2

3LS

feat

ure 2

kur

tosis

Y

Figure 5 Initial novelty model representation Limit of the noveltythreshold sdot sdot sdot and measurements used to train the model lowast

During the initialization stage the reduced set of featuresselected bymeans of the Laplacian Score ranking is composedof the RMS of the 119910-axis and the Kurtosis of the 119910-axisThe OC-SVM model is trained employing healthy conditiondata The resulting OC-SVM during training is shown inFigure 5 The marks lowast in the figure represent measurementsof the machine used to train the model in this case andin the subsequence figures only 1-fold of the fivefold cross-validation is displayedThe dotted line represents the noveltythreshold value all data inside the boundaries of the dottedline is considered normal The contour plot represents thenovelty score evaluation over different regions of the featuremap

Once the offline initialization stage is finished the onlinestage follows that is new measurements are obtained toassess the condition of the monitored machine To giverobustness to the novelty detection and avoid false alarms abatch consisting of 30 measurements is evaluated if 75 of

8 Shock and Vibration

0

005

01

015

02

minus4

minus3

minus2

minus1

0

1

2

3

4

5

60 2 4 8 10minus2minus4LS feature 1 RMS Y

KnownF1

LS fe

atur

e 2 k

urto

sis Y

Figure 6 Evaluation of the fault scenario 1198651 The novelty model istrained employing data from healthy operation condition

minus2 0 2 4 6 8 10minus4LS feature 1 RMS Y

minus4

minus3

minus2

minus1

0

1

2

3

4

5

minus001

0

001

002

003

004

005

006

007

008

Known

LS fe

atur

e 2 k

urto

sis Y

Figure 7 Contour plot of the novelty model after including 1198651

the analyzed measurements are evaluated as novelty then thealarm is triggered Next the 1198651 scenario is presented to testthe performance of the model The plot of the scenario andthe novelty threshold obtained during training is presentedin Figure 6

As can be appreciated the new scenario lays outside thenovelty threshold so it is successfully detected as noveltyOnce a novel scenario is detected and identified as a fault bythe user it is incorporated to the database to consider it as partof the known scenarios and the novelty model is retrainedwithout changing the features to include this informationFigure 7 shows the feature space after the novelty modelis trained using healthy and 1198651 data as part of the knownscenarios

Once the model is retrained the third scenario 1198652 isintroduced The visual representation of the test is presentedin Figure 8 this scenario is not detected as novel because only50 of the measurements are labeled as novel by the model

minus001

0

001

002

003

004

005

006

007

008

minus2 0 2 4 6 8 10minus4LS feature 1 RMS Y

minus4

minus3

minus2

minus1

0

1

2

3

4

5

KnownF2

LS fe

atur

e 2 k

urto

sis Y

Figure 8 Evaluation of the fault scenario 1198652 The novelty model istrained employing data from healthy and 1198651 scenarios

A similar result is obtained when the novelty model istrained using healthy and 1198652 data and is tested with data of1198651 A summary of the results of novelty detectionmaintainingthe same features obtained during initialization is shown inTable 1

As it can be seen in Table 1 using the reduced set obtainedduring initialization is easy to detect the novel scenarioswhen there is only information of the healthy condition butwhen a novel scenario is included in the database and thenoveltymodel is retrained the reduced set of features initiallyobtained does not necessarily provide a good representationto detect a new scenario during test

To improve these results the methodology presented inthis work proposes to evaluate again the feature reductionmodule each time a retraining is applied Following theoutline presented for results and parting from the first retrainwhere the scenario 1198651 is included in the database the featurereduction module is applied again but this time includinginformation from healthy condition and 1198651 scenario Sincetwo scenarios are taken into consideration and the labels areknown a supervised approach can be applied in this case aFisher Score ranking for feature reduction is employed Thenovelty model using the initial set of features and the noveltymodel after retraining with the new reduced set of featuresare shown in Figure 9

The new reduced set of features is composed of the RMSof the 119910-axis and the Kurtosis of the 119911-axis as can be appre-ciated the new set of features present a more discriminativedistribution of the three scenarios considered At the laststep (Figure 9(d)) when 1198652 scenario is included the FisherScore still ranked the same features of the last retraining(Figure 9(b)) as the highest yet it would be possible thata different set of features would be obtained The resultsobtained demonstrate the advantages of including the FisherScore reduction module to the retraining procedure the newdistribution of the scenarios avoided an overlapping of the

Shock and Vibration 9

minus2 0 2 4 6 8 10minus4LS feature 1 RMS Y

minus4

minus3

minus2

minus1

0

1

2

3

4

5

0

005

01

015

02

KnownF1

LS fe

atur

e 2 k

urto

sis Y

(a)

minus5

minus4

minus3

minus2

minus1

0

1

2

3

minus2 0 2 4 6 8 10minus4FS feature 1 RMS Y

minus001

0

001

002

003

004

005

006

007

008

Known

FS fe

atur

e 2 k

urto

sis Z

(b)

minus001

0

001

002

003

004

005

006

007

008

minus2 0 2 4 6 8 10minus4FS feature 1 RMS Y

FS fe

atur

e 2 k

urto

sis Z

minus5

minus4

minus3

minus2

minus1

0

1

2

3

KnownF2

(c)

minus2 0 2 4 6 8 10minus4FS feature 1 RMS Y

minus5

minus4

minus3

minus2

minus1

0

1

2

3

minus0005

0

0005

001

0015

002

0025

003

0035

Known

FS fe

atur

e 2 k

urto

sis Z

(d)

Figure 9 Process of evaluation and retraining employing the methodology proposed (a) Evaluation of the fault scenario 1198651 (b) Retrainingof the novelty model and reformulation of the reduced set of features including 1198651 (c) Evaluation of the fault scenario 1198652 (d) Retraining ofthe novelty model and reformulation of the reduced set of features including 1198652Table 1 Performance of the novelty detection using only healthy class data to reduce the number of features where DR stands fordimensionality reduction Different scenarios are included according to the information available to train and test the novelty model

DRPerformance using OC-SVM ()

Known Test Known Test Known Test119867 1198651 + 1198652 119867 + 1198651 1198652 119867 + 1198652 1198651LS 954 (plusmn11) 576 (plusmn67) 435 (plusmn64)

new scenario under test therefore the noveltymodel success-fully detected as novelty all measurements corresponding tothe 1198652 scenario

It is worth mentioning that if initially the scenario 1198652is used for training and the scenario 1198651 is used for testingthe new set of features obtained could be different from theaforementionedThe results achieved fromboth scenarios are

shown in Table 2 which also includes a comparison of theresults obtained employing PCA LDA and Laplacian Scoredimensionality reduction techniques in the retraining stepinstead of the Fisher Score proposed

Regarding the classical feature extraction techniquesPCA and LDA the test scenario is not identified as novelin both cases The Fisher Score and the Laplacian Score

10 Shock and Vibration

Table 2 Performance of the novelty detection employing a reduc-tion of features during retraining Different scenarios are includedaccording to the information available to train and test the noveltymodel

DRPerformance using OC-SVM ()

Known Test Known Test119867 + 1198651 1198652 119867 + 1198652 1198651PCA 590 (plusmn52) 642 (plusmn51)LDA 474 (plusmn61) 512 (plusmn61)LS 843 (plusmn24) 100 (plusmn00)Fisher S 100 (plusmn00) 100 (plusmn00)successfully identified the new scenarios as novel since thepercentage obtained in both cases is higher than the 75 pre-defined threshold in the methodology to activate the alarmBoth techniques achieved high scores but still the FisherScore provided a more appropriate selection of features

As mentioned in Section 2 LDA is a feature extractiontechnique based on the Fisher discriminant coefficient sosimilar results between Fisher Score ranking and LDA areexpected however employing Fisher Score ranking scoreachieved a better result this can be caused because the testconsists of novel scenarios and LDA finds the directions onthe feature space specialized for the two supervised scenariosemployed during training meanwhile Fisher Score rankingprovides a more general approach by selecting features

To test the robustness of the Laplacian Score and FisherScore approaches a comparative test is performed where theset of features is increased from 10 to 15 and by varyingthe number of reduced features selected The five featuresincluded in the original set are obtained from the119909-axis of theaccelerometer monitored where the features calculated arepresented in the aforementioned statistical time-frequencyfeatures these features were discarded initially because theyare not part of the perpendicular plane of the motor anddo not contribute significantly to the monitoring in factincluding these features could affect the performance of thefeature reduction and novelty detection modules The resultsare presented in Table 3

The features obtained from the Fisher Score still present abetter distribution to detect the new scenariosThe LaplacianScore performance is affected when irrelevant features areincluded in the feature set but increases when more dimen-sions are taken into consideration Since the objective of theFisher Score is not topology preservation contrary to LS itis capable of discarding all the irrelevant features that wereincluded and the performance is not affected

5 Conclusions

This work proposes an adaptive novelty detection methodol-ogy based on vibration analysis for the condition monitoringand diagnosis of the components of a kinematic chain Themethodology is based on the acquisition of vibration signalsthat are generated in the kinematic chain along with anadequate signal processing to extract features to characterizethe components and an adaptive novelty detection model to

Table 3 Performance of the novelty detection increasing the num-ber of initial features from 10 to 15 and varying the number of thereduced set of features

DRPerformance varying the number of features

reduced ()2 3 4

LS 590 (plusmn41) 643 (plusmn33) 886 (plusmn25)Fisher S 100 (plusmn00) 100 (plusmn00) 100 (plusmn00)detect anomalies The method is composed of two sequentialstages an offline stage to initialize and retrain the modulesand an online stage to continuously assess the conditionof the machine During initialization the model is trainedemploying only information from the machine workingunder healthy condition and two additional faulty scenariosare introduced to test the performance of the method underunknown operations

The adaptive novelty detection approach successfullydetected both novel scenarios and the model incorporatedthe information to avoid generating alarms if the same faultis detected A comparison between the proposed methodand classical dimensionality reduction approaches highlightsthe limitations of maintaining a static set of features duringmonitoring instead of reformulating the feature reductionmodule once new information is available In this particularstudy employing features reduced by Laplacian Score andFisher Score obtained similar results nevertheless it doesnot imply that a similar outcome will be present during theanalysis of other faults Fisher Score is encouraged to beemployed in this methodology rather than maintaining theLaplacian Score approach due to the similarity between themethod objective and the objective function of Fisher Scorein both the ideal case is to find the features to maximizethe distance between scenarios while maintaining compactclusters A specific comparison of performances between LSand Fisher Score was also included in which the Fisher Scoreobtained better results when irrelevant features are includedin the original set of features and when the dimensionalityof the reduced set is increased this highlights the advantagesand robustness of the feature selection approach by FisherScore compared to the LS

From an industrial perspective two aspects should bementioned First dealing with an identical operating sce-nario (ie healthy) the variation of the working conditions(speed and torque patterns) will affect the resulting valueof the estimated numerical features and consequently itsrepresentation in the feature space used for novelty modeldesign Thus the introduction of the working condition asan additional degree of freedom in the novelty detectionscheme implies the increase of the dimensional space thatthe novelty detectionmodel is managingThis fact implies anincrease of the risk of data overlapping (different scenariosunder different operating conditions showing similar featurevalues) and also an increase of complexity for boundariesdefinition during the novelty modelling (data spread) Thusthe proposed method is not limited to stationary conditionsbut to the repetition of the pattern of working condition used

Shock and Vibration 11

to train the model Indeed in practical industrial applicationwhere cyclostationary processes take place the proposedmethod should be executed during the same period (the sameworking conditions) in order to reduce data variability andincrease novelty performance In case of multiple workingconditions consideration parallel novelty model structurescould be generated in order to activate each one dependingon the corresponding working condition

Second the integration of additional physicalmagnitudesas stator currents temperatures and others represents atrendy approach in electromechanical fault diagnosis inorder to improve fault diagnosis resolution and perfor-mance The corresponding data fusion scheme implies ahigh-dimensional feature space that is multiple numericalfeatures estimated from multiple physical magnitudes con-sidered In this point it must be noted that the presentedmethodology proposes the reevaluation of the feature reduc-tionmodule each time a retraining is appliedThus the largerthe available pool of features is the higher the potentiality ofthe proposedmethod to increase the performance during thedata characterization is

Also the proposed method can be extended andimproved for further development this improvement couldinclude a diagnosis method to not only detect anomalies inthe kinematic chain but also identify the fault causing theabnormal behavior In this sense future work will include aspecific comparison of broken bar fault detection capabilitiesbetween the proposedmethodology and classical approaches

Competing Interests

The authors declare that they have no competing interests

Acknowledgments

This research was partially supported by CONACyT schol-arships Grants 278033 and 313604 and also by the SpanishMinistry of Education Culture and Sport under GrantFPU1300589 The authors would also like to thank a partialfinancial support by the Generalitat de Catalunya (GRCMCIA Grant no SGR 2014-101)

References

[1] H Henao G-A Capolino M Fernandez-Cabanas et alldquoTrends in fault diagnosis for electrical machines a review ofdiagnostic techniquesrdquo IEEE Industrial Electronics Magazinevol 8 no 2 pp 31ndash42 2014

[2] N Bayar S Darmoul S Hajri-Gabouj and H Pierreval ldquoFaultdetection diagnosis and recovery using Artificial Immune Sys-tems a reviewrdquoEngineeringApplications of Artificial Intelligencevol 46 pp 43ndash57 2015

[3] SWang JWan D Zhang D Li and C Zhang ldquoTowards smartfactory for Industry 40 a self-organized multi-agent systemwith big data based feedback and coordinationrdquo ComputerNetworks vol 101 pp 158ndash168 2016

[4] M A F Pimentel D A Clifton L Clifton and L TarassenkoldquoA review of novelty detectionrdquo Signal Processing vol 99 pp215ndash249 2014

[5] R Langone O Mauricio Agudelo B De Moor and J AK Suykens ldquoIncremental kernel spectral clustering for online

learning of non-stationary datardquo Neurocomputing vol 139 pp246ndash260 2014

[6] M-H Chen P-C Chang and J-L Wu ldquoA population-basedincremental learning approach with artificial immune systemfor network intrusion detectionrdquo Engineering Applications ofArtificial Intelligence vol 51 pp 171ndash181 2016

[7] S Khomfoi and L M Tolbert ldquoFault diagnosis and reconfigu-ration for multilevel inverter drive using AI-based techniquesrdquoIEEE Transactions on Industrial Electronics vol 54 no 6 pp2954ndash2968 2007

[8] A Bellini F Filippetti C Tassoni and G-A CapolinoldquoAdvances in diagnostic techniques for induction machinesrdquoIEEE Transactions on Industrial Electronics vol 55 no 12 pp4109ndash4126 2008

[9] V T Tran and B-S Yang ldquoAn intelligent condition-basedmaintenance platform for rotating machineryrdquo Expert Systemswith Applications vol 39 no 3 pp 2977ndash2988 2012

[10] P A Delgado-Arredondo A Garcia-Perez D Morinigo-Soteloet al ldquoComparative study of time-frequency decompositiontechniques for fault detection in induction motors using vibra-tion analysis during startup transientrdquo Shock and Vibration vol2015 Article ID 708034 14 pages 2015

[11] M D Prieto G Cirrincione A G Espinosa J A Ortegaand H Henao ldquoBearing fault detection by a novel condition-monitoring scheme based on statistical-time features and neu-ral networksrdquo IEEE Transactions on Industrial Electronics vol60 no 8 pp 3398ndash3407 2013

[12] P Perner ldquoConcepts for novelty detection and handling basedon a case-based reasoning process schemerdquo Engineering Appli-cations of Artificial Intelligence vol 22 no 1 pp 86ndash91 2009

[13] C Englund and A Verikas ldquoA hybrid approach to outlier detec-tion in the offset lithographic printing processrdquo EngineeringApplications of Artificial Intelligence vol 18 no 6 pp 759ndash7682005

[14] H Garces and D Sbarbaro ldquoOutliers detection in environmen-tal monitoring databasesrdquo Engineering Applications of ArtificialIntelligence vol 24 no 2 pp 341ndash349 2011

[15] S Ma S M Li and Y P Xiong ldquoUncertainty reduced noveltydetection approach applied to rotatingmachinery for conditionmonitoringrdquo Shock and Vibration vol 2015 Article ID 73721310 pages 2015

[16] J McBain and M Timusk ldquoFeature extraction for noveltydetection as applied to fault detection in machineryrdquo PatternRecognition Letters vol 32 no 7 pp 1054ndash1061 2011

[17] S Wang J Yub E Lapirac and J Lee ldquoA modified supportvector data description based novelty detection approach formachinery componentsrdquo Applied Soft Computing vol 13 no 2pp 1193ndash1205 2013

[18] D P Filev R B Chinnam F Tseng and P Baruah ldquoAn indus-trial strength novelty detection framework for autonomousequipment monitoring and diagnosticsrdquo IEEE Transactions onIndustrial Informatics vol 6 no 4 pp 767ndash779 2010

[19] B S J Costa P P Angelov and L A Guedes ldquoFully unsu-pervised fault detection and identification based on recursivedensity estimation and self-evolving cloud-based classifierrdquoNeurocomputing vol 150 pp 289ndash303 2015

[20] B Scholkopf R C Williamson A J Smola J Shawe-Taylorand J C Platt ldquoSupport vector method for novelty detectionrdquoin Neural Information Processing Systems vol 12 pp 582ndash5881999

12 Shock and Vibration

[21] D M J Tax and R P W Duin ldquoSupport vector domaindescriptionrdquo Pattern Recognition Letters vol 20 no 11ndash13 pp1191ndash1199 1999

[22] S D Villalba and P Cunningham ldquoAn evaluation of dimensionreduction techniques for one-class classificationrdquo ArtificialIntelligence Review vol 27 no 4 pp 273ndash294 2007

[23] I T Jolliffe Principal Component Analysis Springer New YorkNY USA 2nd edition 1986

[24] X He D Cai and P Niyogi ldquoLaplacian score for featureselectionrdquo inAdvances in Neural Information Processing Systems18 pp 507ndash514 MIT Press 2005

[25] A J Izenman ldquoLinear discriminant analysisrdquo in Modern Mul-tivariate Statistical Techniques A J Izenman Ed pp 237ndash280Springer New York NY USA 2008

[26] Q Gu Z Li and J Han ldquoGeneralized fisher score for featureselectionrdquo httpsarxivorgabs12023725

[27] Z Gao C Cecati and S X Ding ldquoA survey of fault diagnosisand fault-tolerant techniquesmdashpart II fault diagnosis withknowledge-based and hybridactive approachesrdquo IEEE Trans-actions on Industrial Electronics vol 62 no 6 pp 3768ndash37742015

[28] A E Lazzaretti D M J Tax H Vieira Neto and V HFerreira ldquoNovelty detection and multi-class classification inpower distribution voltage waveformsrdquo Expert Systems withApplications vol 45 pp 322ndash330 2016

International Journal of

AerospaceEngineeringHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

RoboticsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Active and Passive Electronic Components

Control Scienceand Engineering

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of

RotatingMachinery

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporation httpwwwhindawicom

Journal ofEngineeringVolume 2014

Submit your manuscripts athttpwwwhindawicom

VLSI Design

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Shock and Vibration

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Civil EngineeringAdvances in

Acoustics and VibrationAdvances in

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Electrical and Computer Engineering

Journal of

Advances inOptoElectronics

Hindawi Publishing Corporation httpwwwhindawicom

Volume 2014

The Scientific World JournalHindawi Publishing Corporation httpwwwhindawicom Volume 2014

SensorsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Modelling amp Simulation in EngineeringHindawi Publishing Corporation httpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Chemical EngineeringInternational Journal of Antennas and

Propagation

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Navigation and Observation

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

DistributedSensor Networks

International Journal of

Page 6: Research Article Vibration-Based Adaptive Novelty ...downloads.hindawi.com/journals/sv/2016/2417856.pdf · applied under a continuous learning framework. e structure of the method

6 Shock and Vibration

New Fault Detected

Continuos Monitoring

Add Information to DDBB

No

Yes

Perform New Feature

Reduction

Novelty model redesign

A supervised method will be

used

Measurementsof the new fault

Data of all stored scenarios

included

Retr

aini

ng

New faultdetected

Continuousmonitoring

Add information to DDBB

Perform new feature

reduction

Novelty modelredesign

Figure 2 Proposed retraining approach First measurements char-acterizing the fault are stored and then the feature reductionmoduleand the novelty design module are modified to incorporate the newscenario encountered

it has been explained the feature reduction module duringits initialization is supported by an unsupervised LaplacianScore ranking due to the lack of additional scenarios yet oncethe information of a fault is available the feature reductionmodule could improve its performance by employing super-vised methods

Then the Laplacian approach is replaced by a FisherScore ranking approach in this work where the features aresorted according to the Fisher coefficient calculated fromeach feature It is important to mention that Laplacian Scorecan be configured towork also under a supervised frameworkand could obtain similar results than employing a Fisherapproach similarities between the two approaches are dis-cussed in [24] Nevertheless Fisher Score is used in thismethodology to search for a possible discriminative scenarioto increase the detection of novel scenarios A comparison ofresults obtained from both features reduction approaches ispresented in the results of this work

The consideration of a faulty scenario in the noveltymodel may contradict the principle of anomaly detectionwhere the objective is to detect healthy behaviors from therest Nevertheless the aim of an adaptive condition mon-itoring system should be to learnt from all the identifiedconditions to subsequently detect them if they are presentedagain by a fault detection module Indeed some workspresent a parallel structure for fault detection and noveltydetection modules [18 19] where the novelty module learnsthe known faults along with the healthy operation because itsobjective is to detect only new scenarios while the objectiveof the fault detection is to identify the condition of the

Gearbox

Load

VFD

DAS

Inductionmotor

AccelerometerEncoder

PC

z

yx

Figure 3 Test bench used for experimentation

machine including known faults This work is based on suchparallel structure approach where the fault scenariosmust betaken into consideration to be included in the novelty modeland then they could be identified by a complementary faultdetection module with a high confidence level

4 Experimental Results

41 Description of the Experimental Platform The test benchused for testing the kinematic chain with different faults andthe acquisition system used to capture the vibration signalsare shown in Figure 3 The test bench consists in a 1492-W-three-phase induction motor (WEG 00236ET3E145T-W22)with its rotational speed controlled through a variable fre-quency drive-VFD (WEG CFW08) the operating speed isfixed to 60Hz for all experiments It also consists of a 4 1 ratiogearbox (BALDORGCF4X01AA) and a DC generator (BAL-DOR CDP3604) which is used as the mechanical load com-prising around 20of the nominal loadThe vibration signalsare acquired using a triaxial accelerometer (LIS3L02AS4)mounted on a board with the signal conditioning andantialias filtering Two 12-bit 4-channel serial output sam-pling analog-to-digital converters (ADS7841) are used in thedata acquisition system (DAS) board Vibration signal fromthe perpendicular plane (119910-119911) to the motor shaft is acquiredusing a triaxial accelerometer LIS3L02AS4 mounted on aboard with the signal conditioning and antialiasing filteringSampling frequency is set to 3 kHz for vibration acquisitionThe data retrieved by the DAS is stored in a regular computer(PC)

Three scenarios are considered to verify the performanceof the proposed method the first one 119867 is the kinematicchain working under healthy condition and the other two 1198651and 1198652 represent the kinematic chain working under faultyconditions For 1198651 the motor is working with a half brokenbar and for 1198652 the motor is working with a fully brokenbar The detail of the failures is shown in Figure 4 The halfbroken bar failure is artificially produced by drilling a 6mmhole with a depth of 3mm that corresponds mostly to the

Shock and Vibration 7

(a) (b)

Figure 4 Detail of the faults produced in the test bench (a) corresponds to the 12 broken rotor bar and (b) to one broken rotor bar

22 of the section of the rotor bar and the full broken baris produced by a through-hole with a diameter of 6mm anda depth of 14mm which corresponds to the complete sectionof the rotor bar

The information stored from the kinematic chain consistsof an acquisition of 60 seconds of themachine working underthe three scenarios mentioned each acquisition is segmentedin 30 parts of 2 seconds and a set of features is calculatedfrom the 30 segmented measurements Since two axes aretaken into consideration a total of ten features are calculatedfrom each segmented acquisition of the machine workingunder the different scenarios mentioned The first step ofthe methodology is the offline stage where a reduced set offeatures is obtained and the novelty model is designed

42 Parameter Selection Regarding the Laplacian Score con-figuration a simple approach is followed for parameter tun-ing that is a value of 119896 = 3 is used for constructing the adja-cency graph and a ldquosimple mindedrdquo weighting approach isfollowed Since the proposed approach is compared to differ-ent feature reduction modules selecting generic parameterssettings is useful for the purpose of evaluation but ignores thatthere may be dependencies between the feature reductionmodel and the novelty model Regarding the design of thenovelty model the kernel used is the Gaussian and the valueof the configuration parameter is tuned tominimize the errorin the validation In all experiments 80 of the samplesare used for training and 20 for validation To train andadjust the parameter of the novelty model a fivefold cross-validation is used

43 Results and Discussions In order to highlight the contri-butions andmotivation of this work the outline of the resultsis presented as follows first a test is performed by a classicalapproach then the proposedmethodology is applied and theresults are compared The classical static approach impliesconserving the reduced features set obtained at the offline ini-tialization stage meanwhile the proposed dynamic approachimplies a possible reformulation of the reduced features setduring the retraining stage

Healthy

0

005

01

015

02

minus2 minus1minus3 1 2 30LS feature 1 RMS Y

minus3

minus2

minus1

0

1

2

3LS

feat

ure 2

kur

tosis

Y

Figure 5 Initial novelty model representation Limit of the noveltythreshold sdot sdot sdot and measurements used to train the model lowast

During the initialization stage the reduced set of featuresselected bymeans of the Laplacian Score ranking is composedof the RMS of the 119910-axis and the Kurtosis of the 119910-axisThe OC-SVM model is trained employing healthy conditiondata The resulting OC-SVM during training is shown inFigure 5 The marks lowast in the figure represent measurementsof the machine used to train the model in this case andin the subsequence figures only 1-fold of the fivefold cross-validation is displayedThe dotted line represents the noveltythreshold value all data inside the boundaries of the dottedline is considered normal The contour plot represents thenovelty score evaluation over different regions of the featuremap

Once the offline initialization stage is finished the onlinestage follows that is new measurements are obtained toassess the condition of the monitored machine To giverobustness to the novelty detection and avoid false alarms abatch consisting of 30 measurements is evaluated if 75 of

8 Shock and Vibration

0

005

01

015

02

minus4

minus3

minus2

minus1

0

1

2

3

4

5

60 2 4 8 10minus2minus4LS feature 1 RMS Y

KnownF1

LS fe

atur

e 2 k

urto

sis Y

Figure 6 Evaluation of the fault scenario 1198651 The novelty model istrained employing data from healthy operation condition

minus2 0 2 4 6 8 10minus4LS feature 1 RMS Y

minus4

minus3

minus2

minus1

0

1

2

3

4

5

minus001

0

001

002

003

004

005

006

007

008

Known

LS fe

atur

e 2 k

urto

sis Y

Figure 7 Contour plot of the novelty model after including 1198651

the analyzed measurements are evaluated as novelty then thealarm is triggered Next the 1198651 scenario is presented to testthe performance of the model The plot of the scenario andthe novelty threshold obtained during training is presentedin Figure 6

As can be appreciated the new scenario lays outside thenovelty threshold so it is successfully detected as noveltyOnce a novel scenario is detected and identified as a fault bythe user it is incorporated to the database to consider it as partof the known scenarios and the novelty model is retrainedwithout changing the features to include this informationFigure 7 shows the feature space after the novelty modelis trained using healthy and 1198651 data as part of the knownscenarios

Once the model is retrained the third scenario 1198652 isintroduced The visual representation of the test is presentedin Figure 8 this scenario is not detected as novel because only50 of the measurements are labeled as novel by the model

minus001

0

001

002

003

004

005

006

007

008

minus2 0 2 4 6 8 10minus4LS feature 1 RMS Y

minus4

minus3

minus2

minus1

0

1

2

3

4

5

KnownF2

LS fe

atur

e 2 k

urto

sis Y

Figure 8 Evaluation of the fault scenario 1198652 The novelty model istrained employing data from healthy and 1198651 scenarios

A similar result is obtained when the novelty model istrained using healthy and 1198652 data and is tested with data of1198651 A summary of the results of novelty detectionmaintainingthe same features obtained during initialization is shown inTable 1

As it can be seen in Table 1 using the reduced set obtainedduring initialization is easy to detect the novel scenarioswhen there is only information of the healthy condition butwhen a novel scenario is included in the database and thenoveltymodel is retrained the reduced set of features initiallyobtained does not necessarily provide a good representationto detect a new scenario during test

To improve these results the methodology presented inthis work proposes to evaluate again the feature reductionmodule each time a retraining is applied Following theoutline presented for results and parting from the first retrainwhere the scenario 1198651 is included in the database the featurereduction module is applied again but this time includinginformation from healthy condition and 1198651 scenario Sincetwo scenarios are taken into consideration and the labels areknown a supervised approach can be applied in this case aFisher Score ranking for feature reduction is employed Thenovelty model using the initial set of features and the noveltymodel after retraining with the new reduced set of featuresare shown in Figure 9

The new reduced set of features is composed of the RMSof the 119910-axis and the Kurtosis of the 119911-axis as can be appre-ciated the new set of features present a more discriminativedistribution of the three scenarios considered At the laststep (Figure 9(d)) when 1198652 scenario is included the FisherScore still ranked the same features of the last retraining(Figure 9(b)) as the highest yet it would be possible thata different set of features would be obtained The resultsobtained demonstrate the advantages of including the FisherScore reduction module to the retraining procedure the newdistribution of the scenarios avoided an overlapping of the

Shock and Vibration 9

minus2 0 2 4 6 8 10minus4LS feature 1 RMS Y

minus4

minus3

minus2

minus1

0

1

2

3

4

5

0

005

01

015

02

KnownF1

LS fe

atur

e 2 k

urto

sis Y

(a)

minus5

minus4

minus3

minus2

minus1

0

1

2

3

minus2 0 2 4 6 8 10minus4FS feature 1 RMS Y

minus001

0

001

002

003

004

005

006

007

008

Known

FS fe

atur

e 2 k

urto

sis Z

(b)

minus001

0

001

002

003

004

005

006

007

008

minus2 0 2 4 6 8 10minus4FS feature 1 RMS Y

FS fe

atur

e 2 k

urto

sis Z

minus5

minus4

minus3

minus2

minus1

0

1

2

3

KnownF2

(c)

minus2 0 2 4 6 8 10minus4FS feature 1 RMS Y

minus5

minus4

minus3

minus2

minus1

0

1

2

3

minus0005

0

0005

001

0015

002

0025

003

0035

Known

FS fe

atur

e 2 k

urto

sis Z

(d)

Figure 9 Process of evaluation and retraining employing the methodology proposed (a) Evaluation of the fault scenario 1198651 (b) Retrainingof the novelty model and reformulation of the reduced set of features including 1198651 (c) Evaluation of the fault scenario 1198652 (d) Retraining ofthe novelty model and reformulation of the reduced set of features including 1198652Table 1 Performance of the novelty detection using only healthy class data to reduce the number of features where DR stands fordimensionality reduction Different scenarios are included according to the information available to train and test the novelty model

DRPerformance using OC-SVM ()

Known Test Known Test Known Test119867 1198651 + 1198652 119867 + 1198651 1198652 119867 + 1198652 1198651LS 954 (plusmn11) 576 (plusmn67) 435 (plusmn64)

new scenario under test therefore the noveltymodel success-fully detected as novelty all measurements corresponding tothe 1198652 scenario

It is worth mentioning that if initially the scenario 1198652is used for training and the scenario 1198651 is used for testingthe new set of features obtained could be different from theaforementionedThe results achieved fromboth scenarios are

shown in Table 2 which also includes a comparison of theresults obtained employing PCA LDA and Laplacian Scoredimensionality reduction techniques in the retraining stepinstead of the Fisher Score proposed

Regarding the classical feature extraction techniquesPCA and LDA the test scenario is not identified as novelin both cases The Fisher Score and the Laplacian Score

10 Shock and Vibration

Table 2 Performance of the novelty detection employing a reduc-tion of features during retraining Different scenarios are includedaccording to the information available to train and test the noveltymodel

DRPerformance using OC-SVM ()

Known Test Known Test119867 + 1198651 1198652 119867 + 1198652 1198651PCA 590 (plusmn52) 642 (plusmn51)LDA 474 (plusmn61) 512 (plusmn61)LS 843 (plusmn24) 100 (plusmn00)Fisher S 100 (plusmn00) 100 (plusmn00)successfully identified the new scenarios as novel since thepercentage obtained in both cases is higher than the 75 pre-defined threshold in the methodology to activate the alarmBoth techniques achieved high scores but still the FisherScore provided a more appropriate selection of features

As mentioned in Section 2 LDA is a feature extractiontechnique based on the Fisher discriminant coefficient sosimilar results between Fisher Score ranking and LDA areexpected however employing Fisher Score ranking scoreachieved a better result this can be caused because the testconsists of novel scenarios and LDA finds the directions onthe feature space specialized for the two supervised scenariosemployed during training meanwhile Fisher Score rankingprovides a more general approach by selecting features

To test the robustness of the Laplacian Score and FisherScore approaches a comparative test is performed where theset of features is increased from 10 to 15 and by varyingthe number of reduced features selected The five featuresincluded in the original set are obtained from the119909-axis of theaccelerometer monitored where the features calculated arepresented in the aforementioned statistical time-frequencyfeatures these features were discarded initially because theyare not part of the perpendicular plane of the motor anddo not contribute significantly to the monitoring in factincluding these features could affect the performance of thefeature reduction and novelty detection modules The resultsare presented in Table 3

The features obtained from the Fisher Score still present abetter distribution to detect the new scenariosThe LaplacianScore performance is affected when irrelevant features areincluded in the feature set but increases when more dimen-sions are taken into consideration Since the objective of theFisher Score is not topology preservation contrary to LS itis capable of discarding all the irrelevant features that wereincluded and the performance is not affected

5 Conclusions

This work proposes an adaptive novelty detection methodol-ogy based on vibration analysis for the condition monitoringand diagnosis of the components of a kinematic chain Themethodology is based on the acquisition of vibration signalsthat are generated in the kinematic chain along with anadequate signal processing to extract features to characterizethe components and an adaptive novelty detection model to

Table 3 Performance of the novelty detection increasing the num-ber of initial features from 10 to 15 and varying the number of thereduced set of features

DRPerformance varying the number of features

reduced ()2 3 4

LS 590 (plusmn41) 643 (plusmn33) 886 (plusmn25)Fisher S 100 (plusmn00) 100 (plusmn00) 100 (plusmn00)detect anomalies The method is composed of two sequentialstages an offline stage to initialize and retrain the modulesand an online stage to continuously assess the conditionof the machine During initialization the model is trainedemploying only information from the machine workingunder healthy condition and two additional faulty scenariosare introduced to test the performance of the method underunknown operations

The adaptive novelty detection approach successfullydetected both novel scenarios and the model incorporatedthe information to avoid generating alarms if the same faultis detected A comparison between the proposed methodand classical dimensionality reduction approaches highlightsthe limitations of maintaining a static set of features duringmonitoring instead of reformulating the feature reductionmodule once new information is available In this particularstudy employing features reduced by Laplacian Score andFisher Score obtained similar results nevertheless it doesnot imply that a similar outcome will be present during theanalysis of other faults Fisher Score is encouraged to beemployed in this methodology rather than maintaining theLaplacian Score approach due to the similarity between themethod objective and the objective function of Fisher Scorein both the ideal case is to find the features to maximizethe distance between scenarios while maintaining compactclusters A specific comparison of performances between LSand Fisher Score was also included in which the Fisher Scoreobtained better results when irrelevant features are includedin the original set of features and when the dimensionalityof the reduced set is increased this highlights the advantagesand robustness of the feature selection approach by FisherScore compared to the LS

From an industrial perspective two aspects should bementioned First dealing with an identical operating sce-nario (ie healthy) the variation of the working conditions(speed and torque patterns) will affect the resulting valueof the estimated numerical features and consequently itsrepresentation in the feature space used for novelty modeldesign Thus the introduction of the working condition asan additional degree of freedom in the novelty detectionscheme implies the increase of the dimensional space thatthe novelty detectionmodel is managingThis fact implies anincrease of the risk of data overlapping (different scenariosunder different operating conditions showing similar featurevalues) and also an increase of complexity for boundariesdefinition during the novelty modelling (data spread) Thusthe proposed method is not limited to stationary conditionsbut to the repetition of the pattern of working condition used

Shock and Vibration 11

to train the model Indeed in practical industrial applicationwhere cyclostationary processes take place the proposedmethod should be executed during the same period (the sameworking conditions) in order to reduce data variability andincrease novelty performance In case of multiple workingconditions consideration parallel novelty model structurescould be generated in order to activate each one dependingon the corresponding working condition

Second the integration of additional physicalmagnitudesas stator currents temperatures and others represents atrendy approach in electromechanical fault diagnosis inorder to improve fault diagnosis resolution and perfor-mance The corresponding data fusion scheme implies ahigh-dimensional feature space that is multiple numericalfeatures estimated from multiple physical magnitudes con-sidered In this point it must be noted that the presentedmethodology proposes the reevaluation of the feature reduc-tionmodule each time a retraining is appliedThus the largerthe available pool of features is the higher the potentiality ofthe proposedmethod to increase the performance during thedata characterization is

Also the proposed method can be extended andimproved for further development this improvement couldinclude a diagnosis method to not only detect anomalies inthe kinematic chain but also identify the fault causing theabnormal behavior In this sense future work will include aspecific comparison of broken bar fault detection capabilitiesbetween the proposedmethodology and classical approaches

Competing Interests

The authors declare that they have no competing interests

Acknowledgments

This research was partially supported by CONACyT schol-arships Grants 278033 and 313604 and also by the SpanishMinistry of Education Culture and Sport under GrantFPU1300589 The authors would also like to thank a partialfinancial support by the Generalitat de Catalunya (GRCMCIA Grant no SGR 2014-101)

References

[1] H Henao G-A Capolino M Fernandez-Cabanas et alldquoTrends in fault diagnosis for electrical machines a review ofdiagnostic techniquesrdquo IEEE Industrial Electronics Magazinevol 8 no 2 pp 31ndash42 2014

[2] N Bayar S Darmoul S Hajri-Gabouj and H Pierreval ldquoFaultdetection diagnosis and recovery using Artificial Immune Sys-tems a reviewrdquoEngineeringApplications of Artificial Intelligencevol 46 pp 43ndash57 2015

[3] SWang JWan D Zhang D Li and C Zhang ldquoTowards smartfactory for Industry 40 a self-organized multi-agent systemwith big data based feedback and coordinationrdquo ComputerNetworks vol 101 pp 158ndash168 2016

[4] M A F Pimentel D A Clifton L Clifton and L TarassenkoldquoA review of novelty detectionrdquo Signal Processing vol 99 pp215ndash249 2014

[5] R Langone O Mauricio Agudelo B De Moor and J AK Suykens ldquoIncremental kernel spectral clustering for online

learning of non-stationary datardquo Neurocomputing vol 139 pp246ndash260 2014

[6] M-H Chen P-C Chang and J-L Wu ldquoA population-basedincremental learning approach with artificial immune systemfor network intrusion detectionrdquo Engineering Applications ofArtificial Intelligence vol 51 pp 171ndash181 2016

[7] S Khomfoi and L M Tolbert ldquoFault diagnosis and reconfigu-ration for multilevel inverter drive using AI-based techniquesrdquoIEEE Transactions on Industrial Electronics vol 54 no 6 pp2954ndash2968 2007

[8] A Bellini F Filippetti C Tassoni and G-A CapolinoldquoAdvances in diagnostic techniques for induction machinesrdquoIEEE Transactions on Industrial Electronics vol 55 no 12 pp4109ndash4126 2008

[9] V T Tran and B-S Yang ldquoAn intelligent condition-basedmaintenance platform for rotating machineryrdquo Expert Systemswith Applications vol 39 no 3 pp 2977ndash2988 2012

[10] P A Delgado-Arredondo A Garcia-Perez D Morinigo-Soteloet al ldquoComparative study of time-frequency decompositiontechniques for fault detection in induction motors using vibra-tion analysis during startup transientrdquo Shock and Vibration vol2015 Article ID 708034 14 pages 2015

[11] M D Prieto G Cirrincione A G Espinosa J A Ortegaand H Henao ldquoBearing fault detection by a novel condition-monitoring scheme based on statistical-time features and neu-ral networksrdquo IEEE Transactions on Industrial Electronics vol60 no 8 pp 3398ndash3407 2013

[12] P Perner ldquoConcepts for novelty detection and handling basedon a case-based reasoning process schemerdquo Engineering Appli-cations of Artificial Intelligence vol 22 no 1 pp 86ndash91 2009

[13] C Englund and A Verikas ldquoA hybrid approach to outlier detec-tion in the offset lithographic printing processrdquo EngineeringApplications of Artificial Intelligence vol 18 no 6 pp 759ndash7682005

[14] H Garces and D Sbarbaro ldquoOutliers detection in environmen-tal monitoring databasesrdquo Engineering Applications of ArtificialIntelligence vol 24 no 2 pp 341ndash349 2011

[15] S Ma S M Li and Y P Xiong ldquoUncertainty reduced noveltydetection approach applied to rotatingmachinery for conditionmonitoringrdquo Shock and Vibration vol 2015 Article ID 73721310 pages 2015

[16] J McBain and M Timusk ldquoFeature extraction for noveltydetection as applied to fault detection in machineryrdquo PatternRecognition Letters vol 32 no 7 pp 1054ndash1061 2011

[17] S Wang J Yub E Lapirac and J Lee ldquoA modified supportvector data description based novelty detection approach formachinery componentsrdquo Applied Soft Computing vol 13 no 2pp 1193ndash1205 2013

[18] D P Filev R B Chinnam F Tseng and P Baruah ldquoAn indus-trial strength novelty detection framework for autonomousequipment monitoring and diagnosticsrdquo IEEE Transactions onIndustrial Informatics vol 6 no 4 pp 767ndash779 2010

[19] B S J Costa P P Angelov and L A Guedes ldquoFully unsu-pervised fault detection and identification based on recursivedensity estimation and self-evolving cloud-based classifierrdquoNeurocomputing vol 150 pp 289ndash303 2015

[20] B Scholkopf R C Williamson A J Smola J Shawe-Taylorand J C Platt ldquoSupport vector method for novelty detectionrdquoin Neural Information Processing Systems vol 12 pp 582ndash5881999

12 Shock and Vibration

[21] D M J Tax and R P W Duin ldquoSupport vector domaindescriptionrdquo Pattern Recognition Letters vol 20 no 11ndash13 pp1191ndash1199 1999

[22] S D Villalba and P Cunningham ldquoAn evaluation of dimensionreduction techniques for one-class classificationrdquo ArtificialIntelligence Review vol 27 no 4 pp 273ndash294 2007

[23] I T Jolliffe Principal Component Analysis Springer New YorkNY USA 2nd edition 1986

[24] X He D Cai and P Niyogi ldquoLaplacian score for featureselectionrdquo inAdvances in Neural Information Processing Systems18 pp 507ndash514 MIT Press 2005

[25] A J Izenman ldquoLinear discriminant analysisrdquo in Modern Mul-tivariate Statistical Techniques A J Izenman Ed pp 237ndash280Springer New York NY USA 2008

[26] Q Gu Z Li and J Han ldquoGeneralized fisher score for featureselectionrdquo httpsarxivorgabs12023725

[27] Z Gao C Cecati and S X Ding ldquoA survey of fault diagnosisand fault-tolerant techniquesmdashpart II fault diagnosis withknowledge-based and hybridactive approachesrdquo IEEE Trans-actions on Industrial Electronics vol 62 no 6 pp 3768ndash37742015

[28] A E Lazzaretti D M J Tax H Vieira Neto and V HFerreira ldquoNovelty detection and multi-class classification inpower distribution voltage waveformsrdquo Expert Systems withApplications vol 45 pp 322ndash330 2016

International Journal of

AerospaceEngineeringHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

RoboticsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Active and Passive Electronic Components

Control Scienceand Engineering

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of

RotatingMachinery

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporation httpwwwhindawicom

Journal ofEngineeringVolume 2014

Submit your manuscripts athttpwwwhindawicom

VLSI Design

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Shock and Vibration

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Civil EngineeringAdvances in

Acoustics and VibrationAdvances in

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Electrical and Computer Engineering

Journal of

Advances inOptoElectronics

Hindawi Publishing Corporation httpwwwhindawicom

Volume 2014

The Scientific World JournalHindawi Publishing Corporation httpwwwhindawicom Volume 2014

SensorsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Modelling amp Simulation in EngineeringHindawi Publishing Corporation httpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Chemical EngineeringInternational Journal of Antennas and

Propagation

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Navigation and Observation

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

DistributedSensor Networks

International Journal of

Page 7: Research Article Vibration-Based Adaptive Novelty ...downloads.hindawi.com/journals/sv/2016/2417856.pdf · applied under a continuous learning framework. e structure of the method

Shock and Vibration 7

(a) (b)

Figure 4 Detail of the faults produced in the test bench (a) corresponds to the 12 broken rotor bar and (b) to one broken rotor bar

22 of the section of the rotor bar and the full broken baris produced by a through-hole with a diameter of 6mm anda depth of 14mm which corresponds to the complete sectionof the rotor bar

The information stored from the kinematic chain consistsof an acquisition of 60 seconds of themachine working underthe three scenarios mentioned each acquisition is segmentedin 30 parts of 2 seconds and a set of features is calculatedfrom the 30 segmented measurements Since two axes aretaken into consideration a total of ten features are calculatedfrom each segmented acquisition of the machine workingunder the different scenarios mentioned The first step ofthe methodology is the offline stage where a reduced set offeatures is obtained and the novelty model is designed

42 Parameter Selection Regarding the Laplacian Score con-figuration a simple approach is followed for parameter tun-ing that is a value of 119896 = 3 is used for constructing the adja-cency graph and a ldquosimple mindedrdquo weighting approach isfollowed Since the proposed approach is compared to differ-ent feature reduction modules selecting generic parameterssettings is useful for the purpose of evaluation but ignores thatthere may be dependencies between the feature reductionmodel and the novelty model Regarding the design of thenovelty model the kernel used is the Gaussian and the valueof the configuration parameter is tuned tominimize the errorin the validation In all experiments 80 of the samplesare used for training and 20 for validation To train andadjust the parameter of the novelty model a fivefold cross-validation is used

43 Results and Discussions In order to highlight the contri-butions andmotivation of this work the outline of the resultsis presented as follows first a test is performed by a classicalapproach then the proposedmethodology is applied and theresults are compared The classical static approach impliesconserving the reduced features set obtained at the offline ini-tialization stage meanwhile the proposed dynamic approachimplies a possible reformulation of the reduced features setduring the retraining stage

Healthy

0

005

01

015

02

minus2 minus1minus3 1 2 30LS feature 1 RMS Y

minus3

minus2

minus1

0

1

2

3LS

feat

ure 2

kur

tosis

Y

Figure 5 Initial novelty model representation Limit of the noveltythreshold sdot sdot sdot and measurements used to train the model lowast

During the initialization stage the reduced set of featuresselected bymeans of the Laplacian Score ranking is composedof the RMS of the 119910-axis and the Kurtosis of the 119910-axisThe OC-SVM model is trained employing healthy conditiondata The resulting OC-SVM during training is shown inFigure 5 The marks lowast in the figure represent measurementsof the machine used to train the model in this case andin the subsequence figures only 1-fold of the fivefold cross-validation is displayedThe dotted line represents the noveltythreshold value all data inside the boundaries of the dottedline is considered normal The contour plot represents thenovelty score evaluation over different regions of the featuremap

Once the offline initialization stage is finished the onlinestage follows that is new measurements are obtained toassess the condition of the monitored machine To giverobustness to the novelty detection and avoid false alarms abatch consisting of 30 measurements is evaluated if 75 of

8 Shock and Vibration

0

005

01

015

02

minus4

minus3

minus2

minus1

0

1

2

3

4

5

60 2 4 8 10minus2minus4LS feature 1 RMS Y

KnownF1

LS fe

atur

e 2 k

urto

sis Y

Figure 6 Evaluation of the fault scenario 1198651 The novelty model istrained employing data from healthy operation condition

minus2 0 2 4 6 8 10minus4LS feature 1 RMS Y

minus4

minus3

minus2

minus1

0

1

2

3

4

5

minus001

0

001

002

003

004

005

006

007

008

Known

LS fe

atur

e 2 k

urto

sis Y

Figure 7 Contour plot of the novelty model after including 1198651

the analyzed measurements are evaluated as novelty then thealarm is triggered Next the 1198651 scenario is presented to testthe performance of the model The plot of the scenario andthe novelty threshold obtained during training is presentedin Figure 6

As can be appreciated the new scenario lays outside thenovelty threshold so it is successfully detected as noveltyOnce a novel scenario is detected and identified as a fault bythe user it is incorporated to the database to consider it as partof the known scenarios and the novelty model is retrainedwithout changing the features to include this informationFigure 7 shows the feature space after the novelty modelis trained using healthy and 1198651 data as part of the knownscenarios

Once the model is retrained the third scenario 1198652 isintroduced The visual representation of the test is presentedin Figure 8 this scenario is not detected as novel because only50 of the measurements are labeled as novel by the model

minus001

0

001

002

003

004

005

006

007

008

minus2 0 2 4 6 8 10minus4LS feature 1 RMS Y

minus4

minus3

minus2

minus1

0

1

2

3

4

5

KnownF2

LS fe

atur

e 2 k

urto

sis Y

Figure 8 Evaluation of the fault scenario 1198652 The novelty model istrained employing data from healthy and 1198651 scenarios

A similar result is obtained when the novelty model istrained using healthy and 1198652 data and is tested with data of1198651 A summary of the results of novelty detectionmaintainingthe same features obtained during initialization is shown inTable 1

As it can be seen in Table 1 using the reduced set obtainedduring initialization is easy to detect the novel scenarioswhen there is only information of the healthy condition butwhen a novel scenario is included in the database and thenoveltymodel is retrained the reduced set of features initiallyobtained does not necessarily provide a good representationto detect a new scenario during test

To improve these results the methodology presented inthis work proposes to evaluate again the feature reductionmodule each time a retraining is applied Following theoutline presented for results and parting from the first retrainwhere the scenario 1198651 is included in the database the featurereduction module is applied again but this time includinginformation from healthy condition and 1198651 scenario Sincetwo scenarios are taken into consideration and the labels areknown a supervised approach can be applied in this case aFisher Score ranking for feature reduction is employed Thenovelty model using the initial set of features and the noveltymodel after retraining with the new reduced set of featuresare shown in Figure 9

The new reduced set of features is composed of the RMSof the 119910-axis and the Kurtosis of the 119911-axis as can be appre-ciated the new set of features present a more discriminativedistribution of the three scenarios considered At the laststep (Figure 9(d)) when 1198652 scenario is included the FisherScore still ranked the same features of the last retraining(Figure 9(b)) as the highest yet it would be possible thata different set of features would be obtained The resultsobtained demonstrate the advantages of including the FisherScore reduction module to the retraining procedure the newdistribution of the scenarios avoided an overlapping of the

Shock and Vibration 9

minus2 0 2 4 6 8 10minus4LS feature 1 RMS Y

minus4

minus3

minus2

minus1

0

1

2

3

4

5

0

005

01

015

02

KnownF1

LS fe

atur

e 2 k

urto

sis Y

(a)

minus5

minus4

minus3

minus2

minus1

0

1

2

3

minus2 0 2 4 6 8 10minus4FS feature 1 RMS Y

minus001

0

001

002

003

004

005

006

007

008

Known

FS fe

atur

e 2 k

urto

sis Z

(b)

minus001

0

001

002

003

004

005

006

007

008

minus2 0 2 4 6 8 10minus4FS feature 1 RMS Y

FS fe

atur

e 2 k

urto

sis Z

minus5

minus4

minus3

minus2

minus1

0

1

2

3

KnownF2

(c)

minus2 0 2 4 6 8 10minus4FS feature 1 RMS Y

minus5

minus4

minus3

minus2

minus1

0

1

2

3

minus0005

0

0005

001

0015

002

0025

003

0035

Known

FS fe

atur

e 2 k

urto

sis Z

(d)

Figure 9 Process of evaluation and retraining employing the methodology proposed (a) Evaluation of the fault scenario 1198651 (b) Retrainingof the novelty model and reformulation of the reduced set of features including 1198651 (c) Evaluation of the fault scenario 1198652 (d) Retraining ofthe novelty model and reformulation of the reduced set of features including 1198652Table 1 Performance of the novelty detection using only healthy class data to reduce the number of features where DR stands fordimensionality reduction Different scenarios are included according to the information available to train and test the novelty model

DRPerformance using OC-SVM ()

Known Test Known Test Known Test119867 1198651 + 1198652 119867 + 1198651 1198652 119867 + 1198652 1198651LS 954 (plusmn11) 576 (plusmn67) 435 (plusmn64)

new scenario under test therefore the noveltymodel success-fully detected as novelty all measurements corresponding tothe 1198652 scenario

It is worth mentioning that if initially the scenario 1198652is used for training and the scenario 1198651 is used for testingthe new set of features obtained could be different from theaforementionedThe results achieved fromboth scenarios are

shown in Table 2 which also includes a comparison of theresults obtained employing PCA LDA and Laplacian Scoredimensionality reduction techniques in the retraining stepinstead of the Fisher Score proposed

Regarding the classical feature extraction techniquesPCA and LDA the test scenario is not identified as novelin both cases The Fisher Score and the Laplacian Score

10 Shock and Vibration

Table 2 Performance of the novelty detection employing a reduc-tion of features during retraining Different scenarios are includedaccording to the information available to train and test the noveltymodel

DRPerformance using OC-SVM ()

Known Test Known Test119867 + 1198651 1198652 119867 + 1198652 1198651PCA 590 (plusmn52) 642 (plusmn51)LDA 474 (plusmn61) 512 (plusmn61)LS 843 (plusmn24) 100 (plusmn00)Fisher S 100 (plusmn00) 100 (plusmn00)successfully identified the new scenarios as novel since thepercentage obtained in both cases is higher than the 75 pre-defined threshold in the methodology to activate the alarmBoth techniques achieved high scores but still the FisherScore provided a more appropriate selection of features

As mentioned in Section 2 LDA is a feature extractiontechnique based on the Fisher discriminant coefficient sosimilar results between Fisher Score ranking and LDA areexpected however employing Fisher Score ranking scoreachieved a better result this can be caused because the testconsists of novel scenarios and LDA finds the directions onthe feature space specialized for the two supervised scenariosemployed during training meanwhile Fisher Score rankingprovides a more general approach by selecting features

To test the robustness of the Laplacian Score and FisherScore approaches a comparative test is performed where theset of features is increased from 10 to 15 and by varyingthe number of reduced features selected The five featuresincluded in the original set are obtained from the119909-axis of theaccelerometer monitored where the features calculated arepresented in the aforementioned statistical time-frequencyfeatures these features were discarded initially because theyare not part of the perpendicular plane of the motor anddo not contribute significantly to the monitoring in factincluding these features could affect the performance of thefeature reduction and novelty detection modules The resultsare presented in Table 3

The features obtained from the Fisher Score still present abetter distribution to detect the new scenariosThe LaplacianScore performance is affected when irrelevant features areincluded in the feature set but increases when more dimen-sions are taken into consideration Since the objective of theFisher Score is not topology preservation contrary to LS itis capable of discarding all the irrelevant features that wereincluded and the performance is not affected

5 Conclusions

This work proposes an adaptive novelty detection methodol-ogy based on vibration analysis for the condition monitoringand diagnosis of the components of a kinematic chain Themethodology is based on the acquisition of vibration signalsthat are generated in the kinematic chain along with anadequate signal processing to extract features to characterizethe components and an adaptive novelty detection model to

Table 3 Performance of the novelty detection increasing the num-ber of initial features from 10 to 15 and varying the number of thereduced set of features

DRPerformance varying the number of features

reduced ()2 3 4

LS 590 (plusmn41) 643 (plusmn33) 886 (plusmn25)Fisher S 100 (plusmn00) 100 (plusmn00) 100 (plusmn00)detect anomalies The method is composed of two sequentialstages an offline stage to initialize and retrain the modulesand an online stage to continuously assess the conditionof the machine During initialization the model is trainedemploying only information from the machine workingunder healthy condition and two additional faulty scenariosare introduced to test the performance of the method underunknown operations

The adaptive novelty detection approach successfullydetected both novel scenarios and the model incorporatedthe information to avoid generating alarms if the same faultis detected A comparison between the proposed methodand classical dimensionality reduction approaches highlightsthe limitations of maintaining a static set of features duringmonitoring instead of reformulating the feature reductionmodule once new information is available In this particularstudy employing features reduced by Laplacian Score andFisher Score obtained similar results nevertheless it doesnot imply that a similar outcome will be present during theanalysis of other faults Fisher Score is encouraged to beemployed in this methodology rather than maintaining theLaplacian Score approach due to the similarity between themethod objective and the objective function of Fisher Scorein both the ideal case is to find the features to maximizethe distance between scenarios while maintaining compactclusters A specific comparison of performances between LSand Fisher Score was also included in which the Fisher Scoreobtained better results when irrelevant features are includedin the original set of features and when the dimensionalityof the reduced set is increased this highlights the advantagesand robustness of the feature selection approach by FisherScore compared to the LS

From an industrial perspective two aspects should bementioned First dealing with an identical operating sce-nario (ie healthy) the variation of the working conditions(speed and torque patterns) will affect the resulting valueof the estimated numerical features and consequently itsrepresentation in the feature space used for novelty modeldesign Thus the introduction of the working condition asan additional degree of freedom in the novelty detectionscheme implies the increase of the dimensional space thatthe novelty detectionmodel is managingThis fact implies anincrease of the risk of data overlapping (different scenariosunder different operating conditions showing similar featurevalues) and also an increase of complexity for boundariesdefinition during the novelty modelling (data spread) Thusthe proposed method is not limited to stationary conditionsbut to the repetition of the pattern of working condition used

Shock and Vibration 11

to train the model Indeed in practical industrial applicationwhere cyclostationary processes take place the proposedmethod should be executed during the same period (the sameworking conditions) in order to reduce data variability andincrease novelty performance In case of multiple workingconditions consideration parallel novelty model structurescould be generated in order to activate each one dependingon the corresponding working condition

Second the integration of additional physicalmagnitudesas stator currents temperatures and others represents atrendy approach in electromechanical fault diagnosis inorder to improve fault diagnosis resolution and perfor-mance The corresponding data fusion scheme implies ahigh-dimensional feature space that is multiple numericalfeatures estimated from multiple physical magnitudes con-sidered In this point it must be noted that the presentedmethodology proposes the reevaluation of the feature reduc-tionmodule each time a retraining is appliedThus the largerthe available pool of features is the higher the potentiality ofthe proposedmethod to increase the performance during thedata characterization is

Also the proposed method can be extended andimproved for further development this improvement couldinclude a diagnosis method to not only detect anomalies inthe kinematic chain but also identify the fault causing theabnormal behavior In this sense future work will include aspecific comparison of broken bar fault detection capabilitiesbetween the proposedmethodology and classical approaches

Competing Interests

The authors declare that they have no competing interests

Acknowledgments

This research was partially supported by CONACyT schol-arships Grants 278033 and 313604 and also by the SpanishMinistry of Education Culture and Sport under GrantFPU1300589 The authors would also like to thank a partialfinancial support by the Generalitat de Catalunya (GRCMCIA Grant no SGR 2014-101)

References

[1] H Henao G-A Capolino M Fernandez-Cabanas et alldquoTrends in fault diagnosis for electrical machines a review ofdiagnostic techniquesrdquo IEEE Industrial Electronics Magazinevol 8 no 2 pp 31ndash42 2014

[2] N Bayar S Darmoul S Hajri-Gabouj and H Pierreval ldquoFaultdetection diagnosis and recovery using Artificial Immune Sys-tems a reviewrdquoEngineeringApplications of Artificial Intelligencevol 46 pp 43ndash57 2015

[3] SWang JWan D Zhang D Li and C Zhang ldquoTowards smartfactory for Industry 40 a self-organized multi-agent systemwith big data based feedback and coordinationrdquo ComputerNetworks vol 101 pp 158ndash168 2016

[4] M A F Pimentel D A Clifton L Clifton and L TarassenkoldquoA review of novelty detectionrdquo Signal Processing vol 99 pp215ndash249 2014

[5] R Langone O Mauricio Agudelo B De Moor and J AK Suykens ldquoIncremental kernel spectral clustering for online

learning of non-stationary datardquo Neurocomputing vol 139 pp246ndash260 2014

[6] M-H Chen P-C Chang and J-L Wu ldquoA population-basedincremental learning approach with artificial immune systemfor network intrusion detectionrdquo Engineering Applications ofArtificial Intelligence vol 51 pp 171ndash181 2016

[7] S Khomfoi and L M Tolbert ldquoFault diagnosis and reconfigu-ration for multilevel inverter drive using AI-based techniquesrdquoIEEE Transactions on Industrial Electronics vol 54 no 6 pp2954ndash2968 2007

[8] A Bellini F Filippetti C Tassoni and G-A CapolinoldquoAdvances in diagnostic techniques for induction machinesrdquoIEEE Transactions on Industrial Electronics vol 55 no 12 pp4109ndash4126 2008

[9] V T Tran and B-S Yang ldquoAn intelligent condition-basedmaintenance platform for rotating machineryrdquo Expert Systemswith Applications vol 39 no 3 pp 2977ndash2988 2012

[10] P A Delgado-Arredondo A Garcia-Perez D Morinigo-Soteloet al ldquoComparative study of time-frequency decompositiontechniques for fault detection in induction motors using vibra-tion analysis during startup transientrdquo Shock and Vibration vol2015 Article ID 708034 14 pages 2015

[11] M D Prieto G Cirrincione A G Espinosa J A Ortegaand H Henao ldquoBearing fault detection by a novel condition-monitoring scheme based on statistical-time features and neu-ral networksrdquo IEEE Transactions on Industrial Electronics vol60 no 8 pp 3398ndash3407 2013

[12] P Perner ldquoConcepts for novelty detection and handling basedon a case-based reasoning process schemerdquo Engineering Appli-cations of Artificial Intelligence vol 22 no 1 pp 86ndash91 2009

[13] C Englund and A Verikas ldquoA hybrid approach to outlier detec-tion in the offset lithographic printing processrdquo EngineeringApplications of Artificial Intelligence vol 18 no 6 pp 759ndash7682005

[14] H Garces and D Sbarbaro ldquoOutliers detection in environmen-tal monitoring databasesrdquo Engineering Applications of ArtificialIntelligence vol 24 no 2 pp 341ndash349 2011

[15] S Ma S M Li and Y P Xiong ldquoUncertainty reduced noveltydetection approach applied to rotatingmachinery for conditionmonitoringrdquo Shock and Vibration vol 2015 Article ID 73721310 pages 2015

[16] J McBain and M Timusk ldquoFeature extraction for noveltydetection as applied to fault detection in machineryrdquo PatternRecognition Letters vol 32 no 7 pp 1054ndash1061 2011

[17] S Wang J Yub E Lapirac and J Lee ldquoA modified supportvector data description based novelty detection approach formachinery componentsrdquo Applied Soft Computing vol 13 no 2pp 1193ndash1205 2013

[18] D P Filev R B Chinnam F Tseng and P Baruah ldquoAn indus-trial strength novelty detection framework for autonomousequipment monitoring and diagnosticsrdquo IEEE Transactions onIndustrial Informatics vol 6 no 4 pp 767ndash779 2010

[19] B S J Costa P P Angelov and L A Guedes ldquoFully unsu-pervised fault detection and identification based on recursivedensity estimation and self-evolving cloud-based classifierrdquoNeurocomputing vol 150 pp 289ndash303 2015

[20] B Scholkopf R C Williamson A J Smola J Shawe-Taylorand J C Platt ldquoSupport vector method for novelty detectionrdquoin Neural Information Processing Systems vol 12 pp 582ndash5881999

12 Shock and Vibration

[21] D M J Tax and R P W Duin ldquoSupport vector domaindescriptionrdquo Pattern Recognition Letters vol 20 no 11ndash13 pp1191ndash1199 1999

[22] S D Villalba and P Cunningham ldquoAn evaluation of dimensionreduction techniques for one-class classificationrdquo ArtificialIntelligence Review vol 27 no 4 pp 273ndash294 2007

[23] I T Jolliffe Principal Component Analysis Springer New YorkNY USA 2nd edition 1986

[24] X He D Cai and P Niyogi ldquoLaplacian score for featureselectionrdquo inAdvances in Neural Information Processing Systems18 pp 507ndash514 MIT Press 2005

[25] A J Izenman ldquoLinear discriminant analysisrdquo in Modern Mul-tivariate Statistical Techniques A J Izenman Ed pp 237ndash280Springer New York NY USA 2008

[26] Q Gu Z Li and J Han ldquoGeneralized fisher score for featureselectionrdquo httpsarxivorgabs12023725

[27] Z Gao C Cecati and S X Ding ldquoA survey of fault diagnosisand fault-tolerant techniquesmdashpart II fault diagnosis withknowledge-based and hybridactive approachesrdquo IEEE Trans-actions on Industrial Electronics vol 62 no 6 pp 3768ndash37742015

[28] A E Lazzaretti D M J Tax H Vieira Neto and V HFerreira ldquoNovelty detection and multi-class classification inpower distribution voltage waveformsrdquo Expert Systems withApplications vol 45 pp 322ndash330 2016

International Journal of

AerospaceEngineeringHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

RoboticsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Active and Passive Electronic Components

Control Scienceand Engineering

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of

RotatingMachinery

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporation httpwwwhindawicom

Journal ofEngineeringVolume 2014

Submit your manuscripts athttpwwwhindawicom

VLSI Design

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Shock and Vibration

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Civil EngineeringAdvances in

Acoustics and VibrationAdvances in

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Electrical and Computer Engineering

Journal of

Advances inOptoElectronics

Hindawi Publishing Corporation httpwwwhindawicom

Volume 2014

The Scientific World JournalHindawi Publishing Corporation httpwwwhindawicom Volume 2014

SensorsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Modelling amp Simulation in EngineeringHindawi Publishing Corporation httpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Chemical EngineeringInternational Journal of Antennas and

Propagation

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Navigation and Observation

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

DistributedSensor Networks

International Journal of

Page 8: Research Article Vibration-Based Adaptive Novelty ...downloads.hindawi.com/journals/sv/2016/2417856.pdf · applied under a continuous learning framework. e structure of the method

8 Shock and Vibration

0

005

01

015

02

minus4

minus3

minus2

minus1

0

1

2

3

4

5

60 2 4 8 10minus2minus4LS feature 1 RMS Y

KnownF1

LS fe

atur

e 2 k

urto

sis Y

Figure 6 Evaluation of the fault scenario 1198651 The novelty model istrained employing data from healthy operation condition

minus2 0 2 4 6 8 10minus4LS feature 1 RMS Y

minus4

minus3

minus2

minus1

0

1

2

3

4

5

minus001

0

001

002

003

004

005

006

007

008

Known

LS fe

atur

e 2 k

urto

sis Y

Figure 7 Contour plot of the novelty model after including 1198651

the analyzed measurements are evaluated as novelty then thealarm is triggered Next the 1198651 scenario is presented to testthe performance of the model The plot of the scenario andthe novelty threshold obtained during training is presentedin Figure 6

As can be appreciated the new scenario lays outside thenovelty threshold so it is successfully detected as noveltyOnce a novel scenario is detected and identified as a fault bythe user it is incorporated to the database to consider it as partof the known scenarios and the novelty model is retrainedwithout changing the features to include this informationFigure 7 shows the feature space after the novelty modelis trained using healthy and 1198651 data as part of the knownscenarios

Once the model is retrained the third scenario 1198652 isintroduced The visual representation of the test is presentedin Figure 8 this scenario is not detected as novel because only50 of the measurements are labeled as novel by the model

minus001

0

001

002

003

004

005

006

007

008

minus2 0 2 4 6 8 10minus4LS feature 1 RMS Y

minus4

minus3

minus2

minus1

0

1

2

3

4

5

KnownF2

LS fe

atur

e 2 k

urto

sis Y

Figure 8 Evaluation of the fault scenario 1198652 The novelty model istrained employing data from healthy and 1198651 scenarios

A similar result is obtained when the novelty model istrained using healthy and 1198652 data and is tested with data of1198651 A summary of the results of novelty detectionmaintainingthe same features obtained during initialization is shown inTable 1

As it can be seen in Table 1 using the reduced set obtainedduring initialization is easy to detect the novel scenarioswhen there is only information of the healthy condition butwhen a novel scenario is included in the database and thenoveltymodel is retrained the reduced set of features initiallyobtained does not necessarily provide a good representationto detect a new scenario during test

To improve these results the methodology presented inthis work proposes to evaluate again the feature reductionmodule each time a retraining is applied Following theoutline presented for results and parting from the first retrainwhere the scenario 1198651 is included in the database the featurereduction module is applied again but this time includinginformation from healthy condition and 1198651 scenario Sincetwo scenarios are taken into consideration and the labels areknown a supervised approach can be applied in this case aFisher Score ranking for feature reduction is employed Thenovelty model using the initial set of features and the noveltymodel after retraining with the new reduced set of featuresare shown in Figure 9

The new reduced set of features is composed of the RMSof the 119910-axis and the Kurtosis of the 119911-axis as can be appre-ciated the new set of features present a more discriminativedistribution of the three scenarios considered At the laststep (Figure 9(d)) when 1198652 scenario is included the FisherScore still ranked the same features of the last retraining(Figure 9(b)) as the highest yet it would be possible thata different set of features would be obtained The resultsobtained demonstrate the advantages of including the FisherScore reduction module to the retraining procedure the newdistribution of the scenarios avoided an overlapping of the

Shock and Vibration 9

minus2 0 2 4 6 8 10minus4LS feature 1 RMS Y

minus4

minus3

minus2

minus1

0

1

2

3

4

5

0

005

01

015

02

KnownF1

LS fe

atur

e 2 k

urto

sis Y

(a)

minus5

minus4

minus3

minus2

minus1

0

1

2

3

minus2 0 2 4 6 8 10minus4FS feature 1 RMS Y

minus001

0

001

002

003

004

005

006

007

008

Known

FS fe

atur

e 2 k

urto

sis Z

(b)

minus001

0

001

002

003

004

005

006

007

008

minus2 0 2 4 6 8 10minus4FS feature 1 RMS Y

FS fe

atur

e 2 k

urto

sis Z

minus5

minus4

minus3

minus2

minus1

0

1

2

3

KnownF2

(c)

minus2 0 2 4 6 8 10minus4FS feature 1 RMS Y

minus5

minus4

minus3

minus2

minus1

0

1

2

3

minus0005

0

0005

001

0015

002

0025

003

0035

Known

FS fe

atur

e 2 k

urto

sis Z

(d)

Figure 9 Process of evaluation and retraining employing the methodology proposed (a) Evaluation of the fault scenario 1198651 (b) Retrainingof the novelty model and reformulation of the reduced set of features including 1198651 (c) Evaluation of the fault scenario 1198652 (d) Retraining ofthe novelty model and reformulation of the reduced set of features including 1198652Table 1 Performance of the novelty detection using only healthy class data to reduce the number of features where DR stands fordimensionality reduction Different scenarios are included according to the information available to train and test the novelty model

DRPerformance using OC-SVM ()

Known Test Known Test Known Test119867 1198651 + 1198652 119867 + 1198651 1198652 119867 + 1198652 1198651LS 954 (plusmn11) 576 (plusmn67) 435 (plusmn64)

new scenario under test therefore the noveltymodel success-fully detected as novelty all measurements corresponding tothe 1198652 scenario

It is worth mentioning that if initially the scenario 1198652is used for training and the scenario 1198651 is used for testingthe new set of features obtained could be different from theaforementionedThe results achieved fromboth scenarios are

shown in Table 2 which also includes a comparison of theresults obtained employing PCA LDA and Laplacian Scoredimensionality reduction techniques in the retraining stepinstead of the Fisher Score proposed

Regarding the classical feature extraction techniquesPCA and LDA the test scenario is not identified as novelin both cases The Fisher Score and the Laplacian Score

10 Shock and Vibration

Table 2 Performance of the novelty detection employing a reduc-tion of features during retraining Different scenarios are includedaccording to the information available to train and test the noveltymodel

DRPerformance using OC-SVM ()

Known Test Known Test119867 + 1198651 1198652 119867 + 1198652 1198651PCA 590 (plusmn52) 642 (plusmn51)LDA 474 (plusmn61) 512 (plusmn61)LS 843 (plusmn24) 100 (plusmn00)Fisher S 100 (plusmn00) 100 (plusmn00)successfully identified the new scenarios as novel since thepercentage obtained in both cases is higher than the 75 pre-defined threshold in the methodology to activate the alarmBoth techniques achieved high scores but still the FisherScore provided a more appropriate selection of features

As mentioned in Section 2 LDA is a feature extractiontechnique based on the Fisher discriminant coefficient sosimilar results between Fisher Score ranking and LDA areexpected however employing Fisher Score ranking scoreachieved a better result this can be caused because the testconsists of novel scenarios and LDA finds the directions onthe feature space specialized for the two supervised scenariosemployed during training meanwhile Fisher Score rankingprovides a more general approach by selecting features

To test the robustness of the Laplacian Score and FisherScore approaches a comparative test is performed where theset of features is increased from 10 to 15 and by varyingthe number of reduced features selected The five featuresincluded in the original set are obtained from the119909-axis of theaccelerometer monitored where the features calculated arepresented in the aforementioned statistical time-frequencyfeatures these features were discarded initially because theyare not part of the perpendicular plane of the motor anddo not contribute significantly to the monitoring in factincluding these features could affect the performance of thefeature reduction and novelty detection modules The resultsare presented in Table 3

The features obtained from the Fisher Score still present abetter distribution to detect the new scenariosThe LaplacianScore performance is affected when irrelevant features areincluded in the feature set but increases when more dimen-sions are taken into consideration Since the objective of theFisher Score is not topology preservation contrary to LS itis capable of discarding all the irrelevant features that wereincluded and the performance is not affected

5 Conclusions

This work proposes an adaptive novelty detection methodol-ogy based on vibration analysis for the condition monitoringand diagnosis of the components of a kinematic chain Themethodology is based on the acquisition of vibration signalsthat are generated in the kinematic chain along with anadequate signal processing to extract features to characterizethe components and an adaptive novelty detection model to

Table 3 Performance of the novelty detection increasing the num-ber of initial features from 10 to 15 and varying the number of thereduced set of features

DRPerformance varying the number of features

reduced ()2 3 4

LS 590 (plusmn41) 643 (plusmn33) 886 (plusmn25)Fisher S 100 (plusmn00) 100 (plusmn00) 100 (plusmn00)detect anomalies The method is composed of two sequentialstages an offline stage to initialize and retrain the modulesand an online stage to continuously assess the conditionof the machine During initialization the model is trainedemploying only information from the machine workingunder healthy condition and two additional faulty scenariosare introduced to test the performance of the method underunknown operations

The adaptive novelty detection approach successfullydetected both novel scenarios and the model incorporatedthe information to avoid generating alarms if the same faultis detected A comparison between the proposed methodand classical dimensionality reduction approaches highlightsthe limitations of maintaining a static set of features duringmonitoring instead of reformulating the feature reductionmodule once new information is available In this particularstudy employing features reduced by Laplacian Score andFisher Score obtained similar results nevertheless it doesnot imply that a similar outcome will be present during theanalysis of other faults Fisher Score is encouraged to beemployed in this methodology rather than maintaining theLaplacian Score approach due to the similarity between themethod objective and the objective function of Fisher Scorein both the ideal case is to find the features to maximizethe distance between scenarios while maintaining compactclusters A specific comparison of performances between LSand Fisher Score was also included in which the Fisher Scoreobtained better results when irrelevant features are includedin the original set of features and when the dimensionalityof the reduced set is increased this highlights the advantagesand robustness of the feature selection approach by FisherScore compared to the LS

From an industrial perspective two aspects should bementioned First dealing with an identical operating sce-nario (ie healthy) the variation of the working conditions(speed and torque patterns) will affect the resulting valueof the estimated numerical features and consequently itsrepresentation in the feature space used for novelty modeldesign Thus the introduction of the working condition asan additional degree of freedom in the novelty detectionscheme implies the increase of the dimensional space thatthe novelty detectionmodel is managingThis fact implies anincrease of the risk of data overlapping (different scenariosunder different operating conditions showing similar featurevalues) and also an increase of complexity for boundariesdefinition during the novelty modelling (data spread) Thusthe proposed method is not limited to stationary conditionsbut to the repetition of the pattern of working condition used

Shock and Vibration 11

to train the model Indeed in practical industrial applicationwhere cyclostationary processes take place the proposedmethod should be executed during the same period (the sameworking conditions) in order to reduce data variability andincrease novelty performance In case of multiple workingconditions consideration parallel novelty model structurescould be generated in order to activate each one dependingon the corresponding working condition

Second the integration of additional physicalmagnitudesas stator currents temperatures and others represents atrendy approach in electromechanical fault diagnosis inorder to improve fault diagnosis resolution and perfor-mance The corresponding data fusion scheme implies ahigh-dimensional feature space that is multiple numericalfeatures estimated from multiple physical magnitudes con-sidered In this point it must be noted that the presentedmethodology proposes the reevaluation of the feature reduc-tionmodule each time a retraining is appliedThus the largerthe available pool of features is the higher the potentiality ofthe proposedmethod to increase the performance during thedata characterization is

Also the proposed method can be extended andimproved for further development this improvement couldinclude a diagnosis method to not only detect anomalies inthe kinematic chain but also identify the fault causing theabnormal behavior In this sense future work will include aspecific comparison of broken bar fault detection capabilitiesbetween the proposedmethodology and classical approaches

Competing Interests

The authors declare that they have no competing interests

Acknowledgments

This research was partially supported by CONACyT schol-arships Grants 278033 and 313604 and also by the SpanishMinistry of Education Culture and Sport under GrantFPU1300589 The authors would also like to thank a partialfinancial support by the Generalitat de Catalunya (GRCMCIA Grant no SGR 2014-101)

References

[1] H Henao G-A Capolino M Fernandez-Cabanas et alldquoTrends in fault diagnosis for electrical machines a review ofdiagnostic techniquesrdquo IEEE Industrial Electronics Magazinevol 8 no 2 pp 31ndash42 2014

[2] N Bayar S Darmoul S Hajri-Gabouj and H Pierreval ldquoFaultdetection diagnosis and recovery using Artificial Immune Sys-tems a reviewrdquoEngineeringApplications of Artificial Intelligencevol 46 pp 43ndash57 2015

[3] SWang JWan D Zhang D Li and C Zhang ldquoTowards smartfactory for Industry 40 a self-organized multi-agent systemwith big data based feedback and coordinationrdquo ComputerNetworks vol 101 pp 158ndash168 2016

[4] M A F Pimentel D A Clifton L Clifton and L TarassenkoldquoA review of novelty detectionrdquo Signal Processing vol 99 pp215ndash249 2014

[5] R Langone O Mauricio Agudelo B De Moor and J AK Suykens ldquoIncremental kernel spectral clustering for online

learning of non-stationary datardquo Neurocomputing vol 139 pp246ndash260 2014

[6] M-H Chen P-C Chang and J-L Wu ldquoA population-basedincremental learning approach with artificial immune systemfor network intrusion detectionrdquo Engineering Applications ofArtificial Intelligence vol 51 pp 171ndash181 2016

[7] S Khomfoi and L M Tolbert ldquoFault diagnosis and reconfigu-ration for multilevel inverter drive using AI-based techniquesrdquoIEEE Transactions on Industrial Electronics vol 54 no 6 pp2954ndash2968 2007

[8] A Bellini F Filippetti C Tassoni and G-A CapolinoldquoAdvances in diagnostic techniques for induction machinesrdquoIEEE Transactions on Industrial Electronics vol 55 no 12 pp4109ndash4126 2008

[9] V T Tran and B-S Yang ldquoAn intelligent condition-basedmaintenance platform for rotating machineryrdquo Expert Systemswith Applications vol 39 no 3 pp 2977ndash2988 2012

[10] P A Delgado-Arredondo A Garcia-Perez D Morinigo-Soteloet al ldquoComparative study of time-frequency decompositiontechniques for fault detection in induction motors using vibra-tion analysis during startup transientrdquo Shock and Vibration vol2015 Article ID 708034 14 pages 2015

[11] M D Prieto G Cirrincione A G Espinosa J A Ortegaand H Henao ldquoBearing fault detection by a novel condition-monitoring scheme based on statistical-time features and neu-ral networksrdquo IEEE Transactions on Industrial Electronics vol60 no 8 pp 3398ndash3407 2013

[12] P Perner ldquoConcepts for novelty detection and handling basedon a case-based reasoning process schemerdquo Engineering Appli-cations of Artificial Intelligence vol 22 no 1 pp 86ndash91 2009

[13] C Englund and A Verikas ldquoA hybrid approach to outlier detec-tion in the offset lithographic printing processrdquo EngineeringApplications of Artificial Intelligence vol 18 no 6 pp 759ndash7682005

[14] H Garces and D Sbarbaro ldquoOutliers detection in environmen-tal monitoring databasesrdquo Engineering Applications of ArtificialIntelligence vol 24 no 2 pp 341ndash349 2011

[15] S Ma S M Li and Y P Xiong ldquoUncertainty reduced noveltydetection approach applied to rotatingmachinery for conditionmonitoringrdquo Shock and Vibration vol 2015 Article ID 73721310 pages 2015

[16] J McBain and M Timusk ldquoFeature extraction for noveltydetection as applied to fault detection in machineryrdquo PatternRecognition Letters vol 32 no 7 pp 1054ndash1061 2011

[17] S Wang J Yub E Lapirac and J Lee ldquoA modified supportvector data description based novelty detection approach formachinery componentsrdquo Applied Soft Computing vol 13 no 2pp 1193ndash1205 2013

[18] D P Filev R B Chinnam F Tseng and P Baruah ldquoAn indus-trial strength novelty detection framework for autonomousequipment monitoring and diagnosticsrdquo IEEE Transactions onIndustrial Informatics vol 6 no 4 pp 767ndash779 2010

[19] B S J Costa P P Angelov and L A Guedes ldquoFully unsu-pervised fault detection and identification based on recursivedensity estimation and self-evolving cloud-based classifierrdquoNeurocomputing vol 150 pp 289ndash303 2015

[20] B Scholkopf R C Williamson A J Smola J Shawe-Taylorand J C Platt ldquoSupport vector method for novelty detectionrdquoin Neural Information Processing Systems vol 12 pp 582ndash5881999

12 Shock and Vibration

[21] D M J Tax and R P W Duin ldquoSupport vector domaindescriptionrdquo Pattern Recognition Letters vol 20 no 11ndash13 pp1191ndash1199 1999

[22] S D Villalba and P Cunningham ldquoAn evaluation of dimensionreduction techniques for one-class classificationrdquo ArtificialIntelligence Review vol 27 no 4 pp 273ndash294 2007

[23] I T Jolliffe Principal Component Analysis Springer New YorkNY USA 2nd edition 1986

[24] X He D Cai and P Niyogi ldquoLaplacian score for featureselectionrdquo inAdvances in Neural Information Processing Systems18 pp 507ndash514 MIT Press 2005

[25] A J Izenman ldquoLinear discriminant analysisrdquo in Modern Mul-tivariate Statistical Techniques A J Izenman Ed pp 237ndash280Springer New York NY USA 2008

[26] Q Gu Z Li and J Han ldquoGeneralized fisher score for featureselectionrdquo httpsarxivorgabs12023725

[27] Z Gao C Cecati and S X Ding ldquoA survey of fault diagnosisand fault-tolerant techniquesmdashpart II fault diagnosis withknowledge-based and hybridactive approachesrdquo IEEE Trans-actions on Industrial Electronics vol 62 no 6 pp 3768ndash37742015

[28] A E Lazzaretti D M J Tax H Vieira Neto and V HFerreira ldquoNovelty detection and multi-class classification inpower distribution voltage waveformsrdquo Expert Systems withApplications vol 45 pp 322ndash330 2016

International Journal of

AerospaceEngineeringHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

RoboticsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Active and Passive Electronic Components

Control Scienceand Engineering

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of

RotatingMachinery

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporation httpwwwhindawicom

Journal ofEngineeringVolume 2014

Submit your manuscripts athttpwwwhindawicom

VLSI Design

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Shock and Vibration

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Civil EngineeringAdvances in

Acoustics and VibrationAdvances in

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Electrical and Computer Engineering

Journal of

Advances inOptoElectronics

Hindawi Publishing Corporation httpwwwhindawicom

Volume 2014

The Scientific World JournalHindawi Publishing Corporation httpwwwhindawicom Volume 2014

SensorsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Modelling amp Simulation in EngineeringHindawi Publishing Corporation httpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Chemical EngineeringInternational Journal of Antennas and

Propagation

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Navigation and Observation

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

DistributedSensor Networks

International Journal of

Page 9: Research Article Vibration-Based Adaptive Novelty ...downloads.hindawi.com/journals/sv/2016/2417856.pdf · applied under a continuous learning framework. e structure of the method

Shock and Vibration 9

minus2 0 2 4 6 8 10minus4LS feature 1 RMS Y

minus4

minus3

minus2

minus1

0

1

2

3

4

5

0

005

01

015

02

KnownF1

LS fe

atur

e 2 k

urto

sis Y

(a)

minus5

minus4

minus3

minus2

minus1

0

1

2

3

minus2 0 2 4 6 8 10minus4FS feature 1 RMS Y

minus001

0

001

002

003

004

005

006

007

008

Known

FS fe

atur

e 2 k

urto

sis Z

(b)

minus001

0

001

002

003

004

005

006

007

008

minus2 0 2 4 6 8 10minus4FS feature 1 RMS Y

FS fe

atur

e 2 k

urto

sis Z

minus5

minus4

minus3

minus2

minus1

0

1

2

3

KnownF2

(c)

minus2 0 2 4 6 8 10minus4FS feature 1 RMS Y

minus5

minus4

minus3

minus2

minus1

0

1

2

3

minus0005

0

0005

001

0015

002

0025

003

0035

Known

FS fe

atur

e 2 k

urto

sis Z

(d)

Figure 9 Process of evaluation and retraining employing the methodology proposed (a) Evaluation of the fault scenario 1198651 (b) Retrainingof the novelty model and reformulation of the reduced set of features including 1198651 (c) Evaluation of the fault scenario 1198652 (d) Retraining ofthe novelty model and reformulation of the reduced set of features including 1198652Table 1 Performance of the novelty detection using only healthy class data to reduce the number of features where DR stands fordimensionality reduction Different scenarios are included according to the information available to train and test the novelty model

DRPerformance using OC-SVM ()

Known Test Known Test Known Test119867 1198651 + 1198652 119867 + 1198651 1198652 119867 + 1198652 1198651LS 954 (plusmn11) 576 (plusmn67) 435 (plusmn64)

new scenario under test therefore the noveltymodel success-fully detected as novelty all measurements corresponding tothe 1198652 scenario

It is worth mentioning that if initially the scenario 1198652is used for training and the scenario 1198651 is used for testingthe new set of features obtained could be different from theaforementionedThe results achieved fromboth scenarios are

shown in Table 2 which also includes a comparison of theresults obtained employing PCA LDA and Laplacian Scoredimensionality reduction techniques in the retraining stepinstead of the Fisher Score proposed

Regarding the classical feature extraction techniquesPCA and LDA the test scenario is not identified as novelin both cases The Fisher Score and the Laplacian Score

10 Shock and Vibration

Table 2 Performance of the novelty detection employing a reduc-tion of features during retraining Different scenarios are includedaccording to the information available to train and test the noveltymodel

DRPerformance using OC-SVM ()

Known Test Known Test119867 + 1198651 1198652 119867 + 1198652 1198651PCA 590 (plusmn52) 642 (plusmn51)LDA 474 (plusmn61) 512 (plusmn61)LS 843 (plusmn24) 100 (plusmn00)Fisher S 100 (plusmn00) 100 (plusmn00)successfully identified the new scenarios as novel since thepercentage obtained in both cases is higher than the 75 pre-defined threshold in the methodology to activate the alarmBoth techniques achieved high scores but still the FisherScore provided a more appropriate selection of features

As mentioned in Section 2 LDA is a feature extractiontechnique based on the Fisher discriminant coefficient sosimilar results between Fisher Score ranking and LDA areexpected however employing Fisher Score ranking scoreachieved a better result this can be caused because the testconsists of novel scenarios and LDA finds the directions onthe feature space specialized for the two supervised scenariosemployed during training meanwhile Fisher Score rankingprovides a more general approach by selecting features

To test the robustness of the Laplacian Score and FisherScore approaches a comparative test is performed where theset of features is increased from 10 to 15 and by varyingthe number of reduced features selected The five featuresincluded in the original set are obtained from the119909-axis of theaccelerometer monitored where the features calculated arepresented in the aforementioned statistical time-frequencyfeatures these features were discarded initially because theyare not part of the perpendicular plane of the motor anddo not contribute significantly to the monitoring in factincluding these features could affect the performance of thefeature reduction and novelty detection modules The resultsare presented in Table 3

The features obtained from the Fisher Score still present abetter distribution to detect the new scenariosThe LaplacianScore performance is affected when irrelevant features areincluded in the feature set but increases when more dimen-sions are taken into consideration Since the objective of theFisher Score is not topology preservation contrary to LS itis capable of discarding all the irrelevant features that wereincluded and the performance is not affected

5 Conclusions

This work proposes an adaptive novelty detection methodol-ogy based on vibration analysis for the condition monitoringand diagnosis of the components of a kinematic chain Themethodology is based on the acquisition of vibration signalsthat are generated in the kinematic chain along with anadequate signal processing to extract features to characterizethe components and an adaptive novelty detection model to

Table 3 Performance of the novelty detection increasing the num-ber of initial features from 10 to 15 and varying the number of thereduced set of features

DRPerformance varying the number of features

reduced ()2 3 4

LS 590 (plusmn41) 643 (plusmn33) 886 (plusmn25)Fisher S 100 (plusmn00) 100 (plusmn00) 100 (plusmn00)detect anomalies The method is composed of two sequentialstages an offline stage to initialize and retrain the modulesand an online stage to continuously assess the conditionof the machine During initialization the model is trainedemploying only information from the machine workingunder healthy condition and two additional faulty scenariosare introduced to test the performance of the method underunknown operations

The adaptive novelty detection approach successfullydetected both novel scenarios and the model incorporatedthe information to avoid generating alarms if the same faultis detected A comparison between the proposed methodand classical dimensionality reduction approaches highlightsthe limitations of maintaining a static set of features duringmonitoring instead of reformulating the feature reductionmodule once new information is available In this particularstudy employing features reduced by Laplacian Score andFisher Score obtained similar results nevertheless it doesnot imply that a similar outcome will be present during theanalysis of other faults Fisher Score is encouraged to beemployed in this methodology rather than maintaining theLaplacian Score approach due to the similarity between themethod objective and the objective function of Fisher Scorein both the ideal case is to find the features to maximizethe distance between scenarios while maintaining compactclusters A specific comparison of performances between LSand Fisher Score was also included in which the Fisher Scoreobtained better results when irrelevant features are includedin the original set of features and when the dimensionalityof the reduced set is increased this highlights the advantagesand robustness of the feature selection approach by FisherScore compared to the LS

From an industrial perspective two aspects should bementioned First dealing with an identical operating sce-nario (ie healthy) the variation of the working conditions(speed and torque patterns) will affect the resulting valueof the estimated numerical features and consequently itsrepresentation in the feature space used for novelty modeldesign Thus the introduction of the working condition asan additional degree of freedom in the novelty detectionscheme implies the increase of the dimensional space thatthe novelty detectionmodel is managingThis fact implies anincrease of the risk of data overlapping (different scenariosunder different operating conditions showing similar featurevalues) and also an increase of complexity for boundariesdefinition during the novelty modelling (data spread) Thusthe proposed method is not limited to stationary conditionsbut to the repetition of the pattern of working condition used

Shock and Vibration 11

to train the model Indeed in practical industrial applicationwhere cyclostationary processes take place the proposedmethod should be executed during the same period (the sameworking conditions) in order to reduce data variability andincrease novelty performance In case of multiple workingconditions consideration parallel novelty model structurescould be generated in order to activate each one dependingon the corresponding working condition

Second the integration of additional physicalmagnitudesas stator currents temperatures and others represents atrendy approach in electromechanical fault diagnosis inorder to improve fault diagnosis resolution and perfor-mance The corresponding data fusion scheme implies ahigh-dimensional feature space that is multiple numericalfeatures estimated from multiple physical magnitudes con-sidered In this point it must be noted that the presentedmethodology proposes the reevaluation of the feature reduc-tionmodule each time a retraining is appliedThus the largerthe available pool of features is the higher the potentiality ofthe proposedmethod to increase the performance during thedata characterization is

Also the proposed method can be extended andimproved for further development this improvement couldinclude a diagnosis method to not only detect anomalies inthe kinematic chain but also identify the fault causing theabnormal behavior In this sense future work will include aspecific comparison of broken bar fault detection capabilitiesbetween the proposedmethodology and classical approaches

Competing Interests

The authors declare that they have no competing interests

Acknowledgments

This research was partially supported by CONACyT schol-arships Grants 278033 and 313604 and also by the SpanishMinistry of Education Culture and Sport under GrantFPU1300589 The authors would also like to thank a partialfinancial support by the Generalitat de Catalunya (GRCMCIA Grant no SGR 2014-101)

References

[1] H Henao G-A Capolino M Fernandez-Cabanas et alldquoTrends in fault diagnosis for electrical machines a review ofdiagnostic techniquesrdquo IEEE Industrial Electronics Magazinevol 8 no 2 pp 31ndash42 2014

[2] N Bayar S Darmoul S Hajri-Gabouj and H Pierreval ldquoFaultdetection diagnosis and recovery using Artificial Immune Sys-tems a reviewrdquoEngineeringApplications of Artificial Intelligencevol 46 pp 43ndash57 2015

[3] SWang JWan D Zhang D Li and C Zhang ldquoTowards smartfactory for Industry 40 a self-organized multi-agent systemwith big data based feedback and coordinationrdquo ComputerNetworks vol 101 pp 158ndash168 2016

[4] M A F Pimentel D A Clifton L Clifton and L TarassenkoldquoA review of novelty detectionrdquo Signal Processing vol 99 pp215ndash249 2014

[5] R Langone O Mauricio Agudelo B De Moor and J AK Suykens ldquoIncremental kernel spectral clustering for online

learning of non-stationary datardquo Neurocomputing vol 139 pp246ndash260 2014

[6] M-H Chen P-C Chang and J-L Wu ldquoA population-basedincremental learning approach with artificial immune systemfor network intrusion detectionrdquo Engineering Applications ofArtificial Intelligence vol 51 pp 171ndash181 2016

[7] S Khomfoi and L M Tolbert ldquoFault diagnosis and reconfigu-ration for multilevel inverter drive using AI-based techniquesrdquoIEEE Transactions on Industrial Electronics vol 54 no 6 pp2954ndash2968 2007

[8] A Bellini F Filippetti C Tassoni and G-A CapolinoldquoAdvances in diagnostic techniques for induction machinesrdquoIEEE Transactions on Industrial Electronics vol 55 no 12 pp4109ndash4126 2008

[9] V T Tran and B-S Yang ldquoAn intelligent condition-basedmaintenance platform for rotating machineryrdquo Expert Systemswith Applications vol 39 no 3 pp 2977ndash2988 2012

[10] P A Delgado-Arredondo A Garcia-Perez D Morinigo-Soteloet al ldquoComparative study of time-frequency decompositiontechniques for fault detection in induction motors using vibra-tion analysis during startup transientrdquo Shock and Vibration vol2015 Article ID 708034 14 pages 2015

[11] M D Prieto G Cirrincione A G Espinosa J A Ortegaand H Henao ldquoBearing fault detection by a novel condition-monitoring scheme based on statistical-time features and neu-ral networksrdquo IEEE Transactions on Industrial Electronics vol60 no 8 pp 3398ndash3407 2013

[12] P Perner ldquoConcepts for novelty detection and handling basedon a case-based reasoning process schemerdquo Engineering Appli-cations of Artificial Intelligence vol 22 no 1 pp 86ndash91 2009

[13] C Englund and A Verikas ldquoA hybrid approach to outlier detec-tion in the offset lithographic printing processrdquo EngineeringApplications of Artificial Intelligence vol 18 no 6 pp 759ndash7682005

[14] H Garces and D Sbarbaro ldquoOutliers detection in environmen-tal monitoring databasesrdquo Engineering Applications of ArtificialIntelligence vol 24 no 2 pp 341ndash349 2011

[15] S Ma S M Li and Y P Xiong ldquoUncertainty reduced noveltydetection approach applied to rotatingmachinery for conditionmonitoringrdquo Shock and Vibration vol 2015 Article ID 73721310 pages 2015

[16] J McBain and M Timusk ldquoFeature extraction for noveltydetection as applied to fault detection in machineryrdquo PatternRecognition Letters vol 32 no 7 pp 1054ndash1061 2011

[17] S Wang J Yub E Lapirac and J Lee ldquoA modified supportvector data description based novelty detection approach formachinery componentsrdquo Applied Soft Computing vol 13 no 2pp 1193ndash1205 2013

[18] D P Filev R B Chinnam F Tseng and P Baruah ldquoAn indus-trial strength novelty detection framework for autonomousequipment monitoring and diagnosticsrdquo IEEE Transactions onIndustrial Informatics vol 6 no 4 pp 767ndash779 2010

[19] B S J Costa P P Angelov and L A Guedes ldquoFully unsu-pervised fault detection and identification based on recursivedensity estimation and self-evolving cloud-based classifierrdquoNeurocomputing vol 150 pp 289ndash303 2015

[20] B Scholkopf R C Williamson A J Smola J Shawe-Taylorand J C Platt ldquoSupport vector method for novelty detectionrdquoin Neural Information Processing Systems vol 12 pp 582ndash5881999

12 Shock and Vibration

[21] D M J Tax and R P W Duin ldquoSupport vector domaindescriptionrdquo Pattern Recognition Letters vol 20 no 11ndash13 pp1191ndash1199 1999

[22] S D Villalba and P Cunningham ldquoAn evaluation of dimensionreduction techniques for one-class classificationrdquo ArtificialIntelligence Review vol 27 no 4 pp 273ndash294 2007

[23] I T Jolliffe Principal Component Analysis Springer New YorkNY USA 2nd edition 1986

[24] X He D Cai and P Niyogi ldquoLaplacian score for featureselectionrdquo inAdvances in Neural Information Processing Systems18 pp 507ndash514 MIT Press 2005

[25] A J Izenman ldquoLinear discriminant analysisrdquo in Modern Mul-tivariate Statistical Techniques A J Izenman Ed pp 237ndash280Springer New York NY USA 2008

[26] Q Gu Z Li and J Han ldquoGeneralized fisher score for featureselectionrdquo httpsarxivorgabs12023725

[27] Z Gao C Cecati and S X Ding ldquoA survey of fault diagnosisand fault-tolerant techniquesmdashpart II fault diagnosis withknowledge-based and hybridactive approachesrdquo IEEE Trans-actions on Industrial Electronics vol 62 no 6 pp 3768ndash37742015

[28] A E Lazzaretti D M J Tax H Vieira Neto and V HFerreira ldquoNovelty detection and multi-class classification inpower distribution voltage waveformsrdquo Expert Systems withApplications vol 45 pp 322ndash330 2016

International Journal of

AerospaceEngineeringHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

RoboticsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Active and Passive Electronic Components

Control Scienceand Engineering

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of

RotatingMachinery

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporation httpwwwhindawicom

Journal ofEngineeringVolume 2014

Submit your manuscripts athttpwwwhindawicom

VLSI Design

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Shock and Vibration

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Civil EngineeringAdvances in

Acoustics and VibrationAdvances in

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Electrical and Computer Engineering

Journal of

Advances inOptoElectronics

Hindawi Publishing Corporation httpwwwhindawicom

Volume 2014

The Scientific World JournalHindawi Publishing Corporation httpwwwhindawicom Volume 2014

SensorsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Modelling amp Simulation in EngineeringHindawi Publishing Corporation httpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Chemical EngineeringInternational Journal of Antennas and

Propagation

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Navigation and Observation

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

DistributedSensor Networks

International Journal of

Page 10: Research Article Vibration-Based Adaptive Novelty ...downloads.hindawi.com/journals/sv/2016/2417856.pdf · applied under a continuous learning framework. e structure of the method

10 Shock and Vibration

Table 2 Performance of the novelty detection employing a reduc-tion of features during retraining Different scenarios are includedaccording to the information available to train and test the noveltymodel

DRPerformance using OC-SVM ()

Known Test Known Test119867 + 1198651 1198652 119867 + 1198652 1198651PCA 590 (plusmn52) 642 (plusmn51)LDA 474 (plusmn61) 512 (plusmn61)LS 843 (plusmn24) 100 (plusmn00)Fisher S 100 (plusmn00) 100 (plusmn00)successfully identified the new scenarios as novel since thepercentage obtained in both cases is higher than the 75 pre-defined threshold in the methodology to activate the alarmBoth techniques achieved high scores but still the FisherScore provided a more appropriate selection of features

As mentioned in Section 2 LDA is a feature extractiontechnique based on the Fisher discriminant coefficient sosimilar results between Fisher Score ranking and LDA areexpected however employing Fisher Score ranking scoreachieved a better result this can be caused because the testconsists of novel scenarios and LDA finds the directions onthe feature space specialized for the two supervised scenariosemployed during training meanwhile Fisher Score rankingprovides a more general approach by selecting features

To test the robustness of the Laplacian Score and FisherScore approaches a comparative test is performed where theset of features is increased from 10 to 15 and by varyingthe number of reduced features selected The five featuresincluded in the original set are obtained from the119909-axis of theaccelerometer monitored where the features calculated arepresented in the aforementioned statistical time-frequencyfeatures these features were discarded initially because theyare not part of the perpendicular plane of the motor anddo not contribute significantly to the monitoring in factincluding these features could affect the performance of thefeature reduction and novelty detection modules The resultsare presented in Table 3

The features obtained from the Fisher Score still present abetter distribution to detect the new scenariosThe LaplacianScore performance is affected when irrelevant features areincluded in the feature set but increases when more dimen-sions are taken into consideration Since the objective of theFisher Score is not topology preservation contrary to LS itis capable of discarding all the irrelevant features that wereincluded and the performance is not affected

5 Conclusions

This work proposes an adaptive novelty detection methodol-ogy based on vibration analysis for the condition monitoringand diagnosis of the components of a kinematic chain Themethodology is based on the acquisition of vibration signalsthat are generated in the kinematic chain along with anadequate signal processing to extract features to characterizethe components and an adaptive novelty detection model to

Table 3 Performance of the novelty detection increasing the num-ber of initial features from 10 to 15 and varying the number of thereduced set of features

DRPerformance varying the number of features

reduced ()2 3 4

LS 590 (plusmn41) 643 (plusmn33) 886 (plusmn25)Fisher S 100 (plusmn00) 100 (plusmn00) 100 (plusmn00)detect anomalies The method is composed of two sequentialstages an offline stage to initialize and retrain the modulesand an online stage to continuously assess the conditionof the machine During initialization the model is trainedemploying only information from the machine workingunder healthy condition and two additional faulty scenariosare introduced to test the performance of the method underunknown operations

The adaptive novelty detection approach successfullydetected both novel scenarios and the model incorporatedthe information to avoid generating alarms if the same faultis detected A comparison between the proposed methodand classical dimensionality reduction approaches highlightsthe limitations of maintaining a static set of features duringmonitoring instead of reformulating the feature reductionmodule once new information is available In this particularstudy employing features reduced by Laplacian Score andFisher Score obtained similar results nevertheless it doesnot imply that a similar outcome will be present during theanalysis of other faults Fisher Score is encouraged to beemployed in this methodology rather than maintaining theLaplacian Score approach due to the similarity between themethod objective and the objective function of Fisher Scorein both the ideal case is to find the features to maximizethe distance between scenarios while maintaining compactclusters A specific comparison of performances between LSand Fisher Score was also included in which the Fisher Scoreobtained better results when irrelevant features are includedin the original set of features and when the dimensionalityof the reduced set is increased this highlights the advantagesand robustness of the feature selection approach by FisherScore compared to the LS

From an industrial perspective two aspects should bementioned First dealing with an identical operating sce-nario (ie healthy) the variation of the working conditions(speed and torque patterns) will affect the resulting valueof the estimated numerical features and consequently itsrepresentation in the feature space used for novelty modeldesign Thus the introduction of the working condition asan additional degree of freedom in the novelty detectionscheme implies the increase of the dimensional space thatthe novelty detectionmodel is managingThis fact implies anincrease of the risk of data overlapping (different scenariosunder different operating conditions showing similar featurevalues) and also an increase of complexity for boundariesdefinition during the novelty modelling (data spread) Thusthe proposed method is not limited to stationary conditionsbut to the repetition of the pattern of working condition used

Shock and Vibration 11

to train the model Indeed in practical industrial applicationwhere cyclostationary processes take place the proposedmethod should be executed during the same period (the sameworking conditions) in order to reduce data variability andincrease novelty performance In case of multiple workingconditions consideration parallel novelty model structurescould be generated in order to activate each one dependingon the corresponding working condition

Second the integration of additional physicalmagnitudesas stator currents temperatures and others represents atrendy approach in electromechanical fault diagnosis inorder to improve fault diagnosis resolution and perfor-mance The corresponding data fusion scheme implies ahigh-dimensional feature space that is multiple numericalfeatures estimated from multiple physical magnitudes con-sidered In this point it must be noted that the presentedmethodology proposes the reevaluation of the feature reduc-tionmodule each time a retraining is appliedThus the largerthe available pool of features is the higher the potentiality ofthe proposedmethod to increase the performance during thedata characterization is

Also the proposed method can be extended andimproved for further development this improvement couldinclude a diagnosis method to not only detect anomalies inthe kinematic chain but also identify the fault causing theabnormal behavior In this sense future work will include aspecific comparison of broken bar fault detection capabilitiesbetween the proposedmethodology and classical approaches

Competing Interests

The authors declare that they have no competing interests

Acknowledgments

This research was partially supported by CONACyT schol-arships Grants 278033 and 313604 and also by the SpanishMinistry of Education Culture and Sport under GrantFPU1300589 The authors would also like to thank a partialfinancial support by the Generalitat de Catalunya (GRCMCIA Grant no SGR 2014-101)

References

[1] H Henao G-A Capolino M Fernandez-Cabanas et alldquoTrends in fault diagnosis for electrical machines a review ofdiagnostic techniquesrdquo IEEE Industrial Electronics Magazinevol 8 no 2 pp 31ndash42 2014

[2] N Bayar S Darmoul S Hajri-Gabouj and H Pierreval ldquoFaultdetection diagnosis and recovery using Artificial Immune Sys-tems a reviewrdquoEngineeringApplications of Artificial Intelligencevol 46 pp 43ndash57 2015

[3] SWang JWan D Zhang D Li and C Zhang ldquoTowards smartfactory for Industry 40 a self-organized multi-agent systemwith big data based feedback and coordinationrdquo ComputerNetworks vol 101 pp 158ndash168 2016

[4] M A F Pimentel D A Clifton L Clifton and L TarassenkoldquoA review of novelty detectionrdquo Signal Processing vol 99 pp215ndash249 2014

[5] R Langone O Mauricio Agudelo B De Moor and J AK Suykens ldquoIncremental kernel spectral clustering for online

learning of non-stationary datardquo Neurocomputing vol 139 pp246ndash260 2014

[6] M-H Chen P-C Chang and J-L Wu ldquoA population-basedincremental learning approach with artificial immune systemfor network intrusion detectionrdquo Engineering Applications ofArtificial Intelligence vol 51 pp 171ndash181 2016

[7] S Khomfoi and L M Tolbert ldquoFault diagnosis and reconfigu-ration for multilevel inverter drive using AI-based techniquesrdquoIEEE Transactions on Industrial Electronics vol 54 no 6 pp2954ndash2968 2007

[8] A Bellini F Filippetti C Tassoni and G-A CapolinoldquoAdvances in diagnostic techniques for induction machinesrdquoIEEE Transactions on Industrial Electronics vol 55 no 12 pp4109ndash4126 2008

[9] V T Tran and B-S Yang ldquoAn intelligent condition-basedmaintenance platform for rotating machineryrdquo Expert Systemswith Applications vol 39 no 3 pp 2977ndash2988 2012

[10] P A Delgado-Arredondo A Garcia-Perez D Morinigo-Soteloet al ldquoComparative study of time-frequency decompositiontechniques for fault detection in induction motors using vibra-tion analysis during startup transientrdquo Shock and Vibration vol2015 Article ID 708034 14 pages 2015

[11] M D Prieto G Cirrincione A G Espinosa J A Ortegaand H Henao ldquoBearing fault detection by a novel condition-monitoring scheme based on statistical-time features and neu-ral networksrdquo IEEE Transactions on Industrial Electronics vol60 no 8 pp 3398ndash3407 2013

[12] P Perner ldquoConcepts for novelty detection and handling basedon a case-based reasoning process schemerdquo Engineering Appli-cations of Artificial Intelligence vol 22 no 1 pp 86ndash91 2009

[13] C Englund and A Verikas ldquoA hybrid approach to outlier detec-tion in the offset lithographic printing processrdquo EngineeringApplications of Artificial Intelligence vol 18 no 6 pp 759ndash7682005

[14] H Garces and D Sbarbaro ldquoOutliers detection in environmen-tal monitoring databasesrdquo Engineering Applications of ArtificialIntelligence vol 24 no 2 pp 341ndash349 2011

[15] S Ma S M Li and Y P Xiong ldquoUncertainty reduced noveltydetection approach applied to rotatingmachinery for conditionmonitoringrdquo Shock and Vibration vol 2015 Article ID 73721310 pages 2015

[16] J McBain and M Timusk ldquoFeature extraction for noveltydetection as applied to fault detection in machineryrdquo PatternRecognition Letters vol 32 no 7 pp 1054ndash1061 2011

[17] S Wang J Yub E Lapirac and J Lee ldquoA modified supportvector data description based novelty detection approach formachinery componentsrdquo Applied Soft Computing vol 13 no 2pp 1193ndash1205 2013

[18] D P Filev R B Chinnam F Tseng and P Baruah ldquoAn indus-trial strength novelty detection framework for autonomousequipment monitoring and diagnosticsrdquo IEEE Transactions onIndustrial Informatics vol 6 no 4 pp 767ndash779 2010

[19] B S J Costa P P Angelov and L A Guedes ldquoFully unsu-pervised fault detection and identification based on recursivedensity estimation and self-evolving cloud-based classifierrdquoNeurocomputing vol 150 pp 289ndash303 2015

[20] B Scholkopf R C Williamson A J Smola J Shawe-Taylorand J C Platt ldquoSupport vector method for novelty detectionrdquoin Neural Information Processing Systems vol 12 pp 582ndash5881999

12 Shock and Vibration

[21] D M J Tax and R P W Duin ldquoSupport vector domaindescriptionrdquo Pattern Recognition Letters vol 20 no 11ndash13 pp1191ndash1199 1999

[22] S D Villalba and P Cunningham ldquoAn evaluation of dimensionreduction techniques for one-class classificationrdquo ArtificialIntelligence Review vol 27 no 4 pp 273ndash294 2007

[23] I T Jolliffe Principal Component Analysis Springer New YorkNY USA 2nd edition 1986

[24] X He D Cai and P Niyogi ldquoLaplacian score for featureselectionrdquo inAdvances in Neural Information Processing Systems18 pp 507ndash514 MIT Press 2005

[25] A J Izenman ldquoLinear discriminant analysisrdquo in Modern Mul-tivariate Statistical Techniques A J Izenman Ed pp 237ndash280Springer New York NY USA 2008

[26] Q Gu Z Li and J Han ldquoGeneralized fisher score for featureselectionrdquo httpsarxivorgabs12023725

[27] Z Gao C Cecati and S X Ding ldquoA survey of fault diagnosisand fault-tolerant techniquesmdashpart II fault diagnosis withknowledge-based and hybridactive approachesrdquo IEEE Trans-actions on Industrial Electronics vol 62 no 6 pp 3768ndash37742015

[28] A E Lazzaretti D M J Tax H Vieira Neto and V HFerreira ldquoNovelty detection and multi-class classification inpower distribution voltage waveformsrdquo Expert Systems withApplications vol 45 pp 322ndash330 2016

International Journal of

AerospaceEngineeringHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

RoboticsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Active and Passive Electronic Components

Control Scienceand Engineering

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of

RotatingMachinery

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporation httpwwwhindawicom

Journal ofEngineeringVolume 2014

Submit your manuscripts athttpwwwhindawicom

VLSI Design

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Shock and Vibration

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Civil EngineeringAdvances in

Acoustics and VibrationAdvances in

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Electrical and Computer Engineering

Journal of

Advances inOptoElectronics

Hindawi Publishing Corporation httpwwwhindawicom

Volume 2014

The Scientific World JournalHindawi Publishing Corporation httpwwwhindawicom Volume 2014

SensorsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Modelling amp Simulation in EngineeringHindawi Publishing Corporation httpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Chemical EngineeringInternational Journal of Antennas and

Propagation

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Navigation and Observation

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

DistributedSensor Networks

International Journal of

Page 11: Research Article Vibration-Based Adaptive Novelty ...downloads.hindawi.com/journals/sv/2016/2417856.pdf · applied under a continuous learning framework. e structure of the method

Shock and Vibration 11

to train the model Indeed in practical industrial applicationwhere cyclostationary processes take place the proposedmethod should be executed during the same period (the sameworking conditions) in order to reduce data variability andincrease novelty performance In case of multiple workingconditions consideration parallel novelty model structurescould be generated in order to activate each one dependingon the corresponding working condition

Second the integration of additional physicalmagnitudesas stator currents temperatures and others represents atrendy approach in electromechanical fault diagnosis inorder to improve fault diagnosis resolution and perfor-mance The corresponding data fusion scheme implies ahigh-dimensional feature space that is multiple numericalfeatures estimated from multiple physical magnitudes con-sidered In this point it must be noted that the presentedmethodology proposes the reevaluation of the feature reduc-tionmodule each time a retraining is appliedThus the largerthe available pool of features is the higher the potentiality ofthe proposedmethod to increase the performance during thedata characterization is

Also the proposed method can be extended andimproved for further development this improvement couldinclude a diagnosis method to not only detect anomalies inthe kinematic chain but also identify the fault causing theabnormal behavior In this sense future work will include aspecific comparison of broken bar fault detection capabilitiesbetween the proposedmethodology and classical approaches

Competing Interests

The authors declare that they have no competing interests

Acknowledgments

This research was partially supported by CONACyT schol-arships Grants 278033 and 313604 and also by the SpanishMinistry of Education Culture and Sport under GrantFPU1300589 The authors would also like to thank a partialfinancial support by the Generalitat de Catalunya (GRCMCIA Grant no SGR 2014-101)

References

[1] H Henao G-A Capolino M Fernandez-Cabanas et alldquoTrends in fault diagnosis for electrical machines a review ofdiagnostic techniquesrdquo IEEE Industrial Electronics Magazinevol 8 no 2 pp 31ndash42 2014

[2] N Bayar S Darmoul S Hajri-Gabouj and H Pierreval ldquoFaultdetection diagnosis and recovery using Artificial Immune Sys-tems a reviewrdquoEngineeringApplications of Artificial Intelligencevol 46 pp 43ndash57 2015

[3] SWang JWan D Zhang D Li and C Zhang ldquoTowards smartfactory for Industry 40 a self-organized multi-agent systemwith big data based feedback and coordinationrdquo ComputerNetworks vol 101 pp 158ndash168 2016

[4] M A F Pimentel D A Clifton L Clifton and L TarassenkoldquoA review of novelty detectionrdquo Signal Processing vol 99 pp215ndash249 2014

[5] R Langone O Mauricio Agudelo B De Moor and J AK Suykens ldquoIncremental kernel spectral clustering for online

learning of non-stationary datardquo Neurocomputing vol 139 pp246ndash260 2014

[6] M-H Chen P-C Chang and J-L Wu ldquoA population-basedincremental learning approach with artificial immune systemfor network intrusion detectionrdquo Engineering Applications ofArtificial Intelligence vol 51 pp 171ndash181 2016

[7] S Khomfoi and L M Tolbert ldquoFault diagnosis and reconfigu-ration for multilevel inverter drive using AI-based techniquesrdquoIEEE Transactions on Industrial Electronics vol 54 no 6 pp2954ndash2968 2007

[8] A Bellini F Filippetti C Tassoni and G-A CapolinoldquoAdvances in diagnostic techniques for induction machinesrdquoIEEE Transactions on Industrial Electronics vol 55 no 12 pp4109ndash4126 2008

[9] V T Tran and B-S Yang ldquoAn intelligent condition-basedmaintenance platform for rotating machineryrdquo Expert Systemswith Applications vol 39 no 3 pp 2977ndash2988 2012

[10] P A Delgado-Arredondo A Garcia-Perez D Morinigo-Soteloet al ldquoComparative study of time-frequency decompositiontechniques for fault detection in induction motors using vibra-tion analysis during startup transientrdquo Shock and Vibration vol2015 Article ID 708034 14 pages 2015

[11] M D Prieto G Cirrincione A G Espinosa J A Ortegaand H Henao ldquoBearing fault detection by a novel condition-monitoring scheme based on statistical-time features and neu-ral networksrdquo IEEE Transactions on Industrial Electronics vol60 no 8 pp 3398ndash3407 2013

[12] P Perner ldquoConcepts for novelty detection and handling basedon a case-based reasoning process schemerdquo Engineering Appli-cations of Artificial Intelligence vol 22 no 1 pp 86ndash91 2009

[13] C Englund and A Verikas ldquoA hybrid approach to outlier detec-tion in the offset lithographic printing processrdquo EngineeringApplications of Artificial Intelligence vol 18 no 6 pp 759ndash7682005

[14] H Garces and D Sbarbaro ldquoOutliers detection in environmen-tal monitoring databasesrdquo Engineering Applications of ArtificialIntelligence vol 24 no 2 pp 341ndash349 2011

[15] S Ma S M Li and Y P Xiong ldquoUncertainty reduced noveltydetection approach applied to rotatingmachinery for conditionmonitoringrdquo Shock and Vibration vol 2015 Article ID 73721310 pages 2015

[16] J McBain and M Timusk ldquoFeature extraction for noveltydetection as applied to fault detection in machineryrdquo PatternRecognition Letters vol 32 no 7 pp 1054ndash1061 2011

[17] S Wang J Yub E Lapirac and J Lee ldquoA modified supportvector data description based novelty detection approach formachinery componentsrdquo Applied Soft Computing vol 13 no 2pp 1193ndash1205 2013

[18] D P Filev R B Chinnam F Tseng and P Baruah ldquoAn indus-trial strength novelty detection framework for autonomousequipment monitoring and diagnosticsrdquo IEEE Transactions onIndustrial Informatics vol 6 no 4 pp 767ndash779 2010

[19] B S J Costa P P Angelov and L A Guedes ldquoFully unsu-pervised fault detection and identification based on recursivedensity estimation and self-evolving cloud-based classifierrdquoNeurocomputing vol 150 pp 289ndash303 2015

[20] B Scholkopf R C Williamson A J Smola J Shawe-Taylorand J C Platt ldquoSupport vector method for novelty detectionrdquoin Neural Information Processing Systems vol 12 pp 582ndash5881999

12 Shock and Vibration

[21] D M J Tax and R P W Duin ldquoSupport vector domaindescriptionrdquo Pattern Recognition Letters vol 20 no 11ndash13 pp1191ndash1199 1999

[22] S D Villalba and P Cunningham ldquoAn evaluation of dimensionreduction techniques for one-class classificationrdquo ArtificialIntelligence Review vol 27 no 4 pp 273ndash294 2007

[23] I T Jolliffe Principal Component Analysis Springer New YorkNY USA 2nd edition 1986

[24] X He D Cai and P Niyogi ldquoLaplacian score for featureselectionrdquo inAdvances in Neural Information Processing Systems18 pp 507ndash514 MIT Press 2005

[25] A J Izenman ldquoLinear discriminant analysisrdquo in Modern Mul-tivariate Statistical Techniques A J Izenman Ed pp 237ndash280Springer New York NY USA 2008

[26] Q Gu Z Li and J Han ldquoGeneralized fisher score for featureselectionrdquo httpsarxivorgabs12023725

[27] Z Gao C Cecati and S X Ding ldquoA survey of fault diagnosisand fault-tolerant techniquesmdashpart II fault diagnosis withknowledge-based and hybridactive approachesrdquo IEEE Trans-actions on Industrial Electronics vol 62 no 6 pp 3768ndash37742015

[28] A E Lazzaretti D M J Tax H Vieira Neto and V HFerreira ldquoNovelty detection and multi-class classification inpower distribution voltage waveformsrdquo Expert Systems withApplications vol 45 pp 322ndash330 2016

International Journal of

AerospaceEngineeringHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

RoboticsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Active and Passive Electronic Components

Control Scienceand Engineering

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of

RotatingMachinery

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporation httpwwwhindawicom

Journal ofEngineeringVolume 2014

Submit your manuscripts athttpwwwhindawicom

VLSI Design

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Shock and Vibration

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Civil EngineeringAdvances in

Acoustics and VibrationAdvances in

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Electrical and Computer Engineering

Journal of

Advances inOptoElectronics

Hindawi Publishing Corporation httpwwwhindawicom

Volume 2014

The Scientific World JournalHindawi Publishing Corporation httpwwwhindawicom Volume 2014

SensorsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Modelling amp Simulation in EngineeringHindawi Publishing Corporation httpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Chemical EngineeringInternational Journal of Antennas and

Propagation

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Navigation and Observation

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

DistributedSensor Networks

International Journal of

Page 12: Research Article Vibration-Based Adaptive Novelty ...downloads.hindawi.com/journals/sv/2016/2417856.pdf · applied under a continuous learning framework. e structure of the method

12 Shock and Vibration

[21] D M J Tax and R P W Duin ldquoSupport vector domaindescriptionrdquo Pattern Recognition Letters vol 20 no 11ndash13 pp1191ndash1199 1999

[22] S D Villalba and P Cunningham ldquoAn evaluation of dimensionreduction techniques for one-class classificationrdquo ArtificialIntelligence Review vol 27 no 4 pp 273ndash294 2007

[23] I T Jolliffe Principal Component Analysis Springer New YorkNY USA 2nd edition 1986

[24] X He D Cai and P Niyogi ldquoLaplacian score for featureselectionrdquo inAdvances in Neural Information Processing Systems18 pp 507ndash514 MIT Press 2005

[25] A J Izenman ldquoLinear discriminant analysisrdquo in Modern Mul-tivariate Statistical Techniques A J Izenman Ed pp 237ndash280Springer New York NY USA 2008

[26] Q Gu Z Li and J Han ldquoGeneralized fisher score for featureselectionrdquo httpsarxivorgabs12023725

[27] Z Gao C Cecati and S X Ding ldquoA survey of fault diagnosisand fault-tolerant techniquesmdashpart II fault diagnosis withknowledge-based and hybridactive approachesrdquo IEEE Trans-actions on Industrial Electronics vol 62 no 6 pp 3768ndash37742015

[28] A E Lazzaretti D M J Tax H Vieira Neto and V HFerreira ldquoNovelty detection and multi-class classification inpower distribution voltage waveformsrdquo Expert Systems withApplications vol 45 pp 322ndash330 2016

International Journal of

AerospaceEngineeringHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

RoboticsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Active and Passive Electronic Components

Control Scienceand Engineering

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of

RotatingMachinery

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporation httpwwwhindawicom

Journal ofEngineeringVolume 2014

Submit your manuscripts athttpwwwhindawicom

VLSI Design

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Shock and Vibration

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Civil EngineeringAdvances in

Acoustics and VibrationAdvances in

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Electrical and Computer Engineering

Journal of

Advances inOptoElectronics

Hindawi Publishing Corporation httpwwwhindawicom

Volume 2014

The Scientific World JournalHindawi Publishing Corporation httpwwwhindawicom Volume 2014

SensorsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Modelling amp Simulation in EngineeringHindawi Publishing Corporation httpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Chemical EngineeringInternational Journal of Antennas and

Propagation

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Navigation and Observation

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

DistributedSensor Networks

International Journal of

Page 13: Research Article Vibration-Based Adaptive Novelty ...downloads.hindawi.com/journals/sv/2016/2417856.pdf · applied under a continuous learning framework. e structure of the method

International Journal of

AerospaceEngineeringHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

RoboticsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Active and Passive Electronic Components

Control Scienceand Engineering

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of

RotatingMachinery

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporation httpwwwhindawicom

Journal ofEngineeringVolume 2014

Submit your manuscripts athttpwwwhindawicom

VLSI Design

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Shock and Vibration

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Civil EngineeringAdvances in

Acoustics and VibrationAdvances in

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Electrical and Computer Engineering

Journal of

Advances inOptoElectronics

Hindawi Publishing Corporation httpwwwhindawicom

Volume 2014

The Scientific World JournalHindawi Publishing Corporation httpwwwhindawicom Volume 2014

SensorsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Modelling amp Simulation in EngineeringHindawi Publishing Corporation httpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Chemical EngineeringInternational Journal of Antennas and

Propagation

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Navigation and Observation

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

DistributedSensor Networks

International Journal of