Modelling procedures for directed network of data blocks

Agnar Höskuldsson, Centre for Advanced Data Analysis, Copenhagen

Data structures:

Directed network of data blocksInput data blocksOutput data blocksIntermediate data blocks

Methods

Optimization procedures for each passage through the networkBalanced optimization of fit and prediction (H-principle) Scores, loadings, loading weights, regression coefficients for each data blockMethods of regression analysis applicable at each data blockEvaluation procedures at each data blockGraphic procedures at each data block

Chemometric methods1. Regression estimation,

X, Y. Traditional presentation: Yest=XB, and standard deviations for B.Latent structure:X=TP’ + X0. X0 not used.Y=TQ’+Y0. Y0 not explained.

2. Fit and precision. Both fit and precision are controlled.

3. Selection of score vectorsAs large as possibledescribe Y as well as possiblemodelling stops, when no more found (cross-validation)

4. Graphic analysis of latent structureScore and loading plotsPlot of weight (and loading weight) vectors

Chemometric methods

5. Covariance as measure of relationship X’Y for scaled data measures strength X1’Y=0, implies that X1 is remmoved from analysis

6. Causal analysis T=XR From score plots we can infer about the original measurement values Control charts for score values can be related to contribution charts

7. Analysis of X Most time of analysis is devoted to understand the structure of X. Plots are marked by symbols to better identify points in scor or loading plots.

8. Model validation. Cross-validation is used to validate the results Bootstrapping (re-sampling from data) used to establish confidence intervals

Chemometric methods

9. Different methods Different types of data/situations may require different type of method One is looking for interpretations of the latent structure found

10. Theory generation Results from analysis are used to establish views/theories on the data Results motivate further analysis (groupings, non-linearity etc)

Partitioning data, 1

X1 X2 XL Y1 Y2

Measurement data Responsedata

Reference data

Partitioning data, 2

-There is often a natural sub-division of data.

- It is often required to study the role of a sub-block

- Data block with few variables may ’disappear’ among one with many variables, e.g. Optical instruments often give many variables.

Instrumental data Response data

X YX1 X2 X3 Y1 Y2

engineering

chemicalprocess

quality

chemical results

Path diagram 1

Examples:

Production processOrganisational dataDiagram for sub-processesCausal diagram

Path diagram 2, schematic application of modelling

x10 is a new sample from X1,x20 is a new one from X2,x30 is a new one from X3,

how do they generate new samples for X4, X5, X6 and X7?

Resulting estimating equations

X4,est=X1B14+X2B24+X3B34

X5,est=X1B15+X2B25+X3B35

X6,est=X4B46+X5B56

X7,est=X6B67

Path diagram 3

Time t1

Time t2

Data blocks can be aligned to time.Modelling can start at time t2.

Notation and schematic illustrations

Instrumental data Response dataw

w: weight vector (to be found)t: score vector, t = Xw =w1x1 + ... + wKxK

q: loading vector, q =YTt = [ (y1Tt), ... , (yM

Tt) ]u: Y-score vector, u=Yq = q1 y1 + ... + qM yM

Vectors are collected into matrices, e.g., T=(t1, ... , tA)

Adjustments:

XX – t pT/(tTt)

YY – t qT/(tTt)

Conjugate vectors 1

r: t=Xw, p=XTt. paTrb=0 for ab.

r: t=Xq, qaTrb=0 for ab.

r and s:

t=Xw, p=XTv,

paTrb=0, ta

Tsb=0 for ab.

Conjugate vectors 2

The conjugate vectors R=(r1, r2, ..., rA) satisfy: T=XR.

Latent structure solution:

X = T PT + X0, where X0 is the part of X that is not usedY = T QT + Y0, where Y0 is the part of Y that could not be explained

Y = T QT + Y0= X (R QT) + Y0= X B + Y0, for B= R QT

The conjugate vectors are always computed together with the score vectors.

When regression on score vectors has been computed, the regression on the original variables is computed as shown.

Optimization procedure, 1

Two data blocks: X1 X2

|q2|2 max

One data block: X1

|t1|2 max

Three data blocks

|qz|2 max

X basis Y estimated Y basis Z estimated

Adjustments:t1 describes X1: X1X1-t1p1

T/(t1Tt1), p1=X1

t1 describes X2: X2X2-t1q2T/(t1

Tt1), q2=X2Tt1.

q2 describes X3: X3X3-t3q2T/(q2

Tq2), t3=X3q2.

t3 describes X4: X4X4-t3q4T/(t3

Tt3), q4=X4Tt3.

Two input and two output data blocks:

q14q24

Find w1 and w2:

|q13+q23+q14+q24|2 max

Two input, one intermediate and one output data blocks:

q13q23

q134q234

Find w1 and w2:

|q134+q234|2 max

Balanced optimization of fit and prediction (H-principle) X Y

Linear regressionIn linear regression we are looking for a weight vector w, so that the resulting score vector t=Xw is good!

The basic measure of quality is the prediction variance for a sample, x0. Assuming negligible bias it can be written (assuming standard assumptions)

F(w) = Var(y(x0)) = k[1 – (yTt)2/(tTt)][1 + t02/(tTt)].

It can be shown that F(cw)=F(w) for all c>0. Choose c such that (tTt)=1. Then

F(w) = k[1 – (yTt)2][1 + t02].

In order to get a prediction variance as small as possible, it is natural to choose w such that (yTt)2 becomes as large as possible,

maximize (yTt)2 = maximize |q|2 (PLS regression)

Weighing along objects (rows) (same algorithm, but using the transposes):

Task: find weight vector v1:maximize |t2|2

Task: find weight vector v1:maximize |q3|2

Task: find weight vector w1:maximize |q3|2,

q3=X3Tt2

=X3TX2p1

=X3TX2X1

Regression equations

X3,est=X2B23

X2,est=B12X1

X1,est=X1B11

If p1 is a good weight vector for X2, a good result may be expected.

Pre-processing may be needed to find variables in X1 and in X2 that are highly correlated to each other.

Three types of reports

Reports:

How a data block is doing in a network

How a data block can be described bydata blocks that lead to it.

How a data block can be described byone data block that leads to it.

Production data, 1

X2 YX1

X1: Process parameters, 8 variables

X2: NIR data, 1560 variables (reduced to 120)

No |X2|2 |Y|2 |X|2 |Y|2 1 78,961 51,483 74,969 51,9642 91,538 67,559 86,786 69,5533 96,351 76,291 91,627 80,6434 97,942 81,383 95,373 85,0585 98,620 83,900 95,919 89,0566 98,967 85,705 97,054 90,0507 99,205 87,917 97,508 91,9908 99,294 90,472 97,990 93,4559 99,349 92,183 98,667 94,02010 99,426 92,947 98,896 94,70811 99,606 93,084 99,103 95,08212 99,657 93,376 99,202 95,740

X1 ’disappears’ inthe NIR data X2.

Production data, 2

At each step:

Results for X2, process parameters:5 score vectors explain 11.92% of Y.

Results for X1, NIR data:12 score vectorsexplain 84.141% of Y.

No Step |Y|2

1 1 4,957

2 2 9,315

3 5 10,393

4 6 10,929

5 8 11,920

No Step |Y|2

1 1 51,483

2 2 69,121

3 3 73,070

4 4 76,506

5 5 78,669

6 6 80,923

7 7 82,129

8 8 82,552

9 9 83,132

10 10 83,590

11 11 83,881

12 12 84,141

Total 96.06%=11.920%+84.14% is explained of Y.

At each step the score vectors are evaluated. Non-significant ones are excluded.

Production data, 3

-0.4 -0.3 -0.2 -0.1 0 0.1 0.2 0.3-0.4

Plot of estimated versus observed quality variable using only score vectors for process parameters.X2

Y75.12%

96.06%

R2-values:

87.75%

The process parameters contribute marginally by 11.92%. But if only they were used, they would explain 75.12% of the variation of Y.

R2=0.7512

Directed network of data blocks

... ...

Input blocks Intermediate blocks Output blocks

Give weight vectors for initial score vectors

Are described by previous blocks and give score vectors for succeeding blocks

Are described by previous blocks

Magnitudes computed between two data blocks

Ti: Score vectorsQi: Loading vectorsBi: Regression coefficients

Measures of precision

Measures of fit

Different views:a) As a part of a pathb) If the results are viewed

marginallyc) If only XiXk

Stages in batch processes

Batches

Stages

XkX2X1

1 2 K Final quality

Paths: X1 X2 ... XK Y Given a sample x10, the path modelgives estimated samples for later blocks

[X1 X2 X3] X4 Y Given values of (x10 x20 x30), estimatesfor values of x4 and y are given.

[X1 X2 X3] [X4 X5] Y Given values of (x10 x20 x30), estimatesfor values of (x4 x5) and y are given.

Schematic illlustration of the modelling task for sequential processes

Stages

Initial conditions

Known process parameters

Next stage

Later stages

Plots of score vectors

X1 X1 – X2

t2 X1 – XL

The plots will show how the changes are relative to the first data block.

Graphic software to specify paths

Blocks are dragged into the screen. Relationships specified.

Pre-processing of data

• Centring. If desired centring of data is carried out

• Scaling. In the computations all variables are scaled to unit length (or unit standard deviation if centred). It is checked if scaling disturbs the variable, e.g. if it is constant except for two values, or if the variable is at the noise level. When analysis has been completed, values are scaled back so that units are in original values.

• Redundant variable. It is investigated if a variable does not contribute to the explanation of any of the variables that the presnt block lead to. If it is redundant, it iseliminated from analysis.

• Redundant data block. It is investigated if a data block can provide with a significant description of the block that it is connected to later in the network. If it can not contribute to the description of the blocks, it is removed from the network.

Post-processing of results

Score vectors computed in the passages through the network are evaluated in the analysis at one passage. Apart from the input blocks the score vectors found between passages are not independent. The score vectors found in a relationship XiXj are evaluated to see if all are significant or some should be removed for this relationship.

Cross-validation like in standard regression methods

Confidence intervals for parmeters by resampling technique

International workshop on

Multi-block and Path Methods

24. – 30. May 2009, Mijas, Malaga, Spain

Modelling procedures for directed network of data blocks

Documents

Creative toy blocks - Lasykids Blocks

Sparse statistical modelling - UCLrmjbale/Stat/7.pdf · matrix, with p = 200 and 10 blocks of 1s of size 20 20. ... with kuk 1 = p p=3. Sparse statistical modelling Tom Bartlett 13/28

Modelling the Dependency between Foreign Directed

Lecture 8 CIS 341: COMPILERScis341/current/lectures/lec08.pdf · • Basic blocks can be arranged into a control-flow graph –Nodes are basic blocks –There is a directed edge from

Mode-Directed Neural-Symbolic Modelling · 2017-07-26 · Mode-Directed Neural-Symbolic Modelling Ashwin Srinivasan1 and Lovekesh Vig2 1 Department of Computer Sc. & Information Systems

pp.ipd.kit.edu · Elementary blocks A statement consists of a set of elementary blocks blocks : Stmt → P(Blocks) blocks([x := a]!)={[x := a]!} blocks([skip]!)={[skip]!} blocks(S1;S2

Lecture 7 COMPILER DESIGN · • Basic blocks can be arranged into a control-ﬂow graph – Nodes are basic blocks – There is a directed edge from node A to node B if the control

Benedict - Explaining Ships Dynamic and HandlingFigure 3: Graphical presentation of blocks in SIMULINK modelling the manoeuvring behaviour for complete speed characteristic – examples

Modelling procedures for directed network of data blocks

Outline of today 1. Introduction 2. Visual perception Principles 1. Pre-attentive processing 2. Sequential, Goal-Directed Processing 3. Building Blocks

Making sense of Bolkestein-bashing: trade liberalization ... · (iv) analyzing the implications of diﬀerential labor market institutions across blocks. The modelling apparatus used

Modelling and Analysis of Discrete Event Simulations · for more powerful modelling and simulation capabilities, e.g. – Combine Simulink and SimEvents blocks for hybrid time and

Guided & Shared Reading - Four Blocks€¦ · -Guided Reading (Anchor -Read-Apply)-Word Study (Key Words + Making Words)-Writing-Self-Directed Reading -Communication with symbols

Estimating High-Dimensional Directed Acyclic Graphs with ...stat.ethz.ch/Manuscripts/buhlmann/pcalgo3.pdfNeapolitan, 2004). Major building blocks of the models are nodes, which represent

1 BESAC Feb 27, 2001 Polymers and block copolymers for directed self-assembly of nanomaterials Self-assembling building blocks or templates “Bottom-up”

Pluggable Terminal Blocks PCB Screw terminal Blocks PCB Spring Terminal Blocks Barrier Terminal Blocks Transformer Terminal Blocks

STRUCTURAL EQUATION MODELLING OF COGNITIVE LOADING … · the central tenets of the CDIO approach namely, complex questions, zone of proximal development and self-directed learning

Hieroglyphic Stairway of Copán, Honduras · unstable climate in Honduras (Maudslay 1889–1902, 65). During the 1894–95 field season, directed by Gordon, the hieroglyphic blocks

Trace-Directed Modelling State-of-the-Art, Ideas and Plans

the basic blocks also 7. Write short note on any two of the following a. LEX and YACC b. Symbol Table c. Basic Blocks and Flow Graphs d. Syntax-Directed Translation e. Error Recovery