Quality of Curve Fitting P M V Subbarao Professor Mechanical Engineering Department Suitability of A...

Preview:

Citation preview

Quality of Curve Fitting

P M V SubbaraoProfessor

Mechanical Engineering Department

Suitability of A Model to a Data Set…..

Goodness of fit and the correlation coefficient

• A measure of how good the regression curve as a representation of the data.

• It is possible to fit two curves to data by • (a) treating x as the independent variable : y=ax+b, y as

the dependent variable or by• (b) treating y as the independent variable and x as the

dependent variable. • This is described by a relation of the form x= a'y +b'. • The procedure followed earlier can be followed again to

find best values of a’ and b’.

N

iii

N

ii

N

ii yxybya

11

'

1

2'

N

ii

N

ii xNbya

1

'

1

'

In matrix form

N

iii

N

ii

N

ii

N

ii

N

ii

yx

x

a

b

yy

yN

1

1

'

'

1

2

1

1

N

iii

N

ii

N

ii yxxbxa

111

2

N

ii

N

ii ybNxa

11

N

iii

N

ii

N

ii

N

ii

N

ii

yx

y

a

b

xx

xN

1

1

1

2

1

1

Recast the second fit line as:

'

'

'

1

a

bx

ay

'

1

ais the slope of this second line, which not same as the first line

2

11

2

111'''

N

ii

N

ii

N

ii

N

ii

N

iii

yyN

yxyxNabyax

•The ratio of the slopes of the two lines is a measure of how good the form of the fit is to the data.•In view of this the correlation coefficient ρ defined through the relation

'2

line Regression second of Slope

line Regressionfirst of slopeaa

2

11

2

111'''

N

ii

N

ii

N

ii

N

ii

N

iii

yyN

yxyxNabyax

2

11

2

111

N

ii

N

ii

N

ii

N

ii

N

iii

xxN

yxyxNabaxy

2

11

2

2

11

2

2

1112

N

ii

N

ii

N

ii

N

ii

N

ii

N

ii

N

iii

yyNxxN

yxyxN

2

11

2

2

11

2

111

N

ii

N

ii

N

ii

N

ii

N

ii

N

ii

N

iii

yyNxxN

yxyxN

Correlation Coefficient

• The sign of the correlation coefficient is determined by the sign of the covariance.

• If the regression line has a negative slope the correlation coefficient is negative

• while it is positive if the regression line has a positive slope. • The correlation is said to be perfect if ρ = ± 1.• The correlation is poor if ρ ≈ 0.• Absolute value of the correlation coefficient should be greater

than 0.5 to indicate that y and x are related!• In the case of a non-linear fit a quantity known as the index of

correlation is defined to determine the goodness of the fit. • The fit is termed good if the variance of the deviates is much

less than the variance of the y’s. • It is required that the index of correlation defined below to be

close to ±1 for the fit to be considered good.

N

i

N

ii

i

N

iii

N

yy

xfy

1

2

1

1

2

1

2=1.000 2=0.991 2=0.904

2=0.821 2=0.493 2=0.0526

Multi-Variable Regression Analysis

• Cases considered so far, involved one independent variable and one dependent variable.

• Sometimes the dependent variable may be a function of more than one variable.

• For example, the relation of the form

• is a common type of relationship for flow through an Orifice or Venturi.

• mass flow rate is a dependent variable and others are independent variables.

pipe

orifice

d

dApTpfm ,,,,

Set up a mathematical model as:e

pipe

orificedcb

d

dAp

RT

pam

Taking logarithm both sides

pipe

orifice

d

deAdpc

RT

pbam lnlnlnlnlnln

Simply: eodncmblay ln

where y is the dependent variable, l, m, n, o and p are independent variables and a, b, c, d, e are the fit parameters.

The least square method may be used to determine the fit parameters.

Let the data be available for set of N values of y, l, m, n, o, p values.

The quantity to be minimized is given by

N

iiiiiii fpeodncmblayError

1

2

What is the permissible value of N ?

The normal linear equations are obtained by the usual process of setting the first partial derivatives with respect to the fit parameters to zero.

N

iiiiiii fpeodncmblay

a

Error

1

02

N

iiiiiiii fpeodncmblayl

b

Error

1

02

N

ii

N

ii

N

ii

N

ii

N

ii

N

ii ypfoendmclbNa

111111

N

iii

N

iii

N

iii

N

iii

N

iii

N

ii

N

ii ylplfolenldmlclbla

111111

2

1

These equations are solved simultaneously to get the six fit parameters.

We may also calculate the index of correlation as an indicator of the quality of the fit. This calculation is left to you!

Power Law Curve for Multi Variable Regression

AnalysisTrue Power of Power Law……

Newton’s Law of Viscosity/Cooling

• 1701: Sir Newton published a paper titled: Scala Graduum Caloris.

• How to Realize the Law?• A general heat transfer surface may not be isothermal !?!• Fluid temperature will vary from inlet to exit !?!?!• The local velocity of flow will also vary from inlet to exit ?!?!• How to use Newton’s Law in a Real life?

Scale Analysis

Define characteristic parameters:

L : length

u ∞ : free stream velocity

T ∞ : free stream temperature

CA, ∞ : free stream concentration of species A

General parameters:

x, y : positions (independent variables)

u, v : velocities (dependent variables)

T : temperature (dependent variable)

C : species concentration (dependent variable)

also, recall that momentum requires a pressure gradient for the movement of a fluid:

p : pressure (dependent variable)

Define dimensionless variables:

L

xx *

L

yy *

u

uu*

u

vv*

s

s

TT

TT

sAA

sAA

CC

CCC

A

,,

,*

2*

u

pp

Similarity parameters can be derived that relate one set of flow conditions to geometrically similar surfaces for a different set of flow conditions:

0*

*

*

*

x

v

x

u

2*

*2

*

*

*

**

*

**

Re

1

y

u

x

p

y

vv

x

uu

L

2*

2

**

**

PrRe

1

yyv

xu

L

2*

*2

*

**

*

**

Re

1

y

C

Scy

Cv

x

Cu

AAA

L

Reynolds Analogy

2*

*2

*

**

*

**

Re

1

y

u

y

vv

x

uu

L

2*

2

**

**

Re

1

yyv

xu

L

2*

*2

*

**

*

**

Re

1

y

C

y

Cv

x

Cu

AAA

L

Reynolds Analogy

**ACu

At the wall :

*

*

**

*

y

C

yy

u A

Prandtl’s Momentum Boundary Layer

Define a parameter that describes a dimensionless temperature gradient at a fluid-surface interface:

Pr,,Re,*

**

0*

* dx

dpxf

k

hL

yNu L

fluidy

Local Nusselt Number:

Average Nusselt Number

Pr,,Re

*

*

dx

dpfNu Lavg

Define a parameter that describes a dimensionless concentration gradient at a fluid-surface interface:

Scdx

dpxf

D

Lh

y

CSh L

AB

m

y

A ,,Re,*

**

0

*

*

*

Local Sherwood Number:

Average Sherwood Number

Sc

dx

dpfSh Lavg ,,Re

*

*

Boundary Layer Analogies

• Heat and Mass Transfer Analogy:

• Two or more processes governed by dimensionless equations of the same form.

• Accordingly heat and mass transfer relations for a particular geometry are interchangeable.

nL

fluidy

prdx

dpxf

k

hL

yNu

*

**

0*

,Re,*

nL

AB

m

y

A Scdx

dpxf

D

Lh

y

CSh

*

**

0

*

*

,Re,*

then

nn Sc

ShNu

Pr

ShNuC f 2

Re

Replacing Nu and Sh by The Stanton number (St) and mass transfer Stanton number (Stm) respectively,

PrRe

Nu

VC

hSt

p

Sc

Sh

V

hSt m

m Re

mf StSt

C

2

Closing Remarks

• The goal of any experimental activity is to get the maximum realistic information about a system.

• It is not always true that higher number of measurements will give maximum realistic information.

• Larger the number of measurements, huge will be the total error that enters into the measurement equation.

• Larger number of measurements lead to more costly experimentation.

• It is important to obtain maximum realistic information with the minimum number of well designed experiments.

• An experimental program recognizes the major “factors” that affect the outcome of the experiment.

• The factors may be identified by looking at all the quantities that may affect the outcome of the experiment.

• The most important among these may be identified using a few exploratory experiments or

• from past experience or based on some underlying theory or hypothesis.

• The next thing one has to do is to choose the number of levels for each of the factors.

• The data will be gathered for these values of the factors by performing the experiments by maintaining the levels at these values.

Recommended