16
23 Jan 2012 Background shape estimates using sidebands Paul Dauncey G. Davies, D. Futyan, J. Hays, M. Jarvis, M. Kenzie, C. Seez, J. Virdee, N. Wardle Imperial College London 1 Paul Dauncey

Background shape estimates using sidebands

  • Upload
    arich

  • View
    39

  • Download
    0

Embed Size (px)

DESCRIPTION

Background shape estimates using sidebands. Paul Dauncey G. Davies, D. Futyan, J. Hays, M. Jarvis, M. Kenzie, C. Seez, J. Virdee, N. Wardle Imperial College London. Shapes determined from sidebands. David Futyan presented sideband approach at last H gg meeting - PowerPoint PPT Presentation

Citation preview

Page 1: Background shape estimates using sidebands

23 Jan 2012

Background shape estimates using sidebands

Paul Dauncey

G. Davies, D. Futyan, J. Hays, M. Jarvis, M. Kenzie, C. Seez, J. Virdee, N. Wardle

Imperial College London

1Paul Dauncey

Page 2: Background shape estimates using sidebands

23 Jan 2012

Shapes determined from sidebands

2Paul Dauncey

•David Futyan presented sideband approach at last H meeting• https://indico.cern.ch/conferenceDisplay.py?confId=169763

• Data-driven method used to estimate background shape for Higgs limit, using 2D BDT including mass

• Today, show update of the method

Mass-factorised

“kinematic” BDT

Slice in BDT, fit to mass

Fit to BDT output shape

Select |M/MH|<2%

around signal

2D BDT

mass

Page 3: Background shape estimates using sidebands

23 Jan 2012

BDT output shapes

3Paul Dauncey

Background

Signal

• For this particular example with MH 120 GeV, BDT has 7 output bins

• Limit extracted from fitting observed shape in data to background + signal

• Critical to have accurate estimate of background shape in data• Also need robust estimate of shape errors, including bin-to-bin correlations

• Error matrix of background shape nuisance parameters in Higgs limit fit

Page 4: Background shape estimates using sidebands

23 Jan 2012

Sideband windows

4Paul Dauncey

• Zeroth order approximation; shape in sum of sidebands ~ same as shape in signal window• Sidebands give good estimate of shape of background in signal region

• Need procedure to quantify this statement

• Create several sideband windows, all with |M/MH| < 2%

• Three either side of signal region used for David’s results last week

Page 5: Background shape estimates using sidebands

23 Jan 2012

Study using 15 windows

5Paul Dauncey

•Whole mass range from 100 to 180 GeV divided into windows of same size in M/MH

• Allows 15 windows in total

•Due to careful construction of BDT input variables• Fractions per bin are almost

independent of central mass of any window

Page 6: Background shape estimates using sidebands

23 Jan 2012

Fraction dependence per BDT output bin

6Paul Dauncey

Bin 0 Bin 2Bin 1

Bin 4

Bin 3

Bin 5 Bin 6

• Some residual mass dependence of the fractions per bin is seen• Mainly (but possibly not entirely) due to background composition

changing with mass

• Small effect; first order correction applied to each sideband by using linear dependence from above fits

• This method was used by David for results shown last week

Page 7: Background shape estimates using sidebands

23 Jan 2012

Method shown had some approximations

7Paul Dauncey

• For a given Higgs mass, same data were used twice• Once in 15-window linear fit, again in sidebands

• Only 6 of the 15 of windows used for sidebands, so a weak correlation

•A Higgs signal might distort the 15-window linear fits• David showed this is a tiny effect last week

• The 15-window linear fits were done independently for each BDT bin• No fit constraint on fractions having to sum to one

• Scaling correction applied afterwards

• The 15-window linear fits were done with least-squares• Assuming error = N of expected value for each bin

Page 8: Background shape estimates using sidebands

23 Jan 2012

LL fit method

8Paul Dauncey

• Today, describe modified method for sidebands• Streamlines previous procedure; all done in one fit

• Statistically robust with no re-use of same data

• Allows accurate extraction of errors to use as nuisance parameters

•Want to fit to fraction in each BDT bin, for each mass window

•Want these fractions to have linear dependence on mass• Fraction fbn = pb0 + pb1(m−m0), for BDT bin b, mass m centred in window n

• Constant m0 can be any convenient value; take as Higgs mass

• With this choice, the pb0 give the fractions in the signal window directly

• Fractions must sum to 1 over BDT bins, for every mass window• b fbn = 1 = b pb0 + pb1(m−m0)

• Only possible for all m−m0 if b pb0 = 1 and b pb1 = 0

• Force constraint by setting p00 = 1 − b≠0 pb0 and p01 = − b≠0 pb1

• 7−1=6 each of pb0 and pb1 parameters for b≠0 →12 parameters

Page 9: Background shape estimates using sidebands

23 Jan 2012

LL fit method continued

9Paul Dauncey

• 2D fit to sideband windows × BDT output bins simultaneously• Normalisations of each sideband window used are free parameters

• One normalisation parameter per sideband

• Makes NO assumption (Pol5, Pow2, etc) on mass spectrum shape

• Fit with LL using full Poisson likelihood for each data bin• Correct even for low occupancy bins

• Binned LL so gives effective 2 goodness-of-fit measure

•NDoF count depends on number of windows NW

• 12 fraction parameters + Nw normalisation parameters

• Fit to 7×Nw data values, so NDoF = 6Nw−12

• E.g. For 3 sideband windows as used previously• Nw = 6 so NDoF = 24

Page 10: Background shape estimates using sidebands

23 Jan 2012

Consistency check fit

10Paul Dauncey

• Consistency fit to all NW=15 windows• Equivalent to that done by David previously

• Fit gives 2/NDoF = 83.75/78, probability = 30.8%• Linear assumption is reasonable, even over 15 windows

• Results effectively identical to those shown on slide 6• Lack of fraction sum constraint and N errors used previously were good

approximations

Bin 0 Bin 1 Bin 2 Bin 3

Bin 4 Bin 5 Bin 6

Page 11: Background shape estimates using sidebands

23 Jan 2012

Background fraction estimate fits

11Paul Dauncey

•Actual fit used for limit shapes has 3 sideband windows either side of signal region• Assumes linearity over mass range equivalent to 9.5 of the 15 sidebands

• Seems good assumption, given that fit to 15 windows looks reasonable

MH window

21 high mass sideband bins

21 low mass sideband bins E.g. fractions in BDT bin 1

Fit gives direct estimation of fraction

at MH

Page 12: Background shape estimates using sidebands

23 Jan 2012

Example: fit results for MH=120 GeV

12Paul Dauncey

• Errors from fit are always smaller than Poisson N errors of bin contents (used for limit fit)• Worse case: fit error ~ 1/3 N error

• Fit errors checked against toys; agree within 10%• Error estimate robust

1 fit

N

Linear y Log y

Page 13: Background shape estimates using sidebands

23 Jan 2012

Systematic (singular)

13Paul Dauncey

• Critical point: there is only ONE assumption in this whole method• BDT output fractions are assumed to be linear over fit range of sidebands

• Looks like a perfectly sensible assumption even over whole mass range

• Coming up with method to estimate a systematic associated with this assumption; report on this in later meeting

• Suspect dependence is mainly driven by change of background composition with mass• I.e. Born/box vs QCD prompt-prompt vs QCD prompt-fake vs QCD fake-

fake...

• Cannot accurately check due to lack of MC statistics

• With MC factor of ×10 data, could even predict linear dependence and check for consistency

• If found agreement, systematic could then be based on physical quantity; i.e. degree of uncertainty in relative background contributions

Page 14: Background shape estimates using sidebands

23 Jan 2012

Comparison of expected limits

14Paul Dauncey

Previous NewBaseline

•Very little difference; approximations in previous study ARE good• For both, only the systematic for the linear assumption is not included

• Minor difference at high mass seems to be due to adding constraint on fractions; one less DoF in new method

• Higgs expected limits (from Nick)

• Including full correlation matrix of nuisance parameters from fit

• Code for this implemented in h2gglobe

Page 15: Background shape estimates using sidebands

23 Jan 2012

Conclusions

15Paul Dauncey

• Streamlined method for handling of sidebands• Previous (accurate) approximations no longer needed

• Statistically robust, error matrix estimate accurate

• Results effectively identical

•Nuisance parameters from shape for limit fit are not large compared with expected data statistics• Arise directly from statistics of sidebands so will scale in the same way

with luminosity

•Only one (apparently very reasonable) assumption in the whole method• Systematic due to this still to be evaluated

• Given simple assumption, expected to be small and well-controlled

• No assumption on (and hence no systematic from) shape of mass spectrum

Page 16: Background shape estimates using sidebands

23 Jan 2012

Backup Slides

16Paul Dauncey