Efficiency loss for high pt tracks #248

areinsvo · 2020-01-06T20:32:44Z

As discussed, there is an ~20% efficiency loss for mkFit relative to CMSSW for tracks with pt > ~50 GeV.

This can be seen in the standalone, MTV-like validation plots where sim tracks are required to have a corresponding seed (plots here are with the high pt 10μ sample using the offline quadruplet seeds).

This was also seen in Matti’s plots using the HLT quadruplet seeds (compare black and red curves).

One (unproven) theory for the efficiency loss might be that perhaps the sizes of the search windows are too small for straight tracks that have have small errors. For instance, as far as I can tell, there is a minimum dq value set, but there is no minimum window size in dphi. See code here.

It is important to note that for the SIMVAL_MTV_SEED plots, the ONLY way to lose efficiency is if we add incorrect hits to the track. The sim tracks in the denominator of the efficiency are required to be matched to a seed, meaning that all 4 hits in the seed belong to the sim track. See here. This means that if we add 0 hits to the seed, the 4-hit track will by definition be matched to the sim track and be in the numerator of our efficiency. Therefore, the efficiency loss shown in the plots above has to have a different origin than what Matevz was investigating for triplets in PR242.

To investigate this issue, I will make a list of specific tracks that are affected, using the high pt 10μ sample. Slava is working on remaking the 10μ sample using HLT quadruplets for seeds, so we can confirm that fixing the issue offline also fixes the issue in the HLT configuration.

osschar · 2020-01-06T21:00:11Z

dphi_min is taken in, see:

mkFit/mkFit/MkFinder.cc

Line 225 in e864ed9

const auto calcdphi = [&](float dphi2) {

This all got a bit convoluted following several rounds of optimizations :)

areinsvo · 2020-01-06T21:02:19Z

Good to know. Thanks for pointing that out!

areinsvo · 2020-01-23T17:55:58Z

For posterity:
Matevz discovered that the affected tracks had negative entries in the covariance matrix. Manually changing the negative covariances to 1 allowed the seeds to get processed, which fixed the efficiency issue, at the expense of an increase in duplicate rate:
11.4% before, 18.4% after for offline high pt 10muon events
8.5% before, 9.4% after for ttbar PU50 events

All of the plots can be seen in PR250. Once that PR gets merged, we can probably close this issue, unless people prefer to keep it open to remind ourselves to revisit the duplicate rate increase.

areinsvo · 2020-02-20T17:05:40Z

We no longer have a loss of efficiency at high pt, so I’m going to close this issue. I opened a follow-up issue for implementing a proper fix for the negative entries in the covariance matrix.

areinsvo mentioned this issue Jan 23, 2020

Duplicate rate increase at high pt #251

Open

areinsvo mentioned this issue Feb 20, 2020

Negative entries in covariance matrix #254

Open

areinsvo closed this as completed Feb 20, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Efficiency loss for high pt tracks #248

Efficiency loss for high pt tracks #248

areinsvo commented Jan 6, 2020

osschar commented Jan 6, 2020

areinsvo commented Jan 6, 2020

areinsvo commented Jan 23, 2020

areinsvo commented Feb 20, 2020

Efficiency loss for high pt tracks #248

Efficiency loss for high pt tracks #248

Comments

areinsvo commented Jan 6, 2020

osschar commented Jan 6, 2020

areinsvo commented Jan 6, 2020

areinsvo commented Jan 23, 2020

areinsvo commented Feb 20, 2020