lsoffi · JessWYWong · Apr 18, 2020 · Apr 18, 2020 · Jun 9, 2020 · Jun 9, 2020
diff --git a/README_LooseID.md b/README_LooseID.md
@@ -0,0 +1,217 @@
+# egm_tnp_analysis
+
+
+## Install stable branch
+
+>git clone -b egm_tnp_CleanedCodeForUL_17March2020  https://github.com/lsoffi/egm_tnp_analysis
+
+> cd egm_tnp_analysis
+
+> source etc/scripts/setupUL.sh (need ROOT>6.10)
+
+> make
+
+## Quick description
+
+Package to handle analysis of tnp trees. The main tool is the python fitter
+
+   ===> tnpEGM_fitter.py
+
+The interface between the user and the fitter is solely done via the settings file
+
+   ===> etc/config/settings.py
+        - set the flags (i.e. Working points) that can be tested
+        - set the different samples and location
+        - set the fitting bins
+        - set the different cuts to be used
+        - set the output directory
+
+Help message:
+>    python tnpEGM_fitter.py --help
+
+The settings have always to be passed to the fitter
+>    python tnpEGM_fitter.py etc/config/settings.py
+
+Several "settings*.py" files are setup for different eras and are located all in etc/config/
+
+
+## The different fitting steps
+Everything will be done for a specific flag (so the settings can be the same for different flags). Hence, the flag to be used must be specified each time (named myWP in following).
+
+**1. Create the bining.** To each bin is associated a cut that can be tuned bin by bin in the settings.py
+   * After setting up the settings.py check bins 
+
+>   python tnpEGM_fitter_tuning.py etc/config/settings_pho_UL2017_LooseID.py  --flag passingLoose100XV2 --checkBins
+
+   * if  you need additinal cuts for some bins (cleaning cuts), tune cuts in the settings.py, then recheck. 
+     Once satisfied, create the bining
+
+>   python tnpEGM_fitter_tuning.py etc/config/settings_pho_UL2017_LooseID.py  --flag passingLoose100XV2 --createBins
+
+   * CAUTION: when recreacting bins, the output directory is overwritten! So be sure to not redo that once you are at step2
+
+**2. Create the histograms** with the different cuts... this is the longest step. Histograms will not be re-done later
+
+>   python tnpEGM_fitter_tuning.py etc/config/settings_pho_UL2017_LooseID.py --flag passingLoose100XV2 --createHists
+
+**3. Do your first round of fits.**
+   * nominal fit
+
+>   python tnpEGM_fitter_tuning.py etc/config/settings_pho_UL2017_LooseID.py --flag passingLoose100XV2 --doFit
+
+   * MC fit to constrain alternate signal parameters [note this is the only MC fit that makes sense]
+
+>   python tnpEGM_fitter_tuning.py etc/config/settings_pho_UL2017_LooseID.py --flag passingLoose100XV2 --doFit --mcSig --altSig
+
+   * Alternate signal fit (using constraints from previous fits)
+
+>   python tnpEGM_fitter_tuning.py etc/config/settings_pho_UL2017_LooseID.py --flag passingLoose100XV2 --doFit  --altSig
+
+   * Alternate background fit (using constraints from previous fits)
+
+>   python tnpEGM_fitter_tuning.py etc/config/settings_pho_UL2017_LooseID.py --flag passingLoose100XV2 --doFit  --altBkg
+
+**4. Check fits and redo failed ones.** (there is a web index.php in the plot directory to vizualize from the web)
+   * can redo a given bin using its bin number ib. 
+     The bin number can be found from --checkBins, directly in the ouput dir (or web interface)
+
+>   python tnpEGM_fitter_tuning.py etc/config/settings_pho_UL2017_LooseID.py --flag passingLoose100XV2 --doFit --iBin ib
+
+   * the initial parameters can be tuned for this particular bin in the settings.py file. 
+      Once the fit is good enough, do not redo all fits, just fix next failed fit.
+      One can redo any kind of fit bin by bin. For instance the MC with altSig fit (if the constraint parameters were bad in the altSig for instance)
+
+>   python tnpEGM_fitter_tuning.py etc/config/settings_pho_UL2017_LooseID.py --flag passingLoose100XV2 --doFit --mcSig --altSig --iBin ib
+
+**4. Change fits function for 20 < pT < 35 bins
+   * add an extra Gaussian for altSig
+>   python tnpEGM_fitter_tuning.py etc/config/settings_pho_UL2017_LooseID.py --flag passingLoose100XV2 --doFit --mcSig --altSig --addGaus --iBin 1
+>   python tnpEGM_fitter_tuning.py etc/config/settings_pho_UL2017_LooseID.py --flag passingLoose100XV2 --doFit --mcSig --altSig --addGaus --iBin 3
+>   python tnpEGM_fitter_tuning.py etc/config/settings_pho_UL2017_LooseID.py --flag passingLoose100XV2 --doFit --mcSig --altSig --addGaus --iBin 4
+>   python tnpEGM_fitter_tuning.py etc/config/settings_pho_UL2017_LooseID.py --flag passingLoose100XV2 --doFit --mcSig --altSig --addGaus --iBin 5
+>   python tnpEGM_fitter_tuning.py etc/config/settings_pho_UL2017_LooseID.py --flag passingLoose100XV2 --doFit --mcSig --altSig --addGaus --iBin 6
+>   python tnpEGM_fitter_tuning.py etc/config/settings_pho_UL2017_LooseID.py --flag passingLoose100XV2 --doFit --mcSig --altSig --addGaus --iBin 8
+
+>   python tnpEGM_fitter.py etc/config/settings_pho_UL2017_LooseID.py --flag passingLoose100XV2 --doFit --altSig --addGaus --iBin 1
+>   python tnpEGM_fitter.py etc/config/settings_pho_UL2017_LooseID.py --flag passingLoose100XV2 --doFit --altSig --addGaus --iBin 3
+>   python tnpEGM_fitter.py etc/config/settings_pho_UL2017_LooseID.py --flag passingLoose100XV2 --doFit --altSig --addGaus --iBin 4
+>   python tnpEGM_fitter.py etc/config/settings_pho_UL2017_LooseID.py --flag passingLoose100XV2 --doFit --altSig --addGaus --iBin 5
+>   python tnpEGM_fitter.py etc/config/settings_pho_UL2017_LooseID.py --flag passingLoose100XV2 --doFit --altSig --addGaus --iBin 6
+>   python tnpEGM_fitter.py etc/config/settings_pho_UL2017_LooseID.py --flag passingLoose100XV2 --doFit --altSig --addGaus --iBin 8
+
+   * rerun altBkg
+>   python tnpEGM_fitter_tuning.py etc/config/settings_pho_UL2017_LooseID.py --flag passingLoose100XV2 --doFit  --altBkg
+
+
+**5. egm txt ouput file.** Once all fits are fine, put everything in the egm format txt file
+
+>   python tnpEGM_fitter_tuning.py etc/config/setting_pho_UL2017_LooseID.py  --flag passingLoose100XV2 --sumUp
+
+
+## The settings file
+
+The settings file includes all the necessary information for a given setup of fit
+
+**- General settings.**
+
+    * flag: this is the Working point in the tnpTree  (pass: flagCut ; fail !flagCut). The name of the flag myWP is the one to be passed
+to the fitter. One can handle complex flags with a cut string (root cut string):
+> flag = { 'myWP' : myWPCutString } 
+
+    * baseOutDir: the output directory (will be created by the fitter)
+
+**- Sample definition.**
+
+    * tnpTreeDir: the directory in the tnpTree (different for phoID, eleID, reco, hlt)
+
+    * samplesDef: these are the main info
+      - data: data ntuple
+      - mcNom: nominal MC sample
+      - mcAlt: MC for generator syst
+      - tagSel: usually same as nominal MC + different base cuts: check the tag selection syst
+
+     The sample themselves are defined in etc/inputs/tnpSampleDef.py  (the attribute nEvts, lumi are not necessary for the fit per-se and can be omitted).
+     A list of samples for ICHEP2016 from official egm production is already setup properly in the package.
+     Then in the settings.py the sample can be specified further:
+     - sample.set_mctruth() : force mc truth on a MC sample
+     - sample.rename('xxx') : if a sample is used 2 times (like with 2 different sets of cuts), it has to be renamed for the second use
+     - sample.set_cut(cut)  : add a cut to define the sample (like a run range for data or an additional tag selection for alt tag selection syst)
+     - sample.set_weight('totWeight') : name of the weight to be used for MC reweighting (totWeight in this example). Note: the tool can handle a pu Tree to reweight a MC with different PU scenario (ask for further explanations and/or settings_rwPU.py example)
+
+
+**- Cuts.**
+
+    * cutBase: Define here the main cut
+    * additionalCuts: can be used for cleaning cuts (or put additionalCuts = None)
+
+**- Fitting parameters.**
+
+    Define in this section the init parameters for the different fit, can be tuned to improve convergence.
+
+
+## Changes for UL preparation:
+
+IMPORTANT-1: one single tnpSampleDef.py file for all eras.
+
+IMPORTANT-2: I used a specific name system that i would ask you to follow and maintain also in the future for next eras:
+
+If i want to run on PromptReco2018, then:
+
+1. In  tnpSampleDef.py i add an eos directory called eosPromptReco2018 and a module called PromptReco2018 with files inside
+
+2. I use a dedicated setting_ele_PromptReco2018.py and setting_pho_PromptReco2018.py where inside:
+
+    2.1 i specifically call the module  PromptReco2018
+
+    2.2 I call the output directory results/PromptReco2018/blabla
+
+
+At the moment the eras are:
+
+LegacyReReco2016
+
+ReReco2017
+
+PromptReco2018
+
+UL2017
+
+We will add soon:
+
+UL2018
+
+UL2016
+
+====================
+
+##  Update PU weights 
+
+1. Pileup files have to be computed with: python etc/scripts/pureweight.py
+
+Here one has to update the name of the directory whre the files will be located and the corresponding names.
+
+
+2.This pyhton uses the following: libPython/puReweighter.py.
+
+Here one nees to add the PU MC mix numbers that are available here: http://cmslxr.fnal.gov/source/SimGeneral/MixingModule/python/?v=CMSSW_9_4_0
+
+One also needs to update sample names here:
+etc/inputs/tnpSampleDef.py
+
+3.The data PU distrubtions can be computed using the following instructions (similar to what is done in step1):
+
+pileupCalc.py -i /afs/cern.ch/cms/CAF/CMSCOMM/COMM_DQM/certification/Collisions17/13TeV/PromptReco/Cert_294927-306462_13TeV_PromptReco_Collisions17_JSON.txt --inputLumiJSON /afs/cern.ch/cms/CAF/CMSCOMM/COMM_DQM/certification/Collisions17/13TeV/PileUp/pileup_latest.txt --calcMode true --minBiasXsec 69200 --maxPileupBin 100 --numPileupBins 100 pileup_2017_41fb.root
+
+Other pu files for each run, like pileup_2017_RUNB.root, pileup_2017_RUNC.root etc, can be copied from previous location. The previous location of pu directory can be found in github. For example, in this version, the location is,  
+/eos/cms/store/group/phys_egamma/swmukher/tnp/ID_V2_2017/PU
+
+
+The nvtx and rho histos are not needed because we will use the pu method (type = 0) for the reweight.
+
+NB: Before using these py in order to load the needed libraires one has to run: 
+export  PYTHONPATH=$PYTHONPATH:/afs/cern.ch/user/s/soffi/scratch0/TEST/CMSSW-10-0-0-pre3/src/egm_tnp_analysis 
+
+
+#### adding remote (Fabrice version)
+git remote add origin git@github.com:fcouderc/egm_tnp_analysis.git
+