Multivariate Classification Lecture for the CERN School of Computing

You can find a compiled (maybe not the latest) version of the talk here: http://ekpwww.ekp.kit.edu/~tkeck/MultivariateClassificationLecture.pdf

The lecture explains algorithms and concepts used in multivariate classification. Each chapter explains a specific algorithm and an associated idea or concept. Most parts apply to machine learning in general.

The lecture intends to engage the interest of students, it is not to be used a stand-alone introduction into the topic without verbal explanations.

Content

The content of the lecture is roughly separated into two parts. The first part focuses on traditional methods used for multivariate classification in High Energy Physics (excluding neural networks). The second part introduces neural networks and recent developments in the field of Deep Learning. The material does not focus on HEP and summarizes interesting ideas from other fields like image recognition.

Traditional Methods

Motivation for Multivariate Classification in HEP
Neyman-Pearson Lemma & Supervised Learning
Discriminant Analysis & Analytical Solutions
Decision Trees & Model Complexity
Boosted Decision Trees & Ensemble Methods
Support Vector Machines & Kernel Trick
sPlot & Data Driven Techniques

Deep Learning

Artificial Neural Networks & Deep Learning
Convolutional Neural Networks & Representation Learning
Recurrent Neural Networks & Sequential Data Processing
Relation Neural Networks & Network Architetures
Adversarial Networks & Generative Models
Playing Games & Reinforcement Learning
Conclusion
Backup

Used Material

This lectures uses work of other people, in particular as interesting examples or applications for Deep Learning Techniques. The authors are as well cited on the corresponding slides.

Higgs Paper used in the Motivation: https://arxiv.org/abs/1207.7235
(afaik) First usage of Deep Learning in HEP: https://arxiv.org/abs/1402.4735
On the complexity of bird detection in images: https://xkcd.com/1425/
Park Or Bird from Flickr: http://code.flickr.net/2014/10/20/introducing-flickr-park-or-bird/ (parkorbird.flickr.com seems to be offline)
Blog Post by Andrey Karpathy on recurrent neural networks: http://karpathy.github.io/2015/05/21/rnn-effectiveness/
Neural image caption generator by Google: https://arxiv.org/abs/1411.4555
Neural network for relational reasoning by Deepmind: https://arxiv.org/abs/1706.01427
GANs applied to faces by indico: https://arxiv.org/abs/1511.06434
Using adversarial networks in HEP: https://arxiv.org/abs/1611.01046
Adversarial examples by I. Goodfellow et. al.: https://arxiv.org/abs/1412.6572
Playing Atari with Reinforcment Learning by Deepmind: https://arxiv.org/abs/1312.5602
AlphaGo by Deepmind: https://www.nature.com/nature/journal/v529/n7587/full/nature16961.html

Some of images are taken from wikipedia:

The image of the iceberg on deep learning is obtained from a talk by H. Voss, and different versions can be found all over the internet. I couldn't identify the original source, so if you think you are the original author of the image, contact me so I can add a reference or remove the image.

All multivariate methods have hyper-parameters, so some parameters which influence the performance of the algorithm and have to be set by the user. It is common to automatically optimize these hyper-parmaeters using different optimization algorithms. There are four different approaches: grid-search, random-search, gradient, bayesian

Random search

James Bergstra and Yoshua Bengio. „Random Search for Hyper-parameter Optimization“ http://www.jmlr.org/papers/volume13/bergstra12a/bergstra12a.pdf

Gradient-based

Dougal Maclaurin, David Duvenaud, and Ryan Adams. „Gradient-based Hyperparameter Optimization through Reversible Learning“. http://jmlr.org/proceedings/papers/v37/maclaurin15.pdf

Bayesian

Jasper Snoek, Hugo Larochelle, and Ryan P Adams. „Practical Bayesian Optimization of Machine Learning Algorithms“. http://papers.nips.cc/paper/4522-practical-bayesian-optimization-of-machine-learning-algorithms.pdf

sPlot

With sPlot you can train a classifier directly on data, other similar methods are: side-band substration and training data vs mc, both are described in the second paper below

Muriel Pivk and Francois R. Le Diberder. „SPlot: A Statistical tool to unfold data distributions“. https://arxiv.org/abs/physics/0402083

D. Martschei, M. Feindt, S. Honc, and J. Wagner-Kuhr. „Advanced event reweighting using multivariate analysis“. http://iopscience.iop.org/article/10.1088/1742-6596/368/1/012028

Machine Learning Frameworks

FastBDT

https://github.com/thomaskeck/FastBDT

Thomas Keck. „FastBDT: A speed-optimized and cache-friendly implementation of stochastic gradient-boosted decision trees for multivariate classification“. http://arxiv.org/abs/1609.06119.

TMVA

http://tmva.sourceforge.net/

Andreas Hoecker et al. „TMVA: Toolkit for Multivariate Data Analysis“. https://arxiv.org/abs/physics/0703039

FANN

S. Nissen. Implementation of a Fast Artificial Neural Network Library (fann). http://fann.sourceforge.net/fann.pdf

SKLearn

Website http://scikit-learn.org/

F. Pedregosa et al. „Scikit-learn: Machine Learning in Python“. http://www.jmlr.org/papers/volume12/pedregosa11a/pedregosa11a.pdf

hep_ml

Website https://arogozhnikov.github.io/hep_ml/

XGBoost

Website https://xgboost.readthedocs.io/en/latest/

Tianqi Chen and Carlos Guestrin. „XGBoost: A Scalable Tree Boosting System“. https://arxiv.org/abs/1603.02754

Tensorflow

Website https://www.tensorflow.org/

Martin Abadi et al. „TensorFlow: A system for large-scale machine learning“ https://arxiv.org/abs/1605.08695

Theano

Website http://deeplearning.net/software/theano/

Rami Al-Rfou et al. „Theano: A Python framework for fast computation of mathematical expressions“ https://arxiv.org/abs/1605.02688

NeuroBayes

M. Feindt and U. Kerzel. „The NeuroBayes neural network package“ http://www-ekp.physik.uni-karlsruhe.de/~feindt/acat05-neurobayes

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
exercises		exercises
images		images
python		python
0_motivation.tex		0_motivation.tex
10_relation_networks.tex		10_relation_networks.tex
11_adversarial_networks.tex		11_adversarial_networks.tex
12_reinforcement_learning.tex		12_reinforcement_learning.tex
13_conclusion.tex		13_conclusion.tex
14_backup.tex		14_backup.tex
1_neyman_pearson_lemma.tex		1_neyman_pearson_lemma.tex
2_discriminant_analysis.tex		2_discriminant_analysis.tex
3_decision_tree.tex		3_decision_tree.tex
4_boosted_decision_trees.tex		4_boosted_decision_trees.tex
5_support_vector_machines.tex		5_support_vector_machines.tex
6_data_driven_techniques.tex		6_data_driven_techniques.tex
7_artificial_neural_networks.tex		7_artificial_neural_networks.tex
8_convolutional_networks.tex		8_convolutional_networks.tex
9_recurrent_networks.tex		9_recurrent_networks.tex
Makefile		Makefile
MultivariateClassificationLecture.tex		MultivariateClassificationLecture.tex
README.md		README.md
beamerthemeCSC.sty		beamerthemeCSC.sty
dockerfile		dockerfile
header.tex		header.tex

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multivariate Classification Lecture for the CERN School of Computing

Content

Traditional Methods

Deep Learning

Used Material

Further Reading

General Machine Learning

Boosted Decision Trees

Deep Learning (Neural Networks)

Machine Learning Frameworks

About

Releases

Packages

Languages

michaelwassmer/MultivariateClassificationLecture

Folders and files

Latest commit

History

Repository files navigation

Multivariate Classification Lecture for the CERN School of Computing

Content

Traditional Methods

Deep Learning

Used Material

Further Reading

General Machine Learning

Boosted Decision Trees

Deep Learning (Neural Networks)

Machine Learning Frameworks

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages