Logo ČVUT
CZECH TECHNICAL UNIVERSITY IN PRAGUE
STUDY PLANS
2024/2025

Advanced data processing in nuclear and subnuclear physics

The course is not on the list Without time-table
Code Completion Credits Range
D02STAT ZK
Garant předmětu:
Lecturer:
Tutor:
Supervisor:
Department of Physics
Synopsis:

The student will gain theoretical and practical experience with the use of advanced techniques of statistical data analysis, which are currently used in the processing of data in high-energy physics experiments. These include, for example, unfolding, methods based on the Kálmán filter and machine learning methods such as decision trees and neural networks. The theoretical foundations of regression and classification will be discussed. The student will acquire practical knowledge of methods related to data pre-processing, training of machine learning algorithms (supervised machine learning), reliability validation (bias) and overtraining (overtraining). The aim of the exercise is to analyze real experimental data from the open HEPData database and in practice to compare classifiers obtained by different methods.

Requirements:
Syllabus of lectures:

1. Theory of statistical regression and inference

2. Statistical deconvolution methods

3. Optimization and the Kálmán filter

4. Non-parametric methods of regression and classification

4.1. Decision trees

4.2. Neural networks

Syllabus of tutorials:
Study Objective:

The student will gain theoretical and practical experience with the use of advanced techniques of statistical data analysis, which are currently used in the processing of data in high-energy physics experiments. These include, for example, unfolding, methods based on the Kálmán filter and machine learning methods such as decision trees and neural networks. The theoretical foundations of regression and classification will be discussed. The student will acquire practical knowledge of methods related to data pre-processing, training of machine learning algorithms (supervised machine learning), reliability validation (bias) and overtraining (overtraining). The aim of the exercise is to analyze real experimental data from the open HEPData database and in practice to compare classifiers obtained by different methods.

Study materials:

Required literature:

[1] Bohm, Zech, Introduction to Statistics and Data Analysis for Physicist, DESY online library

[2] I. Goodfellow, Y. Bengio, and A. Courville: Deep Learning, MIT Press, 2016

[3] B. Ristic, S. Arulampalam, N. Gordon: Beyond the Kalman Filter: Particle Filters for Tracking Applications,

Artech House, 2004

Recommended literature:

[4]A. Geron: Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow: Concepts, Tools, and

Techniques to Build Intelligent Systems, O’Reilly Media, 2019

[5] B. Hachman: A Living Review of Machine Learning for Particle Physics, github: HEPML-LivingReview

Note:
Further information:
No time-table has been prepared for this course
The course is a part of the following study plans:
Data valid to 2024-05-01
Aktualizace výše uvedených informací naleznete na adrese https://bilakniha.cvut.cz/en/predmet7705106.html