USCMS Researcher: Matteo Marchegiani
Postdoc dates: Jul 2025 - Jun 2026
Home Institution: Carnegie Mellon University
Project: GNN-based End-to-End Reconstruction in the CMS Phase 2 High-Granularity Calorimeter
The goal of the project is to develop new machine learning based algorithms for fast and efficient reconstruction of the High-Granularity Calorimeter at the High-Luminosity LHC.More information: My project proposal
Mentors:
-
Matteo Cremonesi - (Carnegie Mellon University)
- 26 Mar 2026 - "HGCAL reconstruction with Graph Neural Networks", Matteo Marchegiani, TICL Reconstruction Working Meeting
- 24 Mar 2026 - "HGCAL simulation with pileup and merging algorithm", Matteo Marchegiani, ML4RECO
- 10 Feb 2026 - "Maskformers for offline reconstruction", Matteo Marchegiani, ML4RECO
- 3 Dec 2025 - "Studies on time resolution in GNN-based reco", Matteo Marchegiani, HGCAL DPG
- 2 Dec 2025 - "Studies on time resolution with the latest HGCAL GNN model", Matteo Marchegiani, ML4RECO
- 18 Nov 2025 - "Update on performance of the latest HGCAL GNN model", Matteo Marchegiani, ML4RECO
- 13 Oct 2025 - "ML4RECO: GNN and Transformer Based HGCAL Reconstruction", Matteo Marchegiani, CMS Machine Learning Town Hall
- 17 Sep 2025 - "Single-particle energy resolution with latest GNN model", Matteo Marchegiani, ML4RECO
- 27 Aug 2025 - "Updated GNN training after bugfixes in CMSSW 15_0_X", Matteo Marchegiani, ML4RECO
- 16 Jul 2025 - "FineCalo SimTrack Reconstruction Error", Matteo Marchegiani, ML4RECO
Current Status
2026 Q1
- Progress on optimized GNN model
- Study impact of pileup on the training of the GNN-based reconstruction algorithm
- GPU Memory profiling of GNN model trained on dataset with 5x more hits
- Integration of CUDA kernels from FastGraphCompute to speed up KNN and object condensation loss
- Working on training of GNN model with 200 pileup simulation
- First HGCAL simulation with FineCalo + merging algorithm in CMSSW_15_1_0
- Simulated 10 single-electron events + 30 PU interactions from minimum bias events
- Implementation of custom NANO step to save true clusters containing SimHits from primary interaction and pileup
- At 30 PU, the average number of RecHits is ~60k, with ~8k true clusters
- Study a dedicated implementation of the merging algorithm which is compatible with pileup
- Study a dedicated pileup mixing library to save the event history for minimum bias events in the merging
- Study new alternative ML architectures for HGCAL reconstruction
- Masked transformers to reduce the quadratic complexity of self-attention
- Generated 0 PU dataset with RecHits and TICL objects: 2D LayerClusters, 3D Tracksters and TICL Candidates
- Computing the incidence matrix between RecHits and true particles using sparse tensor representation
- First proof-of-concept training using recHits features as input: incidence matrix regression and regression of cluster properties
2025 Q4
- Progress
- Study energy, position and time resolution of the GNN clustering using simulated photons, pions and tau leptons
- Study issue in time resolution in CMSSW_15_X: change in the timing simulation with respect to CMSSW_11_X
- Finalize publication on performance metrics of GNN clustering with 0 pileup simulation
- Study the impact of increased pileup on the GNN training
- Memory profiling of the training with a dataset including ~250k reconstructed hits
- Working on pileup simulation: generate hard process alongside minimum bias events
2025 Q3
- Progress
- Learned how to train the GNN employing GravConv layers in combination with the object condensation loss for the reconstruction of energy clusters in the HGCAL
- Studied the performance and energy resolution of the GNN-based reconstruction in zero-pileup environments, considering simulated photons, pions and tau leptons
- Ported the training datasets production to CMSSW_15_1_X, making use of the FineCalo simulation
- Ported the GNN training to Pytorch 2.6.0 and CUDA 12.4
- Study memory profiling of the model on GPU
Contact me: