Faster RooFitting: Automated parallel calculation of collaborative statistical models

Bos, E. G. Patrick; Patrick Bos, E. G.; Burgard, Carsten D.; Croft, Vincent A.; Hageboeck, Stephan; Moneta, Lorenzo; Pelupessy, Inti; Attema, Jisk J.; Verkerke, Wouter

Published in

IOP Publishing, Journal of Physics: Conference Series, 1(1525), p. 012041, 2020

DOI: 10.1088/1742-6596/1525/1/012041

EDP Sciences, EPJ Web of Conferences, (245), p. 06027, 2020

DOI: 10.1051/epjconf/202024506027

Tools

Export citation

Search in Google Scholar

Faster RooFitting: Automated parallel calculation of collaborative statistical models

Journal article published in 2020 by E. G. Patrick Bos, E. G. Patrick Bos, Carsten D. Burgard, Vincent A. Croft, Stephan Hageboeck, Lorenzo Moneta, Inti Pelupessy, Jisk J. Attema

, Wouter Verkerke

This paper is made freely available by the publisher.

Full text: Download

Preprint: archiving forbidden

Postprint: archiving forbidden

Published version: archiving allowed

Upload

Policy details

Data provided by

Abstract

Abstract RooFit [1,2] is the main statistical modeling and fitting package used to extract physical parameters from reduced particle collision data, e.g. the Higgs boson experiments at the LHC [3,4]. RooFit aims to separate particle physics model building and fitting (the users’ goals) from their technical implementation and optimization in the back-end. In this paper, we outline our efforts to further optimize this back-end by automatically running parts of user models in parallel on multi-core machines. A major challenge is that RooFit allows users to define many different types of models, with different types of computational bottlenecks. Our automatic parallelization framework must then be flexible, while still reducing run-time by at least an order of magnitude, preferably more. We have performed extensive benchmarks and identified at least three bottlenecks that will benefit from parallelization. We designed a parallelization layer that allows us to parallelize existing classes with minimal effort, but with high performance and retaining as much of the existing class’s interface as possible. The high-level parallelization model is a task-stealing approach. Preliminary results show speed-ups of factor 2 to 20, depending on the exact model and parallelization strategy.

Published in

Links

Tools

Faster RooFitting: Automated parallel calculation of collaborative statistical models

Abstract