Symbolic emulators for cosmology: accelerating cosmological analyses without sacrificing precision

Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences The Royal Society 384:2317 (2026) 20240585

Authors:

Deaglan Bartlett, Shivam Pandey

Abstract:

Abstract In cosmology, emulators play a crucial role by providing fast and accurate predictions of complex physical models, enabling efficient exploration of high-dimensional parameter spaces that would be computationally prohibitive with direct numerical simulations. Symbolic emulators have emerged as promising alternatives to numerical approaches, delivering comparable accuracy with significantly faster evaluation times. While previous symbolic emulators were limited to relatively narrow prior ranges, we expand these to cover the parameter space relevant for current cosmological analyses. We introduce approximations to hypergeometric functions used for the Λ cold dark matter (ΛCDM) comoving distance and linear growth factor which are accurate to better than 0.001% and 0.05%, respectively, for all redshifts and for Ωm∈[0.1,0.5]. We show that integrating symbolic emulators into a Dark Energy Survey Year 1 (DES-Y1)-like 3×2 pt analysis produces cosmological constraints consistent with those obtained using standard numerical methods. Our symbolic emulators offer substantial improvements in speed and memory usage, demonstrating their practical potential for scalable, likelihood-based inference. This article is part of the discussion meeting issue ‘Symbolic regression in the physical sciences’.

Symbolic regression and differentiable fits in beyond the standard model physics

Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences The Royal Society 384:2317 (2026) 20240593

Authors:

Shehu AbdusSalam, Steven Abel, Deaglan Bartlett, Miguel Crispim Romao

Abstract:

Abstract We demonstrate the efficacy of symbolic regression (SR) to probe models of particle physics Beyond the Standard Model (BSM), by considering the so-called Constrained Minimal Supersymmetric Standard Model (CMSSM). Like many incarnations of BSM physics this model has a number (four) of arbitrary parameters, which determine the experimental signals, and cosmological observables such as the dark matter relic density. We show that analysis of the phenomenology can be greatly accelerated by using symbolic expressions derived for the observables in terms of the input parameters. Here we focus on the Higgs mass, the cold dark matter relic density and the contribution to the anomalous magnetic moment of the muon. We find that SR can produce remarkably accurate expressions. Using them we make global fits to derive the posterior probability densities of the CMSSM input parameters which are in good agreement with those performed using conventional methods. Moreover, we demonstrate a major advantage of SR, which is the ability to make fits using differentiable methods rather than sampling methods. We also compare the method with neural network (NN) regression. SR produces more globally robust results, while NNs require data that is focused on the promising regions in order to be equally performant. This article is part of the discussion meeting issue ‘Symbolic regression in the physical sciences’.

Identifying Transient Hosts in LSST’s Deep Drilling Fields with Galaxy Catalogs

The Astrophysical Journal American Astronomical Society 1000:2 (2026) 289

Authors:

JG Weston, DR Young, SJ Smartt, M Nicholl, MJ Jarvis, IH Whittam

Abstract:

The upcoming Vera C. Rubin Observatory Legacy Survey of Space and Time (LSST) will enable astronomers to discover rare and distant astrophysical transients. Host-galaxy association is crucial for selecting the most scientifically interesting transients for follow-up. LSST deep drilling field (DDF) observations will detect distant transients occurring in galaxies below the detection limits of most all-sky catalogs. Here, we investigate the use of preexisting, field-specific catalogs for host identification in the DDFs and a ranking of their usefulness. We have compiled a database of 70 deep catalogs that overlap with the Rubin DDFs and constructed thin catalogs to be homogenized and combined for transient-host matching. A systematic ranking of their utility is discussed and applied based on the inclusion of information such as spectroscopic redshifts and morphological information. Utilizing this data against a Dark Energy Survey sample of supernovae with pre-identified hosts in the XMM-Large Scale Structure and the Extended Chandra Deep Field-South fields, we evaluate different methods for transient-host association in terms of both accuracy and processing speed. We also apply light data-cleaning techniques to identify and remove contaminants within our associations, such as diffraction spikes and blended galaxies where the correct host cannot be determined with confidence. We use a lightweight machine learning approach in the form of extreme gradient boosting to generate confidence scores in our contaminant selections and associated metrics. Finally, we discuss the computational expense of implementation within the LSST transient alert brokers, which will require efficient, fast-paced processing to handle the large stream of survey data.

Constraining dark matter halo profiles with symbolic regression.

Philosophical transactions. Series A, Mathematical, physical, and engineering sciences 384:2317 (2026) 20250090

Authors:

Alicia Martin, Tariq Yasin, Deaglan Bartlett, Harry Desmond, Pedro Ferreira

Abstract:

Dark matter haloes are typically characterized by radial density profiles with fixed forms motivated by simulations (e.g. Navarro-Frenk-White [NFW]). However, simulation predictions depend on uncertain dark matter physics and baryonic modelling. Here, we present a method to constrain halo density profiles directly from observations using Exhaustive Symbolic Regression (ESR), a technique that searches the space of analytic expressions for the function that best balances accuracy and simplicity for a given dataset. We test the approach on mock weak lensing excess surface density (ESD) data of synthetic clusters with NFW profiles. Motivated by real data, we assign each ESD data point a constant fractional uncertainty and vary this uncertainty and the number of clusters to probe how data precision and sample size affect model selection. For fractional errors around 5%, ESR recovers the NFW profile even from samples as small as approximately 20 clusters. At higher uncertainties representative of current surveys, simpler functions are favoured over NFW, though it remains competitive. This preference arises because weak lensing errors are smallest in the outskirts, causing the fits to be dominated by the outer profile. ESR therefore provides a robust, simulation-independent framework both for testing mass models and determining which features of a halo's density profile are genuinely constrained by the data. This article is part of the discussion meeting issue 'Symbolic regression in the physical sciences'.

MIGHTEE: The evolving radio luminosity functions of star-forming galaxies to z ∼ 4.5 and the cosmic history of star formation

Monthly Notices of the Royal Astronomical Society 91̽»¨ University Press (OUP) (2026) stag616

Authors:

Nijin J Thykkathu, Matt J Jarvis, Imogen H Whittam, CL Hale, AM Matthews, I Heywood, Eliab Malefahlo, RG Varadaraj, N Stylianou, Chris Pearson, Nick Seymour, Mattia Vaccari

Abstract:

Abstract A key question in extragalactic astronomy is how the star-formation rate density (SFRD) evolves over cosmic time. A powerful way of addressing this question is using radio-continuum observations, where the radio waves are unaffected by dust and are able to reach sufficient resolution to resolve individual galaxies. We present an investigation of the 1.4 GHz radio luminosity functions (RLFs) of star-forming galaxies (SFGs) and Active Galactic Nuclei (AGN) using deep radio continuum observations in the COSMOS and XMM–LSS fields, covering a combined area of ∼4 deg2. These data enable the most accurate measurement of the evolution in the SFRD from mid-frequency radio continuum observations. We model the total RLF as the sum of evolving SFG and AGN components, negating the need for individual source classification. We find that the SFGs have systematically higher space densities at fixed luminosity than found in previous radio studies, but consistent with more recent studies with MeerKAT. We attribute this to the excellent low-surface brightness sensitivity of MeerKAT. We then determine the evolution of the SFRD. Adopting the far-infrared – radio correlation results in a significantly higher SFRD at z > 1, compared to combined UV and far-infrared measurements. However, using more recent relations for the correlation between star-formation rate and radio luminosity, based on full spectral energy distribution modelling, can resolve this apparent discrepancy. Thus radio observations provide a powerful method of determining the total SFRD, in the absence of dust-sensitive far-infrared data.