Symbolic emulators for cosmology: accelerating cosmological analyses without sacrificing precision

Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences The Royal Society 384:2317 (2026) 20240585

Authors:

Deaglan Bartlett, Shivam Pandey

Abstract:

Abstract In cosmology, emulators play a crucial role by providing fast and accurate predictions of complex physical models, enabling efficient exploration of high-dimensional parameter spaces that would be computationally prohibitive with direct numerical simulations. Symbolic emulators have emerged as promising alternatives to numerical approaches, delivering comparable accuracy with significantly faster evaluation times. While previous symbolic emulators were limited to relatively narrow prior ranges, we expand these to cover the parameter space relevant for current cosmological analyses. We introduce approximations to hypergeometric functions used for the 螞 cold dark matter (螞CDM) comoving distance and linear growth factor which are accurate to better than 0.001% and 0.05%, respectively, for all redshifts and for 惟m鈭圼0.1,0.5]. We show that integrating symbolic emulators into a Dark Energy Survey Year 1 (DES-Y1)-like 3脳2 pt analysis produces cosmological constraints consistent with those obtained using standard numerical methods. Our symbolic emulators offer substantial improvements in speed and memory usage, demonstrating their practical potential for scalable, likelihood-based inference. This article is part of the discussion meeting issue 鈥楽ymbolic regression in the physical sciences鈥.

Symbolic regression and differentiable fits in beyond the standard model physics

Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences The Royal Society 384:2317 (2026) 20240593

Authors:

Shehu AbdusSalam, Steven Abel, Deaglan Bartlett, Miguel Crispim Romao

Abstract:

Abstract We demonstrate the efficacy of symbolic regression (SR) to probe models of particle physics Beyond the Standard Model (BSM), by considering the so-called Constrained Minimal Supersymmetric Standard Model (CMSSM). Like many incarnations of BSM physics this model has a number (four) of arbitrary parameters, which determine the experimental signals, and cosmological observables such as the dark matter relic density. We show that analysis of the phenomenology can be greatly accelerated by using symbolic expressions derived for the observables in terms of the input parameters. Here we focus on the Higgs mass, the cold dark matter relic density and the contribution to the anomalous magnetic moment of the muon. We find that SR can produce remarkably accurate expressions. Using them we make global fits to derive the posterior probability densities of the CMSSM input parameters which are in good agreement with those performed using conventional methods. Moreover, we demonstrate a major advantage of SR, which is the ability to make fits using differentiable methods rather than sampling methods. We also compare the method with neural network (NN) regression. SR produces more globally robust results, while NNs require data that is focused on the promising regions in order to be equally performant. This article is part of the discussion meeting issue 鈥楽ymbolic regression in the physical sciences鈥.

Clumps in High-redshift Galaxies: Mass Scaling and Radial Trends from JADES

The Astrophysical Journal 1000:2 (2026)

Authors:

Yongda Zhu, Marcia J Rieke, Zhiyuan Ji, Andrew J Bunker, Courtney Carreira, A Lola Danhaive, Qiao Duan, Eiichi Egami, Daniel J Eisenstein, Kevin Hainline, Benjamin D Johnson, Zheng Ma, D谩vid Pusk谩s, George H Rieke, Pierluigi Rinaldi, Brant Robertson, Sandro Tacchella, Hannah 脺bler, Natalia C Villanueva, Christina C Williams, Christopher NA Willmer, Zihao Wu, Junyu Zhang

Abstract:

Massive star-forming clumps are a prominent feature of high-redshift galaxies and are thought to trace gravitational fragmentation, feedback, and bulge growth in gas-rich disks. We present a statistical analysis of clumps in 鈭3600 galaxies spanning 2 鈮 z 鈮 8 from deep JWST/NIRCam imaging in the JADES GOODS鈥揝outh field. Clumps are identified as residual features after subtracting smooth S茅rsic profiles, enabling a uniform, rest-frame optical census of subgalactic structure. We characterize their physical properties, size鈥搈ass relations, and spatial distributions to constrain models of subgalactic structure formation and evolution. We find that clumps in our sample are typically low-mass (10鈭7鈭8M鈯), actively star-forming, and show diverse gas-phase metallicity, dust attenuation, and stellar population properties. Their sizes and average pairwise separations increase with cosmic time (toward lower redshift), consistent with inside-out disk growth. The clump mass function follows a power law with slope 伪=鈭1.50鈭0.17+0.19 , consistent with fragmentation in turbulent disks. We find a deficit of relatively young clumps near galaxy centers and a radial transition in the size鈥搈ass relation: outer clumps exhibit steeper, near-virial slopes ( Re鈭滿*鈭0.3 ), while inner clumps follow flatter trends ( Re鈭滿*鈭0.2 ), consistent with structural evolution via migration or disruption. These results provide new constraints on the formation, survival, and dynamical evolution of clumps, highlighting their role in shaping galaxy morphology during the peak of cosmic star formation.

Identifying Transient Hosts in LSST鈥檚 Deep Drilling Fields with Galaxy Catalogs

The Astrophysical Journal American Astronomical Society 1000:2 (2026) 289

Authors:

JG Weston, DR Young, SJ Smartt, M Nicholl, MJ Jarvis, IH Whittam

Abstract:

The upcoming Vera C. Rubin Observatory Legacy Survey of Space and Time (LSST) will enable astronomers to discover rare and distant astrophysical transients. Host-galaxy association is crucial for selecting the most scientifically interesting transients for follow-up. LSST deep drilling field (DDF) observations will detect distant transients occurring in galaxies below the detection limits of most all-sky catalogs. Here, we investigate the use of preexisting, field-specific catalogs for host identification in the DDFs and a ranking of their usefulness. We have compiled a database of 70 deep catalogs that overlap with the Rubin DDFs and constructed thin catalogs to be homogenized and combined for transient-host matching. A systematic ranking of their utility is discussed and applied based on the inclusion of information such as spectroscopic redshifts and morphological information. Utilizing this data against a Dark Energy Survey sample of supernovae with pre-identified hosts in the XMM-Large Scale Structure and the Extended Chandra Deep Field-South fields, we evaluate different methods for transient-host association in terms of both accuracy and processing speed. We also apply light data-cleaning techniques to identify and remove contaminants within our associations, such as diffraction spikes and blended galaxies where the correct host cannot be determined with confidence. We use a lightweight machine learning approach in the form of extreme gradient boosting to generate confidence scores in our contaminant selections and associated metrics. Finally, we discuss the computational expense of implementation within the LSST transient alert brokers, which will require efficient, fast-paced processing to handle the large stream of survey data.

Constraining dark matter halo profiles with symbolic regression.

Philosophical transactions. Series A, Mathematical, physical, and engineering sciences 384:2317 (2026) 20250090

Authors:

Alicia Martin, Tariq Yasin, Deaglan Bartlett, Harry Desmond, Pedro Ferreira

Abstract:

Dark matter haloes are typically characterized by radial density profiles with fixed forms motivated by simulations (e.g. Navarro-Frenk-White [NFW]). However, simulation predictions depend on uncertain dark matter physics and baryonic modelling. Here, we present a method to constrain halo density profiles directly from observations using Exhaustive Symbolic Regression (ESR), a technique that searches the space of analytic expressions for the function that best balances accuracy and simplicity for a given dataset. We test the approach on mock weak lensing excess surface density (ESD) data of synthetic clusters with NFW profiles. Motivated by real data, we assign each ESD data point a constant fractional uncertainty and vary this uncertainty and the number of clusters to probe how data precision and sample size affect model selection. For fractional errors around 5%, ESR recovers the NFW profile even from samples as small as approximately 20 clusters. At higher uncertainties representative of current surveys, simpler functions are favoured over NFW, though it remains competitive. This preference arises because weak lensing errors are smallest in the outskirts, causing the fits to be dominated by the outer profile. ESR therefore provides a robust, simulation-independent framework both for testing mass models and determining which features of a halo's density profile are genuinely constrained by the data. This article is part of the discussion meeting issue 'Symbolic regression in the physical sciences'.