Constraining dark matter halo profiles with symbolic regression
Philosophical Transactions of the Royal Society A Mathematical Physical and Engineering Sciences The Royal Society 384:2317 (2026) 20250090
Abstract:
Dark matter haloes are typically characterized by radial density profiles with fixed forms motivated by simulations (e.g. Navarro-Frenk-White [NFW]). However, simulation predictions depend on uncertain dark matter physics and baryonic modelling. Here, we present a method to constrain halo density profiles directly from observations using Exhaustive Symbolic Regression (ESR), a technique that searches the space of analytic expressions for the function that best balances accuracy and simplicity for a given dataset. We test the approach on mock weak lensing excess surface density (ESD) data of synthetic clusters with NFW profiles. Motivated by real data, we assign each ESD data point a constant fractional uncertainty and vary this uncertainty and the number of clusters to probe how data precision and sample size affect model selection. For fractional errors around 5%, ESR recovers the NFW profile even from samples as small as approximately 20 clusters. At higher uncertainties representative of current surveys, simpler functions are favoured over NFW, though it remains competitive. This preference arises because weak lensing errors are smallest in the outskirts, causing the fits to be dominated by the outer profile. ESR therefore provides a robust, simulation-independent framework both for testing mass models and determining which features of a halo's density profile are genuinely constrained by the data. This article is part of the discussion meeting issue 'Symbolic regression in the physical sciences'.Statistical patterns in the equations of physics and the emergence of a meta-law of nature
Philosophical Transactions of the Royal Society A Mathematical Physical and Engineering Sciences The Royal Society 384:2317 (2026) 20250091
Abstract:
Physics seeks to uncover the laws of Nature and express them through mathematical equations . Despite the vast diversity of natural phenomena, physical equations exhibit structural regularities that set them apart from arbitrary mathematical expressions. While principles such as dimensional analysis have long guided the formulation of physical models, the exploration of more subtle statistical patterns within the equations of physics remains an open question. Here, by analysing four corpora of physics equations and applying advanced implicit-likelihood techniques, we find that the frequency of mathematical operators follows an exponential decay law, in contrast to Zipf's power law for word frequencies in natural languages. This reveals a statistical meta-law of physics, possibly reflecting a combination of communication efficiency and constraints imposed by Nature itself. The meta-law offers practical benefits for symbolic regression by drastically narrowing down the space of physically plausible expressions. More broadly, it may inform the development of language models that can generate coherent mathematical representations, advancing the automation of physical law discovery. This article is part of the discussion meeting issue 'Symbolic regression in the physical sciences'.(Exhaustive) symbolic regression and model selection by minimum description length.
Philosophical transactions. Series A, Mathematical, physical, and engineering sciences 384:2317 (2026) 20240584
Abstract:
Symbolic regression (SR) is the machine learning (ML) method for learning functions from data. After a brief overview of the SR landscape, I will describe the two main challenges that traditional algorithms face: they have an unknown (and probably significant) probability of failing to find any given good function, and they suffer from ambiguity and poorly justified assumptions in their function-selection procedure. To address these, I propose an exhaustive search and model selection by the minimum description length (MDL) principle, which allows accuracy and complexity to be directly traded off by measuring each in units of information. I showcase the resulting publicly available Exhaustive Symbolic Regression (ESR) algorithm on three open problems in astrophysics: the expansion history of the universe, the effective behaviour of gravity in galaxies and the potential of the inflaton field. In each case, the algorithm identifies many functions superior to the literature standards. This general-purpose methodology should find widespread utility in science and beyond. This article is part of the discussion meeting issue 'Symbolic regression in the physical sciences'.MIGHTEE-H I: Mass Models and Dark Matter properties
Monthly Notices of the Royal Astronomical Society (2026) stag531
Abstract:
Measuring galaxy rotation curves is critical for inferring the properties of dark-matter haloes in the Lambda Cold Dark Matter (ΛCDM) paradigm. We present H i rotation curves and mass models for 20 galaxies from the MIGHTEE survey. Using extended H i kinematics, we construct resolved mass models that include stellar, gaseous, and dark-matter components. Stellar masses are derived using 3.6 μm imaging under fixed mass-to-light ratio (ϒ* = M/L) assumptions and are complemented, for the first time for a H I-selected sample, by spatially resolved M/L, obtained from multi-wavelength SED fitting. We examine the ratio of baryonic to observed rotation velocity (Vbar/Vobs) at the characteristic radius R2.2. Adopting a fixed ϒ⋆ = 0.5 M⊙/L⊙ yields a clear dependence of V2.2/Vobs on galaxy luminosity, while adopting ϒ⋆ = 0.2 M⊙/L⊙ substantially weakens this trend. In contrast, the resolved M/L analysis preserves the luminosity dependence while modifying the stellar contribution on a galaxy-by-galaxy basis, providing a more accurate representation of the underlying relation. We model the dark-matter haloes using Navarro–Frenk–White profiles and find that the different assumptions for a fixed a M/L systematically shift galaxies relative to the theoretical stellar-to-halo mass and baryonic-to-halo mass relations, while the spatially varying M/L yields the closest agreement with theoretical benchmarks within ΛCDM. We therefore demonstrate that future investigations of the dark matter properties of galaxies using rotation curves need to account for varying M/L across individual galaxy profiles and between galaxies in order to obtain accurate measurements of the dark matter, and therefore test ΛCDM.No evidence for p- or d-wave dark matter annihilation from local large-scale structure
Physical Review D American Physical Society (APS) 113:6 (2026) 063539