Date of Degree

6-2026

Document Type

Master's Thesis

Degree Name

Master of Science

Program

Astrophysics

Advisor

Shy Genel

Advisor

Francisco Villaescusa-Navarro

Subject Categories

Artificial Intelligence and Robotics | Astrophysics and Astronomy | Cosmology, Relativity, and Gravity | Data Science | External Galaxies

Keywords

cosmological parameter inference, galaxy kinematics, large-scale structure, machine learning, statistical methods, velocity fields

Abstract

We perform field-level likelihood-free inference of the matter density parameter Ωm from simulated galaxy catalogs using machine learning models with differing inductive biases. Using features extracted from hydrodynamic simulations in the CAMELS suite, we investigate how both observable choice and model architecture govern the extraction of cosmological information. We consider galaxy positions and line-of-sight peculiar velocities, both separately and in combination, and compare permutation-invariant Deep Sets, implemented with either standard multilayer perceptrons (MLPs) or Kolmogorov–Arnold Networks (KANs), to graph neural networks (GNNs) implemented with MLPs, which explicitly encode spatial relations. We evaluate inference performance under both in-distribution and out-of-distribution (OOD) conditions by testing across simulations with different subgrid galaxy-formation prescriptions. We find that Deep Sets can infer Ωm from in-distribution data with a mean relative error of approximately 18% and from OOD data with a mean relative error of ∼25% using only line-of-sight peculiar velocities, with KANs and MLPs achieving comparable performance. This indicates that peculiar velocities encode nearly robust cosmological information accessible to permutation-invariant models. When positional information is included, Deep Set performance remains unchanged, while GNNs outperform them, inferring Ωm with a mean relative error of about 10% in-distribution and 10%–17% OOD. These results clarify how observable choice and architectural inductive bias jointly determine the robustness and effectiveness of machine-learning-based cosmological inference, with direct implications for the design of analysis pipelines for upcoming galaxy surveys.

Recommended Citation

Baldwin, James O'Connor, "Inductive Biases in Field-Level Cosmological Inference from Galaxy Catalogs" (2026). CUNY Academic Works.
https://academicworks.cuny.edu/gc_etds/6751

Download

Included in

Artificial Intelligence and Robotics Commons, Cosmology, Relativity, and Gravity Commons, Data Science Commons, External Galaxies Commons

COinS

Dissertations, Theses, and Capstone Projects

Inductive Biases in Field-Level Cosmological Inference from Galaxy Catalogs

Date of Degree

Document Type

Degree Name

Program

Advisor

Advisor

Subject Categories

Keywords

Abstract

Recommended Citation

Included in

Browse

Author Corner

Search

Links

Dissertations, Theses, and Capstone Projects

Inductive Biases in Field-Level Cosmological Inference from Galaxy Catalogs

Author

Date of Degree

Document Type

Degree Name

Program

Advisor

Advisor

Subject Categories

Keywords

Abstract

Recommended Citation

Included in

Share

Browse

Author Corner

Search

Links