Paper List

Bioinformatics

MCP-AI: Protocol-Driven Intelligence Framework for Autonomous Reasoning in Healthcare

2025-12-05

This paper addresses the critical gap in healthcare AI systems that lack contextual reasoning, long-term state management, and verifiable workflows by...
Bioinformatics

Model Gateway: Model Management Platform for Model-Driven Drug Discovery

2025-12-05

This paper addresses the critical bottleneck of fragmented, ad-hoc model management in pharmaceutical research by providing a centralized, scalable ML...
Bioinformatics

Tree Thinking in the Genomic Era: Unifying Models Across Cells, Populations, and Species

2025-12-05

This paper addresses the fragmentation of tree-based inference methods across biological scales by identifying shared algorithmic principles and stati...
Bioinformatics

SSDLabeler: Realistic semi-synthetic data generation for multi-label artifact classification in EEG

2025-12-05

This paper addresses the core challenge of training robust multi-label EEG artifact classifiers by overcoming the scarcity and limited diversity of ma...
Neuroscience

Decoding Selective Auditory Attention to Musical Elements in Ecologically Valid Music Listening

2025-12-05

This paper addresses the core challenge of objectively quantifying listeners' selective attention to specific musical components (e.g., vocals, drums,...
Bioengineering

Physics-Guided Surrogate Modeling for Machine Learning–Driven DLD Design Optimization

2025-12-05

This paper addresses the core bottleneck of translating microfluidic DLD devices from research prototypes to clinical applications by replacing weeks-...
Bioinformatics

Mechanistic Interpretability of Antibody Language Models Using SAEs

2025-12-05

This work addresses the core challenge of achieving both interpretability and controllable generation in domain-specific protein language models, spec...
Theoretical Biology

Fluctuating Environments Favor Extreme Dormancy Strategies and Penalize Intermediate Ones

2025-12-05

This paper addresses the core challenge of determining how organisms should tune dormancy duration to match the temporal autocorrelation of their envi...

10 / 18

期刊: ArXiv Preprint

发布日期: 2026-03-11

Computational ModelingCausal Inference

Realizing Common Random Numbers: Event-Keyed Hashing for Causally Valid Stochastic Models

Institute for Disease Modeling, Gates Foundation | Department of Epidemiology, University of North Carolina | Institute for Disease Modeling, Gates Foundation

Vince Buffalo, Carl A. B. Pearson, Daniel Klein

30秒速读

IN SHORT: This paper addresses the critical problem that standard stateful PRNG implementations in agent-based models violate causal validity by making random draws execution-path-dependent, thereby breaking the fundamental assumption of common random numbers needed for valid counterfactual comparisons.

核心创新

Methodology Identifies and formalizes the fundamental mismatch between scientific causal structure in ABMs and program-level causal structure induced by stateful PRNGs through the lens of Structural Causal Models (SCMs)
Methodology Introduces the concept of 'execution invariance' as a necessary property for causally valid ABM counterfactuals, requiring that exogenous noise terms remain stable across intervention scenarios
Methodology Proposes event-keyed random number generation combining counter-based PRNGs (Philox/Threefry) with event identifiers to decouple random draws from simulation execution order

主要结论

Standard stateful PRNG practices violate the execution invariance required for valid SCM-style interventions, as demonstrated through formal analysis of the structural causal model framework
Event-keyed hashing with counter-based PRNGs restores the stable event-indexed exogenous structure assumed by SCMs, enabling proper counterfactual comparisons with variance reduction benefits
The proposed approach allows ABMs to function as valid structural causal models under interventions, maintaining the critical property that interventions change only structural equations while holding exogenous noise terms fixed

研究空白： Current ABM implementations using stateful PRNGs fail to maintain proper coupling of exogenous noise terms across intervention scenarios due to execution-path-dependent draw indexing, making individual counterfactual comparisons causally incoherent even when mechanistic specifications are sound.

摘要: Agent-based models (ABMs) are widely used to estimate causal treatment effects via paired counterfactual simulation. A standard variance reduction technique is common random numbers (CRNs), which couples replicates across intervention scenarios by sharing the same random inputs. In practice, CRNs are implemented by reusing the same base seed, but this relies on a critical assumption: that the same draw index corresponds to the same modeled event across scenarios. Stateful pseudorandom number generators (PRNGs) violate this assumption whenever interventions alter the simulation's execution path, because any change in control flow shifts the draw index used for all downstream events. We argue that this execution-path-dependent draw indexing is not only a variance-reduction nuisance, but represents a fundamental mismatch between the scientific causal structure ABMs are intended to encode and the program-level causal structure induced by stateful PRNG implementations. Formalizing this through the lens of structural causal models (SCMs), we show that standard PRNG practices yield causally incoherent paired counterfactual comparisons even when the mechanistic specification is otherwise sound. We show that a remedy is to combine counter-based random number generators (e.g., Philox/Threefry) with event identifiers. This decouples random number generation from simulation execution order by making random draws explicit functions of the particular modeled event that called them, restoring the stable event-indexed exogenous structure assumed by SCMs.