Paper List
-
Translating Measures onto Mechanisms: The Cognitive Relevance of Higher-Order Information
This review addresses the core challenge of translating abstract higher-order information theory metrics (e.g., synergy, redundancy) into defensible, ...
-
Emergent Bayesian Behaviour and Optimal Cue Combination in LLMs
This paper addresses the critical gap in understanding whether LLMs spontaneously develop human-like Bayesian strategies for processing uncertain info...
-
Vessel Network Topology in Molecular Communication: Insights from Experiments and Theory
This work addresses the critical lack of experimentally validated channel models for molecular communication within complex vessel networks, which is ...
-
Modulation of DNA rheology by a transcription factor that forms aging microgels
This work addresses the fundamental question of how the transcription factor NANOG, essential for embryonic stem cell pluripotency, physically regulat...
-
Imperfect molecular detection renormalizes apparent kinetic rates in stochastic gene regulatory networks
This paper addresses the core challenge of distinguishing genuine stochastic dynamics of gene regulatory networks from artifacts introduced by imperfe...
-
PanFoMa: A Lightweight Foundation Model and Benchmark for Pan-Cancer
This paper addresses the dual challenge of achieving computational efficiency without sacrificing accuracy in whole-transcriptome single-cell represen...
-
Beyond Bayesian Inference: The Correlation Integral Likelihood Framework and Gradient Flow Methods for Deterministic Sampling
This paper addresses the core challenge of calibrating complex biological models (e.g., PDEs, agent-based models) with incomplete, noisy, or heterogen...
-
Contrastive Deep Learning for Variant Detection in Wastewater Genomic Sequencing
This paper addresses the core challenge of detecting viral variants in wastewater sequencing data without reference genomes or labeled annotations, ov...
Single Molecule Localization Microscopy Challenge: A Biologically Inspired Benchmark for Long-Sequence Modeling
Technische Universität Wien
30秒速读
IN SHORT: This paper addresses the core challenge of evaluating state-space models on biologically realistic, sparse, and stochastic temporal processes, which are not captured by existing benchmarks focused on dense, regularly sampled data.
核心创新
- Methodology Introduces SMLM-C, the first benchmark dataset specifically designed to evaluate long-sequence models on sparse spatiotemporal localization data with known ground truth, spanning dSTORM and DNA-PAINT modalities.
- Methodology Formulates SMLM reconstruction as a sequence-to-set prediction task, requiring models to disentangle overlapping localization clouds by jointly exploiting spatial and temporal context over up to 10,000 frames.
- Biology Reveals that state-space model performance degrades substantially as temporal discontinuity increases (e.g., detection accuracy drops from ~73% to ~62% when average off-time increases from 100 to 1000 frames), highlighting fundamental challenges in modeling heavy-tailed blinking dynamics.
主要结论
- State-space models show limited absolute performance on SMLM reconstruction, with the highest detection accuracy reaching only 73.4% ± 1.23% (S5-L on μ_off=100 frames) and dropping to 69.6% ± 0.21% (Mamba-2-L on μ_off=1000 frames) under a 20 nm matching threshold.
- Model performance is strongly influenced by temporal sparsity, with all evaluated architectures (S5 and Mamba-2) showing degraded performance as average off-time increases from 100 to 1000 frames, indicating fundamental challenges in handling long-range temporal dependencies.
- Mamba-2 demonstrates better robustness to long temporal gaps, outperforming S5 in the long off-time regime (μ_off=1000 frames), while S5 performs better under shorter dark states (μ_off=100 frames), suggesting architectural differences in handling temporal discontinuity.
摘要: State space models (SSMs) have recently achieved strong performance on long-sequence modeling tasks while offering improved memory and computational efficiency compared to transformer-based architectures. However, their evaluation has been largely limited to synthetic benchmarks and application domains such as language and audio, leaving their behavior on sparse and stochastic temporal processes in biological imaging unexplored. In this work, we introduce the Single Molecule Localization Microscopy Challenge (SMLM-C), a benchmark dataset consisting of ten SMLM simulations—spanning dSTORM and DNA-PAINT modalities with varying hyperparameter—designed to evaluate state-space models on biologically realistic spatiotemporal point-process data with known ground truth. Using a controlled subset of these simulations, we evaluate state space models and find that performance degrades substantially as temporal discontinuity increases, revealing fundamental challenges in modeling heavy-tailed blinking dynamics. These results highlight the need for sequence models better suited to sparse, irregular temporal processes encountered in real-world scientific imaging data.