Paper List
-
A Theoretical Framework for the Formation of Large Animal Groups: Topological Coordination, Subgroup Merging, and Velocity Inheritance
This paper addresses the core problem of how large, coordinated animal groups form in nature, challenging the classical view of gradual aggregation by...
-
CONFIDE: Hallucination Assessment for Reliable Biomolecular Structure Prediction and Design
This paper addresses the critical limitation of current protein structure prediction models (like AlphaFold3) where high-confidence scores (pLDDT) can...
-
Generative design and validation of therapeutic peptides for glioblastoma based on a potential target ATP5A
This paper addresses the critical bottleneck in therapeutic peptide design: how to efficiently optimize lead peptides with geometric constraints while...
-
Pharmacophore-based design by learning on voxel grids
This paper addresses the computational bottleneck and limited novelty in conventional pharmacophore-based virtual screening by introducing a voxel cap...
-
Human-Centred Evaluation of Text-to-Image Generation Models for Self-expression of Mental Distress: A Dataset Based on GPT-4o
This paper addresses the critical gap in evaluating how AI-generated images can effectively support cross-cultural mental distress communication, part...
-
ANNE Apnea Paper
This paper addresses the core challenge of achieving accurate, event-level sleep apnea detection and characterization using a non-intrusive, multimoda...
-
DeeDeeExperiment: Building an infrastructure for integrating and managing omics data analysis results in R/Bioconductor
This paper addresses the critical bottleneck of managing and organizing the growing volume of differential expression and functional enrichment analys...
-
Cross-Species Antimicrobial Resistance Prediction from Genomic Foundation Models
This paper addresses the core challenge of predicting antimicrobial resistance across phylogenetically distinct bacterial species, where traditional m...
Single Molecule Localization Microscopy Challenge: A Biologically Inspired Benchmark for Long-Sequence Modeling
Technische Universität Wien
30秒速读
IN SHORT: This paper addresses the core challenge of evaluating state-space models on biologically realistic, sparse, and stochastic temporal processes, which are not captured by existing benchmarks focused on dense, regularly sampled data.
核心创新
- Methodology Introduces SMLM-C, the first benchmark dataset specifically designed to evaluate long-sequence models on sparse spatiotemporal localization data with known ground truth, spanning dSTORM and DNA-PAINT modalities.
- Methodology Formulates SMLM reconstruction as a sequence-to-set prediction task, requiring models to disentangle overlapping localization clouds by jointly exploiting spatial and temporal context over up to 10,000 frames.
- Biology Reveals that state-space model performance degrades substantially as temporal discontinuity increases (e.g., detection accuracy drops from ~73% to ~62% when average off-time increases from 100 to 1000 frames), highlighting fundamental challenges in modeling heavy-tailed blinking dynamics.
主要结论
- State-space models show limited absolute performance on SMLM reconstruction, with the highest detection accuracy reaching only 73.4% ± 1.23% (S5-L on μ_off=100 frames) and dropping to 69.6% ± 0.21% (Mamba-2-L on μ_off=1000 frames) under a 20 nm matching threshold.
- Model performance is strongly influenced by temporal sparsity, with all evaluated architectures (S5 and Mamba-2) showing degraded performance as average off-time increases from 100 to 1000 frames, indicating fundamental challenges in handling long-range temporal dependencies.
- Mamba-2 demonstrates better robustness to long temporal gaps, outperforming S5 in the long off-time regime (μ_off=1000 frames), while S5 performs better under shorter dark states (μ_off=100 frames), suggesting architectural differences in handling temporal discontinuity.
摘要: State space models (SSMs) have recently achieved strong performance on long-sequence modeling tasks while offering improved memory and computational efficiency compared to transformer-based architectures. However, their evaluation has been largely limited to synthetic benchmarks and application domains such as language and audio, leaving their behavior on sparse and stochastic temporal processes in biological imaging unexplored. In this work, we introduce the Single Molecule Localization Microscopy Challenge (SMLM-C), a benchmark dataset consisting of ten SMLM simulations—spanning dSTORM and DNA-PAINT modalities with varying hyperparameter—designed to evaluate state-space models on biologically realistic spatiotemporal point-process data with known ground truth. Using a controlled subset of these simulations, we evaluate state space models and find that performance degrades substantially as temporal discontinuity increases, revealing fundamental challenges in modeling heavy-tailed blinking dynamics. These results highlight the need for sequence models better suited to sparse, irregular temporal processes encountered in real-world scientific imaging data.