Paper List
-
MCP-AI: Protocol-Driven Intelligence Framework for Autonomous Reasoning in Healthcare
This paper addresses the critical gap in healthcare AI systems that lack contextual reasoning, long-term state management, and verifiable workflows by...
-
Model Gateway: Model Management Platform for Model-Driven Drug Discovery
This paper addresses the critical bottleneck of fragmented, ad-hoc model management in pharmaceutical research by providing a centralized, scalable ML...
-
Tree Thinking in the Genomic Era: Unifying Models Across Cells, Populations, and Species
This paper addresses the fragmentation of tree-based inference methods across biological scales by identifying shared algorithmic principles and stati...
-
SSDLabeler: Realistic semi-synthetic data generation for multi-label artifact classification in EEG
This paper addresses the core challenge of training robust multi-label EEG artifact classifiers by overcoming the scarcity and limited diversity of ma...
-
Decoding Selective Auditory Attention to Musical Elements in Ecologically Valid Music Listening
This paper addresses the core challenge of objectively quantifying listeners' selective attention to specific musical components (e.g., vocals, drums,...
-
Physics-Guided Surrogate Modeling for Machine Learning–Driven DLD Design Optimization
This paper addresses the core bottleneck of translating microfluidic DLD devices from research prototypes to clinical applications by replacing weeks-...
-
Mechanistic Interpretability of Antibody Language Models Using SAEs
This work addresses the core challenge of achieving both interpretability and controllable generation in domain-specific protein language models, spec...
-
Fluctuating Environments Favor Extreme Dormancy Strategies and Penalize Intermediate Ones
This paper addresses the core challenge of determining how organisms should tune dormancy duration to match the temporal autocorrelation of their envi...
Hybrid eTFCE–GRF: Exact Cluster-Size Retrieval with Analytical pp-Values for Voxel-Based Morphometry
University of Cambridge | National University of Singapore | University of Dundee
30秒速读
IN SHORT: This paper addresses the computational bottleneck in voxel-based neuroimaging analysis by providing a method that delivers exact cluster-size retrieval and analytical inference simultaneously, eliminating the need for slow permutation testing while removing discretization errors.
核心创新
- Methodology Proposes a hybrid algorithm that combines eTFCE's union-find data structure for exact cluster-size retrieval with pTFCE's analytical Gaussian Random Field (GRF) inference, achieving both properties for the first time.
- Methodology Introduces a six-experiment Monte Carlo validation protocol demonstrating nominal family-wise error rate (FWER) control (0/200 rejections, 95% CI [0.0%, 1.9%]), no power loss (Dice ≥0.999), and high cross-variant concordance (r > 0.99).
- Software Develops and releases pytfce, an open-source, pure-Python package that achieves 4.6x to 75x speedup over the reference R implementation and is more than three orders of magnitude faster than permutation-based TFCE.
主要结论
- The hybrid eTFCE-GRF method successfully controls the family-wise error rate at the nominal level, with 0 false positives out of 200 tests (95% CI [0.0%, 1.9%]).
- Statistical power is preserved with Dice coefficients ≥0.999 compared to baseline pTFCE at sufficient signal strength, and cross-variant concordance exceeds r=0.99.
- Runtime improvements are substantial: the baseline implementation is 75x faster than R pTFCE (~5 seconds vs. ~375 seconds), while the hybrid variant is 4.6x faster (~85 seconds) with the added benefit of exact cluster-size retrieval.
摘要: Threshold-free cluster enhancement (TFCE) improves sensitivity in voxel-wise neuroimaging inference by integrating cluster extent across all thresholds, but its reliance on permutation testing makes it prohibitively slow for large datasets. Probabilistic TFCE (pTFCE) replaces permutations with analytical Gaussian random field (GRF) pp-values, which reduces runtime by more than an order of magnitude, yet relies on a fixed threshold grid that introduces discretisation error. Exact TFCE (eTFCE) eliminates this discretisation by computing the integral exactly via a union-find data structure, but still requires permutations for inference. We propose a hybrid method that combines eTFCE’s union-find data structure for exact cluster-size retrieval with pTFCE’s analytical GRF inference. The union-find builds the full cluster hierarchy in a single pass over sorted voxels and enables exact cluster-size queries at any threshold in near-constant time; GRF theory then converts these sizes into analytical pp-values without permutations. We validate the method through a six-experiment Monte Carlo study on synthetic phantoms (64364^{3}, 80 subjects): null family-wise error rate is controlled at the nominal level (0/200 rejections, 95% CI [0.0%,1.9%][0.0\%,1.9\%]); power curves match baseline pTFCE (Dice ≥0.999\geq 0.999 at sufficient signal); smoothness estimation error is below 1%; and cross-variant concordance exceeds r=0.99r=0.99. On real brain data from UK Biobank (N=500N=500, within-vendor) and IXI (N=563N=563, cross-vendor), the method detects biologically plausible scanner, age, and sex effects; on IXI, significance maps form strict subsets of the reference R pTFCE output, which supports conservative family-wise error control. Both methods are implemented in pytfce, a pure-Python package with no R or FSL dependencies, available on PyPI. The baseline reimplementation completes whole-brain voxel-based morphometry in ∼5{\sim}5 s (75×75\times faster than R pTFCE), while the hybrid variant completes in ∼85{\sim}85 s (4.6×4.6\times faster) with the advantage of exact cluster-size retrieval; both are more than three orders of magnitude faster than permutation-based TFCE.