Paper List
-
Ill-Conditioning in Dictionary-Based Dynamic-Equation Learning: A Systems Biology Case Study
This paper addresses the critical challenge of numerical ill-conditioning and multicollinearity in library-based sparse regression methods (e.g., SIND...
-
Hybrid eTFCE–GRF: Exact Cluster-Size Retrieval with Analytical pp-Values for Voxel-Based Morphometry
This paper addresses the computational bottleneck in voxel-based neuroimaging analysis by providing a method that delivers exact cluster-size retrieva...
-
abx_amr_simulator: A simulation environment for antibiotic prescribing policy optimization under antimicrobial resistance
This paper addresses the critical challenge of quantitatively evaluating antibiotic prescribing policies under realistic uncertainty and partial obser...
-
PesTwin: a biology-informed Digital Twin for enabling precision farming
This paper addresses the critical bottleneck in precision agriculture: the inability to accurately forecast pest outbreaks in real-time, leading to su...
-
Equivariant Asynchronous Diffusion: An Adaptive Denoising Schedule for Accelerated Molecular Conformation Generation
This paper addresses the core challenge of generating physically plausible 3D molecular structures by bridging the gap between autoregressive methods ...
-
Omics Data Discovery Agents
This paper addresses the core challenge of making published omics data computationally reusable by automating the extraction, quantification, and inte...
-
Single-cell directional sensing at ultra-low chemoattractant concentrations from extreme first-passage events
This work addresses the core challenge of how a cell can rapidly and accurately determine the direction of a chemoattractant source when the signal is...
-
SDSR: A Spectral Divide-and-Conquer Approach for Species Tree Reconstruction
This paper addresses the computational bottleneck in reconstructing species trees from thousands of species and multiple genes by introducing a scalab...
在强生物域偏移下药物反应模型对患者肿瘤的样本高效适应
Université Grenoble Alpes (UGA)
30秒速读
IN SHORT: 通过从无标记分子谱中学习可迁移表征,利用最少的临床数据实现患者药物反应的有效预测。
核心创新
- Methodology Proposes STaR-DR, a staged transfer-learning framework that explicitly separates unsupervised representation learning, task-specific alignment, and few-shot clinical adaptation.
- Methodology Demonstrates that unsupervised pretraining yields limited gains for in vitro prediction but substantially improves few-shot adaptation to patient tumors under strong domain shift.
- Biology Links performance patterns to latent-space geometry, providing mechanistic insight into when representation learning is beneficial under biological domain shift.
主要结论
- 无监督预训练对域内预测(平衡准确率约0.85)和跨数据集泛化(ROC-AUC约0.75)的益处有限,但在适应具有非常有限标记数据的患者肿瘤时能带来明显收益。
- 分阶段框架在少样本患者水平适应期间实现了更快的性能提升,与单阶段基线相比,有效迁移所需的标记目标样本数量减少了约30-40%。
- 在Leave-Drug-Out协议下性能下降最为显著(AUPRC下降约0.15),突显了将药物反应预测外推到先前未见化合物的内在困难。
摘要: 由于体外细胞系与患者肿瘤之间存在显著的生物学差距,从临床前数据预测患者的药物反应仍然是精准肿瘤学的主要挑战。本研究不追求提高绝对的体外预测准确性,而是探讨在强生物域偏移下,明确分离表征学习与任务监督是否能使药物反应模型对患者数据实现更高效的样本适应。我们提出了一个分阶段的迁移学习框架,其中细胞和药物表征首先通过基于自动编码器的表征学习从大量未标记的药物基因组数据中独立学习。这些表征随后在细胞系数据上与药物反应标签对齐,并最终通过少样本监督适应到患者肿瘤。通过涵盖域内、跨数据集和患者水平设置的系统评估,我们发现当源域和目标域显著重叠时,无监督预训练提供的益处有限,但在适应具有非常有限标记数据的患者肿瘤时能带来明显收益。具体而言,所提出的框架在少样本患者水平适应期间实现了更快的性能提升,同时在标准细胞系基准测试中保持与单阶段基线相当的准确性。总体而言,这些结果表明,从无标记分子谱中学习结构化和可迁移的表征可以显著减少有效药物反应预测所需的临床监督量,为数据高效的临床前到临床转化提供了一条实用途径。