Paper List

Bioinformatics

SpikGPT: A High-Accuracy and Interpretable Spiking Attention Framework for Single-Cell Annotation

2025-12-02

This paper addresses the core challenge of robust single-cell annotation across heterogeneous datasets with batch effects and the critical need to ide...
Bioinformatics

Unlocking hidden biomolecular conformational landscapes in diffusion models at inference time

2025-12-02

This paper addresses the core challenge of efficiently and accurately sampling the conformational landscape of biomolecules from diffusion-based struc...
Computational Neuroscience

Personalized optimization of pediatric HD-tDCS for dose consistency and target engagement

2025-12-01

This paper addresses the critical limitation of one-size-fits-all HD-tDCS protocols in pediatric populations by developing a personalized optimization...
Computational Biophysics

Realistic Transition Paths for Large Biomolecular Systems: A Langevin Bridge Approach

2025-12-01

This paper addresses the core challenge of generating physically realistic and computationally efficient transition paths between distinct protein con...
Bioinformatics

Consistent Synthetic Sequences Unlock Structural Diversity in Fully Atomistic De Novo Protein Design

2025-12-01

This paper addresses the core pain point of low sequence-structure alignment in existing synthetic datasets (e.g., AFDB), which severely limits the pe...
Bioinformatics

MoRSAIK: Sequence Motif Reactor Simulation, Analysis and Inference Kit in Python

2025-12-01

This work addresses the computational bottleneck in simulating prebiotic RNA reactor dynamics by developing a Python package that tracks sequence moti...
Bioinformatics

On the Approximation of Phylogenetic Distance Functions by Artificial Neural Networks

2025-12-01

This paper addresses the core challenge of developing computationally efficient and scalable neural network architectures that can learn accurate phyl...
Bioinformatics

EcoCast: A Spatio-Temporal Model for Continual Biodiversity and Climate Risk Forecasting

2025-12-01

This paper addresses the critical bottleneck in conservation: the lack of timely, high-resolution, near-term forecasts of species distribution shifts ...

15 / 18

期刊: ArXiv Preprint

发布日期: 2018-02-09

Bio-inspired RoboticsReinforcement Learning

通过虚拟鱼强化学习控制鱼群

Faculty of Engineering, Kyoto University | Graduate School of Information Science, University of Hyogo

Yusuke Nishii, Hiroaki Kawashima

30秒速读

IN SHORT: 证明了无模型强化学习可以利用虚拟视觉刺激有效引导鱼群，克服了缺乏精确行为模型的问题。

核心创新

Methodology First application of model-free Q-learning to control collective animal behavior via virtual agents, bypassing the need for complex fish school models.
Methodology Introduces a practical camera-display interaction system with coordinate mapping, enabling real-time state observation and virtual stimulus presentation.
Biology Leverages fundamental biological reactions (attraction, alignment, optomotor response) for control, validated with Rummy-nose tetra (Hemigrammus bleheri).

主要结论

模拟结果证实，即使鱼有50%的概率忽略虚拟刺激，强化学习也能学习到有效的策略（奖励接近+1），证明了其对间歇性反应的鲁棒性。
现实世界实验表明，学习到的策略显著优于无刺激基线（p < 0.01）和启发式“停留在边缘”策略，成功将鱼群质心引导至目标边缘。
该研究成功将模拟训练的Q函数迁移到真实环境，减少了所需学习时间，并验证了强化学习方法的可迁移性。

研究空白： 现有影响鱼群的方法依赖于物理机器人（存在耐久性/控制问题）或需要精确的集体行为预测模型，而后者目前尚不可用。这在理论模型与实用、自适应控制之间造成了空白。

摘要: 本研究探索了一种利用强化学习训练的虚拟鱼来引导和控制鱼群的方法。我们使用屏幕上显示的2D虚拟鱼来克服物理机器人代理固有的耐久性和运动限制等技术挑战。为了解决缺乏真实鱼详细行为模型的问题，我们采用了无模型强化学习方法。首先，模拟结果表明，即使模拟的真实鱼经常忽略虚拟刺激，强化学习也能获得有效的运动策略。其次，活鱼的现实世界实验证实，学习到的策略成功地将鱼群引导至指定的目标方向。统计分析表明，所提出的方法显著优于基线条件，包括无刺激和启发式“停留在边缘”策略。这项研究为如何通过人工代理利用强化学习影响集体动物行为提供了早期示范。