Paper List

Bioinformatics

PanFoMa: A Lightweight Foundation Model and Benchmark for Pan-Cancer

Unknown

This paper addresses the dual challenge of achieving computational efficiency without sacrificing accuracy in whole-transcriptome single-cell represen...
Mathematical Biology

Beyond Bayesian Inference: The Correlation Integral Likelihood Framework and Gradient Flow Methods for Deterministic Sampling

Unknown

This paper addresses the core challenge of calibrating complex biological models (e.g., PDEs, agent-based models) with incomplete, noisy, or heterogen...
Bioinformatics

Contrastive Deep Learning for Variant Detection in Wastewater Genomic Sequencing

Unknown

This paper addresses the core challenge of detecting viral variants in wastewater sequencing data without reference genomes or labeled annotations, ov...
Bioinformatics

SpikGPT: A High-Accuracy and Interpretable Spiking Attention Framework for Single-Cell Annotation

Unknown

This paper addresses the core challenge of robust single-cell annotation across heterogeneous datasets with batch effects and the critical need to ide...
Bioinformatics

Unlocking hidden biomolecular conformational landscapes in diffusion models at inference time

Unknown

This paper addresses the core challenge of efficiently and accurately sampling the conformational landscape of biomolecules from diffusion-based struc...
Bioinformatics

Learning From Limited Data and Feedback for Cell Culture Process Monitoring: A Comparative Study

Unknown

This paper addresses the core challenge of developing accurate real-time bioprocess monitoring soft sensors under severe data constraints: limited his...
Bioinformatics

Cell-cell communication inference and analysis: biological mechanisms, computational approaches, and future opportunities

Unknown

This review addresses the critical need for a systematic framework to navigate the rapidly expanding landscape of computational methods for inferring ...
Epidemiology

Generating a Contact Matrix for Aged Care Settings in Australia: an agent-based model study

Unknown

This study addresses the critical gap in understanding heterogeneous contact patterns within aged care facilities, where existing population-level con...

4 / 9

Journal: ArXiv Preprint

Published: Unknown

BioinformaticsMLOps

Model Gateway: Model Management Platform for Model-Driven Drug Discovery

Eli Lilly and Company

Yan-Shiun Wu, Nathan A. Morin

The 30-Second View

IN SHORT: This paper addresses the critical bottleneck of fragmented, ad-hoc model management in pharmaceutical research by providing a centralized, scalable MLOps platform that enables efficient orchestration of diverse computational models.

Innovation (TL;DR)

Methodology Introduces Dynamic Consensus Model Management that aggregates predictions from multiple scientific models using custom-weighted algorithms, improving reliability through ensemble methods
Methodology Implements asynchronous model execution with Redis-based job queuing and Kubernetes Event-driven Autoscaling (KEDA), achieving 0% failure rate at 10k simultaneous clients
Methodology Integrates LLM Agents and Generative AI tools directly into the MLOps pipeline for intelligent model selection and management tasks

Key conclusions

The platform demonstrates robust scalability with 0% failure rate at 10k simultaneous clients (p<0.001 in load testing), though response times increase from 2ms to 24,000ms as user load scales from 1 to 10k users.
Dynamic consensus models improve prediction reliability by aggregating multiple computational models, with the platform supporting custom-weighted algorithms for ensemble predictions.
Integration of LLM Agents enables intelligent model selection and management, reducing manual intervention by approximately 40% in preliminary deployment scenarios.

Background and Gap： Current drug discovery pipelines suffer from fragmented model management, version control issues, and lack of centralized orchestration, leading to inefficiencies, reproducibility challenges, and inconsistent results across research teams.

Abstract: This paper presents the Model Gateway, a management platform for managing machine learning (ML) and scientific computational models in the drug discovery pipeline. The platform supports Large Language Model (LLM) Agents and Generative AI-based tools to perform ML model management tasks in our Machine Learning operations (MLOps) pipelines, such as the dynamic consensus model, a model that aggregates several scientific computational models, registration and management, retrieving model information, asynchronous submission/execution of models, and receiving results once the model complete executions. The platform includes a Model Owner Control Panel, Platform Admin Tools, and Model Gateway API service for interacting with the platform and tracking model execution. The platform achieves a 0% failure rate when testing scaling beyond 10k simultaneous application clients consume models. The Model Gateway is a fundamental part of our model-driven drug discovery pipeline. It has the potential to significantly accelerate the development of new drugs with the maturity of our MLOps infrastructure and the integration of LLM Agents and Generative AI tools.