https://arxiv.org/api/22Acxonif7MW2jfw+bIu6E7Cmmc 2026-03-16T12:43:30Z 6635 60 15 http://arxiv.org/abs/2507.02883v2 DISPROTBENCH: Uncovering the Functional Limits of Protein Structure Prediction Models in Intrinsically Disordered Regions 2026-02-10T15:40:21Z

Intrinsically disordered regions (IDRs) play central roles in cellular function, yet remain poorly evaluated by existing protein structure prediction benchmarks. Current evaluations largely focus on well-folded domains, overlooking three fundamental challenges in realistic biological settings: the structural complexity of proteins, the resulting low availability of reliable ground truth, and prediction uncertainty that can propagate into high-risk downstream failures, such as in drug discovery, protein-protein interaction modeling, and functional annotation. We present DisProtBench, an IDR-centric benchmark that explicitly incorporates prediction uncertainty into the evaluation of protein structure prediction models (PSPMs). To address structural complexity and ground-truth scarcity, we curate and unify a large-scale, multi-modal dataset spanning disease-relevant IDRs, GPCR-ligand interactions, and multimeric protein complexes. To assess predictive uncertainty, we introduce Functional Uncertainty Sensitivity (FUS), a novel prediction uncertainty-stratified metric that quantifies downstream task performance under prediction uncertainty. Using this benchmark, we conduct a systematic evaluation of state-of-the-art PSPMs and reveal clear, task-dependent failure modes. Protein-protein interaction prediction degrades sharply in IDRs, while structure-based drug discovery remains comparatively robust. These effects are largely invisible to standard global accuracy metrics, which overestimate functional reliability under prediction uncertainty. We have open-sourced our benchmark and the codebase at https://github.com/Susan571/DisProtBench.

2025-06-18T23:58:22Z Xinyue Zeng Tuo Wang Adithya Kulkarni Alexander Lu Alexandra Ni Phoebe Xing Junhan Zhao Siwei Chen Dawei Zhou http://arxiv.org/abs/2602.05451v2 CPTCs Drive Somatic-Visceral Communication via the Wnt Axis in Somatic Mechanotherapy: A Single-Cell Deep Learning Study 2026-02-10T14:29:24Z

Somatic mechanical stimulation (e.g., acupuncture) exerts systemic immunomodulatory effects, yet the cellular bridge translating peripheral physical force into visceral repair remains elusive. Here, employing a custom interpretable deep learning framework (CARSS) on single-cell RNA sequencing data, we identify CD34$^{+}$PDGFR$α$$^{+}$ telocytes (CPTCs) as the primary mechanosensors in both fascia and colon during bacterial colitis. We show that somatic mechanotherapy triggers an AP-1/Hsp70-dependent transcriptional program in fascial CPTCs, inducing systemic Wnt elevation, which elicits a "transcriptional resonance" in colonic CPTCs, reprogramming their communication network from an inflammatory amplifier to a Wnt-driven regenerative hub. Mechanistically, this axis activates epithelial $β$-catenin/Myc signaling, suppressing apoptosis and restoring barrier integrity independent of immune cells. Our findings define a CPTC-Driven Mechano-Resonance Axis, where CPTCs serve as synchronized relay stations that convert local mechanical cues into systemic regenerative microenvironments.

2026-02-05T08:50:46Z 7 Main Figures + 7 Supplementary Figures Haixiang Huang Zhenwei Zhang BingBing Shen Jianming Yue Lu Mei Xudong Zhu Yonghong Shi Qianmei Zhu Yeping Shi Yifan Luo Yitong Xing Meng Dai Qiusheng Chen http://arxiv.org/abs/2602.02320v2 A Large-Scale Dataset for Molecular Structure-Language Description via a Rule-Regularized Method 2026-02-10T13:28:30Z

Molecular function is largely determined by structure. Accurately aligning molecular structure with natural language is therefore essential for enabling large language models (LLMs) to reason about downstream chemical tasks. However, the substantial cost of human annotation makes it infeasible to construct large-scale, high-quality datasets of structure-grounded descriptions. In this work, we propose a fully automated annotation framework for generating precise molecular structure descriptions at scale. Our approach builds upon and extends a rule-based chemical nomenclature parser to interpret IUPAC names and construct enriched, structured XML metadata that explicitly encodes molecular structure. This metadata is then used to guide LLMs in producing accurate natural-language descriptions. Using this framework, we curate a large-scale dataset of approximately $163$k molecule-description pairs. A rigorous validation protocol combining LLM-based and expert human evaluation on a subset of $2,000$ molecules demonstrates a high description precision of $98.6\%$. The resulting dataset provides a reliable foundation for future molecule-language alignment, and the proposed annotation method is readily extensible to larger datasets and broader chemical tasks that rely on structural descriptions.

2026-02-02T16:49:19Z Feiyang Cai Guijuan He Yi Hu Jingjing Wang Joshua Luo Tianyu Zhu Srikanth Pilla Gang Li Ling Liu Feng Luo http://arxiv.org/abs/2407.13981v3 Decomposed Direct Preference Optimization for Structure-Based Drug Design 2026-02-10T08:38:33Z

Diffusion models have achieved promising results for Structure-Based Drug Design (SBDD). Nevertheless, high-quality protein subpocket and ligand data are relatively scarce, which hinders the models' generation capabilities. Recently, Direct Preference Optimization (DPO) has emerged as a pivotal tool for aligning generative models with human preferences. In this paper, we propose DecompDPO, a structure-based optimization method aligns diffusion models with pharmaceutical needs using multi-granularity preference pairs. DecompDPO introduces decomposition into the optimization objectives and obtains preference pairs at the molecule or decomposed substructure level based on each objective's decomposability. Additionally, DecompDPO introduces a physics-informed energy term to ensure reasonable molecular conformations in the optimization results. Notably, DecompDPO can be effectively used for two main purposes: (1) fine-tuning pretrained diffusion models for molecule generation across various protein families, and (2) molecular optimization given a specific protein subpocket after generation. Extensive experiments on the CrossDocked2020 benchmark show that DecompDPO significantly improves model performance, achieving up to 95.2% Med. High Affinity and a 36.2% success rate for molecule generation, and 100% Med. High Affinity and a 52.1% success rate for molecular optimization. Code is available at https://github.com/laviaf/DecompDPO.

2024-07-19T02:12:25Z Accepted by TMLR Xiwei Cheng Xiangxin Zhou Yuwei Yang Yu Bao Quanquan Gu http://arxiv.org/abs/2406.16821v3 General Binding Affinity Guidance for Diffusion Models in Structure-Based Drug Design 2026-02-10T00:07:30Z

Structure-based drug design (SBDD) aims to generate ligands that bind strongly and specifically to target protein pockets. Recent diffusion models have advanced SBDD by capturing the distributions of atomic positions and types, yet they often underemphasize binding affinity control during generation. To address this limitation, we introduce \textbf{\textnormal{\textbf{BADGER}}}, a general \textbf{binding-affinity guidance framework for diffusion models in SBDD}. \textnormal{\textbf{BADGER} }incorporates binding affinity awareness through two complementary strategies: (1) \textit{classifier guidance}, which applies gradient-based affinity signals during sampling in a plug-and-play fashion, and (2) \textit{classifier-free guidance}, which integrates affinity conditioning directly into diffusion model training. Together, these approaches enable controllable ligand generation guided by binding affinity. \textnormal{\textbf{BADGER} } can be added to any diffusion model and achieves up to a \textbf{60\% improvement in ligand--protein binding affinity} of sampled molecules over prior methods. Furthermore, we extend the framework to \textbf{multi-constraint diffusion guidance}, jointly optimizing for binding affinity, drug-likeness (QED), and synthetic accessibility (SA) to design realistic and synthesizable drug candidates.

2024-06-24T17:31:41Z Yue Jian Curtis Wu Danny Reidenbach Aditi S. Krishnapriyan 10.1021/acs.jcim.5c01166 http://arxiv.org/abs/2505.15054v3 MolLangBench: A Comprehensive Benchmark for Language-Prompted Molecular Structure Recognition, Editing, and Generation 2026-02-09T21:32:30Z

Precise recognition, editing, and generation of molecules are essential prerequisites for both chemists and AI systems tackling various chemical tasks. We present MolLangBench, a comprehensive benchmark designed to evaluate fundamental molecule-language interface tasks: language-prompted molecular structure recognition, editing, and generation. To ensure high-quality, unambiguous, and deterministic outputs, we construct the recognition tasks using automated cheminformatics tools, and curate editing and generation tasks through rigorous expert annotation and validation. MolLangBench supports the evaluation of models that interface language with different molecular representations, including linear strings, molecular images, and molecular graphs. Evaluations of state-of-the-art models reveal significant limitations: the strongest model (GPT-5) achieves $86.2\%$ and $85.5\%$ accuracy on recognition and editing tasks, which are intuitively simple for humans, and performs even worse on the generation task, reaching only $43.0\%$ accuracy. These results highlight the shortcomings of current AI systems in handling even preliminary molecular recognition and manipulation tasks. We hope MolLangBench will catalyze further research toward more effective and reliable AI systems for chemical applications.The dataset and code can be accessed at https://huggingface.co/datasets/ChemFM/MolLangBench and https://github.com/TheLuoFengLab/MolLangBench, respectively.

2025-05-21T03:22:01Z ICLR-2026 Camera-Ready version Feiyang Cai Jiahui Bai Tao Tang Guijuan He Joshua Luo Tianyu Zhu Srikanth Pilla Gang Li Ling Liu Feng Luo http://arxiv.org/abs/2602.08897v1 A Mathematical Theory of Redox Biology 2026-02-09T16:58:26Z

Redox biology underpins signalling, metabolism, immunity, and adaptation, yet lacks a unifying theoretical framework capable of formalising structure, function, and dynamics. Current interpretations rely on descriptive catalogues of molecules and reactions, obscuring how redox behaviour emerges from constrained biochemical organisation. Here, we present a mathematical theory of redox biology that resolves this gap by treating redox systems as finite, compositional, dynamical, and spatially embedded objects. We define a structured redox state space in which admissible molecular transformations form a neutral algebra of possibilities. Biological function emerges when this structure is embedded within a wider molecular network and interpreted through weighted flux distributions. Time-dependent reweighting of these transformations generates redox dynamics, while spatial embedding enforces locality and causality, yielding a distributed redox field. Within this framework, context dependence, nonlinearity, hysteresis, and memory arise naturally from bounded state spaces and irreversible transformations, without requiring ad hoc assumptions. This theory provides a working, predictive interpretative basis for redox biology: it constrains admissible states and trajectories, clarifies the meaning of redox measurements, and links chemical transformation to biological behaviour. Redox biology emerges as a geometric, dynamical process governed by lawful organisation.

2026-02-09T16:58:26Z 41 pages, 4 figures, 2 boxes James N. Cobley Michalis G. Nikolaidis http://arxiv.org/abs/2602.08641v1 Modeling Protein Evolution via Generative Inference From Monte Carlo Chains to Population Genetics 2026-02-09T13:35:17Z

Generative models derived from large protein sequence alignments define complex fitness landscapes, but their utility for accurately modeling non-equilibrium evolutionary dynamics remains unclear. In this work, we perform a rigorous comparative analysis of three simulation schemes, designed to mimic evolution in silico by local sampling of the probability distribution defined by a generative model. We compare standard independent Markov Chain Monte Carlo, Monte Carlo on a phylogenetic tree, and a population genetics dynamics, benchmarking their outputs against deep sequencing data from four distinct in vitro evolution experiments. We find that standard Monte Carlo fails to reproduce the correct phylogenetic structure and generates unrealistic, gradual mutational sweeps. Performing Monte Carlo on a tree inferred from data improves phylogenetic fidelity and historical accuracy. The population genetics scheme successfully captures phylogenetic correlations, mutational abundances, and selective sweeps as emergent properties, without the need to infer additional information from data. However, the latter choice come at the price of not sampling the proper generative model distribution at long times. Our findings highlight the crucial role of phylogenetic correlations and finite-population effects in shaping evolutionary trajectories on fitness landscapes. These models therefore provide powerful tools for predicting complex adaptive paths and for reliably extrapolating evolutionary dynamics beyond current experimental limitations.

2026-02-09T13:35:17Z Leonardo Di Bari Thierry Mora Andrea Pagnani Aleksandra M. Walczak Francesco Zamponi Saverio Rossi http://arxiv.org/abs/2602.18476v1 BioLM-Score: Language-Prior Conditioned Probabilistic Geometric Potentials for Protein-Ligand Scoring 2026-02-09T12:31:49Z

Protein-ligand scoring is a central component of structure-based drug design, underpinning molecular docking, virtual screening, and pose optimization. Conventional physics-based energy functions are often computationally expensive, limiting their utility in large-scale screening. In contrast, deep learning-based scoring models offer improved computational efficiency but frequently suffer from limited cross-target generalization and poor interpretability, which restrict their practical applicability. Here we present BioLM-Score, a simple yet generalizable protein-ligand scoring model that couples geometric modeling with representation learning. Specifically, it employs modality-specific and structure-aware encoders for proteins and ligands, each augmented with biomolecular language models to enrich structural and chemical representations. Subsequently, these representations are integrated through a mixture density network to predict multimodal interatomic distance distributions, from which statistically grounded likelihood-based scores are derived. Evaluations on the CASF-2016 benchmark demonstrate that BioLM-Score achieves significant improvements across docking, scoring, ranking, and screening tasks. Moreover, the proposed scoring function serves as an effective optimization objective for guiding docking protocols and conformational search. In summary, BioLM-Score provides a principled and practical alternative to existing scoring functions, combining efficiency, generalization, and interpretability for structure-based drug discovery.

2026-02-09T12:31:49Z 9 pages, 2 figures Zhangfan Yang Baoyun Chen Dong Xu Jia Wang Ruibin Bai Junkai Ji Zexuan Zhu http://arxiv.org/abs/2602.06020v2 Mechanisms of AI Protein Folding in ESMFold 2026-02-08T20:38:53Z

How do protein structure prediction models fold proteins? We investigate this question by tracing how ESMFold folds a beta hairpin, a prevalent structural motif. Through counterfactual interventions on model latents, we identify two computational stages in the folding trunk. In the first stage, early blocks initialize pairwise biochemical signals: residue identities and associated biochemical features such as charge flow from sequence representations into pairwise representations. In the second stage, late blocks develop pairwise spatial features: distance and contact information accumulate in the pairwise representation. We demonstrate that the mechanisms underlying structural decisions of ESMFold can be localized, traced through interpretable representations, and manipulated with strong causal effects.

2026-02-05T18:54:54Z Our code, data, and results are available at https://folding.baulab.info Kevin Lu Jannik Brinkmann Stefan Huber Aaron Mueller Yonatan Belinkov David Bau Chris Wendler http://arxiv.org/abs/2601.18435v2 The Quantum Cliff: A Critical Proton Tunneling Threshold Determines Clinical Severity in RPE65-Mediated Retinal Disease 2026-02-08T10:36:55Z

Predicting clinical severity from genotype remains a fundamental challenge in molecular medicine, particularly for enzymes whose function depends on sub-atomic-scale geometry. Mutations in the \textit{RPE65} isomerohydrolase cause Leber Congenital Amaurosis (LCA) and related retinal diseases; however, the kinetic mechanisms connecting sub-atomic-scale perturbations to blindness remain unclear. In this study, we demonstrate that mutations in the human visual isomerase RPE65 are governed by a quantum-mechanical threshold effect arising from proton tunneling in the active site. We established a hybrid quantum-classical structure-to-phenotype pipeline combining AlphaFold structure prediction with \textit{ab initio} quantum simulation using the Variational Quantum Eigensolver (VQE) to analyze minimal proton-coupled electron transfer in the visual cycle. Our analysis reveals that many pathogenic mutations do not merely occlude the active site, but rather strongly reduce the quantum probability of proton tunneling. We observed a sharp non-linear effect, termed the "Quantum Cliff," where minute structural changes (below 0.1 Å) reduce the reaction rate by multiple orders of magnitude. Based on these findings, we introduce a dimensionless Relative Quantum Activity Score (RQAS) that isolates the geometry-controlled exponential sensitivity of the reaction rate and successfully distinguishes between mild and severe patient phenotypes. These results suggest that RPE65 operates near a quantum-critical point, where sub-Angstrom structural perturbations induce a catastrophic loss of function. Furthermore, our findings establish quantum tunneling as a predictive mechanistic link between atomic structure and clinical phenotype, proposing a general framework for quantum-structural disease modeling.

2026-01-26T12:50:27Z Biraja Ghoshal William Woof Bhargab Ghoshal Nikolas Pontikos http://arxiv.org/abs/2602.07735v1 TerraBind: Fast and Accurate Binding Affinity Prediction through Coarse Structural Representations 2026-02-08T00:01:43Z

We present TerraBind, a foundation model for protein-ligand structure and binding affinity prediction that achieves 26-fold faster inference than state-of-the-art methods while improving affinity prediction accuracy by $\sim$20\%. Current deep learning approaches to structure-based drug design rely on expensive all-atom diffusion to generate 3D coordinates, creating inference bottlenecks that render large-scale compound screening computationally intractable. We challenge this paradigm with a critical hypothesis: full all-atom resolution is unnecessary for accurate small molecule pose and binding affinity prediction. TerraBind tests this hypothesis through a coarse pocket-level representation (protein C$_β$ atoms and ligand heavy atoms only) within a multimodal architecture combining COATI-3 molecular encodings and ESM-2 protein embeddings that learns rich structural representations, which are used in a diffusion-free optimization module for pose generation and a binding affinity likelihood prediction module. On structure prediction benchmarks (FoldBench, PoseBusters, Runs N' Poses), TerraBind matches diffusion-based baselines in ligand pose accuracy. Crucially, TerraBind outperforms Boltz-2 by $\sim$20\% in Pearson correlation for binding affinity prediction on both a public benchmark (CASP16) and a diverse proprietary dataset (18 biochemical/cell assays). We show that the affinity prediction module also provides well-calibrated affinity uncertainty estimates, addressing a critical gap in reliable compound prioritization for drug discovery. Furthermore, this module enables a continual learning framework and a hedged batch selection strategy that, in simulated drug discovery cycles, achieves 6$\times$ greater affinity improvement of selected molecules over greedy-based approaches.

2026-02-08T00:01:43Z 31 pages, 14 figures Matteo Rossi Ryan Pederson Miles Wang-Henderson Ben Kaufman Edward C. Williams Carl Underkoffler Owen Lewis Howell Adrian Layer Stephan Thaler Narbe Mardirossian John Anthony Parkhill http://arxiv.org/abs/2602.06418v1 Adaptive Protein Tokenization 2026-02-06T06:15:14Z

Tokenization is a promising path to multi-modal models capable of jointly understanding protein sequences, structure, and function. Existing protein structure tokenizers create tokens by pooling information from local neighborhoods, an approach that limits their performance on generative and representation tasks. In this work, we present a method for global tokenization of protein structures in which successive tokens contribute increasing levels of detail to a global representation. This change resolves several issues with generative models based on local protein tokenization: it mitigates error accumulation, provides embeddings without sequence-reduction operations, and allows task-specific adaptation of a tokenized sequence's information content. We validate our method on reconstruction, generative, and representation tasks and demonstrate that it matches or outperforms existing models based on local protein structure tokenizers. We show how adaptive tokens enable inference criteria based on information content, which boosts designability. We validate representations generated from our tokenizer on CATH classification tasks and demonstrate that non-linear probing on our tokenized sequences outperforms equivalent probing on representations from other tokenizers. Finally, we demonstrate how our method supports zero-shot protein shrinking and affinity maturation.

2026-02-06T06:15:14Z Rohit Dilip Ayush Varshney David Van Valen http://arxiv.org/abs/2602.04883v1 Protein Autoregressive Modeling via Multiscale Structure Generation 2026-02-04T18:59:49Z

We present protein autoregressive modeling (PAR), the first multi-scale autoregressive framework for protein backbone generation via coarse-to-fine next-scale prediction. Using the hierarchical nature of proteins, PAR generates structures that mimic sculpting a statue, forming a coarse topology and refining structural details over scales. To achieve this, PAR consists of three key components: (i) multi-scale downsampling operations that represent protein structures across multiple scales during training; (ii) an autoregressive transformer that encodes multi-scale information and produces conditional embeddings to guide structure generation; (iii) a flow-based backbone decoder that generates backbone atoms conditioned on these embeddings. Moreover, autoregressive models suffer from exposure bias, caused by the training and the generation procedure mismatch, and substantially degrades structure generation quality. We effectively alleviate this issue by adopting noisy context learning and scheduled sampling, enabling robust backbone generation. Notably, PAR exhibits strong zero-shot generalization, supporting flexible human-prompted conditional generation and motif scaffolding without requiring fine-tuning. On the unconditional generation benchmark, PAR effectively learns protein distributions and produces backbones of high design quality, and exhibits favorable scaling behavior. Together, these properties establish PAR as a promising framework for protein structure generation.

2026-02-04T18:59:49Z ByteDance Seed Tech Report; Page: https://par-protein.github.io/ Yanru Qu Cheng-Yen Hsieh Zaixiang Zheng Ge Liu Quanquan Gu http://arxiv.org/abs/2512.03312v2 Unlocking hidden biomolecular conformational landscapes in diffusion models at inference time 2026-02-04T16:54:34Z

The function of biomolecules such as proteins depends on their ability to interconvert between a wide range of structures or "conformations." Researchers have endeavored for decades to develop computational methods to predict the distribution of conformations, which is far harder to determine experimentally than a static folded structure. We present ConforMix, an inference-time algorithm that enhances sampling of conformational distributions using a combination of classifier guidance, filtering, and free energy estimation. Our approach upgrades diffusion models -- whether trained for static structure prediction or conformational generation -- to enable more efficient discovery of conformational variability without requiring prior knowledge of major degrees of freedom. ConforMix is orthogonal to improvements in model pretraining and would benefit even a hypothetical model that perfectly reproduced the Boltzmann distribution. Remarkably, when applied to a diffusion model trained for static structure prediction, ConforMix captures structural changes including domain motion, cryptic pocket flexibility, and transporter cycling, while avoiding unphysical states. Case studies of biologically critical proteins demonstrate the scalability, accuracy, and utility of this method.

2025-12-02T23:52:05Z Project page: https://github.com/drorlab/conformix NeurIPS 2025 Daniel D. Richman Jessica Karaguesian Carl-Mikael Suomivuori Ron O. Dror