Software-Aided Prediction of Key Peptide Properties Using LC–MS Data
June 2026, ASMS Conference
Paula Cifuentes1, 2, 3; Ramon Adàlia2, 3, 4; Lisa A.Vasicek5; Richard Gundersdorf5; Abigail Wheeler5; Paul Harradine5; Ismael Zamora3
1Universitat Pompeu Fabra, Barcelona, Spain; 2Lead Molecular Design, SL, Sant Cugat del Vallès, Spain; 3Mass Analytica, S.L., Sant Cugat del Vallés, Spain; 4Universitat Autònoma de Barcelona, Cerdanyola del Vallès, Spain; 5Merck & Co., Inc., West Point, PA
Abstract
Introduction
Peptides have emerged as promising therapeutic agents due to their high specificity, favorable safety profiles, and cost-effective synthesis. However, their clinical development is limited by low oral bioavailability and short half-lives. These challenges arise from high clearance rates, poor solubility, limited membrane permeability, and reduced metabolic stability caused by peptidase activity and modulated by post-translational modifications. Deficiencies in any of these properties can significantly impact peptide’s therapeutic efficacy. Consequently, in silico prediction tools have become increasingly important in the pharmaceutical industry, enabling early identification and elimination of unsuitable peptide drug candidates. Despite recent advances, existing tools are often limited to natural amino acids, cannot process cyclic peptides, and lack customization to user-specific experimental data, highlighting the need for further development.
Methods
The methodology defines a new workflow that integrates LC-MS data from peptide metabolism studies with a Graphormer-based machine learning model to predict five key peptide properties: potential cleavage sites, half-life, permeability, solvent accessibility, and post-translational modifications. The methodology operates without structural constraints, allowing cyclic peptides, and modified amino acids. The models employ transformer architecture with added mechanisms to encode graph structural information. Users can train models with their own LC-MS experimental data for improved alignment with specific peptides and continuously update them via a self-learning approach. The five selected end points predictive models have been compared to the state-of-the art tools. Additionally, the site of cleavage model and half-life models were validated using experimental MetID data from a pharmaceutical company.
You must be logged in to access this content. Not yet registered? Create a new account
