<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.0 20040830//EN" "journalpublishing.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" dtd-version="2.0" xml:lang="en" article-type="news"><front><journal-meta><journal-id journal-id-type="nlm-ta">J Med Internet Res</journal-id><journal-id journal-id-type="publisher-id">jmir</journal-id><journal-id journal-id-type="index">1</journal-id><journal-title>Journal of Medical Internet Research</journal-title><abbrev-journal-title>J Med Internet Res</abbrev-journal-title><issn pub-type="epub">1438-8871</issn><publisher><publisher-name>JMIR Publications</publisher-name><publisher-loc>Toronto, Canada</publisher-loc></publisher></journal-meta><article-meta><article-id pub-id-type="publisher-id">v28i1e101366</article-id><article-id pub-id-type="doi">10.2196/101366</article-id><article-categories><subj-group subj-group-type="heading"><subject>News and Perspectives</subject></subj-group></article-categories><title-group><article-title>From Virtual Molecules to Clinical Trials: How AI Is Reshaping Preclinical Drug Discovery</article-title></title-group><contrib-group><contrib contrib-type="author"><name name-style="western"><surname>Cuffari</surname><given-names>Benedette</given-names></name><role>JMIR correspondent</role></contrib></contrib-group><contrib-group><contrib contrib-type="editor"><name name-style="western"><surname>Clegg</surname><given-names>Kayleigh-Ann</given-names></name></contrib></contrib-group><pub-date pub-type="collection"><year>2026</year></pub-date><pub-date pub-type="epub"><day>29</day><month>5</month><year>2026</year></pub-date><volume>28</volume><elocation-id>e101366</elocation-id><history><date date-type="received"><day>14</day><month>05</month><year>2026</year></date><date date-type="accepted"><day>14</day><month>05</month><year>2026</year></date></history><copyright-statement>&#x00A9; JMIR Publications. Originally published in the Journal of Medical Internet Research (<ext-link ext-link-type="uri" xlink:href="https://www.jmir.org">https://www.jmir.org</ext-link>), 29.5.2026. </copyright-statement><copyright-year>2026</copyright-year><license license-type="open-access" xlink:href="https://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (<ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research (ISSN 1438-8871), is properly cited. The complete bibliographic information, a link to the original publication on <ext-link ext-link-type="uri" xlink:href="https://www.jmir.org/">https://www.jmir.org/</ext-link>, as well as this copyright and license information must be included.</p></license><self-uri xlink:type="simple" xlink:href="https://www.jmir.org/2026/1/e101366"/><abstract><p>Pharmaceutical development is typically a long and arduous process, but advances in artificial intelligence may help streamline each stage of this pipeline. In this <italic>News and Perspectives</italic> article, JMIR Correspondent Benedette Cuffari reports on recent innovations and their potential implications for drug discovery.</p></abstract><kwd-group><kwd>artificial intelligence in drug discovery</kwd><kwd>machine learning in drug design</kwd><kwd>generative chemistry</kwd><kwd>de novo drug design</kwd><kwd>computational drug discovery</kwd></kwd-group></article-meta></front><body><boxed-text id="IB1"><p><bold>Key Takeaways:</bold></p><list list-type="bullet"><list-item><p>Artificial intelligence is transforming preclinical drug discovery by enabling more efficient identification, design, and optimization of drug candidates through machine learning, deep learning, and generative modeling approaches.</p></list-item><list-item><p>Virtual screening, molecular docking, and de novo design methods are increasingly integrated into drug discovery pipelines to improve prediction of drug-target interactions and accelerate lead optimization.</p></list-item></list></boxed-text><p>From &#x201C;bench to bedside,&#x201D; drug discovery and development is a <ext-link ext-link-type="uri" xlink:href="https://pharmrev.aspetjournals.org/article/S0031-6997(25)07511-8/abstract">highly complex process </ext-link>that typically takes 10 to 15 years or more before a new molecule reaches the market. Drug development typically begins with the identification and validation of a biological target, such as a disease-associated receptor, enzyme, or signaling protein. <ext-link ext-link-type="uri" xlink:href="https://www.cell.com/molecular-therapy-family/nucleic-acids/fulltext/S2162-2531(24)00182-3">Thereafter</ext-link>, candidate molecules are subjected to in vitro high-throughput screens (HTS) to optimize drug activity, target affinity, and toxicity profiles before a compound undergoes extensive preclinical and clinical testing. Collectively, the development of a single drug molecule is valued at an average of US $2.6 billion, with these high costs attributed to the sequential nature of each stage.</p><p>Recent advancements in artificial intelligence (AI) technologies are enabling faster, data-driven identification and optimization of drug candidates, with the potential to reduce both development time and costs. These advancements are transforming every stage of drug discovery, from target identification to molecule design and optimization.</p><sec id="s1"><title>Machine Learning in Drug Discovery</title><p>Machine learning (ML) is a subset of AI that enables systems to learn labeled or unlabeled data from chemical libraries, biological assays, and clinical trials; predict outcomes; and generate actionable insights. <ext-link ext-link-type="uri" xlink:href="https://pubs.acs.org/doi/10.1021/acsomega.5c00549">ML methods</ext-link> were originally introduced into the drug discovery process in the 1980s and 1990s, during which random forests, support vector machines (SVMs), and other ML algorithms were utilized to model layered relationships between molecular structure and biological activity based on learned datasets.</p><p>By the early 2000s, deep learning (DL) algorithms capable of processing vast amounts of data enhanced the ability of these systems to predict structural and molecular properties. Specifically, biological and chemical data acquired from HTS and other multiparameter analyses are analyzed to model and predict drug-target interactions. Modern DL tools involved in drug discovery include graph neural networks&#x2014;a powerful molecular modeling system that predicts solubility and protein-ligand interactions&#x2014;whereas transformer architectures model long-range dependencies and contextual relationships during target/drug prioritization.</p><p>About 90% of drug candidates fail during phase I, II, or III clinical trials, with unsuccessful outcomes primarily attributed to a lack of clinical efficacy, as well as unmanageable toxicity, suboptimal drug-like properties, and poor strategic planning. To improve the safety of new medications, AI models, like quantitative structure-activity relationship (QSAR) modeling and DL, have been developed to assess a wide range of absorption, distribution, metabolism, excretion, and toxicity features. <ext-link ext-link-type="uri" xlink:href="https://academic.oup.com/bib/article/25/2/bbae008/7611273?login=false&#x0026;guestAccessKey=">ChemMORT</ext-link>, for example, is a recent DL-based analytical platform that relies on a simplified molecular input line entry system (SMILES) encoder, descriptive decoder, and molecular optimizer to better predict how drug candidates will behave within biological systems.</p></sec><sec id="s2"><title>Why Drug Development Remains Expensive</title><p>Despite recent developments in computing power, pharmaceutical innovation has not become proportionally faster or cheaper. This contrast is often described as Eroom&#x2019;s Law&#x2014;the observation that drug development becomes more expensive over time, even as digital technologies rapidly advance.</p><p>Mark Gerstein, PhD&#x2014;professor of biomedical informatics, molecular biophysics, and computer science at Yale University&#x2014;said that this paradox helps explain why enthusiasm around AI should be tempered with realism. &#x201C;There is a lot of excitement with AI in drug discovery,&#x201D; Gerstein said, &#x201C;but the reality is that we have not been as successful at discovering drugs as we would have hoped.&#x201D;</p><p>Computers continue to improve under Moore&#x2019;s Law&#x2014;the approximate doubling of transistors on microchips every 2 years&#x2014;and the cost of genome sequencing has fallen dramatically. Nevertheless, these efficiencies have not eliminated the biological and clinical challenges that drive failure rates and development costs.</p><fig position="float" id="figureWL1"><graphic alt-version="no" mimetype="image" position="float" xlink:type="simple" xlink:href="jmir_v28i1e101366_fig01.png"/></fig></sec><sec id="s3"><title>Structure-Based AI Design</title><p>Generative chemistry, otherwise known as &#x201C;drug design,&#x201D; involves the creation of novel molecular compounds intended to be stable, valid, and unique drugs with pharmaceutical potential. Using both ML and DL algorithms, virtual screening is frequently involved in the early stages of drug development to learn the structures and properties of enormous compound libraries and predict their potential interactions with a specified drug target.</p><p>Structure-based virtual screening (SBVS), for example, leverages molecular docking to identify drug candidates based on their predicted interactions with a target protein. <ext-link ext-link-type="uri" xlink:href="https://jopcr.com/articles/revolutionizing-drug-design-with-artificial-intelligence-a-comprehensive-review-of-techniques-applications-and-case-studies">ML algorithms</ext-link>, like SVMs and random forest models, predict ligand binding within the protein&#x2019;s active site, allowing medicinal chemists to rationally modify functional groups to improve binding affinity.</p><p>Gerstein also points to equivariant neural networks as a promising advance in AI-driven drug discovery. Since molecules obey physical laws and can rotate, stretch, or change their conformation, predictive models must account for these transformations. Equivariant architectures are designed to preserve these relationships, making them especially valuable for molecular modeling and binding prediction.</p><p>Compared with ML-based methods used to create drug-like candidates based on existing chemical libraries, <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1093/bib/bbae338">de novo</ext-link><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1093/bib/bbae338"> design</ext-link> can be applied to generate new compounds with specific potency and selectivity properties. To this end, generative models, like variational autoencoders and generative adversarial networks (GANs), can be trained on a dataset of known molecules to learn their structural properties and generate molecules with similar properties. Reinforcement learning algorithms are also used during de novo molecule design to optimize the chemical structure of a new drug based on its pharmacological properties, with several hybrid approaches emerging that integrate different aspects of each model.</p></sec><sec id="s4"><title>AI-Driven Drug Discovery Platforms and Clinical Translation Partnerships</title><p>Exscientia, Insilico Medicine, Recursion, BenevolentAI, and Schr&#x00F6;dinger are five leading AI-derived drug discovery companies that have <ext-link ext-link-type="uri" xlink:href="https://www.sciencedirect.com/science/article/pii/S2211383522000521?via%3Dihub">successfully advanced novel small-molecule drug candidates into the clinic</ext-link>. Both rentosertib (ISM001-055)&#x2014;a pulmonary fibrosis candidate developed by Insilico Medicine&#x2014;and DSP-1181&#x2014;a serotonin (5-HT1A) receptor agonist developed by Exscientia&#x2014;were designed by using AI-driven platforms before advancing to early clinical testing.</p><p>BenevolentAI and AstraZeneca <ext-link ext-link-type="uri" xlink:href="https://www.astrazeneca.com/media-centre/press-releases/2019/astrazeneca-starts-artificial-intelligence-collaboration-to-accelerate-drug-discovery-30042019.html#">joined forces in 2019</ext-link> to identify novel targets in chronic kidney disease and idiopathic pulmonary fibrosis by using AI-driven knowledge graphs. Collectively, these partnerships highlight that drug discovery is shifting from isolated computational innovation toward integrated pipelines in which AI platforms and pharmaceutical companies drive target identification, molecule design, and clinical translation.</p></sec><sec id="s5"><title>Challenges and Limitations</title><p>The integration of AI techniques into drug design has several advantages over traditional discovery methods, including reduced time and costs, as well as better accuracy in predicting the safety and efficacy of new drugs. Nevertheless, the lack of diverse, representative, and high-quality datasets may introduce bias that leads to inaccurate predictions. In fact, many available chemical and biological datasets are incomplete or are acquired from heterogeneous experimental conditions, thereby reducing model generalizability and applicability to new chemical structures.</p><p>Trust is also imperative, underscoring the importance of transparency through open-source models, shared datasets, and explainable frameworks. However, AI does not eliminate downstream failures in clinical development. Gerstein cautions that once a promising molecule is identified, substantial challenges remain in optimizing delivery, systemic effects, and safety. Biological complexity, patient heterogeneity, and unforeseen toxicity mechanisms perpetuate high attrition rates, highlighting the need for AI systems that integrate more robust biological clinical context.</p></sec></body><back/></article>