Danielle Belgrave on Generative AI in Pharma and Drugs – O’Reilly

May 29, 2025

8

Danielle Belgrave on Generative AI in Pharma and Drugs – O’Reilly

Be part of Danielle Belgrave and Ben Lorica for a dialogue of AI in healthcare. Danielle is VP of AI and machine studying at GSK (previously GlaxoSmithKline). She and Ben talk about utilizing AI and machine studying to get higher diagnoses that replicate the variations between sufferers. Hear in to be taught concerning the challenges of working with well being information—a discipline the place there’s each an excessive amount of information and too little, and the place hallucinations have severe penalties. And for those who’re enthusiastic about healthcare, you’ll additionally learn the way AI builders can get into the sector.

Try different episodes of this podcast on the O’Reilly studying platform.

Concerning the Generative AI within the Actual World podcast: In 2023, ChatGPT put AI on everybody’s agenda. In 2025, the problem shall be turning these agendas into actuality. In Generative AI within the Actual World, Ben Lorica interviews leaders who’re constructing with AI. Study from their expertise to assist put AI to work in your enterprise.

Factors of Curiosity

0:00: Introduction to Danielle Belgrave, VP of AI and machine studying at GSK. Danielle is our first visitor representing Large Pharma. Will probably be attention-grabbing to see how individuals in pharma are utilizing AI applied sciences.
0:49: My curiosity in machine studying for healthcare started 15 years in the past. My PhD was on understanding affected person heterogeneity in asthma-related illness. This was earlier than digital healthcare information. By leveraging completely different varieties of knowledge, genomics information and biomarkers from youngsters, and seeing how they developed bronchial asthma and allergic ailments, I developed causal modeling frameworks and graphical fashions to see if we might establish who would reply to what therapies. This was fairly novel on the time. We recognized 5 various kinds of bronchial asthma. If we will perceive heterogeneity in bronchial asthma, an even bigger problem is knowing heterogeneity in psychological well being. The thought was making an attempt to know heterogeneity over time in sufferers with anxiousness.
4:12: After I went to DeepMind, I labored on the healthcare portfolio. I grew to become very interested by learn how to perceive issues like MIMIC, which had digital healthcare information, and picture information. The thought was to leverage instruments like energetic studying to reduce the quantity of knowledge you’re taking from sufferers. We additionally revealed work on bettering the range of datasets.
5:19: After I got here to GSK, it was an thrilling alternative to do each tech and well being. Well being is without doubt one of the most difficult landscapes we will work on. Human biology could be very difficult. There’s a lot random variation. To grasp biology, genomics, illness development, and have an effect on how medication are given to sufferers is superb.
6:15: My function is main AI/ML for scientific improvement. How can we perceive heterogeneity in sufferers to optimize scientific trial recruitment and ensure the proper sufferers have the proper remedy?
6:56: The place does AI create essentially the most worth throughout GSK at present? That may be each conventional AI and generative AI.
7:23: I take advantage of all the things interchangeably, although there are distinctions. The true vital factor is specializing in the issue we are attempting to resolve, and specializing in the information. How will we generate information that’s significant? How will we take into consideration deployment?
8:07: And all of the Q&A and pink teaming.
8:20: It’s arduous to place my finger on what’s essentially the most impactful use case. After I consider the issues I care about, I take into consideration oncology, pulmonary illness, hepatitis—these are all very impactful issues, and so they’re issues that we actively work on. If I had been to spotlight one factor, it’s the interaction between after we are taking a look at entire genome sequencing information and taking a look at molecular information and making an attempt to translate that into computational pathology. By taking a look at these information sorts and understanding heterogeneity at that degree, we get a deeper organic illustration of various subgroups and perceive mechanisms of motion for response to medication.
9:35: It’s not scalable doing that for people, so I’m concerned about how we translate throughout differing types or modalities of knowledge. Taking a biopsy—that’s the place we’re getting into the sector of synthetic intelligence. How will we translate between genomics and taking a look at a tissue pattern?
10:25: If we consider the impression of the scientific pipeline, the second instance could be utilizing generative AI to find medication, goal identification. These are sometimes in silico experiments. Now we have perturbation fashions. Can we perturb the cells? Can we create embeddings that may give us representations of affected person response?
11:13: We’re producing information at scale. We need to establish targets extra shortly for experimentation by rating likelihood of success.
11:36: You’ve talked about multimodality quite a bit. This contains pc imaginative and prescient, photos. What different modalities?
11:53: Textual content information, well being information, responses over time, blood biomarkers, RNA-Seq information. The quantity of knowledge that has been generated is kind of unimaginable. These are all completely different information modalities with completely different constructions, other ways of correcting for noise, batch results, and understanding human techniques.
12:51: Whenever you run into your former colleagues at DeepMind, what sorts of requests do you give them?
13:14: Neglect concerning the chatbots. Loads of the work that’s occurring round massive language fashions—pondering of LLMs as productiveness instruments that may assist. However there has additionally been a whole lot of exploration round constructing bigger frameworks the place we will do inference. The problem is round information. Well being information could be very sparse. That’s one of many challenges. How will we fine-tune fashions to particular options or particular illness areas or particular modalities of knowledge? There’s been a whole lot of work on basis fashions for computational pathology or foundations for single cell construction. If I had one want, it will be taking a look at small information and the way do you’ve gotten sturdy affected person representations when you’ve gotten small datasets? We’re producing massive quantities of knowledge on small numbers of sufferers. It is a massive methodological problem. That’s the North Star.
15:12: Whenever you describe utilizing these basis fashions to generate artificial information, what guardrails do you place in place to stop hallucination?
15:30: We’ve had a accountable AI workforce since 2019. It’s vital to think about these guardrails particularly in well being, the place the rewards are excessive however so are the stakes. One of many issues the workforce has applied is AI rules, however we additionally use mannequin playing cards. Now we have policymakers understanding the results of the work; we even have engineering groups. There’s a workforce that appears exactly at understanding hallucinations with the language mannequin we’ve constructed internally, known as Jules.¹ There’s been a whole lot of work taking a look at metrics of hallucination and accuracy for these fashions. We additionally collaborate on issues like interpretability and constructing reusable pipelines for accountable AI. How can we establish the blind spots in our evaluation?
17:42: Final yr, lots of people began doing fine-tuning, RAG, and GraphRAG; I assume you do all of those?
18:05: RAG occurs quite a bit within the accountable AI workforce. Now we have constructed a information graph. That was one of many earliest information graphs—earlier than I joined. It’s maintained by one other workforce in the intervening time. Now we have a platforms workforce that offers with all of the scaling and deploying throughout the corporate. Instruments like information graph aren’t simply AI/ML. Additionally Jules—it’s maintained outdoors AI/ML. It’s thrilling once you see these options scale.
20:02: The buzzy time period this yr is brokers and even multi-agents. What’s the state of agentic AI inside GSK?
20:18: We’ve been engaged on this for fairly some time, particularly throughout the context of huge language fashions. It permits us to leverage a whole lot of the information that now we have internally, like scientific information. Brokers are constructed round these datatypes and the completely different modalities of questions that now we have. We’ve constructed brokers for genetic information or lab experimental information. An orchestral agent in Jules can mix these completely different brokers in an effort to draw inferences. That panorama of brokers is basically vital and related. It provides us refined fashions on particular person questions and varieties of modalities.
21:28: You alluded to customized medication. We’ve been speaking about that for a very long time. Are you able to give us an replace? How will AI speed up that?
21:54: It is a discipline I’m actually optimistic about. Now we have had a whole lot of impression; generally when you’ve gotten your nostril to the glass, you don’t see it. However we’ve come a great distance. First, by information: Now we have exponentially extra information than we had 15 years in the past. Second, compute energy: After I began my PhD, the truth that I had a GPU was superb. The size of computation has accelerated. And there was a whole lot of affect from science as effectively. There was a Nobel Prize for protein folding. Understanding of human biology is one thing we’ve pushed the needle on. Loads of the Nobel Prizes had been about understanding organic mechanisms, understanding primary science. We’re presently on constructing blocks in the direction of that. It took years to get from understanding the ribosome to understanding the mechanism for HIV.
23:55: In AI for healthcare, we’ve seen extra instant impacts. Simply the very fact of understanding one thing heterogeneous: If we each get a analysis of bronchial asthma, that may have completely different manifestations, completely different triggers. That understanding of heterogeneity in issues like psychological well being: We’re completely different; issues must be handled in a different way. We even have the ecosystem, the place we will have an effect. We are able to impression scientific trials. We’re within the pipeline for medication.
25:39: One of many items of labor we’ve revealed has been round understanding variations in response to the drug for hepatitis B.
26:01: You’re within the UK, you’ve gotten the NHS. Within the US, we nonetheless have the information silo drawback: You go to your main care, after which a specialist, and so they have to speak utilizing information and fax. How can I be optimistic when techniques don’t even discuss to one another?
26:36: That’s an space the place AI will help. It’s not an issue I work on, however how can we optimize workflow? It’s a techniques drawback.
26:59: All of us affiliate information privateness with healthcare. When individuals discuss information privateness, they get sci-fi, with homomorphic encryption and federated studying. What’s actuality? What’s in your every day toolbox?
27:34: These instruments aren’t essentially in my every day toolbox. Pharma is closely regulated; there’s a whole lot of transparency across the information we gather, the fashions we constructed. There are platforms and techniques and methods of ingesting information. When you’ve got a collaboration, you typically work with a trusted analysis atmosphere. Information doesn’t essentially depart. We do evaluation of knowledge of their trusted analysis atmosphere, we make sure that all the things is privateness preserving and we’re respecting the guardrails.
29:11: Our listeners are primarily software program builders. They might surprise how they enter this discipline with none background in science. Can they simply use LLMs to hurry up studying? When you had been making an attempt to promote an ML developer on becoming a member of your workforce, what sort of background do they want?
29:51: You want a ardour for the issues that you simply’re fixing. That’s one of many issues I like about GSK. We don’t know all the things about biology, however now we have excellent collaborators.
30:20: Do our listeners must take biochemistry? Natural chemistry?
30:24: No, you simply want to speak to scientists. Get to know the scientists, hear their issues. We don’t work in silos as AI researchers. We work with the scientists. Loads of our collaborators are medical doctors, and have joined GSK as a result of they need to have an even bigger impression.

Footnotes

To not be confused with Google’s latest agentic coding announcement.

Danielle Belgrave on Generative AI in Pharma and Drugs – O’Reilly

Factors of Curiosity

Footnotes

Related Articles

Optimizing LLM-based journey planning

14 AI Conferences that You Cannot Afford to Miss in 2025

Licensed and Unstoppable: Recertify with Rev Up

LEAVE A REPLY Cancel reply

Latest Articles

Optimizing LLM-based journey planning

14 AI Conferences that You Cannot Afford to Miss in 2025

Licensed and Unstoppable: Recertify with Rev Up

🚁 Drone-Photos presents: Chic Provence – Opus 1 🇫🇷🌅

Unlocking exact composition evaluation of nanomedicines

ABOUT US