Towards Conversational AI for Disease Management

Abstract

While large language models (LLMs) have shown promise in diagnostic dialogue¹, their capabilities for effective management reasoning—including disease progression, therapeutic response, and safe medication prescription—remain under-explored. We advance the previously demonstrated diagnostic capabilities of the Articulate Medical Intelligence Explorer (AMIE)¹⁻³ through a new LLM-based agentic system optimized for multi-visit clinical management and dialogue. To ground its reasoning in authoritative clinical knowledge, AMIE leverages Gemini’s long-context capabilities⁴, combining in-context retrieval with structured reasoning to align its output with up-to-date clinical practice guidelines and drug formularies. In a randomized, blinded virtual Objective Structured Clinical Examination (OSCE) study, AMIE was compared to 21 primary care physicians (PCPs) across 100 multi-visit case scenarios designed to reflect UK NICE Guidance and BMJ Best Practice guidelines. AMIE was non-inferior to PCPs in management reasoning as assessed by specialists and scored better in both preciseness of treatments and investigations, and in its alignment with and grounding in clinical guidelines. To benchmark medication reasoning, we developed RxQA, a multiple-choice question benchmark derived from two national drug formularies (US, UK) and validated by board-certified pharmacists. Though AMIE and PCPs both benefited from the ability to access external drug information, AMIE outperformed PCPs on higher difficulty questions. While further research would be needed before real-world translation, AMIE’s strong performance across evaluations marks a significant step towards conversational AI as a tool in disease management.

Access through your institution

Buy or subscribe

Access through your institution

Access Nature and 54 other Nature Portfolio journals

Get Nature+, our best-value online-access subscription

$32.99 / 30 days

cancel any time

Learn more

Subscribe to this journal

Receive 52 print issues and online access

$199.00 per year

only $3.83 per issue

Learn more

Rent or buy this article

Prices vary by article type

from$1.95

to$39.95

Learn more

Prices may be subject to local taxes which are calculated during checkout

Author information

Author notes

These authors jointly supervised this work: Alan Karthikesalingam, Mike Schaekermann
These authors contributed equally: Valentin Liévin, Anil Palepu

Authors and Affiliations

Google DeepMind, Mountain View, California, USA
Valentin Liévin, Khaled Saab, David Stutz, Yong Cheng, S. Sara Mahdavi, Joëlle Barral, Ryutaro Tanno & Tao Tu
Google Research, Mountain View, California, USA
Anil Palepu, Wei-Hung Weng, Kavita Kulkarni, Dale R. Webster, Katherine Chou, Avinatan Hassidim, Yossi Matias, James Manyika, Vivek Natarajan, Adam Rodman, Alan Karthikesalingam & Mike Schaekermann

Authors

Valentin Liévin
View author publications
Search author on:PubMed Google Scholar
Anil Palepu
View author publications
Search author on:PubMed Google Scholar
Wei-Hung Weng
View author publications
Search author on:PubMed Google Scholar
Khaled Saab
View author publications
Search author on:PubMed Google Scholar
David Stutz
View author publications
Search author on:PubMed Google Scholar
Yong Cheng
View author publications
Search author on:PubMed Google Scholar
Kavita Kulkarni
View author publications
Search author on:PubMed Google Scholar
S. Sara Mahdavi
View author publications
Search author on:PubMed Google Scholar
Joëlle Barral
View author publications
Search author on:PubMed Google Scholar
Dale R. Webster
View author publications
Search author on:PubMed Google Scholar
Katherine Chou
View author publications
Search author on:PubMed Google Scholar
Avinatan Hassidim
View author publications
Search author on:PubMed Google Scholar
Yossi Matias
View author publications
Search author on:PubMed Google Scholar
James Manyika
View author publications
Search author on:PubMed