Artificial Intelligence in Healthcare and Education Open access Peer reviewed

Ambient AI Scribes in Clinical Practice: A Randomized Trial

Paul J. Lukac, Sitaram Vangala, Aaron Chin, Joshua Khalili and 4 more

NEJM AI | Nov 26, 2025 | 35 citations

Scollr summary

What this paper is about

Both DAX and Nabla resulted in potential improvements in burnout, task load, and work exhaustion, but these secondary end point findings need confirmation in larger, multicenter trials.

Full abstract

Read the full abstract

BACKGROUND: Ambient artificial intelligence (AI) scribes record patient encounters and rapidly generate visit notes, representing a promising solution to documentation burden and physician burnout. However, the scribes' impacts have not been examined in randomized clinical trials. METHODS: In this parallel three-group pragmatic randomized clinical trial, 238 outpatient physicians, representing 14 specialties, were assigned 1:1:1 via covariate-constrained randomization (balancing on time-in-note, baseline burnout score, and clinic days per week) to either one of two AI scribe applications - Microsoft Dragon Ambient eXperience (DAX) Copilot or Nabla - or a usual-care control group from November 4, 2024, to January 3, 2025. The primary outcome was the change from baseline log writing time-in-note. Secondary end points measured by surveys included the Mini-Z 2.0, a four-item physician task load (PTL), and Professional Fulfillment Index - Work Exhaustion (PFI-WE) scores to evaluate aspects of burnout; work environment; stress; and targeted questions addressing safety, accuracy, and usability. RESULTS: DAX was used in 33.5% of 24,696 visits; Nabla was used in 29.5% of 23,653 visits. Nabla users experienced a 9.5% (95% confidence interval [CI], -17.2% to -1.8%; P=0.02) decrease in time-in-note versus the control group, whereas DAX users exhibited no significant change versus the control group (-1.7%; 95% CI, -9.4% to +5.9%; P=0.66). Increases in total Mini-Z (scale 10-50; DAX 2.83 [95% CI, +1.28 to +4.37]; Nabla +2.69 [95% CI, +1.14 to +4.23]) and reductions in PTL (scale 0-400; DAX -39.9 [95% CI, -71.9 to -7.9]; Nabla -31.7 [95% CI, -63.8 to +0.4]), and PFI-WE (scale 0-4; DAX 0.32 [95% CI,-0.55 to -0.08]; Nabla -0.23 [95% CI, -0.46 to +0.01]) scores suggest improvement for users of either scribe versus the control. One grade 1 (mild) adverse event was reported, while clinically significant inaccuracies were noted "occasionally" on five-point Likert questions (DAX 2.7 [95% CI, 2.4 to 3.0]; Nabla 2.8 [95% CI, 2.6 to 3.0]). CONCLUSIONS: Nabla reduced time-in-note versus the control. Both DAX and Nabla resulted in potential improvements in burnout, task load, and work exhaustion, but these secondary end point findings need confirmation in larger, multicenter trials. Clinicians reported that performance was similar across the two distinct platforms, and occasional inaccuracies observed in either scribe require ongoing vigilance. (Funded by the University of California, Los Angeles, Department of Medicine and others; ClinicalTrials.gov number, NCT06792890.).

Direct answer

What can I do from this paper page?

Use this page to scan "Ambient AI Scribes in Clinical Practice: A Randomized Trial" quickly: start with the summary and abstract, then check the authors, source, topics, and related papers. From here, open Scollr to follow Artificial Intelligence in Healthcare and Education research, save the paper, or map adjacent work.

Authors

Researchers on this paper

Paul J. Lukac

first | University of California, Los Angeles | ORCID 0000-0002-3315-6613

Sitaram Vangala

middle | University of California, Los Angeles

Aaron Chin

middle | University of California, Los Angeles | ORCID 0000-0001-9490-6580

Joshua Khalili

middle | University of California, Los Angeles | ORCID 0009-0009-3559-4332

Ya‐Chen Tina Shih

middle | University of California, Los Angeles | ORCID 0000-0001-7290-3864

Catherine A. Sarkisian

middle | West Los Angeles College

Eric M. Cheng

middle | University of California, Los Angeles

John N. Mafi

last | RAND Corporation | ORCID 0000-0002-0322-7636

Research areas

Follow related topics

Latest Digital Mental Health Interventions research Latest Artificial Intelligence in Healthcare and Education research Latest Explainable Artificial Intelligence (XAI) research

Citation

BibTeX

@article{Lukac2025Ambient,
  title = {Ambient AI Scribes in Clinical Practice: A Randomized Trial},
  author = {Paul J. Lukac and Sitaram Vangala and Aaron Chin and Joshua Khalili and Ya‐Chen Tina Shih and Catherine A. Sarkisian and Eric M. Cheng and John N. Mafi},
  journal = {NEJM AI},
  year = {2025},
  doi = {10.1056/aioa2501000},
  url = {https://doi.org/10.1056/aioa2501000}
}

FAQ

Using this paper in a discovery workflow

How do I find related work for this paper?

Use the related papers and topic links on this page as starting points. In Scollr, you can also open the paper and build a literature map around its references, citing papers, and related work.

How can I keep up with new Artificial Intelligence in Healthcare and Education research papers?

Follow Artificial Intelligence in Healthcare and Education research in Scollr. New papers from the topic flow into a personalized feed, and you can save useful studies to revisit later.

Can I cite this paper from this page?

This page includes a static BibTeX block for Ambient AI Scribes in Clinical Practice: A Randomized Trial. Always verify the DOI, source, and publication details against the publisher record before submitting a manuscript.

Follow this research in Scollr

Follow the topics and authors behind this paper, save useful studies, and build a literature map when you are ready to go deeper.

Get the app