Protocol
Abstract
Background: Generative artificial intelligence (GenAI) leverages large language models (LLMs) that are transforming health care. Specialized ambient GenAI tools, like Nuance Dax, Speke, and Tandem Health, “listen” to consultations and generate clinical notes. Medical-focused models, like Med-PaLM, provide tailored health care insights. GenAI’s capability to summarize complex data and generate responses in various conversational styles or literacy levels makes it particularly valuable since it has the potential to alleviate the burden of clinical documentation on health care professionals (HCPs). While GenAI may prove to be helpful, offering novel benefits, it comes with its own set of challenges. The quality of the source data can introduce biases, leading to skewed recommendations or outright false information (so-called hallucinations). In addition, due to the conversational nature of chatbot responses, users may be susceptible to misinformation, posing risks to both safety and privacy. Therefore, careful implementation and rigorous oversight are essential to ensure accuracy, ethical integrity, and alignment with clinical standards. Despite these advances, currently, no review has investigated HCPs’ experiences and opinions about GenAI in clinical documentation. Yet, such a perspective is crucial to better understand how these technologies can be safely and ethically adopted and implemented in clinical practice.
Objective: We aim to present the protocol for a scoping review exploring HCPs’ experiences and opinions about GenAI and ambient scribes in clinical documentation.
Methods: This scoping review will be carried out following the methodological framework of Arksey and O’Malley and the PRISMA-ScR (Preferred Reporting Items for Systematic Reviews and Meta-Analyses for Scoping Reviews) checklist. Relevant papers will be searched for in PubMed, IEEE Xplore, APA PsycInfo, CINAHL, and Web of Science. The review will include studies published between January 2023 and September 2025. Studies will be included that represent original peer-reviewed work that explores HCPs’ experiences and opinions about the use of GenAI or ambient scribes for clinical documentation. Data extraction will include publication type, country, sample characteristics, clinical setting, study aim, study design, research question, and key findings. Study quality will be assessed using the Mixed Methods Appraisal Tool.
Results: The results will be presented as a narrative synthesis structured along the key themes of the evidence mapped. Data will be collated and presented in charts and tabular format. Findings will be reported in a peer-reviewed scoping review.
Conclusions: This will be the first scoping review that considers HCPs’ experiences and opinions about GenAI and ambient scribes in clinical documentation. The results will clarify how HCPs use—or avoid using—GenAI in daily health care work. This insight will help address perceived benefits, risks, expectations, and uncertainties. It may also reveal key research gaps in the field.
International Registered Report Identifier (IRRID): PRR1-10.2196/73602
doi:10.2196/73602
Keywords
Introduction
Background
Generative artificial intelligence (GenAI), a subset of artificial intelligence (AI), leverages large language models (LLMs) to learn patterns from vast datasets and generate novel content [-].
In health care, the use of GenAI tools for clinical documentation is growing, primarily scribes and chatbots. Examples of ambient AI scribes, also known as ambient voice technology, in the market include Nuance Dragon Ambient eXperience by Microsoft [], Speke by ScribeAmerica [] and Tandem Health [], among others.
Chatbots range from general purpose models that are not clinically validated, such as OpenAI’s ChatGPT, to medical-specific ones, like Med-Palm, developed by Google Research [], which is pretrained on medical data and has undergone clinical benchmarking in clinical settings.
Applied to health care, these tools may help clinicians with documentation tasks and reduce administrative burdens, which are known drivers of burnout [-]. These tools may also improve accessibility for patients [,] by synthesizing complex data and writing responses in a desired conversational style or literacy level [,], ultimately improving the overall efficiency of health care services [,].
The adoption of GenAI in health care, however, also raises significant concerns [,]. AI outputs can be biased, potentially exacerbating ageism, racism, or gender discrimination in health care due to omissions and biases present in the training data used [-]. Such “algorithmic biases” can result in unfair suggestions and discriminatory advice. Even in passive use cases, such as ambient scribes that transcribe clinician-patient interactions, bias may emerge in how speech is transcribed, summarized, or prioritized, potentially misrepresenting certain dialects, accents, or phrasing more commonly used by specific demographic groups. This could affect the completeness or accuracy of documentation and introduce disparities into medical records. These tools may also generate dangerous medical advice and can produce incorrect information called “hallucinations” [,]. Users may be vulnerable to misinformation because of the compelling, rapid responses offered, which could further endanger privacy and safety [,].
Previous reviews have examined the perspectives of health care professionals (HCPs) on AI use in clinical settings, focusing on its roles in decision-making, efficiency, quality of care, and documentation systems [,]. Others have explored perceived benefits and concerns, such as bias, transparency, data privacy, and safety, as well as factors influencing AI adoption, including barriers and facilitators influencing AI adoption [-]. Additional review studies have investigated HCPs’ overall understanding and experiences with AI, mapping the broader implications of emerging AI-powered technologies across the health care landscape [,]. However, none of these reviews have specifically examined the lived experiences, perceptions, or opinions of HCPs, particularly frontline clinicians, regarding the use of GenAI in clinical documentation and decision-making.
Added to this gap, survey research indicates important trends and insights in relation to the adoption of technological tools for clinical documentation. Namely, HCPs both desire and expect technological tools to alleviate their administrative burdens [-].
Theoretical framing is also required to better explore barriers and facilitators to GenAI uptake in clinical documentation. For example, the technology acceptance model (TAM) provides one useful framework to examine this gap, highlighting perceived usefulness and ease of use as key factors influencing technology adoption [].
To address this theoretical and empirical gap, we have planned a scoping review to systematically map the current evidence on HCPs’ experiences and opinions of GenAI use in clinical documentation. This manuscript presents the protocol detailing the methodology and approach that will guide the review.
Study Objectives
This scoping review aims to explore HCPs’ perspectives on the use of GenAI tools, such as chatbots and ambient AI, specifically for clinical documentation, by synthesizing existing evidence on their experiences and opinions and identifying gaps to guide future research.
To achieve this aim, the following objectives have been developed: (1) to identify, collate, and summarize existing evidence on HCPs’ experiences of GenAI tools, including chatbots and ambient AI, on clinical documentation; (2) to assess HCPs’ opinions about, and perceived impact of, GenAI on documentation efficiency, quality, privacy, and the potential harms and benefits for HCPs and patients; and (3) to identify research gaps to inform future investigations into HCPs’ experiences and opinions on the use of GenAI and ambient AI for clinical documentation.
Methods
Scoping Review
A scoping review is well-suited for exploring emerging fields like GenAI in clinical documentation, as it maps the existing literature, identifies key themes, and highlights gaps in knowledge. Unlike a systematic review, which focuses on a narrowly defined question with strict inclusion criteria, a scoping review provides a broader overview, capturing diverse perspectives, methodologies, and contexts []. This approach is valuable for informing future research directions and policy considerations in a rapidly evolving technological landscape []. We will carry out a scoping review using the framework proposed by Arksey and O’Malley [], consisting of the following five stages: (1) identifying the research question, (2) identifying the relevant studies, (3) selecting eligible studies, (4) collecting data, and (5) summarizing data and synthesizing results. The Preferred Reporting Items for Systematic Reviews and Meta-Analyses Extension for Scoping Reviews (PRISMA ScR) checklist will be used to report the review [,]. The final publication will emphasize any further changes made to the study design.
Stage 1: Identifying the Research Question
Based on the objectives and iterative discussions within the research team, we agreed on the primary research question (RQ) and 8 secondary RQs.
Primary RQ
What is known from the literature about the uptake, experiences, and opinions among HCPs regarding the use of GenAI, including ambient AI, in clinical documentation?
Secondary RQs
The eight secondary RQs are as follows:
- To what extent have HCPs adopted GenAI chatbots and ambient scribes for clinical documentation?
- What drives HCPs’ adoption of GenAI chatbots and ambient scribes?
- What are the key themes and recurring patterns in HCPs’ interactions with GenAI and ambient scribes in clinical documentation?
- How do HCPs perceive the benefits and challenges of using GenAI and ambient scribes in documentation?
- What ethical considerations and concerns do HCPs raise regarding the integration of GenAI and ambient scribes in documentation?
- Is the use of GenAI and ambient scribes for documentation associated with HCPs’ perceptions about efficiency, workflow, and job satisfaction?
- What gaps exist in the literature regarding HCPs’ experiences and opinions about GenAI and ambient AI in clinical documentation?
- How do HCPs’ experiences or opinions vary depending on the type of GenAI system or architecture (eg, consumer LLMs, ambient AI scribes, speech-to text, and voice-activated, domain-specific models) used in clinical documentation?
What Are Experiences and Opinions?
For the purposes of this review, we denote experiences as any direct, practical interaction with a GenAI tool available on the market that is used by an HCP for clinical documentation. We interpret opinions to refer to any expressed thoughts or perceptions about these tools, including views formed without direct use, as well as perceived implications or consequences. The initial definitions of “experience” and “opinion” are provided in .
Experiences
This refers to health care professionals’ direct, practical interactions with generative artificial intelligence tools: what they did, observed, or encountered while using them.
Opinions
This refers to health care professionals’ personal judgments, beliefs, or attitudes about the usefulness, effectiveness, or impact of these tools.
We used the SPIDER (sample, phenomenon of interest, design, evaluation, research type) [] framework to systematically formulate our research questions and guide the selection of relevant studies, as summarized in . During the preliminary test searches conducted for the scoping review, the research team observed that setting limitations on the study design within the SPIDER framework excluded relevant articles. These exclusions were due to several studies’ metadata either not identifying the design or providing insufficient detail. As a result, we decided to remove the “design” element from the search strategy. This modification maximizes the retrieval of relevant data by ensuring no design constraints are applied in the search string. The “design” element will be retained during screening and synthesis to manage heterogeneity and preserve focus.
| SPIDER dimension | Description |
| Sample | Health care professionals (eg, doctors, nurses, and dentists) |
| Phenomenon of interest | Use of GenAIa, including apps and ambient scribes, to write documentation |
| Design | No limitations during the search stage |
| Evaluation | Experiences and opinions |
| Research type | Qualitative methods, quantitative methods, or mixed methods |
aGenAI: generative artificial intelligence.
Stage 2: Identifying Relevant Studies
The PRISMA flow diagram [] outlines the planned multistage screening process for identifying relevant studies (). As this is a protocol, the flowchart includes placeholders marked “n=0” to indicate that specific data (eg, number of records screened or excluded) are not yet available, since the formal search and selection process has not been conducted. An experienced research librarian from Uppsala University, Sweden, will handle the deduplication procedure.

The search strategy was developed through an iterative process by CGS and the Uppsala University library. The first list of search phrases was compiled using seed references, and it was further refined through conversations between the library and CGS. It is composed of four main concepts: (1) HCPs, (2) GenAI, (3) clinical documentation, and (4) experiences and opinions. These concepts were combined with the Boolean AND and OR operators, as shown in . The literature search will be done in the following five databases: PubMed, IEEE Xplore, APA PsycInfo, CINAHL, and Web of Science. The search terms were tailored to different databases. The whole search query is available in .
Health care professional search string
“Health Personnel”[Mesh] OR “Medical staff”[Mesh] OR “health personnel”[Title/Abstract] OR “medical staff”[Title/Abstract] OR “healthcare professional*”[Title/Abstract] OR “health care professional*”[Title/Abstract] OR HCPs[Title/Abstract] OR “healthcare worker*”[Title/Abstract] OR “health care worker*”[Title/Abstract] OR “health care provider*”[Title/Abstract] OR “healthcare provider*”[Title/Abstract] OR physician*[Title/Abstract] OR nurs*[Title/Abstract] OR clinican*[Title/Abstract]
Generative artificial intelligence search string
“Artificial Intelligence”[Mesh] OR “Natural language processing”[Mesh] OR GenAI[Title/Abstract] OR chatbots[Title/Abstract] OR ChatGPT[Title/Abstract] OR “artificial intelligence”[Title/Abstract] OR AI-based[Title/Abstract] OR “natural language processing*”[Title/Abstract] OR “summarization tools”[Title/Abstract] OR “machine intelligence”[Title/Abstract] OR AI[Title/Abstract] OR “ambient intelligence”[Title/Abstract] OR “ambient scribes”[Title/Abstract]
Clinical documentation search string
“Documentation”[Mesh] OR “Medical records systems, computerized”[Mesh] OR “Electronic Health Records”[Mesh] OR “Health Records, Personal”[Mesh] OR “Patient Discharge”[Mesh] OR “Patient Discharge Summaries”[Mesh] OR documentation*[Title/Abstract] OR “medical record*”[Title/Abstract] OR “clinical note*”[Title/Abstract] OR “medical note*”[Title/Abstract] OR “clinical record*”[Title/Abstract] OR “patient discharge*”[Title/Abstract] OR “health record*”[Title/Abstract] OR “patient record*”[Title/Abstract]
Experiences and opinions search string
“Attitude of Health Personnel”[Mesh] OR “Attitude”[Mesh] OR experience*[Title/Abstract] OR opinion*[Title/Abstract] OR attitude*[Title/Abstract] OR efficiency[Title/Abstract] OR quality[Title/Abstract] OR “patient understanding”[Title/Abstract] OR “ethical implication*”[Title/Abstract] OR “privacy implication*”[Title/Abstract] OR perception*[Title/Abstract] OR reaction*[Title/Abstract] OR response*[Title/Abstract
Stage 3: Selecting Eligible Studies
Inclusion and Exclusion Criteria
The inclusion and exclusion criteria, as shown in , were defined by the entire research team and will be applied in the study selection process. The SPIDER framework was used to define the eligibility criteria for study inclusion and exclusion, organized by sample, phenomenon of interest, design, evaluation, and research type. Additional criteria included setting, time period, and language. This review will include formally published peer-reviewed journal articles and conference papers that have undergone peer-review, and it will exclude gray literature, opinion pieces, letters, protocols, reviews, and other publications that do not represent original research. Exceptions will be made for cases where a source includes original data along with adequate methodological and analytical details.
Inclusion criteria
- Sample: health care professionals (HCPs) as participants, including but not limited to physicians, nurses, dentists, and other clinical staff
- Phenomenon of interest: use of generative artificial intelligence (GenAI) and ambient scribes for clinical documentation
- Design: case studies, surveys, focus groups, and interviews; pilot programs, ethical and legal considerations, and longitudinal studies
- Evaluation: studies that explore HCPs’ experiences and opinions regarding the use of GenAI or ambient scribes for clinical documentation
- Research type: original, peer-reviewed studies presenting empirical data, including qualitative, quantitative, or mixed methods designs
- Setting: all medical disciplines, all health care settings, no location restrictions
- Time period: published between January 2023 and September 2025
- Language: studies published and available in English
Exclusion criteria
- Sample: HCPs not included as participants
- Phenomenon of interest: no use of GenAI or ambient scribes for clinical documentation
- Design: studies that do not use quantitative, qualitative, or mixed methods designs, or that focus solely on ethical, legal, or policy considerations without reporting HCPs’ experiences or opinions
- Evaluation: studies focused solely on the quantitative measurement or assessment of the effectiveness, accuracy, or performance of GenAI tools, without addressing HCPs’ experiences or opinions
- Research type: studies that are not empirical research, including reviews, such as systematic, scoping, and meta-analyses; gray literature, including opinion pieces, protocols, and reviews; and gray data, such as websites, tweets, and blogs
- Setting: not medical discipline or health care setting
- Time period: published before January 2023
- Language: studies published in a language other than English
Mixed methods studies will be included if they contain quantitative and/or qualitative data relevant to HCPs’ experiences or opinions on the use of GenAI tools for clinical documentation.
No geographic limitations will be applied to ensure a comprehensive overview of global perspectives. Due to resource constraints, the study will exclusively include articles published in English. The research team acknowledges that this approach limits the inclusion of studies conducted in different parts of the world and published in other languages. This will potentially introduce language bias and lead to the underrepresentation of diverse global perspectives. Efforts to mitigate this are described in the Strengths and Limitations subsection.
Study Selection Process
We will perform collaborative, blinded screening of abstracts and titles using Covidence (Veritas Health Innovation) []. Every member of the research team will take part in this procedure, and at least two people will review each record in the result set. The entire texts of the relevant studies will be considered when discussing discrepancies. A third reviewer will be involved and given the authority to decide whether to include or exclude the study in the event that differences cannot be settled.
Stage 4: Collecting Data
After selecting the studies to include, metadata (eg, title, authors, and publication year) of the remaining records will be exported and summarized to Google Sheets for further processing. The spreadsheet will be extended with the following parameters, based on the studies’ full texts: publication type, country, sample characteristics, clinical setting, study aim, study design, research question, and key findings. This will allow us to extract and organize relevant data from included studies. The results will be presented in diagrams or tabular format for a clear and comprehensive visualization. While no design restrictions will be applied during the search phase to ensure broad retrieval of relevant studies, study design will be evaluated during the screening and data charting processes. To manage potential heterogeneity, included studies will be grouped according to their methodological design (eg, qualitative, quantitative, or mixed methods) and analyzed within these categories during the synthesis phase. This approach allows us to preserve the comprehensiveness of the search while maintaining analytical clarity and relevance in reporting.
All members of the study team will participate in the data extraction process. Additionally, the first author (CGS) will verify that the data extraction is accurate and comprehensive. The Mixed Methods Appraisal Tool (MMAT) [] will be used to evaluate the studies’ quality and methodological rigor. All studies included will be graded by two researchers separately, and their findings will be approved by a third independent researcher who will be consulted if no consensus can be obtained.
Stage 5: Summarizing Data and Synthesizing Results
Narrative Synthesis
Study results will be extracted from the full texts by the lead author (CGS) and summarized into two formats: (1) a reduced format within a textbox, providing an overview of the findings from all included studies, and (2) a detailed narrative synthesis. The narrative data will be analyzed independently by at least two researchers using thematic analysis following the 6-phase framework of Braun and Clark [].
We will adopt a hybrid approach to theme development: initial deductive codes will be informed by the research questions, particularly focusing on HCPs’ experiences and opinions, but we will also remain open to inductive themes that emerge from the data during analysis.
At least two researchers will code the data independently using Google Sheets to facilitate coding, organization, and comparison. An initial coding framework will be developed and refined through iterative discussion. Intercoder reliability will be enhanced through regular consensus meetings, and discrepancies will be resolved collaboratively. Final themes will be reviewed and approved by the full research team to ensure consistency and analytical rigor.
We aim to identify patterns and relationships within and across studies, as suggested by Levac et al [], in order to explore HCPs’ perspectives on the potential of both GenAI apps and ambient AI tools to reduce documentation burdens in creating clinical notes. This analysis will examine associated benefits (eg, clarity, speed, and ability to translate complex medical information), risks (eg, mistakes, biases, and patient privacy), and opinions based on HCPs’ experiences. Ethical, legal, and privacy concerns will be coded as distinct themes to ensure focused attention on these sensitive issues. In assessing the methodological rigor of the studies, we also envisage the potential to identify research gaps; for example, we predict there may be perceived privacy and ethical concerns pertaining to the potential for errors and liability associated with these tools.
Assessing the Robustness of the Synthesis
The robustness of the narrative synthesis depends on the quality of the included studies as well as on the trustworthiness of the synthesis []. In order to minimize bias, we will conduct the study quality appraisal through MMAT [] to ensure that studies of equal technical quality are given equal weight.
Ethical Considerations
This study is not subject to ethical approval since only publicly available data material will be used within the scoping review methodology.
Results
The results of this scoping review will be presented in both narrative and tabular formats. The narrative synthesis will reflect key themes emerging from the literature on HCPs’ experiences and opinions regarding the use of GenAI and ambient AI tools in clinical documentation. These themes will include perceived benefits (eg, clarity, efficiency, and support for complex tasks), risks (eg, errors, bias, and privacy concerns), and practical or ethical considerations as reported by the participants. Descriptive study characteristics (eg, publication type, country, sample characteristics, clinical setting, study aim, study design, research question, and key findings) will be summarized in tables or diagrams.
In addition, findings will be organized using the TAM to interpret patterns related to perceived usefulness, ease of use, and behavioral intention. Where relevant, ethical, legal, and privacy considerations reported in the literature will be highlighted and mapped in relation to technology adoption and implementation dynamics.
Discussion
Anticipated Findings
This scoping review is expected to provide a comprehensive overview of the current evidence on HCPs’ experiences and opinions regarding the use of GenAI tools in clinical documentation. By systematically mapping this literature, we aim to identify prevailing themes as well as significant knowledge gaps. For example, we anticipate insights into perceived risks related to patient safety, bias, and data protection, as well as ethical and privacy concerns that may influence technology acceptance and use in clinical settings.
By applying the TAM as a guiding framework, we will interpret patterns across studies related to perceived usefulness, ease of use, and behavioral intention to adopt GenAI tools. This approach may reveal aspects such as usability challenges or contextual factors shaping HCPs’ readiness to integrate these technologies.
Furthermore, our review could help clarify how ethical considerations overlap with user acceptance, practical implementation, and broader sociotechnical dynamics that influence GenAI uptake in health care. This integrated perspective can inform targeted strategies to support safe, ethical, and effective adoption of GenAI tools in clinical practice.
Finally, applying the MMAT to assess study quality will support a more robust and trustworthy narrative synthesis by contextualizing findings within the methodological strength of the available evidence.
Comparison to Prior Work
Several reviews have explored HCPs’ perspectives on AI in clinical settings. Ayorinde et al [] conducted a systematic review synthesizing experiences with non–knowledge-based AI tools in decision-making. Bracken et al [] focused on the efficiency, quality, and stakeholder views of AI-driven documentation systems using measurable outcomes. Buchanan et al [] mapped emerging AI-powered health technologies and their implications for nursing across clinical, administrative, policy, and research domains. Chustecki [] used a narrative review to examine benefits and concerns related to AI in health care, including bias, transparency, data privacy, and safety. Hassan et al [] and Lambert et al [] conducted scoping and integrative reviews, respectively, to identify barriers and facilitators to AI adoption and acceptance among HCPs. Sahoo et al [] highlighted perceived pros and cons of AI in health care, while Shinners et al [] explored HCPs’ understanding of and experiences with AI in care delivery. While these reviews broadly explore HCPs’ perspectives on AI, none specifically address their experiences and opinions on the use of GenAI for clinical documentation.
Strengths and Limitations
Given that GenAI tools are increasingly being heralded as valuable in clinical documentation, this scoping review examines an urgent topic. By synthesizing HCPs’ experiences and opinions, it strives to obtain insights on implementation and uptake of these tools. These findings can guide future research and support the safe integration of GenAI in health care.
As GenAI rapidly evolves, a potential limitation of the study is that the findings may soon become outdated. Additionally, the exclusion of non–English-language studies might lead to the omission of relevant research. This may limit the generalizability of the findings and introduce a Western-centric bias, particularly given the global scale of GenAI adoption in health care. This constraint may overlook relevant insights from non–English-speaking countries. While resource limitations prevent full inclusion of non–English-language literature, studies with English-language abstracts will be screened when available to partially mitigate this limitation. Furthermore, the exclusion of gray literature means that some studies will be overlooked. Nevertheless, our search strategy is designed to identify the most comprehensive and high-quality evidence.
Future Directions
Our scoping review aims to map current research on this issue. This approach aims to clarify how HCPs are—or are not—integrating GenAI into their daily work, helping to identify perceived benefits, risks, expectations, and uncertainties. Results also may help to identify research gaps in this field.
Dissemination Plan
To ensure the findings of this scoping review reach relevant stakeholders and contribute meaningfully to practice, policy, and future research, we plan to implement a comprehensive dissemination strategy. This will include publishing the results in a peer-reviewed, open-access journal and presenting them at relevant academic conferences and professional meetings.
Conclusion
This scoping review will systematically map the existing literature on HCPs’ experiences and opinions regarding the use of GenAI tools in clinical documentation. By identifying key themes, knowledge gaps, and patterns related to technology acceptance, the findings will help inform future research, policy, and practice. The integration of the TAM as a guiding framework, along with focused attention on ethical and practical considerations, will provide a structured lens for interpreting the literature. Ultimately, this review aims to support the safe, effective, and ethical implementation of GenAI in health care settings.
Acknowledgments
This work was supported by the Forte (2024-00039) grant AI (Health care Unleashed: Responsible and Ethical Implementation of Large Language Model Chatbots in Clinical Workflows and Patient Care). We greatly appreciate the valuable assistance of Uppsala University librarian Emma Kristina Carlsson with our scoping review. Her expertise in research databases and guidance in refining our search strategy were crucial in the design of a robust study protocol.
Data Availability
As this is a protocol, no data have been collected or analyzed to date. The scoping review will use data from publicly available sources, retrievable through the following databases: PubMed, IEEE Xplore, APA PsycInfo, CINAHL, and Web of Science.
Authors' Contributions
CGS drafted the manuscript. CGS and CB were responsible for the conception of the study. All authors read the paper, provided feedback on it, and approved it for submission.
Conflicts of Interest
None declared.
Adapted search strategies for each database.
DOCX File , 26 KBReferences
- Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez A, et al. Attention is all you need [preprint]. arXiv. [FREE Full text] [CrossRef]
- Goodfellow I, Pouget-Abadie J, Mirza M, Xu B, Warde-Farley D, Ozair S, et al. Generative adversarial nets. In: Advances in Neural Information Processing Systems. Curran Associates, Inc; 2020. Presented at: Neural Information Processing Systems; December 8–13, 2014; Montreal, Canada. URL: https://proceedings.neurips.cc/paper_files/paper/2014/file/f033ed80deb0234979a61f95710dbe25-Paper.pdf [CrossRef]
- Kingma DP, Welling M. Auto-encoding variational Bayes. arXiv. Preprint posted online 2013 Dec 20. [FREE Full text] [CrossRef]
- Nuance DAX. 2025. URL: https://www.nuance.com/healthcare/dragon-ai-clinical-solutions/dax-copilot/infographic/move-beyond-scribes-to-automatically-document-care.html [accessed 2025-07-25]
- Speke. ScribeAmerica. 2025. URL: https://www.scribeamerica.com/speke/ [accessed 2025-02-20]
- AI medical scribe. Tandem. 2025. URL: https://www.tandemhealth.ai/ [accessed 2025-02-20]
- Med-PaLM. Google Research. 2025. URL: https://sites.research.google/med-palm/ [accessed 2025-02-20]
- Blease C, Kaptchuk TJ, Bernstein MH, Mandl KD, Halamka JD, DesRoches CM. Artificial intelligence and the future of primary care: exploratory qualitative study of UK general practitioners' views. J Med Internet Res. Mar 20, 2019;21(3):e12802. [FREE Full text] [CrossRef] [Medline]
- Blease C, Locher C, Leon-Carlyle M, Doraiswamy M. Artificial intelligence and the future of psychiatry: qualitative findings from a global physician survey. Digit Health. Oct 27, 2020;6:2055207620968355. [FREE Full text] [CrossRef] [Medline]
- Blease C, Kharko A, Locher C, Torous J, Rosch S, Hägglund M, et al. Generative artificial intelligence in medicine: a mixed methods survey of UK general practitioners. JMIR Preprints. [FREE Full text] [CrossRef]
- Kharko A, McMillan B, Hagström J, Muli I, Davidge G, Hägglund M, et al. Generative artificial intelligence writing open notes: A mixed methods assessment of the functionality of GPT 3.5 and GPT 4.0. Digit Health. Oct 29, 2024;10:20552076241291384. [FREE Full text] [CrossRef] [Medline]
- Ayers JW, Poliak A, Dredze M, Leas EC, Zhu Z, Kelley JB, et al. Comparing physician and artificial intelligence chatbot responses to patient questions posted to a public social media forum. JAMA Intern Med. Jun 01, 2023;183(6):589-596. [FREE Full text] [CrossRef] [Medline]
- Blease C, Torous J, McMillan B, Hägglund M, Mandl KD. Generative language models and open notes: exploring the promise and limitations. JMIR Med Educ. Jan 04, 2024;10:e51183. [FREE Full text] [CrossRef] [Medline]
- Baker HP, Dwyer E, Kalidoss S, Hynes K, Wolf J, Strelzow JA. ChatGPT's ability to assist with clinical documentation: a randomized controlled trial. J Am Acad Orthop Surg. Feb 01, 2024;32(3):123-129. [FREE Full text] [CrossRef] [Medline]
- Blease C, Torous J. ChatGPT and mental healthcare: balancing benefits with risks of harms. BMJ Ment Health. Nov 10, 2023;26(1):e300884. [FREE Full text] [CrossRef] [Medline]
- Blease C. Open AI meets open notes: surveillance capitalism, patient privacy and online record access. J Med Ethics. Jan 23, 2024;50(2):84-89. [FREE Full text] [CrossRef] [Medline]
- Gross N. What ChatGPT tells us about gender: a cautionary tale about performativity and gender biases in AI. Social Sciences. Aug 01, 2023;12(8):435. [CrossRef]
- King M. Harmful biases in artificial intelligence. Lancet Psychiatry. Nov 2022;9(11):e48. [FREE Full text] [CrossRef] [Medline]
- Ferrara E. Should ChatGPT be biased? Challenges and risks of bias in large language models. First Monday. Nov 07, 2023;28(11):1-39. [FREE Full text] [CrossRef]
- Kotek H, Dockum R, Sun D. Gender bias and stereotypes in large language models. In: CI '23: Proceedings of The ACM Collective Intelligence Conference. New York, NY. Association for Computing Machinery; 2023. Presented at: ACM Collective Intelligence Conference; November 6–9, 2023:12-24; Delft, Netherlands. [CrossRef]
- Ji Z, Lee N, Frieske R, Yu T, Su D, Xu Y, et al. Survey of hallucination in natural language generation. ACM Comput Surv. Mar 03, 2023;55(12):1-38. [CrossRef]
- Chen S, Kann BH, Foote MB, Aerts HJWL, Savova GK, Mak RH, et al. Use of artificial intelligence chatbots for cancer treatment information. JAMA Oncol. Oct 01, 2023;9(10):1459-1462. [FREE Full text] [CrossRef] [Medline]
- Ayorinde A, Mensah DO, Walsh J, Ghosh I, Ibrahim SA, Hogg J, et al. Health care professionals' experience of using AI: systematic review with narrative synthesis. J Med Internet Res. Oct 30, 2024;26:e55766. [FREE Full text] [CrossRef] [Medline]
- Bracken A, Reilly C, Feeley A, Sheehan E, Merghani K, Feeley I. Artificial intelligence (AI) - powered documentation systems in healthcare: a systematic review. J Med Syst. Feb 18, 2025;49(1):28. [FREE Full text] [CrossRef] [Medline]
- Chustecki M. Benefits and risks of AI in health care: narrative review. Interact J Med Res. Nov 18, 2024;13:e53616. [FREE Full text] [CrossRef] [Medline]
- Hassan M, Kushniruk A, Borycki E. Barriers to and facilitators of artificial intelligence adoption in health care: scoping review. JMIR Hum Factors. Aug 29, 2024;11:e48633. [FREE Full text] [CrossRef] [Medline]
- Lambert SI, Madi M, Sopka S, Lenes A, Stange H, Buszello C, et al. An integrative review on the acceptance of artificial intelligence among healthcare professionals in hospitals. NPJ Digit Med. Jun 10, 2023;6(1):111. [FREE Full text] [CrossRef] [Medline]
- Sahoo RK, Sahoo KC, Negi S, Baliarsingh SK, Panda B, Pati S. Health professionals' perspectives on the use of Artificial Intelligence in healthcare: A systematic review. Patient Educ Couns. Jan 27, 2025;134:108680. [CrossRef] [Medline]
- Buchanan C, Howitt ML, Wilson R, Booth RG, Risling T, Bamford M. Predicted influences of artificial intelligence on the domains of nursing: scoping review. JMIR Nurs. Dec 17, 2020;3(1):e23939. [FREE Full text] [CrossRef] [Medline]
- Shinners L, Aggar C, Grace S, Smith S. Exploring healthcare professionals' understanding and experiences of artificial intelligence technology use in the delivery of healthcare: an integrative review. Health Informatics J. Jun 26, 2020;26(2):1225-1236. [FREE Full text] [CrossRef] [Medline]
- Davis FD. A technology acceptance model for empirically testing new end-user information systems: theory and results [thesis]. Massachusetts Institute of Technology. 1985. URL: https://dspace.mit.edu/handle/1721.1/15192 [accessed 2025-06-21]
- Arksey H, O'Malley L. Scoping studies: towards a methodological framework. Int J Soc Res Methodol. Feb 2005;8(1):19-32. [CrossRef]
- Levac D, Colquhoun H, O'Brien KK. Scoping studies: advancing the methodology. Implement Sci. Sep 20, 2010;5:69. [FREE Full text] [CrossRef] [Medline]
- Tricco AC, Lillie E, Zarin W, O'Brien KK, Colquhoun H, Levac D, et al. PRISMA Extension for Scoping Reviews (PRISMA-ScR): checklist and explanation. Ann Intern Med. Oct 02, 2018;169(7):467-473. [FREE Full text] [CrossRef] [Medline]
- Cooke A, Smith D, Booth A. Beyond PICO: the SPIDER tool for qualitative evidence synthesis. Qual Health Res. Oct 2012;22(10):1435-1443. [CrossRef] [Medline]
- Beeley C. Web Application Development With R Using Shiny. Birmingham, UK. Packt Publishing; 2013.
- Haddaway NR, Page MJ, Pritchard CC, McGuinness LA. PRISMA2020: an R package and Shiny app for producing PRISMA 2020-compliant flow diagrams, with interactivity for optimised digital transparency and open synthesis. Campbell Syst Rev. Jun 2022;18(2):e1230. [FREE Full text] [CrossRef] [Medline]
- Covidence. 2025. URL: https://www.covidence.org/ [accessed 2025-07-22]
- Hong QN, Fàbregues S, Bartlett G, Boardman F, Cargo M, Dagenais P, et al. The Mixed Methods Appraisal Tool (MMAT) version 2018. Registration of Copyright (#1148552). Canadian Intellectual Property Office, Industry Canada. URL: http://mixedmethodsappraisaltoolpublic.pbworks.com/w/file/fetch/127916259/MMAT__criteria-manual_2018-08-01_ENG.pdf [accessed 2025-07-30]
- Terry G, Hayfield N, Clarke V, Braun V. Thematic analysis. In: The SAGE Handbook of Qualitative Research in Psychology. Thousand Oaks, CA. SAGE Publications; 2017:25.
- Popay J, Roberts H, Petticrew M, Britten N, Arai L, Roen K, et al. Guidance on the conduct of narrative synthesis in systematic reviews. J Epidemiol Community Health. Jan 2005;59(Suppl 1):A7. [FREE Full text]
Abbreviations
| AI: artificial intelligence |
| GenAI: generative artificial intelligence |
| HCP: health care professional |
| LLM: large language model |
| MMAT: Mixed Methods Appraisal Tool |
| PRISMA ScR: Preferred Reporting Items for Systematic Reviews and Meta-Analyses Extension for Scoping Reviews |
| PRISMA: Preferred Reporting Items for Systematic Reviews and Meta-Analyses |
| RQ: research question |
| SPIDER: sample, phenomenon of interest, design, evaluation, research type |
| TAM: technology acceptance model |
Edited by J Sarvestan; submitted 10.03.25; peer-reviewed by MK Milic, J Sena, M Veliyathnadu Ebrahim; comments to author 02.06.25; revised version received 01.07.25; accepted 09.07.25; published 08.08.25.
Copyright©Carolina Garcia Sanchez, Anna Kharko, Maria Hägglund, Sara Riggare, Charlotte Blease. Originally published in JMIR Research Protocols (https://www.researchprotocols.org), 08.08.2025.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Research Protocols, is properly cited. The complete bibliographic information, a link to the original publication on https://www.researchprotocols.org, as well as this copyright and license information must be included.

