Using LLMs like ChatGPT for Real-World Evidence

12 Sep 2023 3 min read News

Large Language Models (LLMs), including ChatGPT, have gained significant traction since late 2022, sparking debates about whether they’re mere hype or a genuine revolution. However, their undeniable utility spans personal and business activities, including but not limited to proofreading and editing, language translation, coding and debugging, and content creation. At Polygon Health Analytics, we recognize several areas where LLMs can make a substantial positive impact on real-world evidence (RWE) generation and Health Economics and Outcome Research (HEOR), as discussed below.

LLMs Enable RWE Generation from Novel Data Types

While most current RWE is derived from claims data or structured Electronic Health Record (EHRs), a wealth of untapped potential lies in unstructured real-world data (RWD) sources. These include unstructured EHRs (e.g., clinician’s notes), surveys, social media (including online patient forums), news articles, and scientific literature. Often overlooked due to challenges like missing entries, typos, specialized jargons, and context-dependent acronyms, these unstructured free-text data can now be harnessed efficiently with LLMs. These models can efficiently extract clinical terms (e.g., diagnoses, symptoms, and medications), transform them into structured formats, fix misspellings, and contextualize acronyms and jargons^1-4 (Figure 1). LLM will unlock previously overlooked RWE insights, driving the evolution of evidence-based medicine.

**Figure 1**. ChatGPT deciphers the clinical notes and extracts medical terms into a structured format (adapted from [4])

LLMs Democratize RWD Exploration and Analysis

RWE generation often involves navigating vast datasets, demanding advanced technical skills. LLMs hold the promise of democratizing data analysis, making it accessible to a broader audience, regardless of programming or statistical expertise. Users can interact with data naturally by posing questions in everyday language, and LLMs can translate these inquiries into structured queries for data retrieval^5,6 (Figure 2). Furthermore, LLMs can assist users in generating code for complex analyses, and interpreting and summarizing findings presented in tables and figures, thereby lowering barriers to RWD analysis and facilitating data-driven decision-making^7,8.

**Figure 2**. LLM converts free-text questions to SQL queries for data retrieving from a relational database storing RWD ([6]).

LLMs Automate Scientific Literature Synthesis

Systematic literature review is about analyzing and synthesizing existing scientific literature for evidence-based decision-making and novel research and development opportunities identification. Traditionally, manual reviews are time-consuming and expensive. LLMs can perform systematic literature review tasks, from defining search terms, to summarizing and extracting information from articles⁹. By deploying multiple AI agents, LLMs can streamline the review process, offering timely insights amidst the growing body of literature¹⁰.

In conclusion, integrating LLMs into RWE generation promises to advance HEOR, patient care, and health policy. LLMs are poised to reshape the landscape of RWD and RWE. As these models continue evolving, their impact on the future of medicine and patient care will be profound. It is crucial to approach these advancements critically, addressing challenges like customized models, data privacy, biasness and fairness, and regulatory compliance.

References:

Jethani, Neil, et al. “Evaluating ChatGPT in Information Extraction: A Case Study of Extracting Cognitive Exam Dates and Scores.” medRxiv (2023): 2023-07.
Huang, Jingwei, et al. “A Critical Assessment of Using ChatGPT for Extracting Structured Data from Clinical Notes.” Available at SSRN 4488945.
Hu, Yan, et al. “Zero-shot clinical entity recognition using chatgpt.” arXiv preprint arXiv:2303.16416 (2023).
“Large language models help decipher clinical notes.” MIT.edu (2022)
Pan, Youcheng, et al. “A BERT-based generation model to transform medical texts to SQL queries for electronic medical records: model development and validation.” JMIR Medical Informatics 9.12 (2021): e32698.
Lee, Gyubok, et al. “EHRSQL: A Practical Text-to-SQL Benchmark for Electronic Health Records.” Advances in Neural Information Processing Systems 35 (2022): 15589-15601.
“How to Use ChatGPT Code Interpreter.” Datacamp.com (2023)
Maddigan, Paula, and Teo Susnjak. “Chat2vis: Generating data visualisations via natural language using chatgpt, codex and gpt-3 large language models.” IEEE Access (2023).
Alshami, Ahmad, et al. “Harnessing the Power of ChatGPT for Automating Systematic Review Process: Methodology, Case Study, Limitations, and Future Directions.” Systems 11.7 (2023): 351.
Talebirad, Yashar, and Amirhossein Nadiri. “Multi-Agent Collaboration: Harnessing the Power of Intelligent LLM Agents.” arXiv preprint arXiv:2306.03314 (2023).

Other Posts You Might Like

Polygon Health Analytics Showcases AI-Driven SLE Research and Social Media Evidence at ISPOR 2026

May 03, 2026

Philadelphia, PA — Polygon Health Analytics LLC will present new research and lead an interactive workshop at the ISPOR 2026 Annual Conference, May 17–20, 2026, at...

AI in HEOR, RWD & Medical Affairs: What 133 Professionals Told Us—and What It Means for the Industry

Apr 14, 2026

Artificial intelligence is gaining traction across many disciplines, and health economics and outcomes research (HEOR), real-world data (RWD), and medical affairs are no exception. To understand...

Will AI Replace Pathologists? -Notes From the 2026 USCAP Floor

Mar 28, 2026

“People should stop training radiologists now.” — Geoffrey Hinton (2016; he later conceded the timeline was wrong) “Within 10 years, AI will replace many doctors…” — Bill Gates,...

Polygon Health Analytics Research to Be Presented at the 2026 USCAP Annual Meeting

Mar 17, 2026

San Antonio, TX — March 18, 2026 — Polygon Health Analytics LLC announced today that its research has been accepted for a platform presentation at the USCAP 115th...

PHA LaunchPad Program — Now Recruiting for the 2026 Summer Cohort

Jan 25, 2026

Location: Remote Duration: 3–6 months (part-time or full-time) Start Date: TBA (based on student team availability in the summer) Now entering its third year, the...

Celebrating 3 Years of Polygon Health Analytics

Jan 13, 2026

From corporate scientist to health tech founder: a candid three-year journey of building Polygon Health Analytics, transforming data, and redefining leadership....

Synthetic Data vs. Real-World Data: A Reality Check for Healthcare AI

Dec 15, 2025

I first encountered the concept of synthetic data back in 2013, while teaching a health informatics course as a tenure-track assistant professor at UNC Charlotte. To...

Drug Development Program Done Right: A Practical Checklist to Prevent Strategic Blind Spots

Nov 28, 2025

In the high-stakes world of pharmaceutical R&D, thousands of drug candidates are abandoned every year long before reaching patients. The harsh reality: fewer than...

QALYs Explained: The Metric That’s Shaping—and Dividing—Healthcare Policy

Nov 10, 2025

Quality-Adjusted Life Years (QALYs) are a cornerstone concept in health economics. They measure the value of medical treatments by considering both how long people live and...

Value-Based Health Care: Shifting the Focus from Quantity to Quality

Oct 23, 2025

Understand how value-based health care shifts focus from volume to outcomes, rewarding better results, reducing costs and improving patient care....

View all

Leveraging Large Language Models like ChatGPT for Real-World Evidence Generation

Other Posts You Might Like