Subtopic Deep Dive

← Artificial Intelligence in Healthcare and Education

Large Language Models in Education
Research Guide

What is Large Language Models in Education?

Large Language Models in Education applies LLMs like ChatGPT for personalized tutoring, automated assessment, curriculum design, and ethical evaluation in pedagogical contexts.

Researchers assess LLMs' performance on exams like USMLE (Kung et al., 2023, 3215 citations) and explore impacts on higher education integrity (Cotton et al., 2023, 1644 citations). Rapid reviews synthesize ChatGPT's effects on teaching and learning (Lo, 2023, 1560 citations). Over 10 key papers from 2023 analyze benefits, challenges, and student perceptions (Chan and Hu, 2023, 1369 citations).

Curated Papers

Key Challenges

Why It Matters

LLMs enable AI-assisted medical education by passing USMLE thresholds (Kung et al., 2023). They raise academic integrity issues, prompting policies against cheating (Cotton et al., 2023; Rudolph et al., 2023). Student surveys reveal benefits like engagement alongside challenges in bias and equity (Chan and Hu, 2023). Systematic reviews map AI's state in higher education, guiding equitable integration (Crompton and Burke, 2023).

Key Research Challenges

Academic Integrity Threats

ChatGPT enables undetectable cheating, undermining traditional assessments (Cotton et al., 2023, 1644 citations; Rudolph et al., 2023, 1549 citations). Detection tools lag behind generative capabilities. Policies must balance access and enforcement.

Ethical Bias Risks

Generative AI perpetuates biases in educational content and assessments (Dwivedi et al., 2023, 3140 citations). Equity issues arise in diverse learner populations. Mitigation strategies remain underdeveloped.

Learning Outcomes Uncertainty

Impacts on critical thinking and skill retention are unclear despite engagement gains (Lo, 2023, 1560 citations; Chan and Hu, 2023, 1369 citations). Longitudinal studies are scarce. Overreliance may hinder deep learning.

Essential Papers

Performance of ChatGPT on USMLE: Potential for AI-assisted medical education using large language models

Tiffany H. Kung, Morgan Cheatham, Arielle Medenilla et al. · 2023 · PLOS Digital Health · 3.2K citations

We evaluated the performance of a large language model called ChatGPT on the United States Medical Licensing Exam (USMLE), which consists of three exams: Step 1, Step 2CK, and Step 3. ChatGPT perfo...

Opinion Paper: “So what if ChatGPT wrote it?” Multidisciplinary perspectives on opportunities, challenges and implications of generative conversational AI for research, practice and policy

Yogesh K. Dwivedi, Nir Kshetri, Laurie Hughes et al. · 2023 · International Journal of Information Management · 3.1K citations

Transformative artificially intelligent tools, such as ChatGPT, designed to generate sophisticated text indistinguishable from that produced by a human, are applicable across a wide range of contex...

Revolutionizing healthcare: the role of artificial intelligence in clinical practice

Shuroug A. Alowais, Sahar S. Alghamdi, Nada Alsuhebany et al. · 2023 · BMC Medical Education · 2.4K citations

Chatting and cheating: Ensuring academic integrity in the era of ChatGPT

Debby Cotton, Peter A. Cotton, J. Reuben Shipway · 2023 · Innovations in Education and Teaching International · 1.6K citations

The use of artificial intelligence in academia is a hot topic in the education field. ChatGPT is an AI tool that offers a range of benefits, including increased student engagement, collaboration, a...

What Is the Impact of ChatGPT on Education? A Rapid Review of the Literature

Chung Kwan Lo · 2023 · Education Sciences · 1.6K citations

An artificial intelligence-based chatbot, ChatGPT, was launched in November 2022 and is capable of generating cohesive and informative human-like responses to user input. This rapid review of the l...

ChatGPT: Bullshit spewer or the end of traditional assessments in higher education?

Jürgen Rudolph, Samson Tan, Shannon Tan · 2023 · Journal of Applied Learning & Teaching · 1.5K citations

ChatGPT is the world’s most advanced chatbot thus far. Unlike other chatbots, it can create impressive prose within seconds, and it has created much hype and doomsday predictions when it comes to s...

Students’ voices on generative AI: perceptions, benefits, and challenges in higher education

Cecilia Ka Yuk Chan, Wenjie Hu · 2023 · International Journal of Educational Technology in Higher Education · 1.4K citations

Abstract This study explores university students’ perceptions of generative AI (GenAI) technologies, such as ChatGPT, in higher education, focusing on familiarity, their willingness to engage, pote...

Reading Guide

Foundational Papers

No pre-2015 foundational papers available; start with highest-cited 2023 works like Kung et al. for performance baselines.

Recent Advances

Prioritize Kung et al. (2023, 3215 citations), Dwivedi et al. (2023, 3140 citations), Lo (2023, 1560 citations) for impacts and reviews.

Core Methods

Core methods: exam benchmarking (USMLE, Kung et al., 2023), rapid reviews (Lo, 2023), surveys (Chan and Hu, 2023), systematic lit mapping (Crompton and Burke, 2023).

How PapersFlow Helps You Research Large Language Models in Education

Discover & Search

Research Agent uses searchPapers and exaSearch to find top-cited works like 'Performance of ChatGPT on USMLE' (Kung et al., 2023), then citationGraph reveals clusters on integrity (Cotton et al., 2023) and findSimilarPapers uncovers student perception studies (Chan and Hu, 2023).

Analyze & Verify

Analysis Agent employs readPaperContent on Kung et al. (2023) for USMLE pass rates, verifyResponse with CoVe to check hallucinated claims against abstracts, runPythonAnalysis for citation trend stats via pandas, and GRADE grading to score evidence strength in Lo's (2023) rapid review.

Synthesize & Write

Synthesis Agent detects gaps in bias mitigation across Dwivedi et al. (2023) and Chan and Hu (2023), flags contradictions in assessment impacts; Writing Agent uses latexEditText for revisions, latexSyncCitations to integrate 10+ papers, latexCompile for camera-ready manuscripts, and exportMermaid for workflow diagrams.

Use Cases

"Extract and analyze USMLE performance stats from ChatGPT papers for education benchmarks."

Research Agent → searchPapers('ChatGPT USMLE') → Analysis Agent → readPaperContent(Kung 2023) → runPythonAnalysis(pandas parse pass rates, matplotlib plot) → CSV export of thresholds vs. human scores.

"Draft a review paper section on ChatGPT ethics in higher ed with citations."

Synthesis Agent → gap detection(Dwivedi 2023, Cotton 2023) → Writing Agent → latexEditText(structure section) → latexSyncCitations(10 papers) → latexCompile(PDF) → peer-ready draft with integrated refs.

"Find GitHub repos with code for detecting ChatGPT in student essays."

Research Agent → searchPapers('ChatGPT detection education') → Code Discovery → paperExtractUrls → paperFindGithubRepo → githubRepoInspect(code quality, metrics) → runnable detector scripts for integrity tools.

Automated Workflows

Deep Research workflow conducts systematic reviews of 50+ ChatGPT education papers, chaining searchPapers → citationGraph → GRADE grading for structured reports on outcomes (Lo, 2023). DeepScan applies 7-step analysis with CoVe checkpoints to verify integrity claims (Cotton et al., 2023). Theorizer generates hypotheses on LLM tutoring efficacy from synthesized lit (Kung et al., 2023).

Try Doxa for Large Language Models in Education Research

Frequently Asked Questions

What defines Large Language Models in Education?

LLMs in education use tools like ChatGPT for tutoring, assessment, and curriculum design, evaluated for ethics and outcomes (Lo, 2023).

What methods assess LLM educational impact?

Methods include USMLE benchmarking (Kung et al., 2023), rapid literature reviews (Lo, 2023), and student perception surveys (Chan and Hu, 2023).

What are key papers on this topic?

Top papers: Kung et al. (2023, 3215 citations) on USMLE; Dwivedi et al. (2023, 3140) on multidisciplinary implications; Cotton et al. (2023, 1644) on cheating.

What open problems exist?

Challenges include bias mitigation, long-term learning effects, and scalable detection of AI cheating (Rudolph et al., 2023; Chan and Hu, 2023).

Research Artificial Intelligence in Healthcare and Education with AI

PapersFlow provides specialized AI tools for your field researchers. Here are the most relevant for this topic:

AI Literature Review

Automate paper discovery and synthesis across 474M+ papers

Deep Research Reports

Multi-source evidence synthesis with counter-evidence

Paper Summarizer

Get structured summaries of any paper in seconds

AI Academic Writing

Write research papers with AI assistance and LaTeX support

Start Researching Large Language Models in Education with AI

Search 474M+ papers, run AI-powered literature reviews, and write with integrated citations — all in one workspace.

Try PapersFlow Free See AI Literature Review

Part of the Artificial Intelligence in Healthcare and Education Research Guide