There is research that AI can both improve and impede learning. The same research suggests important design features that can allow AI to support learning.

1. Student use of AI continues to climb

Multiple surveys confirm steady increases:
- Digital Education Council. (2024). Digital Education Council global AI student survey 2024. https://www.digitaleducationcouncil.com/resource-library-items/digital-education-council-global-ai-student-survey-2024
- Freeman, J. (2025). Student generative AI survey 2025 (HEPI Policy Note 61). Higher Education Policy Institute. https://www.hepi.ac.uk/reports/student-generative-ai-survey-2025/
A recent study of 300 students over four semesters found usage rates rising both for assessment preparation (57%- 83%) and studying (44%-76%,) respectively.
- Parker, L. A., Loper, J., et al (2026) Longitudinal insights into AI in education: Usage, ethics, and policy development in higher education, Computers and Education Open, Volume 10, 100329, ISSN 2666-5573, https://doi.org/10.1016/j.caeo.2025.100329

2. Students SAY they use it for learning (and understanding instructions)

Students SAY they mostly use AI for
- Asking follow-up questions from a lecture
- Unpacking assignment instructions
- Getting guided help on a single homework problem
- Reviewing or editing writing
- Sanity-checking important emails
- Organizing study plans in their calendar
- Generating study materials before an exam
- Pre-grading projects before they turn them in.
  - Jones, Parker [a CalPoly student] (2026, March 26) The AI cheating panic is loud. The way students actually use ChatGPT is much quieter. https://edunewsletter.openai.com/p/the-ai-cheating-panic-is-loud-the
Explaining concepts, summarizing articles, and suggesting research ideas
- Freeman, J. (2025). Student generative AI survey 2025 (HEPI Policy Note 61). Higher Education Policy Institute. https://www.hepi.ac.uk/reports/student-generative-ai-survey-2025/
“Students rarely accept AI text without editing it.”
- https://theconversation.com/college-students-are-writing-with-ai-but-a-pilot-study-finds-theyre-not-simply-letting-it-write-for-them-276856

3. Students use of AI to do academic work is also on the rise.

A large 2025 Anthropic study of actual student usage found a fairly even split between collaborative uses (guidance and refinement) and direct output creation. Usage varies by discipline with slightly less direct output in STEM.
- College Board. (2025, May). New research: Majority of high school students use generative AI for schoolwork.https://newsroom.collegeboard.org/new-research-majority-high-school-students-use-generative-ai-schoolwork
- Freeman, J. (2025). Student generative AI survey 2025 (HEPI Policy Note 61). Higher Education Policy Institute. https://www.hepi.ac.uk/reports/student-generative-ai-survey-2025/
- Handa, K, Bent, D. et al (2025, April 8) Anthropic Education Report: How University Students Use Claude https://www.anthropic.com/news/anthropic-education-report-how-university-students-use-claude

4. Students believe that AI has improved their academic performance

5. Students who use AI find it more ethical than those who use it less.

Students who used AI for studying (M = 3.66, SD = 0.95) rated the use of AI to complete academic work as more ethical than those who did not (M = 2.76, SD = 1.05), t(317) = −7.04, p < .001. Similarly, students who used AI for assignments (M = 3.55, SD = 1.00) rated its ethicality higher than those who did not (M = 2.88, SD = 1.08), t(317) = −6.10, p < .001.”
- Parker, L. A., Loper, J., et al (2026) Longitudinal insights into AI in education: Usage, ethics, and policy development in higher education, Computers and Education Open, Volume 10, 100329, ISSN 2666-5573, https://doi.org/10.1016/j.caeo.2025.100329

6. When student use is UNGUIDED, homework improves and exam scores fall.

This study of 6,811 Chinese Students in grades 7-12 found
- – 30% Time on Homework
- +18% Homework Grades
- – 20% Exam Scores
With the largest learning loss in social science, STEM and languages and for younger and high-achieving students and for boys. The shorter homework completion time, the lower the exam scores. Learning requires friction and discomfort.
- Stromberg, David and Lei, Victor and Wu, Yanhui, The Generative AI Learning Penalty: Evidence from Chinese Secondary Education (June 02, 2026). SSRN: https://ssrn.com/abstract=6868618 or http://dx.doi.org/10.2139/ssrn.6868618
- Abbas, M., Jam, F.A. & Khan, T.I. Is it harmful or helpful? Examining the causes and consequences of generative AI usage among university students. Int J Educ Technol High Educ 21, 10 (2024).

This RCT of roughly 1,000 Turkish HS math students found that using AI improved practice (48%), but when AI (without guardrails, see below) was removed, students scored 17% below the no-AI control. Students did not perceive the damage.
- Bastani, H., Bastani, O., Sungu, A., Ge, H., Kabakcı, Ö., & Mariman, R. (2025). Generative AI without guardrails can harm learning: Evidence from high school mathematics. Proceedings of the National Academy of Sciences, 122, Article e2422633122. https://doi.org/10.1073/pnas.2422633122

A randomized lab study of 117 university students found that AI improved essay scores, but produced no significant gain in knowledge or transfer. Theis study coined the term “metacognitive laziness” for this displacement of self-regulated-learning processes.
- Fan, Y., Tang, L., Le, H., Shen, K., Tan, S., Zhao, Y., Shen, Y., Li, X., & Gašević, D. (2025). Beware of metacognitive laziness: Effects of generative artificial intelligence on learning motivation, processes, and performance. British Journal of Educational Technology, 56(2), 489–530. https://doi.org/10.1111/bjet.13544
- Kosmyna, N., Hauptmann, E., Yuan, Y. T., Situ, J., Liao, X.-H., Beresnitzky, A. V., Braunstein, I., & Maes, P. (2025). Your brain on ChatGPT: Accumulation of cognitive debt when using an AI assistant for essay writing task. arXiv. https://doi.org/10.48550/arXiv.2506.08872

This confirming study deals only with peer feedback.
- Darvishi, A., Khosravi, H., Sadiq, S., Gašević, D., & Siemens, G. (2024). Impact of AI assistance on student agency. Computers & Education, 210, Article 104967. https://doi.org/10.1016/j.compedu.2023.104967

A now famous MIT EEG study found that AI users had weaker and less distributed neural connectivity, the lowest sense of essay ownership and the poorest recall of their own text. These effects persisted and they called this “cognitive debt.” It was a highly-cited preprint, but after criticisms (Stanković et al., 2026) of the study design, limited sample size, reproducibility, methodological issues, inconsistencies in the reporting of results, and limited transparency in several aspects of the study’s procedures and findings, the original authors published a similar list of caveats and revisions: https://www.media.mit.edu/projects/your-brain-on-chatgpt/overview/
- Kosmyna, N., Hauptmann, E., Yuan, Y. T., Situ, J., Liao, X.-H., Beresnitzky, A. V., Braunstein, I., & Maes, P. (2025). Your brain on ChatGPT: Accumulation of cognitive debt when using an AI assistant for essay writing task. https://arxiv.org/abs/2506.08872
- Stanković, Miloš & Hirche, Ella & Kollatzsch, Sarah & Doetsch, Julia. (2025). Comment on: Your Brain on ChatGPT: Accumulation of Cognitive Debt When Using an AI Assistant for Essay Writing Tasks. https://arxiv.org/pdf/2601.00856
- Here is a meta-analysis blog of 30+ studies of AI and The Brain https://www.thealgorithmicbridge.com/p/what-the-studies-say-about-how-ai

All of this probably means that unguided student us of AI leads to less learning, but these studies mostly measure exam scores and we are need longer-term studies that measure durable and transferable learning and under what conditions, BUT…

7. Pre-AI Tutoring Systems were effective when well-designed.

Robust and replicated studies have found that pre-AI system designed for tutoring (and not just answers) were often as good as human tutoring, but that the context and design matters.
- Kulik, J. A., & Fletcher, J. D. (2016). Effectiveness of intelligent tutoring systems: A meta-analytic review. Review of Educational Research, 86(1), 42–78. https://doi.org/10.3102/0034654315581420
- VanLehn, K. (2011). The relative effectiveness of human tutoring, intelligent tutoring systems, and other tutoring systems. Educational Psychologist, 46(4), 197–221. https://doi.org/10.1080/00461520.2011.611369

8. Gen AI tools can improve learning when they are well-designed.

A Harvard RCT of 194 students in Physical Sciences 2 used a pre- and post-class quiz to compare an active in-class activity against an AI tutor that was expert-designed with step-by-step guidance, mastery pacing, hallucination guardrails, and deferral of answers. The instructions, content and worksheet remained identical in both conditions and they ran the experiment twice so all students experienced both conditions. Students with the AI tutor learned significantly more while spending less time on task and felt more engaged and more motivated.
- Kestin, G., Miller, K., Klales, A., Milbourne, T., & Ponti, G. (2025). AI tutoring outperforms in-class active learning: An RCT introducing a novel research-based design in an authentic educational setting. Scientific Reports, 15, Article 17458. https://doi.org/10.1038/s41598-025-97652-6

This review of 69 studies (84% of which involved higher ed students with the largest subject area being languages) from 2022–2024 found that ChatGPT may “genuinely enhance learning performance” because learning was more personal (Wang et al., 2024), access to information and diverse perspectives was more immediate (Urban et al., 2024), and students could engage more deeply with material (Meyer et al., 2024). There was a positive impact on students’ affective-motivational states but mixed evidence for improved self-efficacy and increased higher-order thinking and risks that AI could reduce mental effort. The design of AI was essential.
- Deng, R., Jiang, M., Yu, X., Lu, Y., & Liu, S. (2024). Does ChatGPT enhance student learning? A systematic review and meta-analysis of experimental studies. Computers & Education, 227, Article 105224. https://doi.org/10.1016/j.compedu.2024.105224
- Wang, X., Zhong, Y., et al. (2024). ChatPRCS: A Personalized Support System for English Reading Comprehension Based on ChatGPT,” IEEE Transactions on Learning Technologies, vol. 17, pp. 1722-1736, 2024, doi: 10.1109/TLT.2024.3405747
- Urban, Marek & Děchtěrenko, Filip & Lukavsky, Jiri & Hein, Veronika & Švácha, Filip & Brom, Cyril & Urban, Kamila. (2023). ChatGPT Improves Creative Problem-Solving Performance in University Students: An Experimental Study. 10.31234/osf.io/9z2tc
- Meyer, Jennifer & Jansen, Thorben & Schiller, Ronja & Liebenow, Lucas & Steinbach, Marlene & Horbach, Andrea & Fleckenstein, Johanna. (2023). Using LLMs to bring evidence-based feedback into the classroom: AI-generated feedback increases secondary students’ text revision, motivation, and positive emotions. Computers and Education: Artificial Intelligence. 6. 100199. 10.1016/j.caeai.2023.100199.

Well-designed AI is MUCH better than nothing. The World Bank ran a large RCT at co-ed public schools in Benin City in Nigeria over six weeks in June–July 2024. 800 first-year senior secondary students attended an after-school program twice a week where they studied English with Microsoft Copilot (a GPT-4 at the time) using prompts deliberately designed to push reasoning rather than answer-extraction. Girls, who lagged at baseline, showed larger gains, and attendance was strongly dose-responsive (more sessions, bigger effect).
- De Simone, Martin Elias; Tiberti, Federico Hernan; Barron Rodriguez, Maria Rebeca; Manolio, Federico Alfredo; Mosuro, Wuraola; Dikoru, Eliot Jolomi. (2025, May) From Chalkboards to Chatbots : Evaluating the Impact of Generative AI on Learning Outcomes in Nigeria. Policy Research Working Paper; RRR;People;Impact Evaluation Series Washington, D.C. : World Bank Group. http://documents.worldbank.org/curated/en/099548105192529324

The study of 1,000 Turkish HS math students (above) found that unguarded AI access actively harmed exam scores, but the guardrailed tutor (that withheld answers and have a hint-based scaffolding) eliminated the harm while improving performance (127%).
- Bastani, H., Bastani, O., Sungu, A., Ge, H., Kabakcı, Ö., & Mariman, R. (2025). Generative AI without guardrails can harm learning: Evidence from high school mathematics. Proceedings of the National Academy of Sciences, 122, Article e2422633122. https://doi.org/10.1073/pnas.2422633122

KEY FINDING: Who does the cognitive work matters.

Further, the way students perceive their interaction with AI also matters; the same interaction can create empowerment or dependency. Users with higher self-confidence do more critical thinking while using AI, while higher confidence in the AI is associated with less critical thinking and less effort. This is task-specific. Both the design of the AI tool and student agency are critical.
- Edwards, H., Edwards, D. (2025) How We Think and Live with AI: Early Patterns of Human Adaptation Artifciality Journal. https://journal.artificialityinstitute.org/how-we-think-and-live-with-ai-early-patterns-of-human-adaptation/
- Lee, H.-P., Sarkar, A., Tankelevitch, L., Drosos, I., Rintel, S., Banks, R., & Wilson, N. (2025). The impact of generative AI on critical thinking: Self-reported reductions in cognitive effort and confidence effects from a survey of knowledge workers. In Proceedings of the 2025 CHI Conference on Human Factors in Computing Systems (CHI ’25). ACM. https://doi.org/10.1145/3706598.3713778

Design Principles for Teachers using AI

Classroom time should center human interactions. AI offers an opportunity (maybe even an imperative) to raise standards: What can students do with AI that they could not do on their own? Now is a good moment to refocus assignment design on learning rather than catching cheaters. In the same way that video games offer a customized learning experience, designed to keep every user pleasantly frustrated, AI tools can be customized. The key is to keep students at the right (customized) level of friction. In practice that means teachers designing prompts and secure custom bots rather than allowing students to use open tools. You can find complete templates, examples and instructions at https://weteachwithai.com/creating-ai-simulations-and-custom-bots/

AI to coach, not to answer.

Design AI tools with constraints. For example: Never complete the user’s thinking for them. Ask questions that lead toward the insight rather than delivering it. Ask more than you tell. Your purpose is to create conditions under which students demonstrate, deepen, and reflect on what they have already read. Praise specificity, not correctness. If a student pastes in a plot summary or SparkNotes-style text, then say this…

Scaffold the learning

Specify a clear process for interactions. For example: Begin by doing this. Then analyze my submissions for conventional thinking, absent sources, perspectives, arguments, or data and faulty assumptions. Begin every conversation with this exact sequence. Do not skip steps. Step 1, say this “Hi, I am your Ai tutor.” Step 2… Then choose ONE of the following simulation frames, appropriate to the student’s complexity tier. Then…

Customize for both friction and motivation

AI tools customize easily when provided with context so include both a calibration step (to set the difficulty or friction level) and a customization step (that creates relevance and motivation).

–For motivation: Ask about interests in outside of this class. What do you want to do after graduation?

Last time you told me X; has anything shifted in how you’re thinking about that? Record student’s interests. You will use it to create analogies, frame questions, and build relevance bridges throughout the session. Do this organically.

–For friction and complexity level: Where is one place you have struggled with this material? Can you explain concept X to me? How experienced are you with this topic? How would you interpret or explain Y? Your role is to build a picture of the student’s blind spots, engagement patterns and intellectual strengths (specifically abstract argument or other) and push them into moderate discomfort at least once per session. Do not announce that you are doing this. Simply do it. The user should feel intellectually stretched.

Limit feedback to keep students doing the thinking

These two ideas are connected because too much feedback at once is demotivating. AI is patient so you can focus it to do both at once. For example: Encourage students to engage in self-reflection by providing other perspectives and ideas. Stimulate deeper student thinking by providing thought experiments, second opinions, alternative views, and even contrarian scenarios to broaden horizons. Help students discover nuance, innovative ideas, and new ways of thinking. HOWEVER, give only one suggestion or hint at a time. Focus on where I could most improve. Do not overwhelm me with too much information and guidance at once. Ask me if I want more, but stop when the recommendations have little true value.

Grade the reasoning, not the artifact.

Creating instructions that require the AI to ask about reasoning pushes students to do the thinking, but it also makes their thinking more visible for grading. For example: Ask students to explain their thinking and continue pushing them to go deeper. Ask “why did you choose that strategy?” and “Can you explain each of those steps?” Ask students to provide alternative explanations, sources and counter-arguments. Interrogate students in a way that leads them to distinguish between verifiable facts and speculation. Require commitment: if the student gives a vague or hedging answer, push back: “If you had to bet on ONE interpretation, what would it be? Commit to a reading.” Ask “help me make sense of what you mean.”

Teach calibrated trust and the limits of AI

Since student agency is important for the development of critical thinking, provide opportunities for students to critique AI output. For example: Ask students to dissect your position and tell you why it is wrong. Ask students to verify information you provide. Make “find and fix the AI’s mistake” a recurring task.

Provide human AI-off checkpoints

Students easily come to rely on AI, so it is important to make sure there are unaided work and different (and AI-off) assessments frequently. Some of those could might be AI assessment bots, but then with even more guardrails.

Build access and structure for all.

AI has tremendous potential to increase access but also to widen equity gaps. Do not assume everyone uses AI well. Provide equal access and explicit AI-literacy support. Paid tools are often much better, but AI tools are well-designed then can often run on free models effectively. All of the example simulations here were stressed tested on the best models but set to run and interact with students using only free models.

Designing Learning with AI