TY - JOUR
T1 - Talking tech in education
T2 - a systematic review of chatbot evaluation, methods, communication modalities, and impact
AU - Al-Shamaileh, Ons
AU - Mubin, Omar
AU - Nizamuddin, Nishara
PY - 2025
Y1 - 2025
N2 - This systematic review investigates how chatbots are evaluated and adopted in education; namely the methods used to assess their effectiveness, their communication modalities, and their benefits and challenges. Following the PRISMA reporting guidelines a comprehensive search was conducted across seven academic databases, covering studies published between 2014 and 2024, resulting in 124 empirical studies based on inclusion and exclusion criteria. Thematic analysis focused on evaluation criteria, advantages, and disadvantages, while data collection methods and communication modalities were analyzed descriptively. Eight key themes were identified in the assessment of educational chatbots, including technology acceptance, learning outcomes and usability. Surveys were the most common data collection method followed by interviews. Text modality dominated chatbot communication, while multimodal formats were rarely used. Reported benefits included 24/7 availability, personalized feedback, and improved motivation. However, challenges such as technical issues and academic integrity concerns were noted. Future research should explore longitudinal impacts and more inclusive assessment frameworks.
AB - This systematic review investigates how chatbots are evaluated and adopted in education; namely the methods used to assess their effectiveness, their communication modalities, and their benefits and challenges. Following the PRISMA reporting guidelines a comprehensive search was conducted across seven academic databases, covering studies published between 2014 and 2024, resulting in 124 empirical studies based on inclusion and exclusion criteria. Thematic analysis focused on evaluation criteria, advantages, and disadvantages, while data collection methods and communication modalities were analyzed descriptively. Eight key themes were identified in the assessment of educational chatbots, including technology acceptance, learning outcomes and usability. Surveys were the most common data collection method followed by interviews. Text modality dominated chatbot communication, while multimodal formats were rarely used. Reported benefits included 24/7 availability, personalized feedback, and improved motivation. However, challenges such as technical issues and academic integrity concerns were noted. Future research should explore longitudinal impacts and more inclusive assessment frameworks.
KW - Chatbot
KW - education
KW - evaluation
KW - human-computer interaction
KW - user experience
UR - http://www.scopus.com/inward/record.url?scp=105012587644&partnerID=8YFLogxK
UR - https://go.openathens.net/redirector/westernsydney.edu.au?url=http://doi.org/10.1080/10447318.2025.2532712
U2 - 10.1080/10447318.2025.2532712
DO - 10.1080/10447318.2025.2532712
M3 - Article
AN - SCOPUS:105012587644
SN - 1044-7318
JO - International Journal of Human-Computer Interaction
JF - International Journal of Human-Computer Interaction
ER -