Introduction to the Proceedings
Preface
The University of Palermo is proud to host the 18th International Conference on Educational Data Mining (EDM) in Palermo, Italy, from July 20 to July 23, 2025. EDM is the annual flagship conference of the International Educational Data Mining Society. This year’s theme is New Goals, New Measurements, New Incentives to Learn. The theme highlights the shift toward an era dominated by artificial intelligence - where machines can surpass human performance in numerous cognitive tasks - and highlights the imperative for our educational systems to evolve accordingly.
The scientific program for EDM2025 includes:
Two keynote talks by outstanding researchers in the field:
Jill Burstein (Duolingo, USA).
Chris Piech (Stanford University, USA).
A plenary Prof. Ram Kumar Educational Data Mining Test of Time Award talk given by Ryan S. Baker (University of Pennsylvania, USA).
An invited Data Set Award talk by L Burleigh (The Learning Agency, USA).
Five workshops and three tutorials.
Eight presentation sessions with parallel (2 and 3) tracks.
A poster presentation session.
An industry track session.
A doctoral consortium session.
A cross-conference panel moderated by Ken Koedinger (Carnegie Mellon University, USA):
EDM: Ryan Baker (University of Pennsylvania, USA).
Learning@Scale: Mar Perez-Sanagustin (Université Paul Sabatier Tolouse III, France).
AIED: Olga Santos (Universidad Nacional de Educación a Distancia, Spain).
The workshops are:
9th Educational Data Mining in Computer Science Education (CSEDM) Workshop.
2nd Human-Centric eXplainable AI in Education (HEXED) Workshop.
2nd Workshop on Educational Data Mining in Writing and Literacy Instruction.
Interactive Workshop: Multimodal, Multiparty Learning Analytics (MMLA).
CausalEDM: Linking Innovations in Instructional Design and the Complex Behaviors that Underlie Learning Processes and Outcomes.
The tutorials are:
Bringing Generative Artificial Intelligence (GenAI) to Education.
Leveraging KOALA for Programming Data Collection: A Half-Day Tutorial for Research Application.
Applying DebiasEd: A Package for Mitigating Unfairness in Educational Data
This year, EDM received 105 submissions to the Full Papers track (10 pages), 81 to the Short Papers track (6 pages), and 37 to the Posters and Demos track (4 pages). The program committee accepted 29 full papers (27.6% acceptance rate), 24 short papers (29.6% acceptance rate), 19 poster and demo papers, and 3 industry track papers. Furthermore, EDM 2025 also continued its tradition of providing opportunities for young researchers to present their work and receive feedback from peers and senior researchers. The Doctoral Consortium track this year featured 5 participants. We are also proud to offer travel sponsorships to 24 students who will attend thanks to this support.
We thank the sponsors of EDM 2025 for their generous support. We thank all the authors who submitted their work and the program committee members for their expert inputs. We thank the members of the organization committee for their leadership that made this conference possible. And, a big thank you to the local organizing committee who made this event memorable.
Caitlin Mills | University of Minnesota, USA | Program Chair |
Giora Alexandron | Weizmann Institute of Science, Israel | Program Chair |
Davide Taibi | CNR-ITD, Italy | General Chair |
Giosuè Lo Bosco | University of Palermo, Italy | General Chair |
Luc Paquette | University of Illinois Urbana-Champaign, USA | General Chair |
July 10, 2025
Palermo, Italy
Organizing Committee
General Chairs
Davide Taibi, CNR-ITD, Italy
Giosuè Lo Bosco, University of Palermo, Italy
Luc Paquette, University of Illinois Urbana-Champaign, USA
Program Chairs
Caitlin Mills, University of Minnesota, USA
Giora Alexandron, Weizmann Institute of Science, Israel
Awards Chairs
Arnon Hershkovitz, Tel Aviv University, Israel
Tanja Käser, EPFL, Switzerland
Doctoral Consortium Chairs
Angela Stewart, University of Pittsburgh, USA
Min Chi, North Carolina State University, USA
Neil Heffernan, Worcester Polytechnic Institute, USA
Equity, Diversity, Inclusion, and Accessibility Chairs
Anna Rafferty, Carleton College, USA
Laura Allen, University of Minnesota, USA
Nigel Bosch, University of Illinois Urbana-Champaign, USA
Industry Track Chairs
Avi Segal, Ben-Gurion University of the Negev, Israel
Davide Zucchetti, Randstad, Italy
Yang Jiang, ETS Research Institute, USA
JEDM Track Chairs
Agathe Merceron, Berlin University of Applied Sciences, Germany
Roberto Martinez-Maldonado, Monash University, Australia
Poster & Demo Track Chairs
Heeryung Choi, University of Minnesota, USA
Mingyu Feng, WestEd, USA
Tanya Nazaretsky, EPFL, Switzerland
Proceedings Chairs
Miguel Portaz, UNED, Spain
Mirko Marras, University of Cagliari, Italy
Scholarship Chairs
Ryan Baker, University of Pennsylvania, USA
Antonija Mitrovic, University of Canterbury, New Zealand
Social Media & Publicity Chairs
Gabriella Casalino, University of Bari Aldo Moro, Italy
Jill-Jênn Vie, Inria, France
Sponsorship Chairs
Pier Paolo Corso, University of Palermo, Italy
Simon Woodhead, Eedi, UK
Student Volunteer Chair
Salvatore Calderaro, University of Palermo, Italy
Web Chairs
Daniele Schicchi, CNR-ITD, Italy
Paul Salvador Inventado, California State University Fullerton, USA
Workshop & Tutorial Chairs
Stephen Hutt, University of Denver, USA
Ramkumar Rajendran, IIT Bombay, India
Yang Shi, Utah State University, USA
IEDMS Officers
Luc Paquette | University of Illinois Urbana-Champaign, USA | President |
Anna Rafferty | Carleton College, USA | Treasurer |
IEDMS Board of Directors
Rakesh Agrawal | Data Insights Laboratories, USA |
Tiffany Barnes | North Carolina State University, USA |
Nigel Bosch | University of Illinois Urbana-Champaign, USA |
Neil Heffernan | Worcester Polytechnic Institute, USA |
Sharon Hsiao | Santa Clara University, USA |
Tanja Käser | EPFL, Switzerland |
Kenneth Koedinger | Carnegie Mellon University, USA |
Ramkumar Rajendran | Indian Institute of Technology Bombay, India |
Program Committee
Senior Program Committee Members
Bita Akram | North Carolina State University, USA |
Vincent Aleven | Carnegie Mellon University, USA |
Roger Azevedo | University of Central Florida, USA |
Ryan Baker | University of Pennsylvania, USA |
Tiffany Barnes | North Carolina State University, USA |
Nigel Bosch | University of Illinois Urbana-Champaign, USA |
Anthony F. Botelho | University of Florida, USA |
Jesus G. Boticario | UNED, Spain |
François Bouchet | Sorbonne Université – LIP6, France |
Alex Bowers | Columbia University, USA |
Christopher Brooks | University of Michigan, USA |
Alexandra Cristea | Durham University, UK |
Michel Desmarais | École Polytechnique de Montréal, Canada |
Fabiano Dorça | Universidade Federal de Uberlândia, Brazil |
Michael Eagle | George Mason University, USA |
Vanessa Echeverria | Monash University, Australia |
Yo Ehara | Tokyo Gakugei University, Japan |
Stephen Fancsali | Carnegie Learning, Inc., USA |
Mingyu Feng | WestEd, USA |
Carol Forsyth | Educational Testing Service, USA |
Dragan Gasevic | Monash University, Australia |
Neil Heffernan | Worcester Polytechnic Institute, USA |
Sharon Hsiao | Santa Clara University, USA |
Paul Inventado | California State University Fullerton, USA |
Johan Jeuring | Utrecht University, Netherlands |
Srecko Joksimovic | Education Future, University of South Australia, Australia |
Jelena Jovanovic | University of Belgrade, Serbia |
Kirsty Kitto | University of Technology Sydney, Australia |
Rene Kizilcec | Cornell University, USA |
Simon Knight | University of Technology Sydney, Australia |
Irena Koprinska | The University of Sydney, Australia |
Tanja Käser | EPFL, Switzerland |
Sébastien Lallé | Sorbonne Université, France |
Andrew Lan | University of Massachusetts Amherst, USA |
Jeonghyun Lee | Georgia Institute of Technology, USA |
James Lester | North Carolina State University, USA |
Collin Lynch | North Carolina State University, USA |
Mirko Marras | University of Cagliari, Italy |
Roberto Angel Melendez | Instituto Tecnológico Superior de Misantla, Mexico |
Agathe Merceron | Berlin University of Applied Sciences, Germany |
Roger Nkambou | Université du Québec à Montréal, Canada |
Jaclyn Ocumpaugh | University of Pennsylvania, USA |
Andrew Olney | University of Memphis, USA |
Benjamin Paaßen | Bielefeld University, Germany |
Ranilson Paiva | Universidade Federal de Alagoas, Brazil |
Luc Paquette | University of Illinois Urbana-Champaign, USA |
Zach Pardos | University of California, Berkeley, USA |
Radek Pelánek | Masaryk University Brno, Czechia |
Niels Pinkwart | Humboldt-Universität zu Berlin, Germany |
Agoritsa Polyzou | Florida International University, USA |
Thomas Price | North Carolina State University, USA |
Mar Pérez-Sanagustín | Université Paul Sabatier Toulouse III, France |
Anna Rafferty | Carleton College, USA |
R Rajalakshmi | VIT University, Chennai Campus, India |
Ramkumar Rajendran | IIT Bombay, India |
Maria Mercedes T. Rodrigo | Ateneo de Manila University, Philippines |
Cristobal Romero | University of Cordoba, Spain |
Shazia Sadiq | The University of Queensland, Australia |
Sherry Sahebi | University at Albany – SUNY, USA |
Niels Seidel | FernUniversität in Hagen, Germany |
Atsushi Shimada | Kyushu University, Japan |
Sergey Sosnovsky | Utrecht University, Netherlands |
Tiffany Tang | Wenzhou-Kean University, China |
Stefan Trausan-Matu | University Politehnica of Bucharest, Romania |
Maomi Ueno | The University of Electro-Communications, Japan |
Sebastián Ventura | University of Córdoba, Spain |
Jill-Jênn Vie | Inria Lille, France |
Stephan Weibelzahl | Private University of Applied Sciences, Germany |
Regular Program Committee Members
Faruk Ahmed | The University of Memphis, USA |
Ali Al-Zawqari | Vrije Universitiet Brussel, Belgium |
Nazia Alam | North Carolina State University, USA |
Laia Albó | Universitat de Vic, Spain |
Laura Allen | University of Minnesota, USA |
Claudia Antunes | Universidade de Lisboa, Portugal |
Simón Pedro Arguijo | Tecnológico Nacional de México, Mexico |
Jose Manuel Azevedo | ISCAP, Portugal |
Aicha Bakki | LERIA – Angers University, France |
Ayan Banerjee | Arizona State University, USA |
Denilson Barbosa | University of Alberta, Canada |
Abhinava Barthakur | University of South Australia, Australia |
Marie Bexte | FernUniversität in Hagen, Germany |
Anis Bey | La Rochelle University, France |
Plaban Kumar Bhowmick | Indian Institute of Technology Kharagpur, India |
Nathaniel Blanchard | Colorado State University, USA |
Maria Bolsinova | Tilburg University, Netherlands |
Geoffray Bonnin | Université de Lorraine – LORIA, France |
Conrad Borchers | Carnegie Mellon University, USA |
Mamadou Bousso | Thies University, Senegal |
Keith Brawner | United States Army Research Laboratory, USA |
Julien Broisin | IRIT, Université Toulouse III – Paul Sabatier, France |
Armelle Brun | Université de Lorraine – LORIA, France |
Minghao Cai | University of Alberta, Canada |
Renza Campagni | Università degli Studi di Firenze, Italy |
Meng Cao | Carnegie Mellon University, USA |
Paulo Carvalho | Carnegie Mellon University, USA |
Mehmet Celepkolu | University of Florida, USA |
Guanliang Chen | Monash University, Australia |
Heeryung Choi | University of Minnesota, USA |
Irene-Angelica Chounta | University of Duisburg-Essen, Germany |
Wei Chu | The University of Memphis, USA |
Cheng-Yu Chung | Chunghwa Telecom Laboratories (CHTTL), Taiwan |
Ruth Cobos | Universidad Autónoma de Madrid, Spain |
Maria Constantino González | Tecnológico de Monterrey Campus Laguna, Mexico |
Maria Cutumisu | McGill University, Canada |
Cynthia D’Angelo | University of Illinois at Urbana-Champaign, USA |
Jesper Dannath | Universität Bielefeld, Germany |
Syaamantak Das | Indian Institute of Technology Bombay, India |
M Ali Akber Dewan | Athabasca University, Canada |
Nicholas Diana | Colgate University, USA |
Fahima Djelil | IMT Atlantique, France |
Cristina Dumdumaya | University of Southeastern Philippines, Philippines |
Nghia Duong-Trung | German Research Centre for AI, Germany |
Luke Eglington | Amplify Education Inc., USA |
Effat Farhana | Vanderbilt University, USA |
Márcia Fernandes | Federal University of Uberlândia, Brazil |
Nigel Fernandez | University of Massachusetts Amherst, USA |
Jeremiah Folsom-Kovarik | Soar Technology, Inc., USA |
Davide Fossati | Emory University, USA |
Kazuma Fuchimoto | The University of Electro-Communications, Japan |
Hagit Gabbay | School of Education, Tel Aviv University, Israel |
Wenbin Gan | NIICT, Japan |
Adam Gaweda | North Carolina State University, USA |
Aldo Gordillo | Universidad Politécnica de Madrid (UPM), Spain |
Guher Gorgun | University of Alberta, Canada |
Sabine Graf | Athabasca University, Canada |
Julio Guerra | Universidad Austral de Chile, Chile |
Ella Haig | University of Portsmouth, UK |
Feifei Han | Australian Catholic University, Australia |
Ching Nam Hang | Saint Francis University, Hong Kong |
Jiangang Hao | Educational Testing Service, USA |
Jason Harley | McGill University, Canada |
Carl Haynes-Magyar | Carnegie Mellon University, USA |
Surina He | University of Alberta, Canada |
Sami Heikkinen | LAB University of Applied Sciences, Finland |
Nicolas Hernandez | Nantes Université, France |
Martin Hlosta | Swiss Distance University of Applied Sciences, Switzerland |
Brahim Hmedna | Ibn Zohr University Science Agadir, Morocco |
Anett Hoppe | Leibniz Universität Hannover, Germany |
Paul Hur | Free University of Berlin, Germany |
Stephen Hutt | University of Denver, USA |
Sébastien Iksal | LIUM – Le Mans Université, France |
Vladimir Ivančević | University of Novi Sad, Serbia |
Hyeji Jang | Ewha Womans University, South Korea |
Lan Jiang | University of Illinois Urbana-Champaign, USA |
Yang Jiang | Columbia University, USA |
Jina Kang | University of Illinois Urbana-Champaign, USA |
Mohammad Khalil | University of Bergen, Norway |
Ean Teng Khor | Nanyang Technological University, Singapore |
Ekaterina Kochmar | MBZUAI, United Arab Emirates |
Sotiris Kotsiantis | University of Patras, Greece |
Vitomir Kovanovic | The University of South Australia, Australia |
Milos Kravcik | DFKI GmbH, Germany |
Swathi Krishnaraja | University of Potsdam, Germany |
Amruth Kumar | Ramapo College of New Jersey, USA |
Hatim Lahza | The University of Queensland, Australia |
Hollis Lai | University of Alberta, Canada |
Juan Alfonso Lara Torralbo | Universidad de Córdoba, Spain |
Mikel Larrañaga | University of the Basque Country, Spain |
Tai Le Quy | University of Koblenz, Germany |
Alwyn Vwen Yen Lee | Nanyang Technological University, Singapore |
Marie Lefevre | LIRIS – Université Lyon 1, France |
Arun Balajiee Lekshmi Narayanan | University of Pittsburgh, USA |
Jiawei Li | Nanyang Technological University, China |
Jionghao Lin | Carnegie Mellon University, USA |
Zhexiong Liu | University of Pittsburgh, USA |
Yu Lu | Beijing Normal University, China |
Ivan Luković | University of Belgrade, Serbia |
Nick Lytle | Georgia Institute of Technology, USA |
Sonsoles López-Pernas | University of Eastern Finland, Finland |
Boxuan Ma | Kyushu University, Japan |
Qiang Ma | Kyoto Institute of Technology, Japan |
Qianou Ma | Carnegie Mellon University, USA |
Yael Feldman Maggor | KTH Royal Institute of Technology, Sweden |
Aditi Mallavarapu | University of Illinois at Chicago, USA |
Jeffrey Matayoshi | McGraw Hill ALEKS, USA |
Ahmad Mel | IDLab, Ghent University, Belgium |
Victor Menendez-Dominguez | Universidad Autónoma de Yucatán, Mexico |
Donatella Merlini | Università di Firenze, Italy |
Tsunenori Mine | Kyushu University, Japan |
Tsubasa Minematsu | Kyushu University, Japan |
Sein Minn | INRIA, France |
Phaedra Mohammed | The University of the West Indies, Trinidad and Tobago |
Luis Alberto Morales Rosales | U. Michoacana de San Nicolás de Hidalgo, Mexico |
Pedro Manuel Moreno-Marcos | Universidad Carlos III de Madrid, Spain |
Kousuke Mouri | Tokyo University of Agriculture and Technology, Japan |
Pedro J. Muñoz-Merino | Universidad Carlos III de Madrid, Spain |
Tanya Nazaretsky | EPFL, Israel |
Huy Nguyen | University of Pittsburgh, USA |
Ange Adrienne Nyamen Tato | Université Laval, Canada |
Teresa Ober | Educational Testing Service, USA |
Püren Öncel | University of Minnesota Twin Cities, USA |
Korinn Ostrow | Edmentum, USA |
Tounwendyam F. Ouedraogo | Université Norbert ZONGO, Burkina Faso |
Maciej Pankiewicz | University of Pennsylvania, Poland |
Yancy Vance Paredes | Arizona State University, USA |
Philip I. Pavlik Jr. | University of Memphis, USA |
Jorge Poco | Fundação Getulio Vargas, Brazil |
Paul Stefan Popescu | University of Craiova, Romania |
Oleksandra Poquet | Technical University of Munich, Germany |
David Pritchard | Massachusetts Institute of Technology, USA |
Napol Rachatasumrit | Carnegie Mellon University, USA |
Narjes Rohani | University of Edinburgh, UK |
José Raúl Romero | University of Cordoba, Spain |
Joshua Rosenberg | University of Tennessee, Knoxville, USA |
Daniela Rotelli | University of Pisa, Italy |
Vasile Rus | The University of Memphis, USA |
Mirka Saarela | University of Jyväskylä, Finland |
Asaf Salman | Weizmann Institute of Science, Israel |
Sreecharan Sankaranarayanan | Carnegie Mellon University, USA |
Petra Sauer | Beuth University of Applied Sciences, Germany |
Eliana Scheihing | Universidad Austral de Chile, Chile |
Robin Matthias Schmucker | Carnegie Mellon University, Germany |
Filippo Sciarrone | Universitas Mercatorum, Italy |
Lele Sha | Monash University, Australia |
Lei Shi | Newcastle University, UK |
Aditi Singh | Cleveland State University, USA |
Daevesh Singh | Indian Institute of Technology, India |
Maxim Skryabin | Skryabin.PRO, Russia |
Stefan Slater | Teachers College, USA |
Álvaro Sobrinho | Federal University of the Agreste of Pernambuco, Brazil |
Frank Stinar | University of Illinois Urbana-Champaign, USA |
Jun-Ming Su | National University of Tainan, Taiwan |
Ashwin T S | Vanderbilt University, USA |
Anaïs Tack | KU Leuven, Belgium |
Ling Tan | Australian Council for Educational Research, Australia |
Michelle Taub | University of Central Florida, USA |
Khushboo Thaker | University of Pittsburgh, USA |
Craig Thompson | The University of British Columbia, Canada |
Emiko Tsutsumi | The University of Electro-Communications, Japan |
Maya Usher | Technion, Israel |
Masaki Uto | The University of Electro-Communications, Japan |
Oswaldo Velez-Langs | Universidad de Cordoba, Colombia |
Rémi Venant | LIUM - Le Mans Université, France |
Markel Vigo | The University of Manchester, UK |
Alessandro Vivas | UFVJM, Brazil |
Tuyet-Trinh Vu | SOICT-HUST, Viet Nam |
Deliang Wang | The University of Hong Kong, Hong Kong |
Daniel Weitekamp | Carnegie Mellon University, USA |
Jacob Whitehill | Worcester Polytechnic Institute, USA |
Alistair Willis | The Open University, UK |
Aaron Wong | University of Minnesota, USA |
Beverly Park Woolf | University of Massachusetts, USA |
Peter Wulff | Heidelberg University of Education, Germany |
Elad Yacobson | Weizmann Institute of Science, Israel |
Amelia Zafra Gómez | University of Cordoba, Spain |
Andrew Zamecnik | University of South Australia, Australia |
Alfredo Zapata González | Universidad Autónoma de Yucatán, Mexico |
Diego Zapata-Rivera | Educational Testing Service, USA |
Jiayi Zhang | University of Pennsylvania, USA |
Yingbin Zhang | South China Normal University, China |
Jia Zhu | The Ohio State University, USA |
Craig Zilles | University of Illinois at Urbana-Champaign, USA |
Sponsors
Platinum
Gold
Silver
Bronze
Scholarship and Award Support
Keynote Talks
Growing with AI: A Dynamic Ecosystem for Digital Learning and Assessment
Jill Burstein & Duolingo, USA
The broader “why” in educational data mining is to enhance education with an eye toward improving learner and test-taker experiences and success. To do this, our communities use generative AI, classical AI/machine learning, and statistical and psychometric methods to model learner and test-taker data (such as, data from student-tutor dialogs, writing responses, and clickstream data). Patterns that emerge from the data enable us, for example, to better understand learning behaviors, predict academic performance, and create adaptive learning and assessment experiences that foster student and test-taker success. Thinking about our research, it is important to ask: What is my “why”? And, how does my work create impact within a dynamic learning and assessment ecosystem that includes task design, measurement, security, learner and test-taker experience, and human-centered, responsible AI? Drawing from my assessment background, I will present a theoretical assessment ecosystem, highlighting key components, leverage points, and a human-centered AI approach for accurate and responsible data-driven outcomes. Using concrete examples drawn from language assessment, the talk will examine the intersection of assessment theory and practice. Expanding on this, I will discuss how to adapt the ecosystem for learning, and the importance of the ecosystem as a growth mechanism that supports continuous AI growth and use in digital learning and assessment research and systems development. The talk will offer a perspective about how the ecosystem can help us effectively implement our “Why?”, producing theoretically sound, ethical, and impactful learning and assessment research and systems.
The Next Educational Revolution: Grand Challenges for Learning @ Scale in the Generative AI Era
Chris Piech & Stanford University, USA
The community working on learning at scale has made tremendous progress over the last decade, successfully achieving many of our previously stated grand challenges. As we enter the Generative AI era, what new ambitious milestones should we shoot for to make progress towards the joyful, high-quality education at scale for all learners? How can we get ahead of the curve of the disruption that could come to assessment and jobs? This talk will explore several potential objectives, including scaling human teaching, developing effective generative AI tools, reaching new heights in student understanding, and addressing a major persistent constraint: student motivation.
Prof. Ram Kumar EDM Test of Time Award Talk
Affect Detectors Have Scaled in Research but not in Practice
Ryan S. Baker & University of Pennsylvania, USA
In 2012, we presented one of the first papers demonstrating sensor-free affect detection. This paper established that affect detection was possible using just the interaction logs from a digital learning platform; and it validated the detectors using unseen data from humans conducting classroom observations, establishing a paradigm for the development and validation of these types of EDM models. Since then, dozens of affect detectors have been developed by scholars around the world, providing increasingly high-quality measurements of students’ boredom, frustration, confusion, and engaged concentration/flow, as well as occasionally other affective states such as anxiety and delight. The resultant detectors have been used in a large range of studies investigating affect, from fine-grained studies on affect dynamics, to research on the interaction between affect and behavior or other processes such as self-regulated learning, to research on the longitudinal implications of negative affect. And yet, unlike other EDM algorithms and models — unlike, say, knowledge tracing or dropout prediction — affect detectors are generally not in scaled deployment in real-life learning technologies. Individual papers have demonstrated the potential of affect detection for benefiting learners, but the adaptivity or reporting demonstrated in these papers has not made it into the ongoing functionality of learning systems used at scale.
EDM Data Set Award Talk
PERSUADE (Persuasive Essays for Rating, Selecting, and Understanding Argumentative and Discourse Elements)
L. Burleigh & The Learning Agency, USA
The PERSUADE dataset provides access to comprehensive data such as labels for more than 14,000 essays, including the various argumentative and rhetorical elements contained within each essay response. It also includes the effectiveness rating of these discourse elements, holistic quality scores for the essay responses, and student demographic information that includes grade level, race/ethnicity, economic background, and more. The dataset was developed as a part of the Feedback Prize project, an initiative by Georgia State University and The Learning Agency Lab. The goal of the prize is to spur the development of open-source algorithms in assisted writing feedback tools and help struggling students dramatically improve in writing. The dataset was used in a series of Kaggle challenges, which collectively drew over 10,000 participants. Because the data was publicly shared for research and other uses after the data science competition series, researchers have been able to develop more algorithms for writing feedback outside of competition. Additionally, since the PERSUADE dataset included strong diversity and fairness measures, it has also been sought out by the Gemini AI team at Google to help provide better help to students when students submit essays through their chatbot.
JEDM Talks
Using a Randomized Experiment to Compare Mastery Learning Thresholds
Jeffrey Matayoshi | McGraw Hill ALEKS, USA |
Eric Cosyn | McGraw Hill ALEKS, USA |
Hasan Uzun | McGraw Hill ALEKS, USA |
Eyad Kurd-Misto | McGraw Hill ALEKS, USA |
Many modern adaptive learning and intelligent tutoring systems implement the principles of mastery learning, where a student must demonstrate mastery of core prerequisite material before working on subsequent content within the system. Typically in such cases, a set of rules or algorithms is used to determine if a student has sufficiently mastered the concepts in a topic. In a previous work, we used a quasi-experimental design to investigate the relationship between two different mastery learning thresholds and the forgetting of the learned material. As a follow-up to this initial study, in the present work we analyze the results from a randomized experiment—or, A/B test—directly comparing these two mastery learning thresholds. These latest results seemingly agree with those from our initial study, giving evidence for the validity of the conclusions from our original quasi-experiment. In particular, we find that although students who learn with the higher mastery threshold are less likely to forget the learned knowledge, over time this difference decreases. Additionally, we build on these analyses by looking at how the relationships between the mastery thresholds change based on other factors, such as the amount of struggle students experience while learning or the subject matter being covered.
Evaluating the Effects of Assignment Report Usage on Student Outcomes in an Intelligent Tutoring System: A Randomized-Encouragement Design
Wen Chiang Lim | Worcester Polytechnic Institute, USA |
Neil T. Heffernan | Worcester Polytechnic Institute, USA |
Adam Sales | Worcester Polytechnic Institute, USA |
As online learning platforms become more popular and deeply integrated into education, understanding their effectiveness and what drives that effectiveness becomes increasingly important. While there is extensive prior research illustrating the benefits of intelligent tutoring systems (ITS) for student learning, there is comparatively less focus on how teachers’ use of ITS impacts student outcomes. Much existing research on teachers’ ITS usage relies on qualitative studies, small-scale experiments, or survey data, making it difficult to identify the causal effects of their engagement with these systems. To bridge this gap, we conducted a study using a randomized encouragement design on an online mathematics platform, where teachers were randomly assigned to one of two groups: an encouragement group or a control group. Teachers in the encouragement group received a popup prompt urging them to explore the assignment report after they created an assignment, while those in the control group did not receive any additional prompts. The study focused exclusively on teachers new to the platform, as this group was expected to be most influenced by the encouragement prompt. The findings show that viewing the assignment report did not significantly impact the percentage of students who started the next assignment or their value-added scores. However, it did lead to a notable increase in the percentage of students completing the next assignment. This effect, confirmed using the Anderson-Rubin test (which is robust against weak instruments), demonstrates a measurable causal relationship between teachers’ use of assignment reports and student outcomes. Based on data from 330 teachers, this large-scale study sheds light on the causal effects of teachers engaging with ITS data on student learning and adds to the growing evidence base for effective teaching strategies in online learning environments.
Predicting Perceived Text Complexity: The Role of Person-Related Features in Profile-Based Models
Boris Thome | Heinrich Heine University Düsseldorf, Germany |
Friederike Hertweck | RWI - Leibniz Institute for Economic Research, Germany |
Stefan Conrad | Heinrich Heine University Düsseldorf, Germany |
Text complexity is inherently subjective, as it is not solely determined by linguistic properties but also shaped by the reader’s perception. Factors such as prior knowledge, language proficiency and cognitive abilities influence how individuals assess the difficulty of a text. Existing methods for measuring text complexity commonly rely on quantitative linguistic features and ignore differences in the readers’ backgrounds. In this paper, we evaluate several machine learning models that determine the complexity of texts as perceived by teenagers in high school prior to deciding on their post-secondary pathways. Wecollected and publicly released a dataset from German schools, where 193 students with diverse demographic backgrounds, school grades and language abilities annotated a total of 3,954 German sentences. The text corpus is based on official study guides authored by German governmental authorities. In contrast to existing methods of determining text complexity, we build a model that is specialized to behave like the target audience, thereby accounting for the diverse backgrounds of the readers. The annotations indicate that students generally perceived the texts as significantly simpler than suggested by the Flesch-Reading-Ease score. We show that K-Nearest-Neighbors, Multilayer Perceptron and ensemble models perform well in predicting the subjectively perceived text complexity. Furthermore, SHapley Additive exPlanation (SHAP) values reveal that these perceptions not only differ by the text’s linguistic features but also by the students’ mother tongue, gender and self-estimation of German language skills. We also implement role-play prompting with ChatGPT and Claude and show that state-of-the-art large language models have difficulties in accurately assessing perceived text complexity from a student’s perspective. This work thereby contributes to the growing field of adjusting text complexity to the needs of the target audience that goes beyond quantitative linguistic features.
Developing a Feedback Taxonomy for Math: A Synergy of Perspectives through Data Mining Methods
Seiyon Lee | University of Florida, Gainesville, USA |
Sami Baral | Worcester Polytechnic Institute, USA |
Hongming (Chip) Li | University of Florida, USA |
Li Cheng | University of North Texas, USA |
Shan Zhang | University of Florida, USA |
Carly Siegel Thorp | Worcester Polytechnic Institute, USA |
Jennifer St. John | Worcester Polytechnic Institute, USA |
Tamisha Thompson | Worcester Polytechnic Institute,USA |
Neil Heffernan | Worcester Polytechnic Institute, USA |
Anthony F. Botelho | University of Florida, USA |
Teachers often use open-ended questions to monitor and assess students’ deeper understanding of the content. These questions are particularly useful in mathematics education as they provide more insights into students’ problem-solving processes and strategies, which is not possible through close-ended questions. While open-ended questions has the potential to promote students’ conceptual and procedural knowledge, they also present challenges for teachers to assess students’ responses and providing feedback on all problems they solve. In recent years, a growing body of research has explored methods to automatically grade students’ open responses. However, limited attention has been given to augmenting teachers’ qualitative (textual) feedback for open-ended questions in math. Also, there is a lack of consideration for how perceptions and judgments of feedback content can vary between individuals involved in designing and developing tools for automated assessment, such as teachers and researchers. In this paper, we propose a taxonomy of teacher feedback using several data mining methods (i.e., correlation, factor analysis, cluster analysis), which can serve as an analytical framework to understand the complex structure of teacher feedback. Specifically, we extract diverse perspectives from researchers and teachers who identified types of feedback content by feeding them as distinct features when building each layer of the taxonomy. With a taxonomy developed from a synergy of diverse perspectives and data mining methods, our feedback taxonomy offers a foundation for generating feedback in advanced learning systems, paving the way for tools that can design and deliver scalable, meaningful, and pedagogically effective feedback.