Introduction to the Proceedings

Preface

The University of Palermo is proud to host the 18th International Conference on Educational Data Mining (EDM) in Palermo, Italy, from July 20 to July 23, 2025. EDM is the annual flagship conference of the International Educational Data Mining Society. This year’s theme is New Goals, New Measurements, New Incentives to Learn. The theme highlights the shift toward an era dominated by artificial intelligence - where machines can surpass human performance in numerous cognitive tasks - and highlights the imperative for our educational systems to evolve accordingly.

The scientific program for EDM2025 includes:

The workshops are:

  1. 9th Educational Data Mining in Computer Science Education (CSEDM) Workshop.

  2. 2nd Human-Centric eXplainable AI in Education (HEXED) Workshop.

  3. 2nd Workshop on Educational Data Mining in Writing and Literacy Instruction.

  4. Interactive Workshop: Multimodal, Multiparty Learning Analytics (MMLA).

  5. CausalEDM: Linking Innovations in Instructional Design and the Complex Behaviors that Underlie Learning Processes and Outcomes.

The tutorials are:

  1. Bringing Generative Artificial Intelligence (GenAI) to Education.

  2. Leveraging KOALA for Programming Data Collection: A Half-Day Tutorial for Research Application.

  3. Applying DebiasEd: A Package for Mitigating Unfairness in Educational Data

This year, EDM received 105 submissions to the Full Papers track (10 pages), 81 to the Short Papers track (6 pages), and 37 to the Posters and Demos track (4 pages). The program committee accepted 29 full papers (27.6% acceptance rate), 24 short papers (29.6% acceptance rate), 19 poster and demo papers, and 3 industry track papers. Furthermore, EDM 2025 also continued its tradition of providing opportunities for young researchers to present their work and receive feedback from peers and senior researchers. The Doctoral Consortium track this year featured 5 participants. We are also proud to offer travel sponsorships to 24 students who will attend thanks to this support.

We thank the sponsors of EDM 2025 for their generous support. We thank all the authors who submitted their work and the program committee members for their expert inputs. We thank the members of the organization committee for their leadership that made this conference possible. And, a big thank you to the local organizing committee who made this event memorable.

Caitlin Mills University of Minnesota, USA Program Chair
Giora Alexandron Weizmann Institute of Science, Israel Program Chair
Davide Taibi CNR-ITD, Italy General Chair
Giosuè Lo Bosco University of Palermo, Italy General Chair
Luc Paquette University of Illinois Urbana-Champaign, USA General Chair

July 10, 2025
Palermo, Italy

Organizing Committee

General Chairs

Program Chairs

Awards Chairs

Doctoral Consortium Chairs

Equity, Diversity, Inclusion, and Accessibility Chairs

Industry Track Chairs

JEDM Track Chairs

Poster & Demo Track Chairs

Proceedings Chairs

Scholarship Chairs

Social Media & Publicity Chairs

Sponsorship Chairs

Student Volunteer Chair

Web Chairs

Workshop & Tutorial Chairs

IEDMS Officers

Luc Paquette University of Illinois Urbana-Champaign, USA President
Anna Rafferty Carleton College, USA Treasurer

IEDMS Board of Directors

Rakesh Agrawal Data Insights Laboratories, USA
Tiffany Barnes North Carolina State University, USA
Nigel Bosch University of Illinois Urbana-Champaign, USA
Neil Heffernan Worcester Polytechnic Institute, USA
Sharon Hsiao Santa Clara University, USA
Tanja Käser EPFL, Switzerland
Kenneth Koedinger Carnegie Mellon University, USA
Ramkumar Rajendran Indian Institute of Technology Bombay, India

Program Committee

Senior Program Committee Members

Bita Akram North Carolina State University, USA
Vincent Aleven Carnegie Mellon University, USA
Roger Azevedo University of Central Florida, USA
Ryan Baker University of Pennsylvania, USA
Tiffany Barnes North Carolina State University, USA
Nigel Bosch University of Illinois Urbana-Champaign, USA
Anthony F. Botelho University of Florida, USA
Jesus G. Boticario UNED, Spain
François Bouchet Sorbonne Université – LIP6, France
Alex Bowers Columbia University, USA
Christopher Brooks University of Michigan, USA
Alexandra Cristea Durham University, UK
Michel Desmarais École Polytechnique de Montréal, Canada
Fabiano Dorça Universidade Federal de Uberlândia, Brazil
Michael Eagle George Mason University, USA
Vanessa Echeverria Monash University, Australia
Yo Ehara Tokyo Gakugei University, Japan
Stephen Fancsali Carnegie Learning, Inc., USA
Mingyu Feng WestEd, USA
Carol Forsyth Educational Testing Service, USA
Dragan Gasevic Monash University, Australia
Neil Heffernan Worcester Polytechnic Institute, USA
Sharon Hsiao Santa Clara University, USA
Paul Inventado California State University Fullerton, USA
Johan Jeuring Utrecht University, Netherlands
Srecko Joksimovic Education Future, University of South Australia, Australia
Jelena Jovanovic University of Belgrade, Serbia
Kirsty Kitto University of Technology Sydney, Australia
Rene Kizilcec Cornell University, USA
Simon Knight University of Technology Sydney, Australia
Irena Koprinska The University of Sydney, Australia
Tanja Käser EPFL, Switzerland
Sébastien Lallé Sorbonne Université, France
Andrew Lan University of Massachusetts Amherst, USA
Jeonghyun Lee Georgia Institute of Technology, USA
James Lester North Carolina State University, USA
Collin Lynch North Carolina State University, USA
Mirko Marras University of Cagliari, Italy
Roberto Angel Melendez Instituto Tecnológico Superior de Misantla, Mexico
Agathe Merceron Berlin University of Applied Sciences, Germany
Roger Nkambou Université du Québec à Montréal, Canada
Jaclyn Ocumpaugh University of Pennsylvania, USA
Andrew Olney University of Memphis, USA
Benjamin Paaßen Bielefeld University, Germany
Ranilson Paiva Universidade Federal de Alagoas, Brazil
Luc Paquette University of Illinois Urbana-Champaign, USA
Zach Pardos University of California, Berkeley, USA
Radek Pelánek Masaryk University Brno, Czechia
Niels Pinkwart Humboldt-Universität zu Berlin, Germany
Agoritsa Polyzou Florida International University, USA
Thomas Price North Carolina State University, USA
Mar Pérez-Sanagustín Université Paul Sabatier Toulouse III, France
Anna Rafferty Carleton College, USA
R Rajalakshmi VIT University, Chennai Campus, India
Ramkumar Rajendran IIT Bombay, India
Maria Mercedes T. Rodrigo Ateneo de Manila University, Philippines
Cristobal Romero University of Cordoba, Spain
Shazia Sadiq The University of Queensland, Australia
Sherry Sahebi University at Albany – SUNY, USA
Niels Seidel FernUniversität in Hagen, Germany
Atsushi Shimada Kyushu University, Japan
Sergey Sosnovsky Utrecht University, Netherlands
Tiffany Tang Wenzhou-Kean University, China
Stefan Trausan-Matu University Politehnica of Bucharest, Romania
Maomi Ueno The University of Electro-Communications, Japan
Sebastián Ventura University of Córdoba, Spain
Jill-Jênn Vie Inria Lille, France
Stephan Weibelzahl Private University of Applied Sciences, Germany

Regular Program Committee Members

Faruk Ahmed The University of Memphis, USA
Ali Al-Zawqari Vrije Universitiet Brussel, Belgium
Nazia Alam North Carolina State University, USA
Laia Albó Universitat de Vic, Spain
Laura Allen University of Minnesota, USA
Claudia Antunes Universidade de Lisboa, Portugal
Simón Pedro Arguijo Tecnológico Nacional de México, Mexico
Jose Manuel Azevedo ISCAP, Portugal
Aicha Bakki LERIA – Angers University, France
Ayan Banerjee Arizona State University, USA
Denilson Barbosa University of Alberta, Canada
Abhinava Barthakur University of South Australia, Australia
Marie Bexte FernUniversität in Hagen, Germany
Anis Bey La Rochelle University, France
Plaban Kumar Bhowmick Indian Institute of Technology Kharagpur, India
Nathaniel Blanchard Colorado State University, USA
Maria Bolsinova Tilburg University, Netherlands
Geoffray Bonnin Université de Lorraine – LORIA, France
Conrad Borchers Carnegie Mellon University, USA
Mamadou Bousso Thies University, Senegal
Keith Brawner United States Army Research Laboratory, USA
Julien Broisin IRIT, Université Toulouse III – Paul Sabatier, France
Armelle Brun Université de Lorraine – LORIA, France
Minghao Cai University of Alberta, Canada
Renza Campagni Università degli Studi di Firenze, Italy
Meng Cao Carnegie Mellon University, USA
Paulo Carvalho Carnegie Mellon University, USA
Mehmet Celepkolu University of Florida, USA
Guanliang Chen Monash University, Australia
Heeryung Choi University of Minnesota, USA
Irene-Angelica Chounta University of Duisburg-Essen, Germany
Wei Chu The University of Memphis, USA
Cheng-Yu Chung Chunghwa Telecom Laboratories (CHTTL), Taiwan
Ruth Cobos Universidad Autónoma de Madrid, Spain
Maria Constantino González Tecnológico de Monterrey Campus Laguna, Mexico
Maria Cutumisu McGill University, Canada
Cynthia D’Angelo University of Illinois at Urbana-Champaign, USA
Jesper Dannath Universität Bielefeld, Germany
Syaamantak Das Indian Institute of Technology Bombay, India
M Ali Akber Dewan Athabasca University, Canada
Nicholas Diana Colgate University, USA
Fahima Djelil IMT Atlantique, France
Cristina Dumdumaya University of Southeastern Philippines, Philippines
Nghia Duong-Trung German Research Centre for AI, Germany
Luke Eglington Amplify Education Inc., USA
Effat Farhana Vanderbilt University, USA
Márcia Fernandes Federal University of Uberlândia, Brazil
Nigel Fernandez University of Massachusetts Amherst, USA
Jeremiah Folsom-Kovarik Soar Technology, Inc., USA
Davide Fossati Emory University, USA
Kazuma Fuchimoto The University of Electro-Communications, Japan
Hagit Gabbay School of Education, Tel Aviv University, Israel
Wenbin Gan NIICT, Japan
Adam Gaweda North Carolina State University, USA
Aldo Gordillo Universidad Politécnica de Madrid (UPM), Spain
Guher Gorgun University of Alberta, Canada
Sabine Graf Athabasca University, Canada
Julio Guerra Universidad Austral de Chile, Chile
Ella Haig University of Portsmouth, UK
Feifei Han Australian Catholic University, Australia
Ching Nam Hang Saint Francis University, Hong Kong
Jiangang Hao Educational Testing Service, USA
Jason Harley McGill University, Canada
Carl Haynes-Magyar Carnegie Mellon University, USA
Surina He University of Alberta, Canada
Sami Heikkinen LAB University of Applied Sciences, Finland
Nicolas Hernandez Nantes Université, France
Martin Hlosta Swiss Distance University of Applied Sciences, Switzerland
Brahim Hmedna Ibn Zohr University Science Agadir, Morocco
Anett Hoppe Leibniz Universität Hannover, Germany
Paul Hur Free University of Berlin, Germany
Stephen Hutt University of Denver, USA
Sébastien Iksal LIUM – Le Mans Université, France
Vladimir Ivančević University of Novi Sad, Serbia
Hyeji Jang Ewha Womans University, South Korea
Lan Jiang University of Illinois Urbana-Champaign, USA
Yang Jiang Columbia University, USA
Jina Kang University of Illinois Urbana-Champaign, USA
Mohammad Khalil University of Bergen, Norway
Ean Teng Khor Nanyang Technological University, Singapore
Ekaterina Kochmar MBZUAI, United Arab Emirates
Sotiris Kotsiantis University of Patras, Greece
Vitomir Kovanovic The University of South Australia, Australia
Milos Kravcik DFKI GmbH, Germany
Swathi Krishnaraja University of Potsdam, Germany
Amruth Kumar Ramapo College of New Jersey, USA
Hatim Lahza The University of Queensland, Australia
Hollis Lai University of Alberta, Canada
Juan Alfonso Lara Torralbo Universidad de Córdoba, Spain
Mikel Larrañaga University of the Basque Country, Spain
Tai Le Quy University of Koblenz, Germany
Alwyn Vwen Yen Lee Nanyang Technological University, Singapore
Marie Lefevre LIRIS – Université Lyon 1, France
Arun Balajiee Lekshmi Narayanan University of Pittsburgh, USA
Jiawei Li Nanyang Technological University, China
Jionghao Lin Carnegie Mellon University, USA
Zhexiong Liu University of Pittsburgh, USA
Yu Lu Beijing Normal University, China
Ivan Luković University of Belgrade, Serbia
Nick Lytle Georgia Institute of Technology, USA
Sonsoles López-Pernas University of Eastern Finland, Finland
Boxuan Ma Kyushu University, Japan
Qiang Ma Kyoto Institute of Technology, Japan
Qianou Ma Carnegie Mellon University, USA
Yael Feldman Maggor KTH Royal Institute of Technology, Sweden
Aditi Mallavarapu University of Illinois at Chicago, USA
Jeffrey Matayoshi McGraw Hill ALEKS, USA
Ahmad Mel IDLab, Ghent University, Belgium
Victor Menendez-Dominguez Universidad Autónoma de Yucatán, Mexico
Donatella Merlini Università di Firenze, Italy
Tsunenori Mine Kyushu University, Japan
Tsubasa Minematsu Kyushu University, Japan
Sein Minn INRIA, France
Phaedra Mohammed The University of the West Indies, Trinidad and Tobago
Luis Alberto Morales Rosales U. Michoacana de San Nicolás de Hidalgo, Mexico
Pedro Manuel Moreno-Marcos Universidad Carlos III de Madrid, Spain
Kousuke Mouri Tokyo University of Agriculture and Technology, Japan
Pedro J. Muñoz-Merino Universidad Carlos III de Madrid, Spain
Tanya Nazaretsky EPFL, Israel
Huy Nguyen University of Pittsburgh, USA
Ange Adrienne Nyamen Tato Université Laval, Canada
Teresa Ober Educational Testing Service, USA
Püren Öncel University of Minnesota Twin Cities, USA
Korinn Ostrow Edmentum, USA
Tounwendyam F. Ouedraogo Université Norbert ZONGO, Burkina Faso
Maciej Pankiewicz University of Pennsylvania, Poland
Yancy Vance Paredes Arizona State University, USA
Philip I. Pavlik Jr. University of Memphis, USA
Jorge Poco Fundação Getulio Vargas, Brazil
Paul Stefan Popescu University of Craiova, Romania
Oleksandra Poquet Technical University of Munich, Germany
David Pritchard Massachusetts Institute of Technology, USA
Napol Rachatasumrit Carnegie Mellon University, USA
Narjes Rohani University of Edinburgh, UK
José Raúl Romero University of Cordoba, Spain
Joshua Rosenberg University of Tennessee, Knoxville, USA
Daniela Rotelli University of Pisa, Italy
Vasile Rus The University of Memphis, USA
Mirka Saarela University of Jyväskylä, Finland
Asaf Salman Weizmann Institute of Science, Israel
Sreecharan Sankaranarayanan Carnegie Mellon University, USA
Petra Sauer Beuth University of Applied Sciences, Germany
Eliana Scheihing Universidad Austral de Chile, Chile
Robin Matthias Schmucker Carnegie Mellon University, Germany
Filippo Sciarrone Universitas Mercatorum, Italy
Lele Sha Monash University, Australia
Lei Shi Newcastle University, UK
Aditi Singh Cleveland State University, USA
Daevesh Singh Indian Institute of Technology, India
Maxim Skryabin Skryabin.PRO, Russia
Stefan Slater Teachers College, USA
Álvaro Sobrinho Federal University of the Agreste of Pernambuco, Brazil
Frank Stinar University of Illinois Urbana-Champaign, USA
Jun-Ming Su National University of Tainan, Taiwan
Ashwin T S Vanderbilt University, USA
Anaïs Tack KU Leuven, Belgium
Ling Tan Australian Council for Educational Research, Australia
Michelle Taub University of Central Florida, USA
Khushboo Thaker University of Pittsburgh, USA
Craig Thompson The University of British Columbia, Canada
Emiko Tsutsumi The University of Electro-Communications, Japan
Maya Usher Technion, Israel
Masaki Uto The University of Electro-Communications, Japan
Oswaldo Velez-Langs Universidad de Cordoba, Colombia
Rémi Venant LIUM - Le Mans Université, France
Markel Vigo The University of Manchester, UK
Alessandro Vivas UFVJM, Brazil
Tuyet-Trinh Vu SOICT-HUST, Viet Nam
Deliang Wang The University of Hong Kong, Hong Kong
Daniel Weitekamp Carnegie Mellon University, USA
Jacob Whitehill Worcester Polytechnic Institute, USA
Alistair Willis The Open University, UK
Aaron Wong University of Minnesota, USA
Beverly Park Woolf University of Massachusetts, USA
Peter Wulff Heidelberg University of Education, Germany
Elad Yacobson Weizmann Institute of Science, Israel
Amelia Zafra Gómez University of Cordoba, Spain
Andrew Zamecnik University of South Australia, Australia
Alfredo Zapata González Universidad Autónoma de Yucatán, Mexico
Diego Zapata-Rivera Educational Testing Service, USA
Jiayi Zhang University of Pennsylvania, USA
Yingbin Zhang South China Normal University, China
Jia Zhu The Ohio State University, USA
Craig Zilles University of Illinois at Urbana-Champaign, USA

Sponsors

Platinum


Google logo

Gold


Gates Foundation logo

Silver


Hewlett-Packard Enterprise logo

Bronze


Eedi logo VitalSource logo Duolingo logo

Scholarship and Award Support


Prof. Ram Kumar Memorial Foundation logo

Keynote Talks

Growing with AI: A Dynamic Ecosystem for Digital Learning and Assessment

Jill Burstein & Duolingo, USA

The broader “why” in educational data mining is to enhance education with an eye toward improving learner and test-taker experiences and success. To do this, our communities use generative AI, classical AI/machine learning, and statistical and psychometric methods to model learner and test-taker data (such as, data from student-tutor dialogs, writing responses, and clickstream data). Patterns that emerge from the data enable us, for example, to better understand learning behaviors, predict academic performance, and create adaptive learning and assessment experiences that foster student and test-taker success. Thinking about our research, it is important to ask: What is my “why”? And, how does my work create impact within a dynamic learning and assessment ecosystem that includes task design, measurement, security, learner and test-taker experience, and human-centered, responsible AI? Drawing from my assessment background, I will present a theoretical assessment ecosystem, highlighting key components, leverage points, and a human-centered AI approach for accurate and responsible data-driven outcomes. Using concrete examples drawn from language assessment, the talk will examine the intersection of assessment theory and practice. Expanding on this, I will discuss how to adapt the ecosystem for learning, and the importance of the ecosystem as a growth mechanism that supports continuous AI growth and use in digital learning and assessment research and systems development. The talk will offer a perspective about how the ecosystem can help us effectively implement our “Why?”, producing theoretically sound, ethical, and impactful learning and assessment research and systems.

The Next Educational Revolution: Grand Challenges for Learning @ Scale in the Generative AI Era

Chris Piech & Stanford University, USA

The community working on learning at scale has made tremendous progress over the last decade, successfully achieving many of our previously stated grand challenges. As we enter the Generative AI era, what new ambitious milestones should we shoot for to make progress towards the joyful, high-quality education at scale for all learners? How can we get ahead of the curve of the disruption that could come to assessment and jobs? This talk will explore several potential objectives, including scaling human teaching, developing effective generative AI tools, reaching new heights in student understanding, and addressing a major persistent constraint: student motivation.

Prof. Ram Kumar EDM Test of Time Award Talk

Affect Detectors Have Scaled in Research but not in Practice

Ryan S. Baker & University of Pennsylvania, USA

In 2012, we presented one of the first papers demonstrating sensor-free affect detection. This paper established that affect detection was possible using just the interaction logs from a digital learning platform; and it validated the detectors using unseen data from humans conducting classroom observations, establishing a paradigm for the development and validation of these types of EDM models. Since then, dozens of affect detectors have been developed by scholars around the world, providing increasingly high-quality measurements of students’ boredom, frustration, confusion, and engaged concentration/flow, as well as occasionally other affective states such as anxiety and delight. The resultant detectors have been used in a large range of studies investigating affect, from fine-grained studies on affect dynamics, to research on the interaction between affect and behavior or other processes such as self-regulated learning, to research on the longitudinal implications of negative affect. And yet, unlike other EDM algorithms and models — unlike, say, knowledge tracing or dropout prediction — affect detectors are generally not in scaled deployment in real-life learning technologies. Individual papers have demonstrated the potential of affect detection for benefiting learners, but the adaptivity or reporting demonstrated in these papers has not made it into the ongoing functionality of learning systems used at scale.

EDM Data Set Award Talk

PERSUADE (Persuasive Essays for Rating, Selecting, and Understanding Argumentative and Discourse Elements)

L. Burleigh & The Learning Agency, USA

The PERSUADE dataset provides access to comprehensive data such as labels for more than 14,000 essays, including the various argumentative and rhetorical elements contained within each essay response. It also includes the effectiveness rating of these discourse elements, holistic quality scores for the essay responses, and student demographic information that includes grade level, race/ethnicity, economic background, and more. The dataset was developed as a part of the Feedback Prize project, an initiative by Georgia State University and The Learning Agency Lab. The goal of the prize is to spur the development of open-source algorithms in assisted writing feedback tools and help struggling students dramatically improve in writing. The dataset was used in a series of Kaggle challenges, which collectively drew over 10,000 participants. Because the data was publicly shared for research and other uses after the data science competition series, researchers have been able to develop more algorithms for writing feedback outside of competition. Additionally, since the PERSUADE dataset included strong diversity and fairness measures, it has also been sought out by the Gemini AI team at Google to help provide better help to students when students submit essays through their chatbot.

JEDM Talks

Using a Randomized Experiment to Compare Mastery Learning Thresholds

Jeffrey Matayoshi McGraw Hill ALEKS, USA
Eric Cosyn McGraw Hill ALEKS, USA
Hasan Uzun McGraw Hill ALEKS, USA
Eyad Kurd-Misto McGraw Hill ALEKS, USA

Many modern adaptive learning and intelligent tutoring systems implement the principles of mastery learning, where a student must demonstrate mastery of core prerequisite material before working on subsequent content within the system. Typically in such cases, a set of rules or algorithms is used to determine if a student has sufficiently mastered the concepts in a topic. In a previous work, we used a quasi-experimental design to investigate the relationship between two different mastery learning thresholds and the forgetting of the learned material. As a follow-up to this initial study, in the present work we analyze the results from a randomized experiment—or, A/B test—directly comparing these two mastery learning thresholds. These latest results seemingly agree with those from our initial study, giving evidence for the validity of the conclusions from our original quasi-experiment. In particular, we find that although students who learn with the higher mastery threshold are less likely to forget the learned knowledge, over time this difference decreases. Additionally, we build on these analyses by looking at how the relationships between the mastery thresholds change based on other factors, such as the amount of struggle students experience while learning or the subject matter being covered.

Evaluating the Effects of Assignment Report Usage on Student Outcomes in an Intelligent Tutoring System: A Randomized-Encouragement Design

Wen Chiang Lim Worcester Polytechnic Institute, USA
Neil T. Heffernan Worcester Polytechnic Institute, USA
Adam Sales Worcester Polytechnic Institute, USA

As online learning platforms become more popular and deeply integrated into education, understanding their effectiveness and what drives that effectiveness becomes increasingly important. While there is extensive prior research illustrating the benefits of intelligent tutoring systems (ITS) for student learning, there is comparatively less focus on how teachers’ use of ITS impacts student outcomes. Much existing research on teachers’ ITS usage relies on qualitative studies, small-scale experiments, or survey data, making it difficult to identify the causal effects of their engagement with these systems. To bridge this gap, we conducted a study using a randomized encouragement design on an online mathematics platform, where teachers were randomly assigned to one of two groups: an encouragement group or a control group. Teachers in the encouragement group received a popup prompt urging them to explore the assignment report after they created an assignment, while those in the control group did not receive any additional prompts. The study focused exclusively on teachers new to the platform, as this group was expected to be most influenced by the encouragement prompt. The findings show that viewing the assignment report did not significantly impact the percentage of students who started the next assignment or their value-added scores. However, it did lead to a notable increase in the percentage of students completing the next assignment. This effect, confirmed using the Anderson-Rubin test (which is robust against weak instruments), demonstrates a measurable causal relationship between teachers’ use of assignment reports and student outcomes. Based on data from 330 teachers, this large-scale study sheds light on the causal effects of teachers engaging with ITS data on student learning and adds to the growing evidence base for effective teaching strategies in online learning environments.

Predicting Perceived Text Complexity: The Role of Person-Related Features in Profile-Based Models

Boris Thome Heinrich Heine University Düsseldorf, Germany
Friederike Hertweck RWI - Leibniz Institute for Economic Research, Germany
Stefan Conrad Heinrich Heine University Düsseldorf, Germany

Text complexity is inherently subjective, as it is not solely determined by linguistic properties but also shaped by the reader’s perception. Factors such as prior knowledge, language proficiency and cognitive abilities influence how individuals assess the difficulty of a text. Existing methods for measuring text complexity commonly rely on quantitative linguistic features and ignore differences in the readers’ backgrounds. In this paper, we evaluate several machine learning models that determine the complexity of texts as perceived by teenagers in high school prior to deciding on their post-secondary pathways. Wecollected and publicly released a dataset from German schools, where 193 students with diverse demographic backgrounds, school grades and language abilities annotated a total of 3,954 German sentences. The text corpus is based on official study guides authored by German governmental authorities. In contrast to existing methods of determining text complexity, we build a model that is specialized to behave like the target audience, thereby accounting for the diverse backgrounds of the readers. The annotations indicate that students generally perceived the texts as significantly simpler than suggested by the Flesch-Reading-Ease score. We show that K-Nearest-Neighbors, Multilayer Perceptron and ensemble models perform well in predicting the subjectively perceived text complexity. Furthermore, SHapley Additive exPlanation (SHAP) values reveal that these perceptions not only differ by the text’s linguistic features but also by the students’ mother tongue, gender and self-estimation of German language skills. We also implement role-play prompting with ChatGPT and Claude and show that state-of-the-art large language models have difficulties in accurately assessing perceived text complexity from a student’s perspective. This work thereby contributes to the growing field of adjusting text complexity to the needs of the target audience that goes beyond quantitative linguistic features.

Developing a Feedback Taxonomy for Math: A Synergy of Perspectives through Data Mining Methods

Seiyon Lee University of Florida, Gainesville, USA
Sami Baral Worcester Polytechnic Institute, USA
Hongming (Chip) Li University of Florida, USA
Li Cheng University of North Texas, USA
Shan Zhang University of Florida, USA
Carly Siegel Thorp Worcester Polytechnic Institute, USA
Jennifer St. John Worcester Polytechnic Institute, USA
Tamisha Thompson Worcester Polytechnic Institute,USA
Neil Heffernan Worcester Polytechnic Institute, USA
Anthony F. Botelho University of Florida, USA

Teachers often use open-ended questions to monitor and assess students’ deeper understanding of the content. These questions are particularly useful in mathematics education as they provide more insights into students’ problem-solving processes and strategies, which is not possible through close-ended questions. While open-ended questions has the potential to promote students’ conceptual and procedural knowledge, they also present challenges for teachers to assess students’ responses and providing feedback on all problems they solve. In recent years, a growing body of research has explored methods to automatically grade students’ open responses. However, limited attention has been given to augmenting teachers’ qualitative (textual) feedback for open-ended questions in math. Also, there is a lack of consideration for how perceptions and judgments of feedback content can vary between individuals involved in designing and developing tools for automated assessment, such as teachers and researchers. In this paper, we propose a taxonomy of teacher feedback using several data mining methods (i.e., correlation, factor analysis, cluster analysis), which can serve as an analytical framework to understand the complex structure of teacher feedback. Specifically, we extract diverse perspectives from researchers and teachers who identified types of feedback content by feeding them as distinct features when building each layer of the taxonomy. With a taxonomy developed from a synergy of diverse perspectives and data mining methods, our feedback taxonomy offers a foundation for generating feedback in advanced learning systems, paving the way for tools that can design and deliver scalable, meaningful, and pedagogically effective feedback.