This page holds the proceedings for the 10th International Conference on Educational Data Mining. The conference will be held on June 25 – 28, 2017, in Wuhan, Hubei, China.

Main Proceedings

Citation Information:

Xiangen Hu, Tiffany Barnes, Arnon Hershkovitz and Luc Paquette (eds.) Proceedings of the 10th International Conference on Educational Data Mining.


Workshop Proceedings

Citation Information:

Ran Liu and Michael Eagle (eds.) Proceedings of the EDM 2017 Workshops and Tutorials co-located with the 10th International Conference on Educational Data Mining.

Individual Papers

Invited Talks

Can AI help MOOCs?

Jie Tang

The evolution of virtual tutors, clinician, and companions: A 20-year perspective on conversational agents in real-world applications

Ronald Cole

JEDM Track Journal Papers

Identifiability of the Bayesian Knowledge Tracing Model

Junchen Feng

RiPLE: Recommendation in Peer-Learning Environments Based on Knowledge Gaps and Interests

Hassan Khosravi, Kendra Cooper and Kirsty Kitto

Modeling Wheel-spinning and Productive Persistence in Skill Builders

Shimin Kai, Ma. Victoria Almeda, Ryan Baker, Nicole Shechtman, Cristina Heffernan and Neil Heffernan

Modeling MOOC Student Behavior With Two-Layer Hidden Markov Models

Chase Geigle and Chengxiang Zhai

Closing the loop: Automated data-driven cognitive model discoveries lead to improved instruction and learning

Ran Liu and Kenneth Koedinger

Full papers

Zone out no more: Mitigating mind wandering during computerized reading

Sidney D’Mello, Caitlin Mills, Robert Bixler and Nigel Bosch

Measuring Similarity of Educational Items Using Data on Learners’ Performance

Jiří Řihák and Radek Pelánek

Adaptive Sequential Recommendation for Discussion Forums on MOOCs using Context Trees

Fei Mi and Boi Faltings

Analysis of problem-solving behavior in open-ended scientific-discovery game challenges

Aaron Bauer, Jeff Flatten and Zoran Popović

The Antecedents of and Associations with Elective Replay in An Educational Game: Is Replay Worth It?

Zhongxiu Liu, Christa Cody, Tiffany Barnes, Collin Lynch and Teomara Rutherford

Grade Prediction with Temporal Course-wise Influence

Zhiyun Ren, Xia Ning and Huzefa Rangwala

Toward the Automatic Labeling of Course Questions for Ensuring their Alignment with Learning Outcomes

S. Supraja, Kevin Hartman, Sivanagaraja Tatinati and Andy Khong

Behavior-Based Latent Variable Model for Learner Engagement

Andrew Lan, Christopher Brinton, Tsung-Yen Yang and Mung Chiang

Efficient Feature Embeddings for Student Classification with Variational Auto-encoders

Severin Klingler, Rafael Wampfler, Tanja Käser, Barbara Solenthaler and Markus Gross

Predicting Short- and Long-Term Vocabulary Learning via Semantic Features of Partial Word Knowledge

Sungjin Nam, Gwen Frishkoff and Kevyn Collins-Thompson

Generalizability of Face-Based Mind Wandering Detection Across Task Contexts

Angela Stewart, Nigel Bosch and Sidney D’Mello

Addressing Student Behavior and Affect with Empathy and Growth Mindset

Shamya Karumbaiah, Rafael Lizarralde, Danielle Allessio, Beverly Woolf and Ivon Arroyo

Epistemic Network Analysis and Topic Modeling for Chat Data from Collaborative Learning Environment

Zhiqiang Cai, Brendan Eagan, Nia Dowell, James Pennebaker, Arthur Graesser and David Shaffer

Towards Closing the Loop: Bridging Machine-induced Pedagogical Policies to Learning Theories

Guojing Zhou, Jianxun Wang, Collin Lynch and Min Chi

On the Influence on Learning of Student Compliance with Prompts Fostering Self-Regulated Learning

Sébastien Lallé, Cristina Conati, Roger Azevedo, Michelle Taub and Nicholas Mudrick

Assessing Computer Literacy of Adults with Low Literacy Skills

Andrew Olney, Dariush Bakhtiari, Daphne Greenberg and Arthur Graesser

Towards reliable and valid measurement of individualized student parameters

Ran Liu and Kenneth Koedinger

The Misidentified Identifiability Problem of Bayesian Knowledge Tracing

Shayan Doroudi and Emma Brunskill

Short papers

An Effective Framework for Automatically Generating and Ranking Topics in MOOC Videos

Jile Zhu, Xiang Li, Zhuo Wang and Ming Zhang

Grouping Students for Maximizing Learning from Peers

Rakesh Agrawal, Sharad Nandanwar and Narasimha Murty Musti

Assessing the Dialogic Properties of Classroom Discourse: Proportion Models for Imbalanced Classes

Andrew Olney, Borhan Samei, Patrick Donnelly and Sidney D’Mello

When and who at risk? Call back at these critical points

Yuntao Li, Chengzhen Fu and Yan Zhang

Characterizing Collaboration in the Pair Program Tracing and Debugging Eye-Tracking Experiment: A Preliminary Analysis

Maureen Villamor and Ma. Mercedes Rodrigo

Linking Language to Math Success in a Blended Course

Scott Crossley, Tiffany Barnes, Collin Lynch and Danielle McNamara

Task and Timing: Separating Procedural and Tactical Knowledge in Student Models

Joshua Cook, Collin Lynch, Andrew Hicks and Behrooz Mostafavi

Evaluation of a Data-driven Feedback Algorithm for Open-ended Programming

Thomas Price, Rui Zhi and Tiffany Barnes

Making the Grade: How Learner Engagement Changes After Passing a Course

David Lang, Ben Domingue, Alex Kindel and Andreas Paepcke

Using a Single Model Trained across Multiple Experiments to Improve the Detection of Treatment Effects

Thanaporn Patikorn, Douglas Selent, Neil Heffernan, Joseph Beck and Jian Zou

Data-Mining Textual Responses to Uncover Misconception Patterns

Joshua Michalenko, Andrew Lan, Andrew Waters, Phillip Grimaldi and Richard Baraniuk

Automated Assessment for Scientific Explanations in On-line Science Inquiry

Haiying Li, Janice Gobert and Rachel Dickler

Can Typical Behaviors Identified in MOOCs be Discovered in Other Courses?

Truong-Sinh An, Christopher Krauss and Agathe Merceron

Gaze-based Detection of Mind Wandering during Lecture Viewing

Stephen Hutt, Jessica Hardey, Robert Bixler, Angela Stewart, Evan Risko and Sidney D’Mello

Sequence Modelling For Analysing Student Interaction with Educational Systems

Christian Hansen, Casper Hansen, Niklas Hjuler, Stephen Alstrup and Christina Lioma

Predicting Prospective Peer Helpers to Provide Just-In-Time Help to Users in Question and Answer Forums

Oluwabukola Ishola and Gordon McCalla

Combining Machine Learning and Natural Language Processing Approach to Assess Literary Text Comprehension

Renu Balyan, Kathryn McCarthy and Danielle McNamara

Predicting Student Retention from Behavior in an Online Orientation Course

Shimin Kai, Juan Miguel Andres, Luc Paquette, Ryan Baker, Kati Molnar, Harriet Watkins and Michael Moore

Inferring Frequently Asked Questions from Student Question Answering Forums

Renuka Sindhgatta, Smit Marvaniya, Tejas Dhamecha and Bikram Sengupta

On the Prevalence of Multiple-Account Cheating in Massive Open Online Learning

Yingying Bao, Guanliang Chen and Claudia Hauff

Clustering Student Sequential Trajectories Using Dynamic Time Wrapping

Shitian Shen and Min Chi

Learner Affect Through the Looking Glass: Characterization and Detection of Confusion in Online Courses

Ziheng Zeng, Snigdha Chaturvedi and Suma Bhat

Modeling Classifiers for Virtual Internships Without Participant Data

Dipesh Gautam, Zachari Swiecki, David Shaffer, Vasile Rus and Arthur Graesser

Convolutional Neural Network for Automatic Detection of Sociomoral Reasoning Level

Ange Adrienne Nyamen Tato, Roger Nkambou and Aude Dufresne

A Latent Factor Model For Instructor Content Preference Analysis

Jack Wang, Andrew Lan, Phillip Grimaldi and Richard Baraniuk

Mining Innovative Augmented Graph Grammars for Argument Diagrams through Novelty Selection

Linting Xue, Collin Lynch and Min Chi

An Extended Learner Modeling Method to Assess Students’ Learning Behaviors

Yi Dong and Gautam Biswas

Estimating Individual Treatment Effect from Educational Studies with Residual Counterfactual Networks

Siyuan Zhao and Neil Heffernan

Online Learning Persistence and Academic Achievement

Ying Fang, Benjamin Nye, Philip Pavlik Jr., Yonghong Xu, Arthur Graesser and Xiangen Hu

Using Temporal Association Rule Mining to Predict Dyadic Rapport in Peer Tutoring

Michael Madaio, Rae Lasko, Justine Cassell and Amy Ogan

Learning to Represent Student Knowledge on Programming Exercises Using Deep Learning

Lisa Wang, Angela Sy, Larry Liu and Chris Piech

Development of a Trajectory Model for Visualizing Teacher ICT Usage Based on Event Segmentation Data

Longwei Zheng, Rui Shi, Xiaoqing Gu, Bingcong Wu and Yuanyuan Feng


Modeling Network Dynamics of MOOC Discussion Interactions at Scale

Jingjing Zhang and Maxim Skryabin

Studying MOOC Completion at Scale Using the MOOC Replication Framework

Juan Miguel Andres, Ryan Baker, George Siemens, Dragan Gašević, Catherine Spann and Scott Crossley

Clustering Students in ASSISTments: Exploring System- and School-Level Traits to Advance Personalization

Seth Adjei, Korinn Ostrow, Erik Erickson and Neil Heffernan

Application of the Dynamic Time Warping Distance for the Student Drop-out Prediction on Time Series Data

Alexander Askinadze and Stefan Conrad

Student Use of Scaffolded Inquiry Simulations in Middle School Science

Elizabeth McBride, Jonathan Vitale and Marcia Linn

Modeling Dormitory Occupancy Using Markov Chains

David Pokrajac, Kimberley Sudler, Diana Yankovich and Teresa Hardee

Improving Models of Peer Grading in SPOC

Yong Han, Wenjun Wu and Xuan Zhou

Personalized Feedback for Open-Response Mathematical Questions using Long Short-Term Memory Networks

Joshua Michalenko, Andrew Lan and Richard Baraniuk

Intelligent Composition of Test Papers based on MOOC Learning Data

Lin Ma and Yuchun Ma

Toward Replicable Predictive Model Evaluation in MOOCs

Josh Gardner and Christopher Brooks

Modeling the Zone of Proximal Development with a Computational Approach

Irene-Angelica Chounta, Bruce Mclaren, Patricia Albacete, Pamela Jordan and Sandra Katz

A Prediction and Early Alert Model Using Learning Management System Data and Grounded in Learning Science Theory

Wonjoon Hong and Matthew Bernacki

Cluster Analysis of Real Time Location Data – An Application of Gaussian Mixture Models

Alvaro Ortiz-Vazquez, Xiang Liu, Ching-Fu Lan, Hui Soo Chae and Gary Natriello

A Topic Model and Social Network Analysis of a School Blogging Platform

Xiaoting Kuang, Hui Soo Chae, Brian Hughes and Gary Natriello

Supporting the Encouragement of Forum Participation

Aashna Garg and Andreas Paepcke

Untangling The Program Name Versus The Curriculum: An Investigation of Titles and Curriculum Content

R. Wes Crues

Emerging Patterns in Student’s Learning Attributes through Text Mining

Kejkaew Thanasuan, Warasinee Chaisangmongkon and Chanikarn Wongviriyawong

A Neural Network Approach to Estimate Student Skill Mastery in Cognitive Diagnostic Assessments

Qi Guo, Maria Cutumisu and Ying Cui

Automatic Peer Tutor Matching: Data-Driven Methods to Enable New Opportunities for Help

Nicholas Diana, Michael Eagle, John Stamper, Shuchi Grover, Marie Bienkowski and Satabdi Basu

Short-Answer Responses to STEM Exercises: Measuring Response Validity and Its Impact on Learning

Andrew Waters, Phillip Grimaldi, Andrew Lan and Richard Baraniuk

Using an Additive Factor Model and Performance Factor Analysis to Assess Learning Gains in a Tutoring System to Help Adults with Reading Difficulties

Genghu Shi, Philip Pavlik Jr. and Arthur Graesser

Identifying student communities in blended courses

Niki Gitinabard, Collin Lynch, Sarah Heckman and Tiffany Barnes

Automatic Scoring Method for Descriptive Test Using Recurrent Neural Network

Keiji Yasuda, Izuru Nogaito, Hiroyuki Kawashima, Hiroaki Kimura and Masayuki Hashimoto

Using Graph-based Modelling to explore changes in students’ affective states during exploratory learning tasks

Beate Grawemeyer, Alex Wollenschlaeger, Sergio Gutierrez-Santos, Wayne Holmes, Manolis Mavrikis and Alex Poulovassilis

Predicting Performance in a Small Private Online Course

Han Wan, Jun Ding, Xiaopeng Gao, Qiaoye Yu and Kangxu Liu

Social work in the classroom? A tool to evaluate topical relevance in student writing

Heeryung Choi, Zijian Wang, Christopher Brooks, Kevyn Collins-Thompson, Beth Glover Reed and Dale Fitch

Causal Forest vs. Naive Causal Forest in Detecting Personalization: An Empirical Study in ASSISTments

Biao Yin, Anthony F. Botelho, Thanaporn Patikorn, Neil Heffernan and Jian Zou

An Offline Evaluation Method for Individual Treatment Rules and How to Find Heterogeneous Treatment Effect

Thanaporn Patikorn, Neil Heffernan and Jian Zou

MyCOS Intelligent Teaching Assistant

Jiao Guo, Xinhua Huang and Boqing Wang

Towards Automatic Classification of Learning Objects: Reducing the Number of Used Features

Cristobal Romero, Pedro Gonzalez Espejo, Eva Gibaja, Alfredo Zapata González and Victor Menendez

The Reading Ability of College Freshmen

Andrew Olney, Breya Walker, Raven Davis and Arthur Graesser

Discovering skill prerequisite structure through Bayesian estimation and nested model comparison

Soo-Yun Han, Jiyoung Yoon and Yun Joo Yoo

Text analysis with LIWC and Coh-Metrix: Portraying MOOCs Instructors

Junyi Li, Yun Tang, Lijun Sun and Xiangen Hu

Identifying relationships between students’ questions type and their behavior

Fatima Harrak, François Bouchet and Vanda Luengo

Metacognitive Prompt Overdose: Positive and Negative Effects of Prompts in iSTART

Kathryn McCarthy, Amy Johnson, Aaron Likens, Zachary Martin and Danielle McNamara

Tracking Online Reading of College Students

Andrew Olney, Eric Hosman, Arthur Graesser and Sidney D’Mello

Dropout Prediction in MOOCs using Learners’ Study Habits Features

Wan Han, Jun Ding, Xiaopeng Gao and David Pritchard

Exploring the Relationship Between Student Pre-knowledge and Engagement in MOOC Class Using Polytomous IRT

Jingxuan Liu and Hongli Li

An Analysis of Students’ Questions in MOOCs Forums

Meng Cao, Yun Tang and Xiangen Hu


Real-time programming exercise feedback in MOOCs

Zhenghao Chen, Andy Nguyen, Amory Schlender and Jiquan Ngiam

Why data standards are critical for EDM and AIED

Xiangen Hu, Robby Robson and Avron Barr

Tutorial: Principal Stratification for EDM Experiments

Adam Sales

Whitebox: A Device To Assist Group Work Evaluation

Daisuke Yukita

Understanding Student’s Reviewing and Reflection Behaviors Using Web-based Programming Grading Assistant

Yancy Vance Paredes, Po-Kai Huang and Sharon Hsiao

Doctoral Consortium

A Framework for the Estimation of Students’ Programming Abilities

Ella Albrecht

Student Use of Inquiry Simulations in Middle School Science

Elizabeth McBride

Developing Chinese Automated Essay Scoring Model to Assess College Students’ Essay Quality

Yu-Ju Lu, Bor-Chen Kuo and Kai-Chih Pai

Teaching Informal Logical Fallacy Identification with a Cognitive Tutor

Nicholas Diana, John Stamper and Kenneth Koedinger

Automated Extraction of Results from Full Text Journal Articles

R. Wes Crues

Intelligent Argument Grading System forStudent-produced Argument Diagrams

Linting Xue

Industry Track

Dropout Prediction in Home Care Training

Wenjun Zeng, Si-Chi Chin, Brenda Zeimet, Rui Kuang and Chih-Lin Chi

Few hundred parameters outperform few hundred thousand?

Amar Lalwani and Sweety Agrawal

Tell Me More: Digital Eyes to the Physical World for Early Childhood Learning

Vijay Ekambaram, Ruhi Sharma Mittal, Prasenjit Dey, Ravindranath Kokku, Aditya K Sinha and Satya V Nitta

Student Learning Strategies to Predict Success in an Online Adaptive Mathematics Tutoring System

Jun Xie, Shirin Mojarad, Keith Shubeck, Alfred Essa, Ryan Baker and Xiangen Hu

Adaptive Assessment Experiment in a HarvardX MOOC

Ilia Rushkin, Yigal Rosen, Andrew Ang, Colin Fredericks, Dustin Tingley, Mary Jean Blink and Glenn Lopez


Graph-based Educational Data Mining

Collin Lynch, Tiffany Barnes, Linting Xue and Niki Gitinabard

Workshop proposal: deep learning for educational data mining

Joseph Beck, Min Chi and Ryan Baker

Sharing and Reusing Data and Analytic Methods with LearnSphere

Ran Liu, Kenneth Koedinger, John Stamper and Philip Pavlik Jr.