DropoutSeer: Visualizing learning patterns in Massive Open Online Courses for dropout reasoning and prediction

Aiming at massive participation and open access education, Massive Open Online Courses (MOOCs) have attracted millions of learners over the past few years. However, the high dropout rate of learners is considered to be one of the most crucial factors that may hinder the development of MOOCs. To tackle this problem, statistical models have been developed to predict dropout behavior based on learner activity logs. Although predictive models can foresee the dropout behavior, it is still difficult for users to understand the reasons behind the predicted results and further design interventions to prevent dropout. In addition, with a better understanding of dropout, researchers in the area of predictive modeling in turn can improve the models. In this paper, we introduce DropoutSeer, a visual analytics system which not only helps instructors and education experts understand the reasons for dropout, but also allows researchers to identify crucial features which can further improve the performance of the models. Both the heterogeneous data extracted from three different kinds of learner activity logs (i.e., clickstream, forum posts and assignment records) and the predicted results are visualized in the proposed system. Case studies and expert interviews have been conducted to demonstrate the usefulness and effectiveness of DropoutSeer.

[1]  David S. Ebert,et al.  Spatiotemporal social media analytics for abnormal event detection and examination using seasonal-trend decomposition , 2012, 2012 IEEE Conference on Visual Analytics Science and Technology (VAST).

[2]  Linda Corrin,et al.  Visualizing patterns of student engagement and performance in MOOCs , 2014, LAK.

[3]  Patrick Jermann,et al.  Capturing "attrition intensifying" structural traits from didactic interaction sequences of MOOC learners , 2014, EMNLP 2014.

[4]  Mengchen Liu,et al.  A survey on information visualization: recent advances and challenges , 2014, The Visual Computer.

[5]  M. Sheelagh T. Carpendale,et al.  A Visual Backchannel for Large-Scale Events , 2010, IEEE Transactions on Visualization and Computer Graphics.

[6]  Xin Tong,et al.  TextFlow: Towards Better Understanding of Evolving Topics in Text , 2011, IEEE Transactions on Visualization and Computer Graphics.

[7]  Shimei Pan,et al.  TIARA: Interactive, Topic-Based Visual Text Summarization and Analysis , 2012, TIST.

[8]  Yingcai Wu,et al.  Visual Analysis of Topic Competition on Social Media , 2013, IEEE Transactions on Visualization and Computer Graphics.

[9]  Lucy T. Nowell,et al.  ThemeRiver: visualizing theme changes over time , 2000, IEEE Symposium on Information Visualization 2000. INFOVIS 2000. Proceedings.

[10]  Lei Shi,et al.  Understanding text corpora with multiple facets , 2010, 2010 IEEE Symposium on Visual Analytics Science and Technology.

[11]  Lucy T. Nowell,et al.  ThemeRiver: Visualizing Thematic Changes in Large Document Collections , 2002, IEEE Trans. Vis. Comput. Graph..

[12]  Kalyan Veeramachaneni,et al.  Likely to stop? Predicting Stopout in Massive Open Online Courses , 2014, ArXiv.

[13]  M. Sheelagh T. Carpendale,et al.  A Review of Temporal Data Visualizations Based on Space-Time Cube Operations , 2014, EuroVis.

[14]  Qing Chen,et al.  VisMOOC: Visualizing video clickstream data from massive open online courses , 2015, 2014 IEEE Conference on Visual Analytics Science and Technology (VAST).

[15]  Krzysztof Z. Gajos,et al.  Understanding in-video dropouts and interaction peaks inonline lecture videos , 2014, L@S.

[16]  John T. Stasko,et al.  Toward a Deeper Understanding of the Role of Interaction in Information Visualization , 2007, IEEE Transactions on Visualization and Computer Graphics.

[17]  Qing Chen,et al.  PeakVizor: Visual Analytics of Peaks in Video Clickstreams from Massive Open Online Courses , 2016, IEEE Transactions on Visualization and Computer Graphics.

[18]  David J. Spiegelhalter,et al.  Machine Learning, Neural and Statistical Classification , 2009 .

[19]  Danielle S. McNamara,et al.  Good Communities and Bad Communities: Does Membership Affect Performance? , 2015, EDM.

[20]  Chunju Tseng,et al.  Visualization in law enforcement , 2005, DG.O.

[21]  Jim Thomasa,et al.  Challenges for visual analytics , 2009 .

[22]  Pierre Dragicevic,et al.  SpiraClock: a continuous and non-intrusive display for upcoming events , 2002, CHI Extended Abstracts.

[23]  Hal Daumé,et al.  Incorporating Lexical Priors into Topic Models , 2012, EACL.

[24]  Niels Pinkwart,et al.  Predicting MOOC Dropout over Weeks Using Machine Learning Methods , 2014, EMNLP 2014.

[25]  Jane Sinclair,et al.  Dropout rates of massive open online courses : behavioural patterns , 2014 .

[26]  James J. Thomas,et al.  Challenges for Visual Analytics , 2009, Inf. Vis..

[27]  Leonidas J. Guibas,et al.  Syntactic and Functional Variability of a Million Code Submissions in a Machine Learning MOOC , 2013, AIED Workshops.

[28]  Jian Zhao,et al.  egoSlider: Visual Analysis of Egocentric Network Evolution , 2016, IEEE Transactions on Visualization and Computer Graphics.

[29]  Kalyan Veeramachaneni,et al.  Transfer Learning for Predictive Models in Massive Open Online Courses , 2015, AIED.

[30]  Kamran Sedig,et al.  Transactions on Human-computer Interaction Thci Design for Complex Cognitive Activities with Visual Representations: a Pattern-based Approach Sedig and Parsons Interaction Design for Complex Cognitive Activities with Visualizations , 2022 .

[31]  Ben Shneiderman,et al.  LifeLines: visualizing personal histories , 1996, CHI.

[32]  P. Sánchez,et al.  VISUALIZATION METHODS FOR TIME-DEPENDENT DATA-AN OVERVIEW , 2003 .

[33]  Suma Bhat,et al.  Predicting Attrition Along the Way: The UIUC Model , 2014, EMNLP 2014.

[34]  Robert Sanders,et al.  A Process for Predicting MOOC Attrition , 2014, EMNLP 2014.

[35]  Heidrun Schumann,et al.  Visualization of Time-Oriented Data , 2011, Human-Computer Interaction Series.

[36]  Claus Zinn,et al.  Getting to Know Your Student in Distance Learning Contexts , 2006, EC-TEL.

[37]  Marc Alexa,et al.  Visualizing time-series on spirals , 2001, IEEE Symposium on Information Visualization, 2001. INFOVIS 2001..

[38]  Patrick Jermann,et al.  Your click decides your fate: Inferring Information Processing and Attrition Behavior from MOOC Video Clickstream Interactions , 2014, Proceedings of the EMNLP 2014 Workshop on Analysis of Large Scale Social Interaction in MOOCs.

[39]  Franck Dernoncourt,et al.  MoocViz: A Large Scale, Open Access, Collaborative, Data Analytics Platform for MOOCs , 2013 .

[40]  Lise Getoor,et al.  Understanding MOOC Discussion Forums using Seeded LDA , 2014, BEA@ACL.

[41]  J. V. van Wijk,et al.  Cluster and calendar based visualization of time series data , 1999, Proceedings 1999 IEEE Symposium on Information Visualization (InfoVis'99).