Longitudinal Tracking of Students’ Academic Progress using Model-based Clustering
DOI:
https://doi.org/10.61841/pyjm6h93Keywords:
Model-based Clustering, Longitudinal Tracking, Academic ProgressAbstract
Clustering methods have often been used to group similar students’ performance for tracking their academic progress. A typical approach is to group the students by treating the observations as accumulated average marks on several assessed subjects within a period of time. Here, similar characteristics of students are identified based on the overall variation of marks between the subjects but ignore the temporal aspect of students performance even though the assessments are carried out at different time points. Alternatively, such characteristics in the observations could be treated as a set of longitudinal data since the measurements consider time-spaced and repeated events. This paper aims to compare the output between these two different treatments of observations using Model-based Clustering (MBC). Specifically, the average observations are applied to the classical MBC, whereas the longitudinal observations require some adjustment to the covariance matrix to cater for its longitudinal data structure. A synthetic data set generated based on some pre-university students’ marks on four science subjects from three series of continuous assessments is applied to the methods. The results show that the longitudinal observations on adjusted MBC produce a greater number of clusters that could characterize students’ progress with a better internal and external cluster quality compared to the average observations on classical MBC.
Downloads
References
[1] J. O’Flaherty, J. Gleeson. Irish student teachers’ levels of moral reasoning : Context, comparisons, and
contributing influences. Teachers and Teaching: Theory and Practice, Vol. 23, No. 1, pp. 59–77, 2017.
[2] E. Kim, R.P. Corcoran. How engaged are pre-service teachers in the United States? . Teaching and Teacher
Education, Vol. 66, pp. 12–23, 2017.
[3] M. de Morais, M. F. R. Araujo, E. B. Costa, Monitoring Student Performance Using Data Clustering and
Predictive Modelling. IEEE, 2014.
[4] M. Singh, H. Nagar, A. Sant, K-mean and EM Clustering algorithm using attendance performance
improvement Primary school student. International Journal of Scientific Research in Science and Engineering
and Technology, 2016.
[5] P. Jalote, B.N. Jain, S. Sopory. Classification for Research Universities in India. Higher Education, 2019.
[6] W. Hämäläinen, V. Kumpulainen, M. Mozgovoy, Evaluation of clustering methods for adaptive learning
systems. In Artificial Intelligence Applications in Distance Education. IGI Global Book series Advances
in Mobile and Distance Learning, 2015.
[7] J. Xu, K.H. Moon, M. van der Schaar. A Machine Learning Approach for Tracking and Predicting Student
Performance in Degree Programs. IEEE, 2020.
[8] J. Leppink. When Negative Turns Positive and Vice Versa: The Case of Repeated Measurements. Health
Professions Education, Vol. 5, pp. 76–81, 2019.
[9] R. Vilanova, M. Dominguez, J. Vicario, M.A. Prada, M. Barbu, M.J. Varanda, P. Alves, M. Podpora, U. Spagpnolini, A. Paganoni. Data-driven Tool for Monitoring of Students Performance. IFAC PapersOnline, Vol. 52, No. 9, pp. 165–170, 2019.
[10] O. Kwok, M. H. Lai, F. Tong, R. Lara-Alecio, B. Irby, M. Yoon, Y. Yeh. Analyzing Complex Longitudinal Data in Educational Research : A Demonstration With Project English Language and Literacy Acquisition (ELLA) Data Using xxM. Front Psychol. Vol. 9, pp. 790, 2018.
[11] M.S. Yang, S-J. Chang-Chien, Y. Nataliani. Unsupervised Fuzzy Model-based Gaussian Clustering. Information Sciences, Vol. 481, pp. 1–23, 2019.
[12] P.D. McNicholas, Model-Based Clustering. Journal of Classification. Vol. 33, pp. 331-373, 2016.
[13] W.L. Wang, L.M. Castro, V.H. Lachos, T. Lin. Model-based Clustering of Censored Data via Mixtures of Factor Analyzers. Computational Statistics & Data Analysis, Vol. 140, pp. 104–121, 2019.
[14] M. Hoffman, D. Steinley, M.J. Brusco. A Note on Using the Adjusted Rand Index for Link Prediction in Networks. Social Networks, Vol. 42, pp. 72–79, 2015.
[15] S. Manochandar, M. Punniyamoorthy, R.K. Jeyachitra. Development of New Seed with Modified Validity Measures for K-Means Clustering. Computers & Industrial Engineering, Vol. 141, 2020.
[16] A.J. Gates, Y. Ahn. The Impact of Random Models on Clustering Similarity. Journal of Machine Learning Research, Vol. 18, pp. 1–28, 2017.
Downloads
Published
Issue
Section
License
Copyright (c) 2020 AUTHOR

This work is licensed under a Creative Commons Attribution 4.0 International License.
You are free to:
- Share — copy and redistribute the material in any medium or format for any purpose, even commercially.
- Adapt — remix, transform, and build upon the material for any purpose, even commercially.
- The licensor cannot revoke these freedoms as long as you follow the license terms.
Under the following terms:
- Attribution — You must give appropriate credit , provide a link to the license, and indicate if changes were made . You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
- No additional restrictions — You may not apply legal terms or technological measures that legally restrict others from doing anything the license permits.
Notices:
You do not have to comply with the license for elements of the material in the public domain or where your use is permitted by an applicable exception or limitation .
No warranties are given. The license may not give you all of the permissions necessary for your intended use. For example, other rights such as publicity, privacy, or moral rights may limit how you use the material.