Article ID: | iaor2016911 |
Volume: | 33 |
Issue: | 2 |
Start Page Number: | 189 |
End Page Number: | 200 |
Publication Date: | Apr 2016 |
Journal: | Expert Systems |
Authors: | Gamulin Jasna, Gamulin Ozren, Kermek Dragutin |
Keywords: | learning, time series: forecasting methods, behaviour, statistics: empirical |
In this work, it is shown that student access time series generated from Moodle log files contain information sufficient for successful prediction of student final results in blended learning courses. It is also shown that if time series is transformed into frequency domain, using discrete Fourier transforms (DFT), the information contained in it will be preserved. Hence, resulting periodogram and its DFT coefficients can be used for generating student performance models with the algorithms commonly used for that purposes. The amount of data extracted from log files, especially for lengthy courses, can be huge. Nevertheless, by using DFT, drastic compression of data is possible. It is experimentally shown, by means of several commonly used modelling algorithms, that if in average all but 5–10% of most intensive and most frequently used DFT coefficients are removed from datasets, the modelling with the remained data will result with the increase of the model accuracy. Resulting accuracy of the calculated models is in accordance with results for student performance models calculated for different dataset types reported in literature. The advantage of this approach is its applicability because the data are automatically collected in Moodle logs.