Future Prediction of L2 Writing Performance: A Machine Learning Approach

Kutay Uzun*
Department of Foreign Languages, Trakya University, Turkey.
Periodicity:July - September'2020
DOI : https://doi.org/10.26634/jet.17.2.16840

Abstract

Writing in L2 is both crucial and difficult for teachers and students since most of the assessment in higher education is in written form. The production of texts as well as providing feedback to them requires time and effort on both sides. For this reason, prediction of future L2 writing performance in advance may prove quite useful both for teachers and students and the information obtained can be used for failure prevention. In this respect, the present study aims to train a machine learning model which used demographic and psychometric data to predict end-of-term L2 writing performance on a Pass/Fail basis. Secondly, the study aimed to find out if synthetically augmenting available data would increase prediction accuracy. The data were collected in the beginning of the semester from 102 students of English Language Teaching in Turkey. L2 writing performance was measured at the end of the semester. The findings indicated that Instance-Bases Learning with Parameter K algorithm could accurately predict L2 writing performance in advance. Moreover, artificial augmentation of the data was seen to increase prediction accuracy. The findings bear implications for the effective prevention of student failure by detecting poor performance in advance.

Keywords

Educational Data Mining, Early Warning Systems, L2 Writing, Machine Learning, Performance Prediction.

How to Cite this Article?

Uzun, K. (2020). Future Prediction of L2 Writing Performance: A Machine Learning Approach. i-manager’s Journal of Educational Technology, 17(2), 1-13. https://doi.org/10.26634/jet.17.2.16840

References

[3]. Angeline, D. M. D. (2013). Association rule generation for student performance analysis using apriori algorithm. The SIJ Transactions on Computer Science Engineering & its Applications (CSEA), 1(1), 12-16.
[5]. Baker, R. S. J. D. (2010). Data mining for education. In B. McGaw, P. Peterson, & E. Baker (Eds.), International Encyclopedia of Education (3rd ed., pp. 112-118). Oxford, UK: Elsevier.
[8]. Bharadwaj, B. K., & Pal, S. (2011). Mining educational data to analyze students' performance. International Journal of Advance Computer Science and Applications, 2(6), 63–69.
[10]. Bydzovska, H. (2016). A comparative analysis of techniques for predicting student performance. In T. Barnes, M. Chi, & M. Feng (Eds.). In Proceedings of the 9th International Conference on Educational Data Mining (pp. 306–311). North Carolina, NC: Raleigh.
[12]. Chen, X., & Meurers, D. (2016, December). CTAP: A web-based tool supporting automatic complexity analysis. In Proceedings of the Workshop on Computational Linguistics for Linguistic Complexity (CL4LC) (pp. 113-119), Osaka, Japan.
[15]. Chun, D. M. (2007). Technological advances in researching and teaching phonology. In Phonology in Context (pp. 274-299). Palgrave Macmillan, London.
[17]. Cotos, E. (2014). Genre-based automated writing evaluation for L2 research writing: From design to evaluation and enhancement. UK: Palgrave Macmillan, Springer.
[21]. Eibe, F., Mark, A. H., & Witten, I. H. (2016). The WEKA Workbench in Online Appendix for Data Mining: Practical Machine Learning Tools and Techniques (4th ed.). Morgan Kaufmann.
[24]. Goggins, S. P., Xing, W., Chen, X., Chen, B., & Wadholm, B. (2015). Learning analytics at small" scale: Exploring a complexity-grounded model for assessment automation. Journal of Universal Computer Science, 21 (1), 66-92.
[27]. Hastie, T., Tibshirani, R., & Friedman, J. (2009). The Elements of Statistical Learning: Data Mining, Inference, and Prediction (2nd ed.). Springer, New York, NY.
[28]. Heiner, C., Heffernan, N., & Barnes, T. (2007). Educational data mining. In Supplementary Proceedings of the 12th International Conference of Artificial Intelligence in Education.
[30]. Hoyle, D. C. (2008). Automatic PCA dimension selection for high dimensional data and small sample sizes. Journal of Machine Learning Research, 9, 2733-275 9.
[31]. Hwang, G. J., Hsiao, C. L., & Tseng, J. C. (2003). A computer-assisted approach to diagnosing student learning problems in science courses. Journal of Information Science and Engineering, 19(2), 229–248.
[32]. Losifidis, V., & Ntoutsi, E. (2018). Dealing with bias via data augmentation in supervised learning scenarios. In J. Bates, P. D. Clough, R. Jäschke, & J. Otterbacher, (Eds.). In Proceedings of the International Workshop on Bias in Information, Algorithms, and Systems Co-Located with 13th International Conference on Transforming Digital Worlds (pp. 24-29).
[33]. Kabakchieva, D. (2013). Predicting student performance by using data mining methods for classification. Cybernetics and Information Technologies, 13(1), 61-72.
[34]. Kira, K. & Rendell, L. (1992). A practical apprach to feature selection. In D. Sleeman & P. Edwards (Eds.), Machine Learning: Proceedings of International Conference (ICML'92) (pp. 249-256). San Francisco, CA: Morgan Kaufmann.
[36]. Kohavi, R. (1995). A study of cross-validation and bootstrap for accuracy estimation and model selection. In Proceedings of the 14th International Joint Conference on Artificial Intelligence (Vol 2, pp. 1137-1143) San Francisco, CA: Morgan Kaufmann Publishers Inc.
[39]. Luan, J. (2002). Data mining and its applications in higher education. In A. Serban & J. Luan (Eds.). Knowledge management: Building a competitive advantage for higher education (pp. 17-36). San Francisco, CA: Jossey Bass.
[41]. Merceron, A., & Yacef, K. (2004). Mining student data captured from a web-based tutoring tool: Initial exploration and results. Journal of Interactive Learning Research, 15(4), 319–346.
[42]. Merceron, A., & Yacef, K. (2005). Educational data mining: A case study. In C.-K. Looi, G. McCalla, B. Bredeweg & J. Breuker (Eds.). Proceedings of the 12th international conference on artificial intelligence (pp. 467–474). Amsterdam: IOS Press.
[43]. Murphy, K. P. (2012). Machine Learning: A Probabilistic Perspective. Cambridge, MA: MIT Press.
[46]. Noel-Levitz White Paper, (2008). Qualifying Enrollment Success: Maximizing Student Recruitment and Retention through Predictive Modeling. Noel-Levitz, Inc.
[49]. Pratiwi, N. E., Yufrizal, H., & Sukirlan, M. (2017). The relationship between students' self assessment of communicative competence and their actual performance. UNILA Journal of English Teaching, 6(7), 1-10.
[50]. Rai, S., & Jain, A. K. (2013). Students' dropout risk assessment in undergraduate courses of ICT at Residential University – A case study. International Journal of Computer Applications, 84(14), 31-36.
[51]. Ramaswami, M., & Bhaskaran, R. (2010). A CHAID based performance prediction model in educational data mining. IJCSI International Journal of Computer Science Issues, 7(1), 10-18.
[53]. Romero, C., & Ventura, S. (2010). Educational data mining: A review of the state of the art. IEEE Transactions on Systems, Man, and Cybernetics – Part C: Applications and Reviews, 40(6), 601-618.
[56]. Shmueli, G., Patel, N., & Bruce, P. (2007). Data Mining for Business Intelligence. Canada: John Wiley & Sons, Inc.
[57]. Shrivastava, A., Sondhi, J., & Kumar, B. (2017). Machine learning technique for product classification in ecommerce data using Microsoft Azure Cloud. International Research Journal of Engineering & Applied Sciences, 5(2), 11-13.
[58]. Simard, P. Y., Steinkraus, D., & Platt, J. C. (2003). Best practices for convolutional neural networks applied to visual document analysis, "ICDAR, 3, 958-962. " to " In Proceedings of Seventh International Conference on Document Analysis and Recognition, 2003. (pp. 958-963).
[59]. Simon, P. (2013). Too Big to Ignore: The Business Case for Big Data. Hoboken, NJ: Wiley.
[62]. Tsai, Y. R., Chang, Y., & Ouyang, C. S. (2011). Mining error patterns of engineering students' English reading comprehension. In C. Patrick & D. Chunru (Eds.), Proceedings of 2011 International Conference on Machine Learning and Cybernetics (pp. 68–72).
[65]. Villalón, J., Kearney, P., Calvo, R. A., & Reimann, P. (2008). Glosser: Enhanced feedback for student writing tasks. In 22nd Australasian Joint Conference on Artificial Intelligence, Melbourne, Australia.
[66]. West, M. (1994). Discovery sampling and selection models. In S. S. Gupta & J. O. Berger (Eds.). Statistical Decision Theory and Related Topics V (pp. 221-235). New York: Springer-Verlag.
[67]. Witten, I. H., & Frank, E. (2005). Data Mining: Practical Machine Learning Tools and Techniques (2nd ed.). Elsevier, San Francisco, CA.
[68]. Wolfe-Quintero, K., Inagaki, S., & Kim, H. Y. (1998). Second Language Development in Writing: Measures of Fluency, Accuracy, & Complexity. Honolulu, HI: University of Hawaii Press.
[69]. Wu, J., Chang, Y., Mitamura, T., & Chang, J. (2010). Automatic collocation suggestion in academic writing. In Proceedings of the ACL Conference, Short paper track, Uppsala, Finland.
[70]. Yang, X, Kong, X, Hasegawa-Johnson, M, & Xie, Y. (2016). "Landmark-based pronunciation error identification on L2 Mandarin Chinese. Proc. Speech Prosody 2016, 247- 251.
[71]. Yavuz-Erkan, D. (2004). Efficacy of cross-cultural email exchange for enhancing EFL writing: A perspective for tertiary-level Turkish EFL learners (Unpublished doctoral dissertation). Çukurova University, Turkey.
[72]. Yu, P., Own, C., & Lin, L. (2001). On learning behavior analysis of web based interactive environment. In Proceedings of the implementing Curricular Change in Engineering Education (pp. 1-10), Oslo, Norway.
If you have access to this article please login to view the article or kindly login to purchase the article

Purchase Instant Access

Single Article

North Americas,UK,
Middle East,Europe
India Rest of world
USD EUR INR USD-ROW
Online 15 15

Options for accessing this content:
  • If you would like institutional access to this content, please recommend the title to your librarian.
    Library Recommendation Form
  • If you already have i-manager's user account: Login above and proceed to purchase the article.
  • New Users: Please register, then proceed to purchase the article.