Differential Privacy-Enabled TextCNN for MOOCs Fake Review Detection

  • Caiyun Chen School of Information Science and Technology, Tan Kah Kee College, Xiamen University, Zhangzhou 363105, Fujian Province, China
Keywords: DP-TextCNN, Differential Privacy, Fake review, MOOCs

Abstract

The rapid development and widespread adoption of massive open online courses (MOOCs) have indeed had a significant impact on China’s education curriculum. However, the problem of fake reviews and ratings on the platform has seriously affected the authenticity of course evaluations and user trust, requiring effective anomaly detection techniques for screening. The textual characteristics of MOOCs reviews, such as varying lengths and diverse emotional tendencies, have brought complexity to text analysis. Traditional rule-based analysis methods are often inadequate in dealing with such unstructured data. We propose a Differential Privacy-Enabled Text Convolutional Neural Network (DP-TextCNN) framework, aiming to achieve high-precision identification of outliers in MOOCs course reviews and ratings while protecting user privacy. This framework leverages the advantages of Convolutional Neural Networks (CNN) in text feature extraction and combines differential privacy techniques. It balances data privacy protection with model performance by introducing controlled random noise during the data preprocessing stage. By embedding differential privacy into the model training process, we ensure the privacy security of the framework when handling sensitive data, while maintaining a high recognition accuracy. Experimental results indicate that the DP-TextCNN framework achieves an exceptional accuracy of over 95% in identifying fake reviews on the dataset, this outcome not only verifies the applicability of differential privacy techniques in TextCNN but also underscores its potential in handling sensitive educational data. Additionally, we analyze the specific impact of differential privacy parameters on framework performance, offering theoretical support and empirical analysis to strike an optimal balance between privacy protection and framework efficiency.

References

Wei X, Saab N, Admiraal W, 2021, Assessment of Cognitive, Behavioral, and Affective Learning Outcomes in Massive Open Online Courses: A Systematic Literature Review. Computers & Education, 163: 104097.

Wu B, 2021, Influence of MOOC Learners Discussion Forum Social Interactions on Online Reviews of MOOC. Education and Information Technologies, 26: 3483–3496.

Alturkistani A, Lam C, Foley K, et al., 2020, Massive Open Online Course Evaluation Methods: Systematic Review. Journal of Medical Internet Research, 22: e13851.

Alger W, Doan M, Caporusso N, 2024, Student Evaluations of Teaching: Using Big Data Visualization to Explore Challenges and Opportunities. In Proceedings of the 2024 47th MIPRO ICT and Electronics Convention (MIPRO), IEEE, 2024: 508–513.

Sohel A, Hossain MR, Mostofa ZB, et al., 2023, Sentiment Analysis Based on Online Course Feedback Using Textblob and Machine Learning Techniques. In Proceedings of the 2023 26th International Conference on Computer and Information Technology (ICCIT), IEEE, 2023: 1–6.

Shaik T, Tao X, Li Y, et al., 2022, A Review of the Trends and Challenges in Adopting Natural Language Processing Methods for Education Feedback Analysis. IEEE Access, 10: 56720–56739.

Alexandron G, Ruipérez-Valiente JA, Lee S, et al., 2018, Evaluating the Robustness of Learning Analytics Results Against Fake Learners. In Proceedings of the European Conference on Technology Enhanced Learning, Springer, 2018: 74–87.

Graf P, 2024, Making Sense of Today’s Use of Student Evaluations of Teaching (SET). Human Arenas 7: 446–450.

Paul H, Nikolaev A, 2021, Fake Review Detection on Online E-Commerce Platforms: A Systematic Literature Review. Data Mining and Knowledge Discovery, 35: 1830–1881.

Zigomitros A, Casino F, Solanas A, 2020, et al., 2020, Survey on Privacy Properties for Data Publishing of Relational Data. IEEE Access, 8: 51071–51099.

Qin Y, Li M, Zhu J, 2023, Privacy-Preserving Federated Learning Framework in Multimedia Courses Recommendation. Wireless Networks, 29: 1535–1544.

Dong J, Roth A, Su WJ, 2022, Gaussian Differential Privacy. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 84: 3–37.

Shaik T, Tao X, Dann C, et al., 2023, Sentiment Analysis and Opinion Mining on Educational Data: A Survey. Natural Language Processing Journal, 2: 100003.

Zhang T, You F, 2021, Research on Short Text Classification Based on TextCNN. In Proceedings of the Journal of Physics: Conference Series. IOP Publishing, 1757: 012092.

Moore RL, Blackmon SJ, 2022, From the Learner’s Perspective: A Systematic Review of MOOC Learner Experiences (2008–2021). Computers & Education, 190: 104596.

Peng X, Xu Q, 2020, Investigating Learners’ Behaviors and Discourse Content in MOOC Course Reviews. Computers & Education, 143: 103673.

Qi C, Liu S, 2021, Evaluating On-Line Courses via Reviews Mining. IEEE Access, 9: 35439–35451.

Bulusu A, Rao KR, 2021, Sentiment Analysis of Learner Reviews to Improve Efficacy of Massive Open Online Courses (MOOC’s)—A Survey. In Proceedings of the 2021 Fifth International Conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud) (I-SMAC), IEEE, 2021: 933–941.

Kastrati Z, Imran AS, Kurti A, 2020, Weakly Supervised Framework for Aspect-Based Sentiment Analysis on Students’ Reviews of MOOCs. IEEE Access, 8: 106799–106810.

Yang P, Liu Y, Luo Y, et al., 2024, Text Mining and Multi-Attribute Decision-Making-Based Course Improvement in Massive Open Online Courses. Applied Sciences, 14: 3654.

Fan J, Jiang Y, Liu Y, et al., 2022, Interpretable MOOC Recommendation: A Multi-Attention Network for Personalized Learning Behavior Analysis. Internet Research, 32: 588–605.

Gomez MJ, Calderón M, Sánchez V, et al., 2022, Large Scale Analysis of Open MOOC Reviews to Support Learners’ Course Selection. Expert Systems with Applications, 210: 118400.

Li H, Gu H, Hao X, et al., 2024, Data-Driven Analytics for Student Reviews in China’s Higher Vocational Education MOOCs: A Quality Improvement Perspective. PLOS One, 19: e0298675.

Onan A, 2021, Sentiment Analysis on Massive Open Online Course Evaluations: A Text Mining and Deep Learning Approach. Computer Applications in Engineering Education, 29: 572–589.

Wu Y, Ngai EW, Wu P, et al., 2020, Fake Online Reviews: Literature Review, Synthesis, and Directions for Future Research. Decision Support Systems, 132: 113280.

Mohawesh R, Xu S, Tran SN, et al., 2021, Fake Reviews Detection: A Survey. IEEE Access, 9: 65771–65802.

Jain PK, Pamula R, Srivastava G, 2021, A Systematic Literature Review on Machine Learning Applications for Consumer Sentiment Analysis Using Online Reviews. Computer Science Review, 41: 100413.

Salminen J, Kandpal C, Kamel AM, et al., 2022, Creating and Detecting Fake Reviews of Online Products. Journal of Retailing and Consumer Services, 64: 102771.

Chen X, Li Z, Zou D, et al., 2024, Leveraging Deep Learning for Classifying Learner-Generated Course Evaluation Texts. In Proceedings of the International Conference on Blended Learning. Springer, 2024:. 311–321.

Wang J, Xie H, Au OTS, et al., 2020, Attention-Based CNN for Personalized Course Recommendations for MOOC Learners. In Proceedings of the 2020 International Symposium on Educational Technology (ISET). IEEE, 2020: 180–184.

Liu T, Hu W, Liu F, et al., 2021, Sentiment Analysis for MOOC Course Reviews. In Proceedings of the Data Science: 7th International Conference of Pioneering Computer Scientists, Engineers and Educators, ICPCSEE 2021, Springer, 2021: 78–87.

Liu J, Yan Z, Chen S, et al., 2023, Channel Attention TextCNN with Feature Word Extraction for Chinese Sentiment Analysis. ACM Transactions on Asian and Low-Resource Language Information Processing, 22: 1–23.

Bu Z, Dong J, Long Q, et al., 2020, Deep Learning with Gaussian Differential Privacy. Harvard Data Science Review, 2(3).

Ha T, Dang TK, Le H, 2020, Security and Privacy Issues in Deep Learning: A Brief Review. SN Computer Science, 1: 253.

Boulemtafes A, Derhab A, Challal Y, 2020, A Review of Privacy-Preserving Techniques for Deep Learning. Neurocomputing, 384: 21–45.

Doleck T, Lemay DJ, Basnet RB, et al., 2020, Predictive Analytics in Education: A Comparison of Deep Learning Frameworks. Education and Information Technologies, 25: 1951–1963.

Ghazi B, Golowich N, Kumar R, et al., 2021, Deep Learning with Label Differential Privacy. Advances in Neural Information Processing Systems, 34: 27131–27145.

Vasa J, Thakkar A, 2023, Deep Learning: Differential Privacy Preservation in the Era of Big Data. Journal of Computer Information Systems, 63: 608–631.

Liu G, Sun X, Li Y, et al., 2023, An Automatic Privacy-Aware Framework for Text Data in Online Social Network Based on a Multi-Deep Learning Model. International Journal of Intelligent Systems, 2023: 1727285.

Dong M, Li Y, Tang X, et al., 2020, Variable Convolution and Pooling Convolutional Neural Network for Text Sentiment Classification. IEEE Access, 8: 16174–16186.

Published
2025-02-13