Exploring Natural Language Processing Chatbots and Phishing Website Detection: A Literature Perspective

Roshani Talmale*, Harshita Wankhede**, Pranav Lokhande***, Pranay Thakre****, Vrunda Mishra*****, Priya Dhole******, Bhagyshri Balpande*******
*-******* Department of Computer Science and Engineering, S. B. Jain Institute of Technology, Management and Research, Nagpur, India.
Periodicity:July - December'2024

Abstract

The paper proposes a novel approach that integrates machine learning and NLP in order to identify phishing sites and create multilingual interaction by providing better user engagement through the chatbot. It will utilize the XG-Boost algorithm in order to show the phishing detection system with more than 90% accuracy rate in the identification and classification of websites as legitimate or phishing for a set of 10,000 websites. A major contribution of this work is the embedding of a multilingual chatbot, developed on Dialog- flow, with support for English, Hindi, and Marathi, thereby broadening the possible community of users for the system. This paper describes the architecture of the system at its different layers- feature extraction, model training, and its integration with the chatbot. The proposed work fills the gaps in earlier literature as it provides a user interface accompanied by robust detection. This system will also be extended by the provision of language support by adding more languages in the near future and also by increasing the detection accuracy using deep learning models. The results demonstrate that combining machine learning with user-centric design may improve the detection of phishing sites considerably and enhance user engagement.

Keywords

Phishing, Dialog-flow, XG-Boost, Convolutional Neural Network, Chatbot, Cyber Crime, Multilingual.

How to Cite this Article?

Talmale, R., Wankhede, H., Lokhande, P., Thakre, P., Mishra, V., Dhole, P., and Balpande, B. (2024). Exploring Natural Language Processing Chatbots and Phishing Website Detection: A Literature Perspective. i-manager’s Journal on Digital Forensics & Cyber Security, 2(2), 10-19.

References

[4]. Aleedy, M., Shaiba, H., & Bezbradica, M. (2019). Generating and analyzing chatbot responses using natural language processing. International Journal of Advanced Computer Science and Applications, 10(9), 60-68.
[8]. Anil, G. N., Prakash, G. O., Manoj, K. H., Lokesh, M., & Madhusudhan, K. M. (2020). Detection of phishing websites based on feature extraction using machine learning. International Research Journal of Engineering and Technology (IRJET), 7 (7), 476-481.
[9]. Baby, C. J., Khan, F. A., & Swathi, J. N. (2017, April). Home automation using IoT and a chatbot using natural language processing. In 2017 Innovations in Power and Advanced Computing Technologies (i-PACT) (pp. 1-6). IEEE.
[12]. Chawla, A. (2022). Phishing website analysis and detection using Machine Learning. International Journal of Intelligent Systems and Applications in Engineering, 10(1), 10-16.
[15]. Flayh, N. A. (2023). Phishing website detection using machine learning: A review. Wasit Journal for Pure Sciences, 2(2), 270-281.
[16]. Garje, A., Tanwani, N., Kandale, S., Zope, T., & Gore, S. (2021). Detecting phishing websites using machine learning. International Journal of Advances in Engineering and Management (IJAEM), 3 (4), 496-503.
[17]. Kaushik, S., & Rahul. (2023). Chatbot using Natural Language Processing (NLP) techniques. Journal of Emerging Technologies and Innovative Research (JETIR), 10 (9), 1-17.
[19]. Kumar, D. N., Hemanth, N. S. R., Premnath, S., Kumar, V. N., & Uma, S. (2020). Detection of phishing websites using an efficient machine learning framework. International Journal of Engineering Research and Technology, 9(5), 1282-1286.
[21]. Lalwani, T., Bhalotia, S., Pal, A., Bisen, S., & Rathod, V. (2018). Implementation of a Chatbot system using AI and NLP. International Journal of Innovative Research in Computer Science & Technology, 6(3), 26-30.
[22]. Mahajan, R., & Siddavatam, I. (2018). Phishing website detection using machine learning algorithms. International Journal of Computer Applications, 181(23), 45-47.
[23]. Mulik, D. S., Sawant, P., & Bhosale, V. (2021). Application of NLP: Design of Chatbot for new research scholars. Turkish Online Journal of Qualitative Inquiry, 12(8), 2817-2823.
[24]. Patra, B., & Kumar, M. (2020). Natural language processing in Chatbots: A review. Turkish Journal of Computer and Mathematics Education (TURCOMAT), 11(3), 2890-2894.
[26]. Sahingoz, O. K., Buber, E., Demir, O., & Diri, B. (2019). Machine learning based phishing detection from URLs. Expert Systems with Applications, 117, 345-357.
[27]. Sangeetha, S., Sahithya, C., Rasiga, M. R., & Shalini, N. (2021). Chatbot for personal assistant using natural language processing. International Journal of Research in Engineering, Science and Management, 4(3), 96-97.
[28]. Sharma, R. (2012). An analysis of an intelligent chatbot using natural language processing. International Journal of Food and Nutritional Sciences (IJFANS), 11 (6), 936-942.
[30]. Teja, C. S. B., Sasank, T., & Reddy, Y. (2020). Phishing website detection using different machine learning techniques. International Research Journal of Engineering and Technology (IRJET), 7 (10), 607-610.
[31]. Theja, Y. R., & Krishnaveni, R. (2013). Security based phishing website detection. International Journal of Computer Science and Mobile Computing, 2 (4), 523-527.
If you have access to this article please login to view the article or kindly login to purchase the article

Purchase Instant Access

Single Article

North Americas,UK,
Middle East,Europe
India Rest of world
USD EUR INR USD-ROW
Online 15 15

Options for accessing this content:
  • If you would like institutional access to this content, please recommend the title to your librarian.
    Library Recommendation Form
  • If you already have i-manager's user account: Login above and proceed to purchase the article.
  • New Users: Please register, then proceed to purchase the article.