A Comprehensive Survey on AI-Powered Desktop Automation (COSMO)

Amanpreet Kaur Bamrah*, Roshani Talmale**, Ishika Jaiswal***, Jyotleenkaur Saggu****, Tejaswini Patle*****
*-***** Department of Computer Science and Engineering, S. B. Jain Institute of Technology, Management and Research, Nagpur, India.
Periodicity:July - December'2025
DOI : https://doi.org/10.26634/javr.3.2.22788

Abstract

This paper presents an overview of AI-powered desktop automation and its evolution toward intelligent assistants capable of performing real-time tasks using natural language interaction. COSMO (Conversational Smart Machine Operator) is a lightweight desktop assistant designed to execute system level operations, manage files, interact with external APIs, and enhance user productivity. We review existing intelligent assistants, RPA tools, and NLP-driven automation systems, identify persistent challenges (e.g., UI variability, noise, dependence on cloud APIs), and motivate COSMO’s design choices that favor local execution, privacy, and modularity.

Keywords

Desktop Automation, Natural Language Processing, Speech Recognition, Virtual Assistant, GUI Automation, COSMO

How to Cite this Article?

Talmale, R., Jaiswal, I., Bamrah, A., Saggu, J., and Patle, T. (2025). A Comprehensive Survey on AI-Powered Desktop Automation (COSMO). i-manager’s Journal on Augmented & Virtual Reality, 3(2), 38-46. https://doi.org/10.26634/javr.3.2.22788

References

[8]. Nayak, S., Jian, X., Lin, K. Q., Rodriguez, J. A., Kalsi, M., Awal, R., & Rajeswar, S. (2025). Ui-vision: A desktop-centric gui benchmark for visual perception and interaction. arXiv preprint arXiv:2503.15661.
[9]. Nayeem, M., Tabrej, M. S., Deb, K. J., Goswami, S., & Hakim, M. A. (2025). Automatic Speech Recognition in the Modern Era: Architectures, Training, and Evaluation. arXiv preprint arXiv:2510.12827.
[13]. Tang, F., Xu, H., Zhang, H., Chen, S., Wu, X., Shen, Y., & Zhuang, Y. (2025). A survey on (m) llm-based gui agents. arXiv preprint arXiv:2504.13865.
If you have access to this article please login to view the article or kindly login to purchase the article

Purchase Instant Access

Single Article

North Americas,UK,
Middle East,Europe
India Rest of world
USD EUR INR USD-ROW
Pdf 40 40 300
Online 15 15 300
Pdf & Online 40 40 300

Options for accessing this content:
  • If you would like institutional access to this content, please recommend the title to your librarian.
    Library Recommendation Form
  • If you already have i-manager's user account: Login above and proceed to purchase the article.
  • New Users: Please register, then proceed to purchase the article.