Abstract: Voice assistants, which use natural language processing and speech recognition to facilitate smooth conversation, have completely changed human-computer interaction. The development of AI-powered systems that can automate daily operations has been made possible by the rise of intelligent personal assistants like Siri, Alexa, Google Assistant, and Cortana. This paper provides a thorough review of the literature on the creation of JANI AI (Just in Time Assistant for Necessary Insights), an AI-based virtual assistant that can handle a variety of user-centric tasks, including music recommendations, real-time information retrieval, speech recognition, face recognition, optical character recognition (OCR), app automation, and customized voice-based note-taking. The paper examines current voice assistant systems and technologies, such as IoT-based smart home automation (Keerthana et al., 2018) [5], natural language processing (Nil Göksel et al., 2018) [4], and speech recognition (Nguyen et al., 2007) [2].The interactive capabilities of voice assistants have been greatly improved by the combination of gTTS (Google Text-to-Speech) and AIML (Artificial Intelligence Markup Language) for creating dynamic conversational assistants (Gawand et al., 2020) [9].The application of OCR for text extraction from photos and Convolutional Neural Networks (CNNs) for face recognition broadens the capabilities of JANI AI and provides flexible features for both visually impaired and non-visually impaired users.In order to create a multipurpose AI-powered virtual assistant, this review examines several existing systems, highlights developments in the field, and suggests a hybrid design that makes use of open-source AI models and machine learning techniques.
Keywords: Voice Assistant, Conversational AI, Speech Recognition, Natural Language Processing (NLP), Optical Character Recognition (OCR), Face Recognition, Text-to-Speech (TTS), App Automation, Virtual Personal Assistant, Intelligent Personal Assistant, Human-Computer Interaction (HCI), Information Retrieval, Python Automation, Open-Source LLMs, Smart AI Assistant, Audio-Based Search, Real-time Data Extraction, Multi-Modal AI Systems, AI-Based Notes Management, Document Summarization, AI-Powered Chatbot, Interactive Games.
|
DOI:
10.17148/IARJSET.2025.12214