International Scientific Journal of Engineering and Management

An International Scholarly || Multidisciplinary || Open Access || Indexing in all major Database & Metadata
The journal follows the UGC Guidelines and is evaluated for inclusion in the Web of Science
ISSN: 2583-6129

Impact Factor: 8.072

Autonomous Navigation AGV Using Vision-Language Models for Natural Language Guided Indoor Navigation

Version
File Size 563.46 KB
Downloads 0
Files 1
Published 16 June 2026
Updated 16 June 2026

Autonomous Navigation AGV Using Vision-Language Models for Natural Language Guided Indoor Navigation

                         Yashwanth Aradhya                                                     Dr. Vishwanath Koti

Department of Robotics and Artificial Intelligence                    Department of Robotics and Artificial Intelligence

     M S Ramaiah Institute of Technology,                                       M S Ramaiah Institute of Technology,
Bangalore – 560054, India                                                              Bangalore – 560054, India

            Yash.aradhya140@gmail.com                                                           vkoti675@msrit.edu

 

Abstract:

Autonomous Guided Vehicles (AGVs) have become increasingly important in warehouse automation, industrial logistics, healthcare transportation, and smart manufacturing systems. Traditional AGV navigation techniques primarily rely on predefined routes, magnetic strips, guide wires, QR-code markers, and coordinate-based path planning methods. Although these approaches provide reliable navigation in structured environments, they often lack the flexibility required for dynamic indoor environments where obstacles and layouts continuously change. Furthermore, conventional AGV systems provide limited capability for understanding human instructions expressed in natural language.

Recent advancements in Artificial Intelligence, Computer Vision, Natural Language Processing, and Vision-Language Models have enabled the development of intelligent robotic systems capable of understanding semantic instructions and performing autonomous decision-making. Vision-Language Models establish relationships between visual observations and textual instructions, allowing robots to identify navigation targets and interpret human commands in a more natural manner.

This paper presents an Autonomous Navigation AGV integrated with Vision-Language Models for natural language-guided indoor navigation. The proposed system utilizes a Raspberry Pi 5 embedded platform integrated with a camera module, ultrasonic sensors, motor driver circuitry, and DC gear motors. Natural Language Processing techniques are employed to interpret navigation commands, while the Vision-Language Model combines visual perception and language understanding to identify target objects and destinations within the environment. Obstacle detection and avoidance are achieved through ultrasonic sensing to ensure safe navigation.

Experimental evaluation was conducted under multiple indoor navigation scenarios involving command interpretation, target identification, obstacle avoidance, and autonomous movement. Performance analysis demonstrated high command recognition accuracy, reliable obstacle avoidance capability, and effective navigation performance. The proposed system provides a low-cost and scalable solution for intelligent indoor navigation and contributes toward the development of next-generation AI-enabled robotic transportation systems.

Keywords: Autonomous Guided Vehicle, Vision-Language Model, Natural Language Processing, Computer Vision, Autonomous Navigation, Raspberry Pi 5, Obstacle Avoidance, Artificial Intelligence, Human-Robot Interaction.

Download
or download free
[changelog]

Categories & Tags

Similar Downloads

No related download found!
ISJEM Journal

Author's Blog

What is the difference between a Research Paper and a Review Paper?

A research paper and a review paper are both scholarly documents, but they serve different purposes and have different characteristics....
Read More
Author's Blog

What is DOI?

A Digital Object Identifier (DOI) is a unique alphanumeric string that is used to identify and provide a persistent link...
Read More
Author's Blog

What do you need to do during production of your Research Paper?

During the production of a research paper, the following steps need to be taken: conducting research, organizing and analyzing data,...
Read More
Author's Blog

What are the advantages of publishing a research paper?

Publishing a research paper can have many advantages for researchers, including: Career advancement, professional recognition, opportunities for collaboration, increased visibility,...
Read More
Author's Blog

Ways to Support your Academic Wellbeing which preparing the Research Paper/Article

To support your academic wellbeing while publishing a research paper, it's important to set realistic goals, manage your time effectively,...
Read More
Author's Blog

How to improve your Research Paper writing Skills?

Read extensively: One of the best ways to improve your research paper skills is to read extensively in your field...
Read More
Author's Blog

Is DOI compulsory to publish a research paper in a Journal?

DOI is not strictly required to publish a research paper, but it is highly recommended. Basically, the International Scientific Journal...
Read More
Author's Blog

In what ways does research paper give weight to career development?

Publishing a research paper can give weight to a researcher's career development in several ways, such as: establishing oneself as...
Read More
Author's Blog

How to develop a Research Paper from Scratch

Developing a research paper involves several steps including: choosing a topic, conducting background research, formulating a research question or hypothesis,...
Read More
Author's Blog

How Plagiarism report plays crucial role in Research Paper Publication?

Plagiarism is a major concern in the academic and research community, as it undermines the integrity of the research and...
Read More