International Scientific Journal of Engineering and Management

An International Scholarly || Multidisciplinary || Open Access || Indexing in all major Database & Metadata
The journal follows the UGC Guidelines and is evaluated for inclusion in the Web of Science
ISSN: 2583-6129

Impact Factor: 7.839

Enhancing Image Captioning Through Augmented Visual Comprehension with CNN

  • Version
  • Download 8
  • File Size 532.32 KB
  • File Count 1
  • Create Date 30 January 2026
  • Last Updated 30 January 2026

Enhancing Image Captioning Through Augmented Visual Comprehension with CNN

R.L.Pavan Kumar 1 , T.V.D.S.Sreyanth 2 , P.Nithin Sai 3 and G .Surendra 5

1B.tech Student1, Koneru Lakshmaiah Education Foundation, Vaddeswaram, A.P., 522302, India.

1*2100080168@kluniversity.in

1B.tech Student2, Koneru Lakshmaiah Education Foundation, Vaddeswaram, A.P., 522302, India.

1*2100080197@kluniversity.in

1B.tech Student3, Koneru Lakshmaiah Education Foundation, Vaddeswaram, A.P., 522302, India.

1*2100080203@kluniversity.in

2Assisstant Professor2, Department of AI & DS , Koneru Lakshmaiah Education Foundation, Vaddeswaram,

A.P., 522302, India.

2guntisurendra@kluniversity.in

ABSTRACT:
Deep Learning and Computer Vision technologies are expanding quickly, and the challenge of automatically generating informative photo captions has received considerable attention. As discoveries continue to reshape the artificial intelligence landscape, the demand for intelligent systems capable of contextualizing visual content with descriptive captions is growing. Image Captioning is a fascinating area of research that intersects computer vision and deep learning techniques. This research paper explores the application of deep learning to the task of generating descriptive captions for images. The proposed model is extended to integrate YOLO-based object detection which is
incorporated into the feature extraction process, thus increasing the robustness of the image representation. The architecture includes the
integration of onvolutional Neural Networks (LSTM) for feature extraction from images and RNNs for language modeling. The CNN extracts meaningful visual features from images. Attention methods are used to address the issue of matching linguistic and visual information. This enables the model to concentrate on distinct areas of the image while generating captions.
Keywords- CNN, LSTM, YOLO, BLEU
INTRODUCTION:


Download

Author's Blog

What is the difference between a Research Paper and a Review Paper?

A research paper and a review paper are both scholarly documents, but they serve different purposes and have different characteristics....
Read More
Author's Blog

What is DOI?

A Digital Object Identifier (DOI) is a unique alphanumeric string that is used to identify and provide a persistent link...
Read More
Author's Blog

What do you need to do during production of your Research Paper?

During the production of a research paper, the following steps need to be taken: conducting research, organizing and analyzing data,...
Read More
Author's Blog

What are the advantages of publishing a research paper?

Publishing a research paper can have many advantages for researchers, including: Career advancement, professional recognition, opportunities for collaboration, increased visibility,...
Read More
Author's Blog

Ways to Support your Academic Wellbeing which preparing the Research Paper/Article

To support your academic wellbeing while publishing a research paper, it's important to set realistic goals, manage your time effectively,...
Read More
Author's Blog

How to improve your Research Paper writing Skills?

Read extensively: One of the best ways to improve your research paper skills is to read extensively in your field...
Read More
Author's Blog

Is DOI compulsory to publish a research paper in a Journal?

DOI is not strictly required to publish a research paper, but it is highly recommended. Basically, the International Scientific Journal...
Read More
Author's Blog

In what ways does research paper give weight to career development?

Publishing a research paper can give weight to a researcher's career development in several ways, such as: establishing oneself as...
Read More
Author's Blog

How to develop a Research Paper from Scratch

Developing a research paper involves several steps including: choosing a topic, conducting background research, formulating a research question or hypothesis,...
Read More
Author's Blog

How Plagiarism report plays crucial role in Research Paper Publication?

Plagiarism is a major concern in the academic and research community, as it undermines the integrity of the research and...
Read More