AI-Based Image and Video Detection and Translation using Computer Vision, OCR and NLP
AI-Based Image and Video Detection and Translation using Computer Vision, OCR and NLP
Authors:
Mr. Reddy Santosh Kumar
Assistant Professor
Department of Artificial Intelligence and Machine Learning Ballari Institute of Technology & Management
Ballari, Karnataka, India
V NagaChetan Reddy
Department of Artificial Intelligence and Machine Learning Ballari Institute of Technology & Management
Ballari, Karnataka, India
Y Soman Gouda
Department of Artificial Intelligence and Machine Learning Ballari Institute of Technology & Management
Ballari, Karnataka, India
U Rohith Sagar
Department of Artificial Intelligence and Machine Learning Ballari Institute of Technology & Management
Ballari, Karnataka, India
Vijay Kumar D
Department of Artificial Intelligence and Machine Learning Ballari Institute of Technology & Management
Ballari, Karnataka, India
Abstract—Artificial Intelligence based multimedia understand- ing systems are becoming increasingly important in healthcare, surveillance, smart education, tourism, assistive technologies, and digital communication systems. Existing systems generally focus only on object detection or OCR extraction independently with- out providing multilingual understanding and speech narration support.
This paper presents an AI-Based Image and Video Detection and Translation framework that integrates Computer Vision, Op- tical Character Recognition (OCR), Natural Language Processing (NLP), Neural Machine Translation, and Text-to-Speech synthesis into a unified intelligent system.
The proposed system automatically detects objects and ac- tivities from images and videos, extracts textual information, generates contextual descriptions, translates generated content into multiple languages, and converts translated content into speech output.
Index Terms—Artificial Intelligence, Computer Vision, OCR, NLP, Translation, Deep Learning, Multimedia Analytics, Text- to-Speech