International Scientific Journal of Engineering and Management

An International Scholarly || Multidisciplinary || Open Access || Indexing in all major Database & Metadata
The journal follows the UGC Guidelines and is evaluated for inclusion in the Web of Science
ISSN: 2583-6129

Impact Factor: 8.072

Real-Time Offline AI Image Describer for Visually Impaired: BLIP-Based Captioning with Hazard Prioritisation, OCR Integration, and Cosine-Similarity Change Detection

Version
File Size 911.85 KB
Downloads 0
Files 1
Published 17 May 2026
Updated 17 May 2026

Real-Time Offline AI Image Describer for Visually Impaired: BLIP-Based Captioning with Hazard Prioritisation, OCR Integration, and Cosine-Similarity Change Detection

Authors:

Kanishka Singhal, Mansi, Dr. Archana Kumar

AbstractMore than 285 million people worldwide live with some form of visual impairment, yet the AI-powered assistive tools most widely available to them—Microsoft Seeing AI, Google Lookout, and similar cloud-hosted applications—share a cluster of practical shortcomings that limit real-world usefulness: they stop working whenever internet connectivity drops, they read every scene element in the same neutral tone regardless of danger, and they repeat themselves needlessly when nothing meaningful has changed. This paper describes a system built specifically to address each of those problems within a single, self-contained pipeline. The approach combines the Salesforce BLIP vision-language model for natural-language scene description, Tesseract OCR v5 for reading text embedded in the scene, cosine-similarity comparison of BLIP image embeddings to decide whether a new description is actually worth saying, and a keyword-based hazard detector that escalates urgent warnings through a faster, more prominent text-to-speech voice—all running offline through pyttsx3 on ordinary laptop hardware.

We evaluated the system on 150 annotated frames drawn from five different scene types. Overall caption accuracy came out to 76.0%, rising to 86.7% in well-lit indoor conditions. OCR reached an F1 score of 80.7% across scene-text categories, hazard recall was 91.3% with a non-hazard precision of 100%, and change detection cut redundant audio output by 55%. A side-by-side comparison with four deployed assistive systems confirms that none of them simultaneously covers all four capability dimensions without requiring GPU hardware. We also provide a formal mathematical treatment of the caption generation objective, the change detection criterion, and the hazard function. Because the system runs entirely on a CPU with 4 GB of RAM, it represents a meaningful step toward genuinely accessible assistive technology in resource-constrained settings.

Index Termsassistive technology, BLIP, cosine similarity, hazard detection, image captioning, OCR, offline AI, text-to-speech, visually impaired, Vision Transformer (ViT).

Download
or download free
[changelog]

Categories & Tags

Similar Downloads

No related download found!
ISJEM Journal

Author's Blog

What is the difference between a Research Paper and a Review Paper?

A research paper and a review paper are both scholarly documents, but they serve different purposes and have different characteristics....
Read More
Author's Blog

What is DOI?

A Digital Object Identifier (DOI) is a unique alphanumeric string that is used to identify and provide a persistent link...
Read More
Author's Blog

What do you need to do during production of your Research Paper?

During the production of a research paper, the following steps need to be taken: conducting research, organizing and analyzing data,...
Read More
Author's Blog

What are the advantages of publishing a research paper?

Publishing a research paper can have many advantages for researchers, including: Career advancement, professional recognition, opportunities for collaboration, increased visibility,...
Read More
Author's Blog

Ways to Support your Academic Wellbeing which preparing the Research Paper/Article

To support your academic wellbeing while publishing a research paper, it's important to set realistic goals, manage your time effectively,...
Read More
Author's Blog

How to improve your Research Paper writing Skills?

Read extensively: One of the best ways to improve your research paper skills is to read extensively in your field...
Read More
Author's Blog

Is DOI compulsory to publish a research paper in a Journal?

DOI is not strictly required to publish a research paper, but it is highly recommended. Basically, the International Scientific Journal...
Read More
Author's Blog

In what ways does research paper give weight to career development?

Publishing a research paper can give weight to a researcher's career development in several ways, such as: establishing oneself as...
Read More
Author's Blog

How to develop a Research Paper from Scratch

Developing a research paper involves several steps including: choosing a topic, conducting background research, formulating a research question or hypothesis,...
Read More
Author's Blog

How Plagiarism report plays crucial role in Research Paper Publication?

Plagiarism is a major concern in the academic and research community, as it undermines the integrity of the research and...
Read More