International Scientific Journal of Engineering and Management

An International Scholarly || Multidisciplinary || Open Access || Indexing in all major Database & Metadata
ISSN: 2583-6129

Impact Factor: 6.674

Smart Image To Text And Text To Speech Recognition Using Machine Learning

  • Version
  • Download 122
  • File Size 532.08 KB
  • File Count 1
  • Create Date 23 June 2023
  • Last Updated 23 June 2023

Smart Image To Text And Text To Speech Recognition Using Machine Learning

Dr.S.C.Wagaj, Mrs.Lata More, Harshada Kamble, Nikita Pakhale

Electronics & Telecommunication
JSPM’S Rajarshi Shahu College of Engineering
Pune, India

Abstract—The optical character recognition (OCR) and text-to-speech (TTS) concepts are combined in this project. By successfully establishing a voice interface connection with computers, this type of framework helps persons who are visually handicapped. Image to text and text to speech conversion is a technique that uses the OCR method to read and scan 20+ different languages and numbers in the image and converts them to voices. The voice processing module and the picture processing module are both implemented in this project. Numerous methods have been used in the past, such as the Edged Based Method, Connected Component Method, Texture-Based Method, and Mathematical Morphology Method, however they have significant limitations when measured by exactness, f-score, and review. These picture texts can be found in magazines, photographs, newspapers, banners, and other media. The development of intelligent systems to enhance quality of life is the focus of current technological developments in the fields of natural language processing and image processing. An efficient method for text recognition, extraction from images, and text-to-speech conversion is proposed in this paper. In this work, a successful method for text detection, extraction from photos, and text to speech conversion is suggested. The incoming image is first improved by using grey scale conversion. Then, using the maximum stable external areas feature detector, the text portions of the improved image are located. The following step is to use geometric filtering along with a stroke width transform to effectively gather and filter text sections in a picture. The geometric properties and stroke width transform are used to remove the non-text maximum stable exterior regions. Individual letters and alphabets are then grouped to find text sequences, which are subsequently broken up into words. In order to digitize the words, optical character recognition (OCR) is used. The text is converted to speech in the final phase by feeding it through our text-to-speech synthesizer (TTS). On images from documents to nature settings, the suggested method is tested. The correctness and robustness of the suggested framework have been demonstrated by promising findings, which promote its practical use in real-world applications.

Keywords— Image Processing, Text Recognition and Extraction, Maximally stable extremal regions, OCR(Optical Character-Recognition),SWT(Stroke-Width-Transform) TTS(Text-to-speech synthesizer)


Download

Author's Blog

What is the difference between a Research Paper and a Review Paper?

A research paper and a review paper are both scholarly documents, but they serve different purposes and have different characteristics.research...
Read More
Author's Blog

What is DOI?

A Digital Object Identifier (DOI) is a unique alphanumeric string that is used to identify and provide a persistent link...
Read More
Author's Blog

What do you need to do during production of your Research Paper?

During the production of a research paper, the following steps need to be taken: conducting research, organizing and analyzing data,...
Read More
Author's Blog

What are the advantages of publishing a research paper?

Publishing a research paper can have many advantages for researchers, including: Career advancement, professional recognition, opportunities for collaboration, increased visibility,...
Read More
Author's Blog

Ways to Support your Academic Wellbeing which preparing the Research Paper/Article

To support your academic wellbeing while publishing a research paper, it's important to set realistic goals, manage your time effectively,...
Read More
Author's Blog

How to improve your Research Paper writing Skills?

Read extensively: One of the best ways to improve your research paper skills is to read extensively in your field...
Read More
Author's Blog

Is DOI compulsory to publish a research paper in a Journal?

DOI is not strictly required to publish a research paper, but it is highly recommended. Basically, the International Scientific Journal...
Read More
Author's Blog

In what ways does research paper give weight to career development?

Publishing a research paper can give weight to a researcher's career development in several ways, such as: establishing oneself as...
Read More
Author's Blog

How to develop a Research Paper from Scratch

Developing a research paper involves several steps including: choosing a topic, conducting background research, formulating a research question or hypothesis,...
Read More
Author's Blog

How Plagiarism report plays crucial role in Research Paper Publication?

Plagiarism is a major concern in the academic and research community, as it undermines the integrity of the research and...
Read More