International Scientific Journal of Engineering and Management

An International Scholarly || Multidisciplinary || Open Access || Indexing in all major Database & Metadata
ISSN: 2583-6129

Impact Factor: 6.674

Text Extraction from Document Image

  • Version
  • Download 14
  • File Size 593.52 KB
  • File Count 1
  • Create Date 9 May 2023
  • Last Updated 9 May 2023

Text Extraction from Document Image

Anusha C, Saket Mishra, Rohit Metre, Harsh Gurawalia

Department of CSE, PES University, Bangalore-79, Karnataka

Email: anusha20c@gmail.com, saketmishra113@gmail.com, rohitmetre2000@gmail.com, harshrocks2442@gmail.com

Contact: +91 7676240083, +91 9731963460, +91 9611910110, +91 9901360787

Guided By:

Dr.SapnaV.M, Assistant Professor, Dept. of CSE, PES UNIVERSITY,Bangalore,Karnataka

Email: sapnavm@pes.edu

Abstract:. We'll be putting together an OCR system pipeline. A Convolutional neural network will be used to classify each individual character. CNN requires less training than a fully linked network because it has fewer parameters. To make this work, we'll first split the lines, then the words, and ultimately the individual characters that will be sent to CNN. The English character dataset that has been acquired will be used to train the CNN. The EMNIST dataset (Extended Modified National Institute of Standards and Technology) has around 8 lakh samples divided into 62 classes (10 digits + 26 lowercase alphabets + 26 uppercase alphabets). We discovered another CHARS74k dataset since this dataset comprises handwritten characters. CHARS74k has 62 classes, identical to EMNIST, and is a normalized dataset with 1016 samples for each character class. To build words, we will merge the expected character label from CNN. It's possible that the prediction is inaccurate and contains some misclassification. As a result, some adjustments are required. To accomplish this, we will utilize an English word spell checker to locate all similar words and select the most appropriate one.

 

 


Download

Author's Blog

What is the difference between a Research Paper and a Review Paper?

A research paper and a review paper are both scholarly documents, but they serve different purposes and have different characteristics.research...
Read More
Author's Blog

What is DOI?

A Digital Object Identifier (DOI) is a unique alphanumeric string that is used to identify and provide a persistent link...
Read More
Author's Blog

What do you need to do during production of your Research Paper?

During the production of a research paper, the following steps need to be taken: conducting research, organizing and analyzing data,...
Read More
Author's Blog

What are the advantages of publishing a research paper?

Publishing a research paper can have many advantages for researchers, including: Career advancement, professional recognition, opportunities for collaboration, increased visibility,...
Read More
Author's Blog

Ways to Support your Academic Wellbeing which preparing the Research Paper/Article

To support your academic wellbeing while publishing a research paper, it's important to set realistic goals, manage your time effectively,...
Read More
Author's Blog

How to improve your Research Paper writing Skills?

Read extensively: One of the best ways to improve your research paper skills is to read extensively in your field...
Read More
Author's Blog

Is DOI compulsory to publish a research paper in a Journal?

DOI is not strictly required to publish a research paper, but it is highly recommended. Basically, the International Scientific Journal...
Read More
Author's Blog

In what ways does research paper give weight to career development?

Publishing a research paper can give weight to a researcher's career development in several ways, such as: establishing oneself as...
Read More
Author's Blog

How to develop a Research Paper from Scratch

Developing a research paper involves several steps including: choosing a topic, conducting background research, formulating a research question or hypothesis,...
Read More
Author's Blog

How Plagiarism report plays crucial role in Research Paper Publication?

Plagiarism is a major concern in the academic and research community, as it undermines the integrity of the research and...
Read More