AI-Based PDF Translator

Notification

Announcement!

ISJEM Invites papers for various areas like engineering, Management, Science & other multi discplinary subjects. Please submit your paper for review.

ISJEM assigns a digital object identifier (DOI) to each published paper, making it easier for the paper to be cited in various major databases like Google Scholar, ResearchGate, Academia.edu, etc…

ISJEM takes 24–48 hours to publish a research paper. Within 24 hours, the submitted paper will be reviewed and notified of its status, and it will be published once the processing fee is successfully received.

AI-Based PDF Translator

Version

File Size 258.48 KB

Downloads 242

Files 1

Published 12 May 2025

Updated 12 May 2025

AI-Based PDF Translator

Authors:

Rajesh*¹, P. Ashok *², M. sai krishna*³

*¹ Assistant Professor of the Department of CSE (AI & ML) of ACE Engineering College, India.

*^2,3 Students of Department of CSE (AI & ML) of ACE Engineering College, India.

ABSTRACT: Automated document translation plays a critical role in overcoming language barriers and facilitating seamless communication across global industries. This project harnesses the power of Natural Language Processing (NLP) and Optical Character Recognition (OCR) to extract, translate, and reconstruct text from PDF documents while preserving their original layout and formatting. By utilizing Transformer-based models such as GPT and the Google Translate API, alongside robust text extraction tools, the system delivers accurate and efficient multilingual translations. The methodology incorporates Python libraries including PyMuPDF, pdfplumber, Tesseract-OCR, and the OpenAI API to manage text recognition, translation, and reformatting processes. This AI-driven solution aims to enhance accessibility, foster global collaboration, and streamline multilingual document workflows across diverse sectors.

Keywords: PDF Translation, Natural Language Processing (NLP), Optical Character Recognition (OCR), Transformer Models, Multilingual Document Processing.