Meri Awaaz Hi Pehchan Hai: A Survey on Multilingual Podcast Processing System

Notification

Announcement!

ISJEM Invites papers for various areas like engineering, Management, Science & other multi discplinary subjects. Please submit your paper for review.

ISJEM assigns a digital object identifier (DOI) to each published paper, making it easier for the paper to be cited in various major databases like Google Scholar, ResearchGate, Academia.edu, etc…

ISJEM takes 24–48 hours to publish a research paper. Within 24 hours, the submitted paper will be reviewed and notified of its status, and it will be published once the processing fee is successfully received.

Meri Awaaz Hi Pehchan Hai: A Survey on Multilingual Podcast Processing System

Version

File Size 416.39 KB

Downloads 34

Files 1

Published 10 May 2026

Updated 10 May 2026

Meri Awaaz Hi Pehchan Hai: A Survey on Multilingual Podcast Processing System

Dr. Vijayalaxmi Mekali, Arpita Rathod, Bhagyashree, Gagana Poojari, Gayana V

Department of Computer Science and Engineering

K. S. Institute of Technology, Bengaluru – 560109, India vijayalaxmimekali@ksit.edu.in

Abstract—Podcast consumption has increased rapidly in re-cent years, making podcasts an important source of education, entertainment, and information sharing. In multilingual countries like India, audiences prefer podcast content in regional languages such as Kannada, Hindi, and Telugu. Processing and summariz-ing multilingual podcasts remains challenging because of regional accents, code-switching, dialectal variations, and long audio content. The survey reviews multilingual podcast processing and summarization techniques, including Automatic Speech Recog-nition (ASR), Natural Language Processing (NLP), extractive and abstractive summarization, machine translation, and Text-to-Speech systems. The study also examines recent advancements in transformer-based models for low-resource Indian languages and discusses datasets, evaluation metrics, research challenges, and future improvements in multilingual podcast summarization.Index Terms—Keywords: Podcast Summarization, Multilin-gual Automatic Speech Recognition (ASR),Natural Language Processing (NLP), Indian Languages, Transformer Models.