Smart Multilingual News Category Classification using NLP And ML
- Version
- Download 16
- File Size 461.45 KB
- File Count 1
- Create Date 21 March 2026
- Last Updated 21 March 2026
Smart Multilingual News Category Classification using NLP And ML
DR. K. SHASHIKANTH, A. BHAVANI, M. CHARANYA, MD. ADNAN KHAN, K. RUTHWIK
1Associate Professor, Department of CSE(AI&ML), Jyothishmathi Institute of Technology and Science,
Telangana, India.
2
UG Student, Department of CSE(AI&ML), Jyothishmathi Institute of Technology and Science,
Telangana, India. bhavaniadvani@gmail.com
3 UG Student, Department of CSE(AI&ML), Jyothishmathi Institute of Technology and Science, Telangana,
India.
charanyamadipally94@gmail.com
4UG Student, Department of CSE(AI&ML), Jyothishmathi Institute of Technology and Science, Telangana,
India adnankhan.work78@gmail.com
5UG Student, Department of CSE(AI&ML), Jyothishmathi Institute of Technology and Science, Telangana,
India. ruthwik2048@gmail.com
ABSTRACT:The increasing availability of online news in essential.multiple languages has created a strong demand for automated news analysis systems. This paper presents a smart multilingual news category classification framework developed using Natural Language Processing (NLP) techniques. The proposed system is capable of analyzing news articles written in English, Hindi, and Telugu. The system performs multi-label classification by matching predefined multilingual keywords to relevant news categories, generates a brief extractive summary of the input article, and identifies important named entities such as persons, organizations, and locations using multilingual Named Entity Recognition (NER). Language identification is handled using the langdetect library, while entity extraction is carried out using the Stanza NLP toolkit. The application is implemented in Python with a Streamlit-based interface to ensure ease of use and interactivity. Experimental observations indicate that the system efficiently processes multilingual news content and provides meaningful analytical outputs, and future work includes incorporating deep learning and transformer-based models to improve accuracy, robustness, and scalability.
Keywords:Multilingual NLP, News Classification, Named Entity Recognition, Text Summarization, Streamlit,Stanza.
Download