Toxic to Positive Comment Rewriting using Supervised Fine-Tuning and Direct Preference Optimization

Notification

Announcement!

ISJEM Invites papers for various areas like engineering, Management, Science & other multi discplinary subjects. Please submit your paper for review.

ISJEM assigns a digital object identifier (DOI) to each published paper, making it easier for the paper to be cited in various major databases like Google Scholar, ResearchGate, Academia.edu, etc…

ISJEM takes 24–48 hours to publish a research paper. Within 24 hours, the submitted paper will be reviewed and notified of its status, and it will be published once the processing fee is successfully received.

Toxic to Positive Comment Rewriting using Supervised Fine-Tuning and Direct Preference Optimization

Version

File Size 342.69 KB

Downloads 67

Files 1

Published 16 April 2026

Updated 16 April 2026

Toxic to Positive Comment Rewriting using Supervised Fine-Tuning and Direct Preference Optimization

M. Vardhan
Dept. of Computer Science Engineering
RGUKT Basar
Basar, India B200242
P. Divya
Dept. of Computer Science
Engineering RGUKT Basar
Basar, India B200535

G. Krishna Reddy
Dept. of Computer Science Engineering
RGUKT Basar
Basar, India B200596

Abstract—The increased use of harmful, offensive, and disre-spectful language online can be attributed to the rapid growth of social media platforms. Although many existing systems focus on detecting and removing harmful content and toxic comments, this approach does not always encourage constructiveText detoxification is a challenging Natural Language Pro- cessing (NLP) task that requires controlled text generation and contextual understanding. In this study, we propose a transformer-based system for rewriting toxic comments that utilizes both Supervised Fine-Tuning (SFT) and Direct Preference Optimization (DPO).