Analysis of Offensive Data over Multi-Source Social Media Environment Using Modified Random Forest Algorithm

V, Uma Maheswari and R, Priya (2023) Analysis of Offensive Data over Multi-Source Social Media Environment Using Modified Random Forest Algorithm. International Journal of Electronics and Communication Engineering, 10 (9). pp. 63-71. ISSN 23488549

[thumbnail of IJECE-V10I9P107.pdf] Archive
IJECE-V10I9P107.pdf

Download (728kB)

Abstract

The widespread usage of social media platforms has resulted in an increasing volume of offensive content, posing
significant challenges to maintaining a safe and respectful online environment. This research presents an analysis of offensive data over the social media environment using a modified Random Forest algorithm. The proposed modification to the traditional Random Forest algorithm incorporates a Weighted class Random Forest (WRF) to enhance model diversity and robustness. An algorithm utilizes weighted classes during training to address the inherent class imbalance in offensive data. By assigning higher weights to offensive content, the model prioritizes accurately identifying offensive posts, comments, and messages. This paper used the Twitter and Reddit dataset of multi-source social media content, labeled for offensive and nonoffensive content, to train and validate the modified Random Forest model. Our proposed model is compared with Decision
Tree (DT), Extreme-Gradient Boosting (XGBoost), Multi-Layer Perceptron (MLP), K-Nearest Neighbors (KNN), and Traditional Random Forest (RF) algorithms in machine learning. A number of performance metrics are used to assess the model's effectiveness in dealing with offensive data, including accuracy, recall, precision, specificity, and the F1-score. The results demonstrate that the modified Random Forest algorithm outperforms better than other machine learning algorithms. Moreover, the model shows improved resilience to variations in offensive language and context, making it more suitable for real-world applications.

Item Type: Article
Subjects: Computer Science > Database Management System
Divisions: Computer Science
Depositing User: Mr IR Admin
Date Deposited: 13 Sep 2024 07:35
Last Modified: 13 Sep 2024 07:35
URI: https://ir.vistas.ac.in/id/eprint/5817

Actions (login required)

View Item
View Item