Skip to content

Natural Language Processing project for determining whether a social media tweet,post is offensive or not

Notifications You must be signed in to change notification settings

youssef-kishk/NLP-Social-Media-Offensive-language-detection

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 

Repository files navigation

NLP-Social-Media-Offensive-language-detection

Offensive language is pervasive in social media. Individuals frequently take advantage of the perceived anonymity of computer-mediated communication, using this to engage in behavior that many of them would not consider in real life. Online communities, social media platforms, and technology companies have been investing heavily in ways to cope with offensive language to prevent abusive behavior in social media.

This project in a Natural Language Processing(NLP) application on determining whether a social media tweet,post is offensive or not.

Attached the complete project report,Source code and the training data set used,but notice that the data set is unbalanced.

The project passses through several steps to reach the final goal of determing whether the social media post,tweet is offensive or not. The steps are: 1-Reading the training Dataset 2-Cleaning of the dataset and removing the noise using stop words 3-Lexicon Normalization uding Lemmitization 4-Features extraction on text data using bag of words model 5-Several classifiers are used to notice the difference between them, the used classifiers in the project are: Random Forest Classifier ,Naïve Bayes Classifier ,Decision Tree Classifier ,Logistic Regression Classifier ,K-Nearest Neighbor Classifier. 6-The final step is tunning of the results to improve it using K-Fold cross validation.

About

Natural Language Processing project for determining whether a social media tweet,post is offensive or not

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages