bad-words-removal

Aim

This project aims to provide an api for removing bad words and improve the overall experience of the online communications.

Current Features

Detecting Bad words and substituting it with food names
Detecting the toxicity of the comment/text

Pipeline Explanation

1. Detecting bad words and susbtituing food names : Substitution

a. Cleaning the text by using text hero-Tokenization,Punctuation removal b. POS tagging c. RULE based removal of bad words based on POS Tags

2. Detection of toxicity [threat,insult,obscene,toxic,severe_toxic] : Classification

a. Pre processing of text for BERT model[tokenization using BERT tokenizer] b. Adding layers on top of BERT c. Fine tuning on dataset d. Validation e. Predicting for single query

Simple and Complete Google Colab Demo

[![Open In Colab]

1. Installation

1.1 Libraries

python -m nltk.downloader universal_tagset
python -m spacy download en

2. Running the code

2.1 Removing Bad Words

Show Output

'Boolean Questions': ['Is sachin ramesh tendulkar the highest run scorer in '
                       'cricket?',
                       'Is sachin ramesh tendulkar the highest run scorer in '
                       'cricket?',
                       'Is sachin tendulkar the highest run scorer in '
                       'cricket?']

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
assets		assets
substitute_bad_words		substitute_bad_words
templates		templates
toxicity prediction		toxicity prediction
README.md		README.md
__init__.py		__init__.py
app.py		app.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

bad-words-removal

Aim

Current Features

Pipeline Explanation

1. Detecting bad words and susbtituing food names : Substitution

2. Detection of toxicity [threat,insult,obscene,toxic,severe_toxic] : Classification

Simple and Complete Google Colab Demo

1. Installation

1.1 Libraries

2. Running the code

2.1 Removing Bad Words

2.2 Predicting toxicity

NLP models used

Online Demo website

License

Contribution guidelines

Contributors Profile

About

Releases

Packages

Languages

harshil15999/bad-words-removal

Folders and files

Latest commit

History

Repository files navigation

bad-words-removal

Aim

Current Features

Pipeline Explanation

1. Detecting bad words and susbtituing food names : Substitution

2. Detection of toxicity [threat,insult,obscene,toxic,severe_toxic] : Classification

Simple and Complete Google Colab Demo

1. Installation

1.1 Libraries

2. Running the code

2.1 Removing Bad Words

2.2 Predicting toxicity

NLP models used

Online Demo website

License

Contribution guidelines

Contributors Profile

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages