This repository is part of the Intel Unnati Industrial Training Program 2024. It aims to analyze sentiments from online reviews of Intel products using various natural language processing techniques.
This project involves the extraction, cleaning, and analysis of online reviews for Intel products. We utilize sentiment analysis tools such as VADER and RoBERTa to classify the sentiments of these reviews into positive, negative, or neutral categories.
The overall workflow of the project is as follows:
- Data Collection: Scraping Amazon product reviews for Intel processors.
- Data Preprocessing: Cleaning and preparing the data for analysis.
- Feature Engineering: Creating features such as review length and sentiment scores.
- Sentiment Analysis: Using VADER and RoBERTa for sentiment classification.
- Visualization: Generating visualizations such as word clouds and performance comparisons.
Word clouds are generated to visualize the most common words used in positive, neutral, and negative reviews.
The results section includes a comparison of the performance of different sentiment analysis models.
To run this project locally, please follow these steps:
- Clone the repository:
git clone https:// https://github.com/Moses-Mk/Intel-Products-Sentiment-Analysis-from-Online-Reviews.git
- Navigate to the project directory:
cd Intel-Products-Sentiment-Analysis-from-Online-Reviews
- Install the required packages:
pip install -r requirements.txt
To perform sentiment analysis on the dataset, run the following command:
python main.py
Make sure to update the configuration file (config.json
) with the appropriate paths and settings.
Contributions are welcome! Please fork the repository and submit pull requests for any enhancements or bug fixes.
This project is licensed under the MIT License. See the LICENSE file for more details.