geolocating_tweets

This project was done as part of Social Media Mining course taken at ASU. The project is about predicting the user location based on the tweet posted by him. Please see the 'Tweet Based Geolocation.pdf' file for project details.

Pre Processing

1) Loading data from json to csv Run ‘loadfiles.py path’, Give input location to the folder where the original data is present. 2) Removing all non-alphanumeric characters Run ‘remove_chars.csv filename’ 3) Extract hashtags from raw data and merge with the output of above step for different dataset Run ‘hashtags.py raw_filename merge_filename’

Classification

1) ‘nb_classifier.py’ has the implementation of tf-ids, Inverse cluster frequencies and Naïve Bayes classifier. It has different modes: • Run ‘python nb_classifier.py filepath 2 1000’ for Inverse Cluster Frequency Model, change 1000 to any number to increase number of tweets to process • Run ‘python nb_classifier.py filepath 1 1000’ for unique words model, change 1000 to any number to increase number of tweets to process • Search for below line and Change False to True to enable cross validation def main(path, n =1000, cv=False) ‘python nb_classifier.py filepath 1 1000’ to run cross validation.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.spyderproject		.spyderproject
19_19.json		19_19.json
API key.txt		API key.txt
README.md		README.md
Readme.docx		Readme.docx
Tweet Based Geolocation.pdf		Tweet Based Geolocation.pdf
hashtags.py		hashtags.py
liw.csv		liw.csv
loadFiles.py		loadFiles.py
model.R		model.R
nb_classifier.py		nb_classifier.py
preprocess.py		preprocess.py
remove_chars.py		remove_chars.py
results.txt		results.txt
stopwords.csv		stopwords.csv
tfid.py		tfid.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

geolocating_tweets

Pre Processing

Classification

About

Releases

Packages

Languages

akry1/geolocating_tweets

Folders and files

Latest commit

History

Repository files navigation

geolocating_tweets

Pre Processing

Classification

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages