AsterixDB User Defined Functions for Sentiment Analysis of Tweets by Recurrent Neural Networks

UDF's for AsterixDB created during my master's thesis at the Norwegian University of Science and Technology. Will link to paper here once it is written.

The UDF is not working yet and still under development.

Training Data

The neural network in this model has been trained to process text by converting words to floating numbers and running these numbers through a compact embedding layer. To create these word-to-float conversions (or sentence-to-float-array conversions) I've used the words in the tweets in the Stanford Sentiment140 project, which can be downloaded here. To use the neural network for inference of new tweets one necesarily has to use the same word-to-float conversions as the model was trained on, therefore to run the UDF it is necesary to download the training data and update the path-variable inside of WordVec.java.

Contributing

Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
src		src
.DS_Store		.DS_Store
.classpath		.classpath
.factorypath		.factorypath
.gitignore		.gitignore
.project		.project
README.md		README.md
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AsterixDB User Defined Functions for Sentiment Analysis of Tweets by Recurrent Neural Networks

Training Data

Contributing

License

About

Releases

Packages

torstenbm/asterixdb-dl4j-sentiment-udf

Folders and files

Latest commit

History

Repository files navigation

AsterixDB User Defined Functions for Sentiment Analysis of Tweets by Recurrent Neural Networks

Training Data

Contributing

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages