Skip to content

gkaradzhov/ClickbaitRANLP

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 

Repository files navigation

We Built a Fake News & Click-bait Filter: What Happened Next Will Blow Your Mind!

Paper abstract:

It is completely amazing! Fake news and click-baits have totally invaded the cyber space. Let us face it: everybody hates them for three simple reasons. Reason №2 will absolutely amaze you. What these can achieve at the time of election will completely blow your mind! Now, we all agree, this cannot go on, you know, somebody has to stop it. So, we did this research on fake news/click-bait detection and trust us, it is totally great research, it really is! Make no mistake. This is the best research ever! Seriously, come have a look, we have it all: neural networks, attention mechanism, sentiment lexicons, author profiling, you name it. Lexical features, semantic features, we absolutely have it all. And we have totally tested it, trust us! We have results, and numbers, really big numbers. The best numbers ever! Oh, and analysis, absolutely top notch analysis. Interested? Come read the shocking truth about fake news and click-bait in the Bulgarian cyber space. You won't believe what we have found!

Authors:

Georgi Karadzhov, Pepa Gencheva, Preslav Nakov, Ivan Koychev

Please, cite the following paper if you use the resources below:

@InProceedings{RANLP2017:clickbait,
  author    = {Georgi Karadzhov and Pepa Gencheva and Preslav Nakov and Ivan Koychev},
  title     = {We Built a Fake News \& Click-bait Filter: What Happened Next Will Blow Your Mind!},
  booktitle = {Proceedings of the 2017 International Conference on Recent Advances in Natural Language Processing},
  month     = {September},
  year      = {2017},
  address   = {Varna, Bulgaria},
  series    = {RANLP~'17}
}

Resources

Name Short description Link
News Bulgarian news, each labeled wheter it is factual or not and whether it is a clickbait or not Download
LDA LDA topic models generated with gensim on ~100 000 bulgarian news articles Download
Word2Vec Word2Vec model generated with gensim on ~100 000 bulgarian news articles Download
Stopwords Dictionary with stop words Download
PMI-content-clickbait Calculated PMI scores over article content in regards to clickbait label Download
PMI-content-non-factual Calculated PMI scores over article content in regards to not-factual label Download
PMI-header-clickbait Calculated PMI scores over article header in regards to clickbait label Download
PMI-header-non-factual Calculated PMI scores over article header in regards to not-factual label Download
Typos List of words that are frequently mistyped in Bulgarian Download
Foreign Words List of words with foreign origin used in Bulgarian language. Download
Frequency List Frequency list of Bulgarian words, taken from Wikpedia. Download

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published