It is completely amazing! Fake news and click-baits have totally invaded the cyber space. Let us face it: everybody hates them for three simple reasons. Reason №2 will absolutely amaze you. What these can achieve at the time of election will completely blow your mind! Now, we all agree, this cannot go on, you know, somebody has to stop it. So, we did this research on fake news/click-bait detection and trust us, it is totally great research, it really is! Make no mistake. This is the best research ever! Seriously, come have a look, we have it all: neural networks, attention mechanism, sentiment lexicons, author profiling, you name it. Lexical features, semantic features, we absolutely have it all. And we have totally tested it, trust us! We have results, and numbers, really big numbers. The best numbers ever! Oh, and analysis, absolutely top notch analysis. Interested? Come read the shocking truth about fake news and click-bait in the Bulgarian cyber space. You won't believe what we have found!
Georgi Karadzhov, Pepa Gencheva, Preslav Nakov, Ivan Koychev
Please, cite the following paper if you use the resources below:
@InProceedings{RANLP2017:clickbait,
author = {Georgi Karadzhov and Pepa Gencheva and Preslav Nakov and Ivan Koychev},
title = {We Built a Fake News \& Click-bait Filter: What Happened Next Will Blow Your Mind!},
booktitle = {Proceedings of the 2017 International Conference on Recent Advances in Natural Language Processing},
month = {September},
year = {2017},
address = {Varna, Bulgaria},
series = {RANLP~'17}
}
Name | Short description | Link |
---|---|---|
News | Bulgarian news, each labeled wheter it is factual or not and whether it is a clickbait or not | Download |
LDA | LDA topic models generated with gensim on ~100 000 bulgarian news articles | Download |
Word2Vec | Word2Vec model generated with gensim on ~100 000 bulgarian news articles | Download |
Stopwords | Dictionary with stop words | Download |
PMI-content-clickbait | Calculated PMI scores over article content in regards to clickbait label | Download |
PMI-content-non-factual | Calculated PMI scores over article content in regards to not-factual label | Download |
PMI-header-clickbait | Calculated PMI scores over article header in regards to clickbait label | Download |
PMI-header-non-factual | Calculated PMI scores over article header in regards to not-factual label | Download |
Typos | List of words that are frequently mistyped in Bulgarian | Download |
Foreign Words | List of words with foreign origin used in Bulgarian language. | Download |
Frequency List | Frequency list of Bulgarian words, taken from Wikpedia. | Download |