Dataset of The Onion articles and real "Onion-like" news articles from the subreddit r/NotTheOnion, along with a jupyter notebook extracting the dataset and performing classification. The Onion articles are labeled 1 and the r/NotTheOnion articles are labeled 0.