News Headlines Dataset
For research purposes. Available in JSON format.
See headlines.json file for dataset. And script.js file for scraping script written in javascript.
Compiled by the team at Fresh Web Designs
Sentiment: Data includes sentiment score. We use a AFINN-165 wordlist and Emoji Sentiment Ranking to perform sentiment analysis on arbitrary blocks of input text.
Source: Headlines are compiled from Reddit.com every 1 hour. Please note, 'published' data point is when a headline was published on Reddit, not the article's website.
Subreddits: r/news, r/worldnews, r/technology, r/television, r/entertaiment, r/politics, r/sports
Need Custom Scraping?
We're a software company based out of Miami, Florida, US. We work on projects all over the world, large and small. We accept most Crypto. Everything is done on Github. Reach out @ hello@fwd.dev
Donate
We accept Crypto donations at the following addresses:
# Nano
nano_3gf57qk4agze3ozwfhe8w6oap3jmdb4ofe9qo1ra3wcs5jc888rwyt61ymea
# Bitcoin
bc1qcgvew2a7ta3f7xy5999tdwyd8clrvdtpe2uvj5
# Doge
D9U1FLygkMydx3DE2sXgnuFpHm7ePm3Zwe
# Ethereum
0xdD4691Dc9562FB262e4b2076bE255303243f271d