The following dataset contains data on blog posts from MarginalRevolution.com. For posts from Jan. 1, 2010 to 9/17/2016, the following attributes are gathered.
Author Name
Post Title
Post Date
Post content (words)
Number of Words in post
Number of Comments in post
Dummy variable for several commonly used categories
The data was scraped using Python's Beautiful Soup package, and cleaned in R. See my github page (https://github.com/wnowak10/) for the Python and R code.