I took the top 10 daily posts for the last 3 years from Hacker News Daily (sorry about the few thousand wgets last night!), and tallied the occurence of words following other words (or groups of words). This generator probabilistically creates Hacker News titles by looking at the previous N words (N = lookback) and sampling from the distribution of words following them.
Code can be found here.
List of titles can be found here.
- Python, the Web with Emscripten
- Mark Cuban: What Business is Wall Street Journal
- BankSimple: We have an effective vaccine for Malaria
- Reasons to Open Source
- It’s the founding CEO’s job to China'
- Advanced Data Structures MIT
- A list of lists
- Our office is too slow for programming competitions
- The Product is the Internet too much
- More Americans see man who screwed an entire IPv4 /8 that it isn't using
