Key points are not available for this paper at this time.
The World Wide Web is enormous, free, immediately available, and largely linguistic. As we discover, on ever more fronts, that language analysis and generation benefit from big data, so it becomes appealing to use the Web as a data source. The question, then, is how.
Adam Kilgarriff (Thu,) studied this question.