Text this: Large-Scale Pattern-Based Information Extraction from the World Wide Web