Text this: Bootstrapping Information from Corpora in a Cross-Linguistic Perspective