Data
Here's some data I've used to train and evaluate machine learning algorithms:
Rexa Coreference
Contact Record Extraction
Wikipedia Relation Extraction
Publication Venue Canonicalization