I am building a Named Entity Recognizer with a Conditional Random Field and am looking for two things:
A) An open source, English NER dataset for Person, Location, and Organization entities
B) A list of English NER features
I have already looked at the CoNLL-2003 corpus and found this is exactly what I want but it is not readily available. I have been unsuccessful in finding a list of NER features; I am trying to avoid having to hand design these features.
Thanks