During my internship at Microsoft Research last summer, Sumit Basu, Lucy Vanderwende and I developed a system for generating quizzes from arbitrary text data. Now that we have presented the paper at NAACL:HLT 2012, Microsoft is making the corpus we compiled to enable these experiments publicly available for download. For more details about the corpus …
Tag Archive: NLP
Oct 25
NLP in the Media: The Copiale Cipher
I’m always excited to see NLP research featured in the mass media because it closes the gap between what my mom thinks I do and what I actually do. The most recent buzz has been for work from Kevin Knight, his students at the Information Sciences Institute (ISI) at the University of Southern California (USC), …
Feb 19
Merantau: a Silat player’s review
After several months of waiting, I finally saw Merantau[1], an Indonesian language, martial arts flick. When watching the trailers it was billed as kind of an Ong Bak, but with Muay Thai swapped for Pencak Silat. Merantau more than delivered on this premise. In terms of story, Merantau does very little to differentiate itself. A …
Jan 08
Named Entity Recognition
Hi Wayne, I’ve been trying to figure out an appropriate information model (most likely XML-based) to correspond to my annotation schema, as I have started to form my notions of how this should look to allow for future expansion, ease of use when annotating, and accessibility for feature extraction, I’m kind of rethinking how annotation …