UT Dallas CS undergrads: If you received an A or A+ in at least one of my courses and are interested in the Ph.D. program, feel free to come talk to me.

Potential Ph.D. applicants: Owing to the large number of emails I have to process every day, I will not be able to respond to your emails unless I am interested in your application. Feel free to send me your CV, transcripts, and standardized test scores; if you do not hear from me within a week, it probably means that I am not interested in your application.

Upcoming Events

  • NAACL HLT 2016 Workshop (June 16): CORBON: Coreference Resolution Beyond OntoNotes (with M. Ogrodniczuk)
  • IJCAI 2016 Tutorial (July 11): Coreference Resolution: Successes and Challenges
  • EMNLP 2016 Tutorial (Nov 2): Advanced Markov Logic Techniques for Scalable Joint Inference in NLP (with D. Venugopal and V. Gogate)
  • COLING 2016 Workshop (Dec 12): 3rd Workshop on NLP Techniques for Educational Applications (with H.-H. Chen, Y.-H. Tseng, X. Lu, and L.-H. Lee)

Research Interests

My general research area is machine learning of natural language. My recent projects have focused primarily on developing unsupervised and semi-supervised machine learning techniques for natural language processing, with the goal of reducing the amount of annotated data needed to build NLP applications and process resource-scarce languages. Areas in which my group has been working include:

More recently, I have collaborated with other research groups and worked on non-NLP problems. I am particularly interested in developing and applying search and learning techniques to computational problems that arise in areas such as:


Professional Activities

  • Local organizer, North American Computational Linguistics Olympiad (NACLO) Dallas site (2009, 2010, 2011, 2012, 2013, 2014, 2015, 2016)
  • National Science Foundation CISE IIS review panel (2008, 2009, 2010, 2014, 2016)
  • Editorial board member, Computational Linguistics (2014-2016)
  • Information officer, SIGHAN (2014-2017)
  • Workshop co-organizer
    • COLING 2016 Workshop on Natural Language Processing Techniques for Educational Applications (NLP-TEA-3)
    • NAACL HLT 2016 Workshop on Coreference Resolution beyond OntoNotes (CORBON 2016)
  • Tutorials
    • EMNLP 2016 Tutorial (Nov 2): Advanced Markov Logic Techniques for Scalable Joint Inference in NLP (with D. Venugopal and V. Gogate)
    • IJCAI 2016 Tutorial (July 11): Coreference Resolution: Successes and Challenges
  • Journal referee [I am overwhelmed with reviewing and cannot accept journal review requests at this time unless they are from CL or TACL.]
  • Current organizing and program committees
    • Steering Committee, 11th International Workshop on Semantic Evaluation (SemEval 2017)
    • Senior Program Committee, Thirty-First AAAI Conference on Artificial Intelligence (AAAI 2017)
    • Student Scholarship Chair and Student Volunteer Coordinator, 2016 Conference on Empirical Methods in NLP (EMNLP 2016)
    • Area Chair, 26th International Conference on Computational Linguistics (COLING 2016)
    • Senior Program Committee, Thirtieth AAAI Conference on Artificial Intelligence (AAAI 2016)
  • More ...


  • CS 4365: Artificial Intelligence (Spring 2006, 2007, 2008, 2009, 2010, 2012, 2013, 2014, 2015, 2016)
  • CS 4365H: Artificial Intelligence Honors (Spring 2010, 2011, 2012, 2013, 2014)
  • CS 4375: Introduction to Machine Learning (Fall 2006, 2007, 2008, 2009, 2011, 2012, 2013, 2014, 2015, 2016, Spring 2016)
  • CS 6364: Artificial Intelligence (Spring 2009, 2011)
  • CS 6375: Machine Learning (Fall 2005, 2006, 2007, 2010; Spring 2014 (co-taught with Yang Liu))
  • CS 2305: Discrete Math for Computing I (Fall 2004)

Miscellaneous Stuff

  • Computer Science Rankings by conference publications
  • Microsoft Academic Search's ranked list of North American organizations in natural language and speech based on field rating in the last 10 years and 5 years.
  • I am the professor of artificial intelligence for the undergrads. I have had some good times with them in events like this and this.
  • Wanna solve some puzzles? Here is one for you.
  • I have never seen someone in NLP who is so genuinely interested in a bag of words.
  • I was asked many times whether Andrew Ng is my brother. He isn't, but we were introduced to each other by a common friend when we were undergrads.
  • In case you want to know whether you are my doctoral cousin, here is my doctoral ancestry (and the years in which they obtained their degrees): Claire Cardie (1994) → Wendy Lehnert (1977) → Roger Schank (1969) → Jacob Mey (1960) → Louis Hjelmslev (1923) → Holger Pedersen (1897).

