Upcoming Events

Friday, 11/13/09
7:30 pm
Dinner at the Columbia Cottage. Join us for food and conversation – all are welcome! RSVP on Facebook.

Friday, 11/20/09
3 pm (location TBA)
A computational linguistics/natural language processing presentation by Nizar Habash of the Center for Computational Learning Systems:

Automatic Diacritization of Arabic Text

Arabic is written without certain orthographic symbols, called diacritics, which represent among other things short vowels. The restoration of diacritics to written Arabic is an important processing step for several computational linguistic applications, including training language models for automatic speech recognition, text-to- speech generation, and so on. We present here a new diacritization system for written Arabic based on a new combination of known techniques: a lexical resource for morphological analysis, a multi-classifier tagger and a lexeme language model. This new diacritization system outperforms the best previously published results by reducing the word error rate to 14.9% and reducing the diacritic error rate to 4.8%. The presentation includes a detailed error analysis classifying the type of errors resolved by each of the different modules used.

Friday, 12/11/09
Time and location TBA
Peter Connor of Barnard College will give a lecture on translation. More details to come.

Advertisements

One Response

  1. Where is Nizar Habash’s presentation going to be? It’s still TBA. Thanks.

    – Benjamin L. Mayersohn

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s

%d bloggers like this: