WIT3

Web Inventory of Transcribed and Translated Talks

Home 2014-01 Tools

Linguistic preprocessors for Arabic, Farsi and Hebrew have been developed by experts for cleaning and/or normalize TED texts.
The experts are:

These preprocessors were used for the development and the scoring of baseline systems and will be employed for the official evaluation of submissions.

You can download them from <here>.