IWSLT 2016

Training, development and evaluation sets for Arabic Hebrew

The IWSLT 2016 Evaluation Campaign does not include any task on the Arabic/Hebrew pair. Exceptionally, for both Arabic-to-Hebrew and Hebrew-to-Arabic directions, here you can find training, development and evaluation sets built upon the latest available XML files (April 2016) of the two languages.

The archive with training, development and evaluation sets is available at this link.

Acknowledgments: This release was developed with the valuable contribution of Yonatan Belinkov, MIT (Massachusetts, United States).

If you use this corpus in your work, please cite the paper:

M. Cettolo. 2016. An Arabic-Hebrew parallel corpus of TED talks. In Proc. of the AMTA Workshop on Semitic Machine Translation (SeMaT), Austin, US-TX. pdf, bib.