WIT3

Web Inventory of Transcribed and Translated Talks

Home 2019-01-dees Training and development sets for the MT track

Training, development and test sets for German-Spanish (both directions) are linked to the entry of the table below: by clicking, an archive will be downloaded which contains the sets and a README file. Numbers in the table refer to millions of units (untokenized words) of the target side of parallel training data.

If you use this corpus in your work, please cite the paper:

M. Cettolo, C. Girardi, and M. Federico. 2012. WIT3: Web Inventory of Transcribed and Translated Talks. In Proc. of EAMT, pp. 261-268, Trento, Italy. pdf, bib.


de

es
de 4.33
es4.09