Out-of-domain corpus

0
ID: 
5.2
Task leader: 
cristina.españa
Assignees: 
cristina.españa
Assignees: 
lluis.marquez
Status: 
Completed

A parallel general purpose corpus compilation. It will be built from public corpora provided for the 2010 Workshop on Machine Translation (WMT 2010).

A selection of 2,000,000 aligned sentences is already available and it will be used to train statistical systems and compare them with the in-domain results.