Description of the final collection of corpora

ID: 
D5.1
Nature: 
Regular Publication
Dissemination level: 
Public
Due date: 
1 September, 2011

The present document reports the corpora collection needed for the translation systems developed within the workpackage. First of all, it is introduced the framework and domain of application of the workpackage, with a special interest to the structure of patents. If follows a description of the methodology and content of the in-domain and out-of-domain corpora. Finally, we summarise the current status of the workpackage with relation to the data collection.

AttachmentSize
D51c.pdf414.09 KB