Patent Corpora

1 Jun 2010 15:17
Europe/Vienna
ID: 
7.2
Workpackage: 
Case Study: Patents
Task leader: 
meritxell.gonzalez
Assignees: 
cristina.espaƱa
Assignees: 
lluis.marquez
Status: 
Completed
Timeframe: 
Jun 2011 - Oct 2012
Completed on: 
30 November, 2012 - 23:00

Determining and gathering of bilingual and monolingual corpora for the patent case study.

  • SMT system is trained with te MAREC corpus (WP5).
  • EPO dataset is used for testing pourposes (WP5).
  • www-EPO dataset will be used to fill the retrieval databases (WP7)