Patent Corpora
Timeframe:
Jun 2011 - Oct 2012
Completed on:
30 November, 2012 - 23:00
Determining and gathering of bilingual and monolingual corpora for the patent case study.
- SMT system is trained with te MAREC corpus (WP5).
- EPO dataset is used for testing pourposes (WP5).
- www-EPO dataset will be used to fill the retrieval databases (WP7)