warning: Creating default object from empty value in /home/local/www/molto-project.eu/sites/all/modules/i18n/i18ntaxonomy/i18ntaxonomy.pages.inc on line 34.

     

WP5

WP5 Statistical and robust translation - M39

Summary of progress

The main goal of this WP has been to develop a hybrid system between GF and SMT specialised in patent translation and has implied the construction of new resources on the domain and conceiving techniques to integrate both technologies. The WP has also tasks devoted to widen GF and have been focused on building general purpose lexicons.

WP5+WP7 skype meeting minutes 2013.03.25

  • Patents flagship

We will try to integrate the one-click system with the translation tools or a hybrid based on robust parsing if available. We can decide the final architecture after the deliverables.

WP5 Statistical and robust translation - M30

Summary of progress

The milestone MS8 has been achieved in M30 (Translation tool complete), which for WP5 meant to have a complete system integrating the grammar and STM.

WP2, WP3, WP5, WP6, WP7, WP10 Meeting 2012.04.04

Participants: OC, MA, RE, CE, MG

Topic: how to exploit the current MOLTO results and tools in the general panorama of e-science and WDML, and CLARIN and .....

Minutes

Malin and Ramona have worked on developing a Swedish parser based on a wide coverage grammar, Ramona and Txell are handling the patents language, and Jordi the mathematical fragments, plus we have the KRI from Ontotext. It seems that all this should come together now :)

So how hard is the following challenge:

  • to harvest meaningful data that has appeared in the scientific literature?

WP5 Statistical and robust translation - M24

Summary of progress

The milestone MS7 has been achieved in M24 (First prototypes of hybrid combination: The methods are implemented and evaluated on a specific test set).

The work of the fourth semester corresponds to the three last tasks of the WP (T5.4 Baseline systems, T5.5 Hybrid Models and T5.6 Systems evaluation, see http://www.molto-project.eu/workplan/statistical-and-robust-translation). The baseline systems have been improved by extending the GF translator.

Patents WG

This WG is the same as Hybrid Systems & Patents WG .

Hybrid Systems & Patents WG

Minutes

Participants: Aarne, Ramona, Milen, Borislav, Cristina, Meritxell

Decisions taken

Write GF grammars to solve problems in the SMT system: compound and biological names, word reordering, gender agreement.

Biological names are different than compound names and raise different challenges. LauriC can provide a database of compounds and biological names.

Increase parser robustness by chunking the claims and parsing the chunks separately and recombine the results with the help of the grammar.

Syndicate content