krasimir.angelov's blog

First experiments for patent translation from English to German and Bulgarian by using the robust GF parser

Recently I did a major improvement in the efficiency of the robust GF parser and now I am eager to try it out on the patent corpus. The first thing to notice is that although the parser is now usable even for nontrivial sentences, for some long sentences it still fails with "out of memory" error. The borderline is somewhere around sentence length of 20 tokens. There is still room for improvement and I know what must be done but before that I want to do a pilot experiment in translation.

Syndicate content

The views expressed in this blog are personal and do not in any way reflect the view of the MOLTO Consortium