Welcome to the MOLTO website!

Sites

Follow us

To follow us, subscribe to the MOLTO RSS Feed or read our Twitter feed. Occasionally we stream live video from MOLTO events. You can watch us on Ustream and also recorded on YouTube.

Contact us

Let us know what you think by using the MOLTO contact form.

MOLTO Languages

MOLTO translation is based on GF and its resource grammar library, which currently covers the languages shown in the diagram besides here. MOLTO tools will enable the specialization of both the Abstract Syntax and the language fragments to idiomatic usages in new domains of application.

Project Abstract

MOLTO's goal is to develop a set of tools for translating texts between multiple languages in real time with high quality. Languages are separate modules in the tool and can be varied; prototypes covering a majority of the EU's 23 official languages will be built.

As its main technique, MOLTO uses domain-specific semantic grammars and ontology-based interlinguas. These components are implemented in GF (Grammatical Framework), which is a grammar formalism where multiple languages are related by a common abstract syntax. GF has been applied in several small-to-medium size domains, typically targeting up to ten languages but MOLTO will scale this up in terms of productivity and applicability.

A part of the scale-up is to increase the size of domains and the number of languages. A more substantial part is to make the technology accessible for domain experts without GF expertise and minimize the effort needed for building a translator. Ideally, this can be done by just extending a lexicon and writing a set of example sentences.

The most research-intensive parts of MOLTO are the two-way interoperability between ontology standards (OWL) and GF grammars, and the extension of rule-based translation by statistical methods. The OWL-GF interoperability will enable multilingual natural-language-based interaction with machine-readable knowledge. The statistical methods will add robustness to the system when desired. New methods will be developed for combining GF grammars with statistical translation, to the benefit of both.

MOLTO technology will be released as open-source libraries which can be plugged in to standard translation tools and web pages and thereby fit into standard workflows. It will be demonstrated in web-based demos and applied in three case studies: mathematical exercises in 15 languages, patent data in at least 3 languages, and museum object descriptions in 15 languages.

Fact Sheet

Challenge

High-quality translation between many simultaneous languages on the web. We address 15 languages, including 12 of EU's official languages. The tool is directed for producers of documents, rather than consumers. Thus we are not satisfied with browsing quality, but on the other hand we need not have broad coverage.

Goal

Develop tools that enable web content providers to create publishing-quality translations automatically. The tools apply to specific domains, and require an initial adaptation of the translation system to each domain.