WP11: Multilingual Semantic Wiki


Kaarel Kaljurand
Institute of Computational Linguistics, University of Zurich

5th MOLTO Project Meeting, Utrecht
2012-09-20

Presenter Notes

Tasks of WP11

  • multilingual Attempto Controlled English (ACE)
    • D11.1 (2012-06-01)
  • CNL-based multilingual semantic wiki
    • D11.2 (2012-12-01)
    • D11.3 (2013-04-01)

Presenter Notes

Multilingual ACE (D11.1)

Presenter Notes

ACE

  • goal: user-friendly language for formal knowledge engineering
  • subset of natural English
  • translatable into Discourse Representation Structures (DRS)
    • and further into standard first-order logic, OWL, various rule languages
    • enables automatic reasoning, e.g. consistency checking, question answering, ...
  • verbalization of formal languages
    • DRS, OWL
  • end-user documentation: construction and interpretation rules
  • editing environments: AceWiki, ACE Editor, ACE View, ...

Presenter Notes

Multilingual ACE

An ACE grammar in GF/RGL adds multiple natural languages as front-ends to ACE.

Multilinguality

Presenter Notes

ACE in GF

  • implementation of the ACE syntax (i.e. no DRS generation)
    • i.e. targeting an existing precisely defined language
  • extension of Angelov and Ranta (CNL 2009)
  • available in 10 natural languages via the RGL
    • Catalan, Dutch, English, Finnish, French, German, Italian, Spanish, Swedish, Urdu
    • design allows for easy extendability
  • focus on the AceWiki subset
    • almost 100% coverage at almost 0% ambiguity
    • some precision problems, e.g. anaphoric references
  • joint work with UGOT (John J. Camilleri)

Presenter Notes

Example

p -lang=Ace "if a person admires no golfer then the person buys
    at least 2 aquariums that nothing but travelers inspect ." | l

si una persona no admira cap golfista llavors la persona compra
    almenys 2 aquarins que nomÈs viatgers inspeccionen .

als een persoon geen golfer bewondert , dan koopt de persoon
    ten minste 2 aquaria die slechts reizigers inspecteren .

if a person admires no golfer then the person buys
    at least 2 aquariums who nothing but travelers inspect .

jos henkilö ei ihaile mitään golfaajaa niin henkilö ostaa
    vähintään 2 akvaariota jonka vain matkustajat tarkastavat .

si une personne n' admire aucun golfeur alors la personne achète
    au moins 2 aquariums que seulement des voyageurs inspectent .

wenn eine Person keinen Golfer bewundert , dann kauft die Person
    wenigstens 2 Aquariume die nur Reisenden inspizieren .

si una persona non ammira nessuno giocatore di golf allora la persona compra
    almeno 2 acquari che soltanto viaggiatori ispezionano .

si una persona no admira hacia golfista entonces la persona compra
    al menos 2 acuarios que solamente viajeros inspeccionan .

om en person beundrar inget golfspelare så personen köper
    minst 2 akvariumar som bara resenärar avsynar .

اگر اك شخص كوی نہیں گولفر کی تعریفتا ہے تو شخص كم از كم
    ۲ ایکویریم جن صرف مسافر معائتے ہیں كو خریدتا ہے

Presenter Notes

Future work

  • use ACE-in-GF in a multilingual wiki
  • add more languages
    • Danish, Norwegian, Bulgarian, Latvian, Russian (?)
  • extend towards full ACE
  • more evaluation
    • using the Ontograph framework (?)
  • study ambiguity in languages other than ACE

Ambiguity example

p -lang=Dut "John koopt ten minste 2 aquaria , die slechts reizigers inspecteren ."
| l -lang=Ace,Fin

John buys at least 2 aquariums who inspect nothing but travelers .
John ostaa vähintään 2 akvaariota , joka tarkastaa vain matkustajia .

John buys at least 2 aquariums who nothing but travelers inspect .
John ostaa vähintään 2 akvaariota , jonka vain matkustajat tarkastavat .

Presenter Notes

Multilingual CNL-based
Semantic Wiki
(D11.2, D11.3)

Presenter Notes

Existing wiki systems

  • wiki
    • user-friendly collaborative environment for knowledge management
    • content typically unconstrained natural language (NL)
    • powered by software, e.g. MediaWiki
    • e.g. Wikipedia
  • semantic wiki (= wiki + formal semantics)
    • provides: richer query language, consistency checking (via automatic reasoning)
    • content typically NL + typed links (i.e. RDF triples)
    • software: Semantic Mediawiki, ...
  • CNL-based semantic wiki (= semantic wiki using CNL)
    • formal languages hidden (=> can use more expressive formal languages)
    • software: AceWiki
  • multilingual wiki
    • authoring in multiple (natural) languages
    • current systems: only document-level interlinking

Presenter Notes

Multilingual CNL-based Semantic Wiki

  • multiple languages
    • natural: English, German, ACE, ...
    • formal: ACE, Sage, ...
    • languages for content, UI, meta information
  • content
    • viewable/editable/queryable in multiple languages
    • automatically kept in sync
  • CNL-based
    • backed by formal grammar(s), e.g. ACE, Phrasebook, ...
    • formal languages are hidden
  • reasoning and query
    • consistency checking, question answering, ...
    • based on the underlying formal content

Presenter Notes

Use cases

  • multilingual ACE wiki
    • like AceWiki, but in multiple ACE-based CNLs
  • tourist phrasebook
    • book structure (ToC, chapters, index)
    • multiple languages
    • grammar editing
  • catalog of museum objects (paintings, painters)
    • each object on a separate wiki page
    • multiple languages
    • rich queries (e.g. "which Dutch painter painted which French painter?")
  • math exercises
    • multiple user solutions
    • automatically checked
  • collaborative grammar editor
    • dynamically updated regression test sentences

Presenter Notes

Current implementation
(AceWiki-GF)

Presenter Notes

Technologies

  • AceWiki
    • collaborative environment
    • GUI (e.g. look-ahead editor)
    • storage
    • connection to ACE parser
    • connection to OWL reasoners
  • Grammatical Framework (GF)
    • grammars
    • parser (translation, completion, ...)
    • grammar editor

Presenter Notes

AceWiki integration with GF

currently based on the GF Webservice and the GF online editor

  • wiki entry is GF abstract tree set
    • viewed via linearization(s)
    • can represent ambiguity
  • access to multiple online GF grammars
    • provided by GF webservice
    • single grammar per wiki
  • multilingual viewing and editing of wiki content
    • look-ahead editing
  • presentation of the GF-analysis of sentences
    • translations, word alignment diagrams, GF syntax trees, ...
  • grammar editing using the online GF editor

Presenter Notes

Article and Sentence Editor

Presenter Notes

Using the online GF editor

Presenter Notes

Future work

  • work with the MOLTO use cases
  • grammar editing
  • query and reasoning
    • GF-tree based queries
    • ACE-based reasoning (AceWiki OWL reasoning, RACE reasoning)
  • evaluate with real content and real users

Presenter Notes