Blogs

Searching Europeana

Search and retrieval of the records stored at Europeana is a very interesting exercise. Consider the platform first, it is a federated archive where the records are decentralized and consequently managed at the local nodes. How each node implements search is a guess.

Visiting museums almost is becoming a WP8 activity

I have been visiting the Helsinki Ateneum museum last weekend. It is wonderful to be able to spot places where our MOLTO work in cultural heritage could be used. One issue is as usual the spelling mistakes. I read only one or two of these paragraphs written on walls, and in one of them, right in the middle, there is a spelling mistake. One thing is to find a spelling mistake in a paper, but on a wall, for every visitor to see?

If you mail Inbox is especially quiet, then you have a problem

Public amend, namely, I just fetched 133 emails stuck in my account at Chalmers: those of you who answered my emails in the past week, know that I am only reading it now. I should have remembered to change the password also in the mail reader, that is. In any case, I am working on the backlog now.

The job of project manager

This week is one of those in which I seem to spend my time in chasing deliverables and other data that has to go into the progress report. It is not that easy to keep all threads in place and coordinated, one feels like one sends the email and the recipient simply disregards it - just as we do when our conscience calls.

MOLTO cover - Latex sources

Hi all,

Find here the latex sources to add the molto cover to your Deliverables. It consists of two files, the tex and a PDF with the logo. Since it uses the fontspec package, you need to compile using "xelatex". I'm also attaching the needed fonts (Myriad, Myriad Pro and Papyrus).

Enjoy! Meritxell.

First experiments for patent translation from English to German and Bulgarian by using the robust GF parser

Recently I did a major improvement in the efficiency of the robust GF parser and now I am eager to try it out on the patent corpus. The first thing to notice is that although the parser is now usable even for nontrivial sentences, for some long sentences it still fails with "out of memory" error. The borderline is somewhere around sentence length of 20 tokens. There is still room for improvement and I know what must be done but before that I want to do a pilot experiment in translation.

How to add a large lexicon to GF

I am logging some discussion with Ramona, our GF expert developer on possible little GF projects that will benefit GF in the long run.

The goal is to provide a large morphological lexicon for widely used languages based on available dictionary resources such as WordNet. These should be free so that we can refer to them and use them to bootstrap the GF dictionary files. Having such a dictionary makes it much easier to develop domain-specific grammars by having ready-made lexical entries.

Again about those stealth UDP connections

A few days ago I embarked on a crusade to try and figure out why my home computer was being subjected to so many connection attempts, each one needing to be handled by my CPU. It still is, but now I think I figured why. The culprits are all those apps that my contacts (both colleagues and friends) keep open on their machines and require a status update every 10 minutes or so. There is nothing I can do to prevent my computer from being constantly interrupted by pings except use a firewall.

How I analyzed my firewall log file

Since quite some time now, I have been noticing a terrible lack of performance on my Macbook Pro. So yesterday I decided to enable a full firewall, thinking that maybe there was some intruder on my machine. Much to my surprise the logfile showed continuous attack to access some ports, especially 46585 (gtkam?). Hence I decided to notify the providers associated to the IPs that were recorded responsible for the attacks. Since it is a rather long and cumbersome process, I will now here give the few command lines that help a bit in selecting which IPs to investigate further.

will it (mathjax) work?

just enabled using drush.

$ h \leq \frac{1}{2} |\zeta - z| [ |\zeta - z - h| \geq \frac{1}{2} |\zeta - z|] $

implies

$ \left| \frac{1}{\zeta - z - h} - \frac{1}{\zeta - z} \right| = \left| \frac{(\zeta - z) - (\zeta - z - h)}{(\zeta - z - h)(\zeta - z)} \right| \ = \left| \frac{h}{(\zeta - z - h)(\zeta - z)} \right| \ \leq \frac{2 |h|}{|\zeta - z|^2}. $

$ \cos 2\theta = \cos^2 \theta - \sin^2 \theta = 2 \cos^2 \theta - 1.$

Syndicate content