Experiences with Google voice

Following a suggestion from Aarne, I found some Google service for speech input, but the experiments are not encouraging:

  1. I recorded Compute this into a mp4 file using QuickTime Player on the mac

  2. Converted it to flac using:

    sox compute.m4a compute.flac rate 16k

  3. And get into the service by:

    curl -H "Content-Type:audio/x-flac; rate=16000" "https://www.google.com/speech-api/v1/recognize?xjerr=1&client=chromium&lang=en-US" -F "myfile=@compute.flac

But got:

 `{"status":0,"id":"56bdb158dd66b25fc2e221364004e620-1","hypotheses":[{"utterance":"coffee lol","confidence":0.46219563}]}`

Other examples:

  • "I like pickles" ⇒ "I like turtles"

  • "The determinant of x" ⇒ "new york" (with confidence 0.88!)

  • "Compute this" ⇒ "coffee lol"

Of course I'm not a native English speaker, but I expected a better performance.