Boris Smus

interaction engineering

Web-based voice command recognition

[Last time][audio-features] we converted audio buffers into images. This time we'll take these images and train a neural network using [deeplearn.js][deeplearn]. The result is [a browser-based demo][infer-yesno] that lets you speak a command ("yes" or "no"), and see the output of the classifier in real-time, like this: