[Last time][audio-features] we converted audio buffers into images. This time
we'll take these images and train a neural network using
[deeplearn.js][deeplearn]. The result is [a browser-based demo][infer-yesno] that
lets you speak a command ("yes" or "no"), and see the output of the classifier
in real-time, like this: