<html><head><meta http-equiv="Content-Type" content="text/html charset=us-ascii"></head><body style="word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space; ">Hello all,<div><br></div><div>I am trying to reproduce the Shazam algorithm as outlined in Avery Wang's paper "An Industrial-Strength Audio Search Algorithm" (<a href="http://www.ee.columbia.edu/~dpwe/papers/Wang03-shazam.pdf">http://www.ee.columbia.edu/~dpwe/papers/Wang03-shazam.pdf</a>). One of the step in this is to convert the audio to spectrogram and identify the spectrogram peaks. I am wondering if building a custom audio-filter for ffmpeg would be the correct way to go? If so, does anyone have any pointers on converting the audio data to spectrogram for me? (algorithm to use, things to note, etc?)</div><div><br></div><div><br></div><div>Any help would be appreciated. Thanks.</div><div><br></div><div><br>
<br></div></body></html>