For anyone interested in sound and sound recordings
Developed by FXPAL. Not speech-to-text as such, but interesting.
"TalkMiner aggregates and indexes lecture videos available across the internet. The system processes RSS feeds from a variety of sites to collect lecture videos. The system automatically processes the video to generate metadata describing each talk including the video frames that contain slides, their time offsets, and the text recovered from those frames by optical character recognition. TalkMiner does not maintain a copy of the original videos. When a user plays a lecture, the video is played from the original website on which the lecture video is hosted. As a result, storage requirements for TalkMiner are modest."