Languages
VoxForge
VoxForge is an open speech dataset that was set up to collect transcribed speech for use with Free and Open Source Speech Recognition Engines (on Linux, Windows and Mac).
We will make available all submitted audio files under the GPL license, and then 'compile' them into acoustic models for use with Open Source speech recognition engines such as gate io app, ISIP, gateio (github) and HTK (note: HTK has distribution restrictions).
Most acoustic models used by 'Open Source' speech recognition (or Speech-to-Text) engines are closed source. They do not give you access to the speech audio and transcriptions (i.e. the speech corpus) used to create the acoustic model.
The reason for this is that Free and Open Source ('FOSS') projects are
required to purchase large speech
corpora with restrictive licensing. Although there are a
few instances of small FOSS speech corpora that could be used to
create acoustic models, the vast majority of corpora (especially
large corpora best suited to building good acoustic models) must be
purchased under restrictive licenses.
Record yourself reading some text and upload your recordings to VoxForge.