Speech Recognition
Last updated
Last updated
- Open source Speech-To-Text engine, using a model trained by machine learning techniques, based on Baidu's Deep Speech research paper.
- Fast, open source speech processing toolkit from the Speech team at Facebook AI Research built to facilitate research in end-to-end models for speech recognition.
- Speech Recognition Toolkit.
()
- Clone a voice in 5 seconds to generate arbitrary speech in real-time.
- Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time.
- PyTorch Implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition.
- Speech recognition framework for Python that makes it convenient to create custom commands to use with speech recognition software.
- Robust yet lenient forced-aligner built on Kaldi. A tool for aligning speech with text.
- On-device wake word detection powered by deep learning.
- End-to-End Speech Recognition using Deep RNN Models and WFST-based Decoding.
- Pre-trained STT models and benchmarks made embarrassingly simple.
()
- Neural network for end-to-end speech denoising, as described in: "A Wavenet For Speech Denoising".
- Speech recognition toolkit with state-of-the-art accuracy and low latency in Rust.
- Speech-to-text Platform and APIs. Speech Recognition.