1
#
voice-activity-detection
Here are 95 public repositories matching this topic...
Automagically synchronize subtitles with video.
audio
sync
synchronization
video
ffmpeg
captions
subtitles
caption
alignment
fast-fourier-transform
subtitle
vad
vlc
srt
fft
vlc-media-player
srt-subtitles
voice-activity-detection
speech-detection
string-alignment
-
Updated
May 9, 2022 - Python
Command-line utility to transcribe/translate from video/audio/subtitles to subtitles
subtitles
substation-alpha
audio-segmentation
xfyun
cloud-speech-api
voice-activity-detection
baidu-api
xunfei-api
-
Updated
Apr 20, 2022 - Python
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
tutorial
detection
extraction
citation
pytorch
pretrained-models
speaker-recognition
speaker-verification
speech-processing
speaker-diarization
voice-activity-detection
speech-activity-detection
speaker-change-detection
speaker-embedding
pyannote-audio
overlapped-speech-detection
speaker-diarization-pipeline
-
Updated
May 23, 2022 - Python
text-to-speech
tts
speech-synthesis
voice-recognition
speech-recognition
speech-to-text
stt
speech-processing
voice-activity-detection
speech-separation
speech-emotion-recognition
voice-cloning
-
Updated
Jan 25, 2022
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
data
speech
dnn
lstm
speech-recognition
attention
vad
voice-detection
voice-activity-detection
bdnn
acam
speech-activity-detection
-
Updated
Jun 9, 2021 - MATLAB
Silero VAD: pre-trained enterprise-grade Voice Activity Detector, Language Classifier and Spoken Number Detector
voice-commands
pytorch
voice-recognition
voice-control
voice-detection
voice-activity-detection
onnx
language-classifier
-
Updated
Apr 12, 2022 - Python
An audio/acoustic activity detection and audio segmentation tool
-
Updated
Nov 3, 2021 - Python
CNN-based audio segmentation toolkit. Allows to detect speech, music and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
music
speech
audio-analysis
noise
gender-equality
segmentation
gender
praat
gender-classification
male
female
voice-activity-detection
music-detection
mirex
speech-segmentation
speech-music
speaker-gender
speech-detection
-
Updated
Feb 13, 2022 - Python
Python AI assistant 🧠
python
nlp
ai
mongodb
sklearn
pymongo
voice-commands
voice-recognition
nltk
voice-chat
voice-control
python35
nlp-machine-learning
wolfram-language
voice-assistant
google-speech-recognition
voice-activity-detection
voice-recognition-experiment
google-speech-to-text
linux-assistant
-
Updated
Mar 17, 2022 - Python
visualization
security
data
machine-learning
server
voice
python3
voice-recognition
generation
transcription
voice-control
data-cleaning
voice-assistant
encryption-decryption
voice-recording
voice-activity-detection
wake-word-detection
featurization
voice-computing
-
Updated
Mar 12, 2022 - Python
Automatically synchronize and translate subtitles with pretrained deep neural networks, forced alignments and transformers. https://subaligner.readthedocs.io/
scc
transformers
captions
subtitles
alignment
webvtt
substation-alpha
subrip
tmp
sbv
mpl2
sami
ttml
voice-activity-detection
subtitle-conversion
microdvd
subtitle-translation
advanced-substation-alpha
subtitle-synchronization
ebu-stl
-
Updated
May 19, 2022 - Python
Voice Activity Detection based on Deep Learning & TensorFlow
python
machine-learning
deep-neural-networks
deep-learning
time-series
tensorflow
speech
artificial-intelligence
speech-recognition
vad
resnet
deeplearning
time-series-classification
voice-activity-detection
librispeech
speech-detection
librispeech-dataset
mfcc-features
-
Updated
Feb 9, 2022 - Python
A statistical model-based Voice Activity Detection
-
Updated
Nov 30, 2018 - Jupyter Notebook
Voice Activity Detection (VAD) using deep learning.
deep-neural-networks
deep-learning
pytorch
recurrent-neural-networks
densenet
convolutional-neural-networks
voice-activity-detection
focal-loss
-
Updated
Oct 14, 2019 - Jupyter Notebook
Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper
machine-learning
pytorch
voice-activity-detection
speech-activity-detection
noise-robust-asr
sound-activity
-
Updated
Oct 8, 2021 - Python
On-device voice activity detection (VAD) powered by deep learning.
javascript
android
python
c
swift
ios
web
deep-learning
voice-recognition
speech-recognition
vad
voice-activity-detection
-
Updated
May 22, 2022 - JavaScript
Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021
-
Updated
Oct 26, 2021 - Python
The codebase for Data-driven general-purpose voice activity detection.
-
Updated
Dec 2, 2021 - Python
Matlab and Python libraries for an unsupervised method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.
-
Updated
Apr 28, 2022 - MATLAB
Implementation of Logistic Regression, MLP, CNN, RNN & LSTM from scratch in python. Training of deep learning models for image classification, object detection, and sequence processing (including transformers implementation) in TensorFlow.
deep-learning
transformers
coursera
named-entity-recognition
neural-networks
question-answering
face-recognition
mlp
transfer-learning
hyperparameter-tuning
optimization-algorithms
audio-processing
andrew-ng
voice-activity-detection
cnn-for-visual-recognition
image-segmentation-tensorflow
rnn-lstm
structuring-ml-projects
-
Updated
May 21, 2021 - Jupyter Notebook
Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!
android
text-to-speech
nlu
voice
speech
tts
speech-synthesis
voice-recognition
speech-recognition
vad
asr
voice-assistant
natural-language-understanding
voice-as-an-interface
speech-api
voice-activity-detection
voice-synthesis
wakeword
wakeword-activation
-
Updated
Oct 18, 2021 - Java
This VAD library can process audio in real-time utilizing GMM which helps identify presence of human speech in an audio sample that contains a mixture of speech and noise.
audio
android
real-time
offline
webrtc
gaussian-mixture-models
vad
gmm
audio-processing
voice-activity-detection
-
Updated
Jun 10, 2021 - C
A collection of basic python modules for spoken natural language processing
natural-language-processing
tokenizer
tts
speech-recognition
phonetics
pulseaudio
voice-activity-detection
-
Updated
Dec 1, 2019 - Python
Spokestack: give your React Native app a voice interface!
android
ios
text-to-speech
react-native
nlu
voice-commands
tts
speech-synthesis
voice-recognition
speech-recognition
speech-to-text
voice-control
hacktoberfest
speech-processing
asr
voice-assistant
speech-api
voice-activity-detection
voice-interface
nlu-engine
-
Updated
Apr 29, 2022 - TypeScript
This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.
-
Updated
Apr 29, 2022 - Python
machine-learning
tutorial
voice
voice-commands
voice-recognition
workshop-materials
voice-control
gender-classification
voice-assistant
machine-learning-modeling
gender-detection
machine-learning-practice
voice-activity-detection
machine-learning-tutorial
voice-computing
machine-learning-model
surveylex
neurolex
-
Updated
Aug 7, 2020 - Python
Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.
voice-recognition
speaker-recognition
speaker-verification
speech-processing
voice-activity-detection
speaker-identification
speaker-embedding
-
Updated
Oct 4, 2019 - Jupyter Notebook
A python library for voice activity detection (VAD) for speech/non-speech segmentation.
-
Updated
Jun 8, 2021 - Python
PyTorch implementation of automatic speech recognition models.
end-to-end
pytorch
transformer
las
vad
e2e
asr
acoustic-model
voice-activity-detection
deepspeech2
listen-attend-and-spell
-
Updated
Jan 10, 2021 - Python
Improve this page
Add a description, image, and links to the voice-activity-detection topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the voice-activity-detection topic, visit your repo's landing page and select "manage topics."
Current state: releases are build manually
Target state: