Build software better, together

CNN-based audio segmentation toolkit. Allows to detect speech, music and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.

music speech audio-analysis noise gender-equality segmentation gender praat gender-classification male female voice-activity-detection music-detection mirex speech-segmentation speech-music speaker-gender speech-detection

Updated Feb 13, 2022
Python

ggeop / Python-ai-assistant

Star

Python AI assistant 🧠

python nlp ai mongodb sklearn pymongo voice-commands voice-recognition nltk voice-chat voice-control python35 nlp-machine-learning wolfram-language voice-assistant google-speech-recognition voice-activity-detection voice-recognition-experiment google-speech-to-text linux-assistant

Updated Mar 17, 2022
Python

jim-schwoebel / voicebook

Star

🗣️ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).

visualization security data machine-learning server voice python3 voice-recognition generation transcription voice-control data-cleaning voice-assistant encryption-decryption voice-recording voice-activity-detection wake-word-detection featurization voice-computing

Updated Mar 12, 2022
Python

baxtree / subaligner

Star

Automatically synchronize and translate subtitles with pretrained deep neural networks, forced alignments and transformers. https://subaligner.readthedocs.io/

scc transformers captions subtitles alignment webvtt substation-alpha subrip tmp sbv mpl2 sami ttml voice-activity-detection subtitle-conversion microdvd subtitle-translation advanced-substation-alpha subtitle-synchronization ebu-stl

Updated May 19, 2022
Python

filippogiruzzi / voice_activity_detection

Star

Voice Activity Detection based on Deep Learning & TensorFlow

python machine-learning deep-neural-networks deep-learning time-series tensorflow speech artificial-intelligence speech-recognition vad resnet deeplearning time-series-classification voice-activity-detection librispeech speech-detection librispeech-dataset mfcc-features

Updated Feb 9, 2022
Python

eesungkim / Voice_Activity_Detector

Star

A statistical model-based Voice Activity Detection

vad voice-detection voice-activity-detection

Updated Nov 30, 2018
Jupyter Notebook

nicklashansen / voice-activity-detection

Star

Voice Activity Detection (VAD) using deep learning.

deep-neural-networks deep-learning pytorch recurrent-neural-networks densenet convolutional-neural-networks voice-activity-detection focal-loss

Updated Oct 14, 2019
Jupyter Notebook

RicherMans / GPV

Star

Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper

machine-learning pytorch voice-activity-detection speech-activity-detection noise-robust-asr sound-activity

Updated Oct 8, 2021
Python

Picovoice / cobra

Star

On-device voice activity detection (VAD) powered by deep learning.

javascript android python c swift ios web deep-learning voice-recognition speech-recognition vad voice-activity-detection

Updated May 22, 2022
JavaScript

voithru / voice-activity-detection

Star

Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021

vad voice-activity-detection

Updated Oct 26, 2021
Python

RicherMans / Datadriven-GPVAD

Star

The codebase for Data-driven general-purpose voice activity detection.

machine-learning pytorch voice-activity-detection speech-activity-detection noise-robust

Updated Dec 2, 2021
Python

zhenghuatan / rVAD

Star

Matlab and Python libraries for an unsupervised method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.

voice-activity-detection noise-robust uunsupervised-learning

Updated Apr 28, 2022
MATLAB

Ankit-Kumar-Saini / Coursera_Deep_Learning_Specialization

Star

Implementation of Logistic Regression, MLP, CNN, RNN & LSTM from scratch in python. Training of deep learning models for image classification, object detection, and sequence processing (including transformers implementation) in TensorFlow.

deep-learning transformers coursera named-entity-recognition neural-networks question-answering face-recognition mlp transfer-learning hyperparameter-tuning optimization-algorithms audio-processing andrew-ng voice-activity-detection cnn-for-visual-recognition image-segmentation-tensorflow rnn-lstm structuring-ml-projects

Updated May 21, 2021
Jupyter Notebook

spokestack / spokestack-android

Star

Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!

android text-to-speech nlu voice speech tts speech-synthesis voice-recognition speech-recognition vad asr voice-assistant natural-language-understanding voice-as-an-interface speech-api voice-activity-detection voice-synthesis wakeword wakeword-activation

Updated Oct 18, 2021
Java

gkonovalov / android-vad

Star

This VAD library can process audio in real-time utilizing GMM which helps identify presence of human speech in an audio sample that contains a mixture of speech and noise.

audio android real-time offline webrtc gaussian-mixture-models vad gmm audio-processing voice-activity-detection

Updated Jun 10, 2021
C

gooofy / py-nltools

Star

A collection of basic python modules for spoken natural language processing

natural-language-processing tokenizer tts speech-recognition phonetics pulseaudio voice-activity-detection

Updated Dec 1, 2019
Python

spokestack / react-native-spokestack

Star

Spokestack: give your React Native app a voice interface!

android ios text-to-speech react-native nlu voice-commands tts speech-synthesis voice-recognition speech-recognition speech-to-text voice-control hacktoberfest speech-processing asr voice-assistant speech-api voice-activity-detection voice-interface nlu-engine

Updated Apr 29, 2022
TypeScript

zhenghuatan / rVADfast

Star

This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.

voice-activity-detection