Running 1 Uhm β on-device filler-word detection π¬ 1 Every uh, um, hmm tagged at 20 ms β on device, multilingual
Running 1 Clear β on-device speech enhancement π 1 Twelve recordings, raw vs cleaned. Studio sound, on device.
openai/whisper-tiny.en Automatic Speech Recognition β’ 37.8M β’ Updated Jan 22, 2024 β’ 59.3k β’ 116
pyannote/speaker-diarization Automatic Speech Recognition β’ Updated May 10, 2024 β’ 500k β’ 1.28k
ehcalabres/wav2vec2-lg-xlsr-en-speech-emotion-recognition Audio Classification β’ 0.3B β’ Updated Oct 24, 2024 β’ 17.7k β’ 250