Spaces:
Running
Running
Commit History
livestream : handle ffmpeg errors gracefully and stabilize transcript 520d318 unverified
livestream : minor changes dec4507 unverified
livestream : fix losing words across audio chunk (#195) b3a9b29 unverified
whisper : add mechanism for aborting the whisper_full() computation d311de4
whisper.objc : fix context + broken readme links 2361fbc unverified
whisper.objc : add real-time processing (#97) 11bb554 unverified
whisper.objc : fix build warnings 8d1f7e9 unverified
yt-wsp.sh : script to easily transcribe VODs a7c58c8 unverified
command.wasm : add voice assistant example for the Web (#171) 2ee248a unverified
minor : add comment for using "generate_karaoke.sh" 2512003 unverified
livestream.sh : simple tool to transcribe audio livestreams (#185) 2a7b373 unverified
stream.wasm : add web-based real-time transcription (#112) 936213e unverified
whisper.wasm : do not block page while processing (close #86) d0b1d9e unverified
main : add stereo-channel-based diarization (#64) b5e16ed unverified
command : add demonstration video 64508b4 unverified
command : fix build + fix README + add bold printing f70b793 unverified
examples : add "command" tool (#171) 4d3c293 unverified
refactoring : more readable code 5ef0168 unverified
wasm : refactor wasm example + reuse fetch mechanism 3520198 unverified
talk.wasm : update video link + some minor fixes 33a4590 unverified
Update README.md 75f9881 unverified
talk.wasm : move to https://whisper.ggerganov.com/talk bee1ba7 unverified
main : fix dangling pointer when using stdin for input (#65) 2daf96b unverified
main, stream : remove --verbose flag (#178) 8f1a93e unverified
talk.wasm : add audio pre-processing + bump memory 679d38e unverified
talk.wasm : refactoring + update README.md ff21a60 unverified
minor : updates few prints + fix buttons in whisper.wasm 7c7a4d7 unverified
unicode : fix character replacement (thanks to @tamo) be06a9b unverified
close #109 : add fetching of the model over HTTP (whisper.wasm) 04da8a6 unverified
talk.wasm : final touches 722327a unverified
talk.wasm : polishing + adding many AI personalities b38c009 unverified
stream : "-kc" now enables context keeping from previous segment (#90) 28726dd unverified
Prompt previous tokens for streaming (#163) 8ad3dbf unverified
talk.wasm : update README.md 5cb7243 unverified
talk.wasm : GPT-2 meets Whisper in WebAssembly (#155) 411c667 unverified
stream : add "max_tokens" cli arg 57a7bac
stream : add "audio_ctx" parameter 6adc1fe
stream : add "max_tokens" parameter e48ba5c
stream : add "single_segment" option a265bfa
stream : partial encoder experiments a2015c0
whisper : add option to speed up the audio tempo by x2 bec875e
Adds support for stdin wav input d83eddb
Alan commited on