ParaSpeechCLAP: Dual-Encoder Speech-Text Model - a ajd12342 Collection

ajd12342 's Collections

ParaSpeechCLAP: Dual-Encoder Speech-Text Model

ParaSpeechCaps: Rich Style Prompted TTS

ParaSpeechCLAP: Dual-Encoder Speech-Text Model

updated Apr 14

The ParaSpeechCLAP models and datasets used to train them.

ParaSpeechCLAP: A Dual-Encoder Speech-Text Model for Rich Stylistic Language-Audio Pretraining

Paper • 2603.28737 • Published Mar 30
ajd12342/paraspeechclap-intrinsic

Audio Classification • Updated Apr 6
ajd12342/paraspeechclap-situational

Audio Classification • Updated Apr 6
ajd12342/paraspeechclap-combined

Audio Classification • Updated Apr 6
ajd12342/paraspeechcaps-intrinsic-train

Viewer • Updated Apr 6 • 945k • 259
ajd12342/paraspeechcaps-situational-train

Viewer • Updated Apr 6 • 96.2k • 265
ajd12342/paraspeechclap-eval-intrinsic

Viewer • Updated Apr 6 • 9.4k • 53
ajd12342/paraspeechclap-eval-situational

Viewer • Updated Apr 6 • 1.43k • 27
ajd12342/paraspeechclap-eval-combined

Viewer • Updated Apr 6 • 1.43k • 57