charleslwang commited on
Commit
c471e55
·
verified ·
1 Parent(s): 984ab6e

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +73 -0
README.md ADDED
@@ -0,0 +1,73 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ tags:
5
+ - automatic-speech-recognition
6
+ - speech
7
+ - pathological-speech
8
+ - dysarthria
9
+ - huntingtons-disease
10
+ - nemo
11
+ - parakeet
12
+ - multitask-learning
13
+ - articulation
14
+ license: apache-2.0
15
+ pipeline_tag: automatic-speech-recognition
16
+ library_name: nemo
17
+ base_model: charleslwang/parakeet-tdt-0.6b-HD
18
+ model-index:
19
+ - name: parakeet-tdt-0.6b-HD-articulation
20
+ results:
21
+ - task:
22
+ type: automatic-speech-recognition
23
+ name: Automatic Speech Recognition
24
+ dataset:
25
+ name: Huntington Disease clinical speech test set
26
+ type: private
27
+ metrics:
28
+ - type: wer
29
+ value: 6.44
30
+ name: WER (%)
31
+ ---
32
+
33
+ # Parakeet-TDT 0.6B HD Articulation
34
+
35
+ Official checkpoint for the paper **"Towards Robust Automatic Speech Recognition for Huntington Disease."**
36
+
37
+ ## Model description
38
+
39
+ This model is an articulation-aware variant of **Parakeet-TDT 0.6B HD**, tuned for automatic speech recognition on speech affected by **Huntington disease (HD)**. It extends the HD-adapted base model with auxiliary supervision from articulatory biomarker labels.
40
+
41
+ ## What this model does
42
+
43
+ The model transcribes English read / controlled clinical speech from speakers with Huntington disease and healthy controls. It is intended as a research model for studying robust ASR under hyperkinetic motor-speech disruption and for analyzing the effect of articulatory supervision on transcription behavior.
44
+
45
+ ## Training
46
+
47
+ The model was initialized from `charleslwang/parakeet-tdt-0.6b-HD` and further adapted using **parameter-efficient encoder-side adapters** with an auxiliary objective based on articulatory biomarker labels, while keeping the pretrained backbone frozen.
48
+
49
+ ## Evaluation
50
+
51
+ On the reported HD test set, this model achieved:
52
+
53
+ - **WER:** 6.44
54
+ - **Substitutions:** 1.94
55
+ - **Deletions:** 3.21
56
+ - **Insertions:** 1.29
57
+
58
+ ## Intended use
59
+
60
+ This model is intended for:
61
+
62
+ - research on pathological / atypical speech recognition,
63
+ - benchmarking ASR on Huntington disease speech,
64
+ - studying how articulatory auxiliary supervision reshapes error behavior.
65
+
66
+ It is **not** intended for clinical diagnosis, treatment decisions, or standalone medical use.
67
+
68
+ ## Limitations
69
+
70
+ - Trained and evaluated on a relatively small, high-fidelity clinical corpus.
71
+ - Primarily reflects controlled / read speech rather than spontaneous conversational speech.
72
+ - Did not outperform the plain HD-adapted model on overall WER.
73
+ - May not generalize to severe out-of-distribution impairment, other languages, or other recording conditions.