goodsmileduck commited on
Commit
cd3de0d
·
verified ·
1 Parent(s): f7899fd

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +8 -15
README.md CHANGED
@@ -16,26 +16,21 @@ base_model: nvidia/parakeet-tdt-0.6b-v3
16
 
17
  ONNX export of [nvidia/parakeet-tdt-0.6b-v3](https://huggingface.co/nvidia/parakeet-tdt-0.6b-v3) for use with OpenVINO on Intel NPU.
18
 
19
- Includes the bundled NeMo mel spectrogram preprocessor (`nemo128.onnx`) for a self-contained pipeline.
20
 
21
  ## Files
22
 
23
  | File | Size | Description |
24
  |------|------|-------------|
25
- | `encoder-model.onnx` + `.data` | ~2.5 GB | Conformer encoder (runs on NPU) |
26
- | `decoder_joint-model.onnx` | 73 MB | TDT joint decoder (runs on CPU) |
27
- | `nemo128.onnx` | 141 KB | Mel spectrogram preprocessor (onnxruntime CPU) |
28
- | `vocab.txt` | 94 KB | 8193-token vocabulary |
29
- | `config.json` | 97 B | Model metadata |
30
 
31
  ## Pipeline
32
 
33
- ```
34
- Audio (16kHz float32)
35
- -> nemo128.onnx (onnxruntime CPU) -> mel spectrogram [1, 128, T]
36
- -> encoder-model.onnx (OpenVINO NPU) -> encoder features [1, 1024, T/8]
37
- -> decoder_joint-model.onnx (OpenVINO CPU) -> TDT greedy decode -> text
38
- ```
39
 
40
  ## Performance (Intel Core Ultra / Meteor Lake NPU)
41
 
@@ -50,9 +45,7 @@ Audio (16kHz float32)
50
 
51
  Used by [npu-whisper](https://github.com/goodsmileduck/npu-whisper) dictation engine:
52
 
53
- ```bash
54
- python dictation_engine.py --model parakeet
55
- ```
56
 
57
  ## Credits
58
 
 
16
 
17
  ONNX export of [nvidia/parakeet-tdt-0.6b-v3](https://huggingface.co/nvidia/parakeet-tdt-0.6b-v3) for use with OpenVINO on Intel NPU.
18
 
19
+ Includes the bundled NeMo mel spectrogram preprocessor (\) for a self-contained pipeline.
20
 
21
  ## Files
22
 
23
  | File | Size | Description |
24
  |------|------|-------------|
25
+ | \ + \ | ~2.5 GB | Conformer encoder (runs on NPU) |
26
+ | \ | 73 MB | TDT joint decoder (runs on CPU) |
27
+ | \ | 141 KB | Mel spectrogram preprocessor (onnxruntime CPU) |
28
+ | \ | 94 KB | 8193-token vocabulary |
29
+ | \ | 97 B | Model metadata |
30
 
31
  ## Pipeline
32
 
33
+
 
 
 
 
 
34
 
35
  ## Performance (Intel Core Ultra / Meteor Lake NPU)
36
 
 
45
 
46
  Used by [npu-whisper](https://github.com/goodsmileduck/npu-whisper) dictation engine:
47
 
48
+
 
 
49
 
50
  ## Credits
51