Instructions to use speechbrain/mtl-mimic-voicebank with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- speechbrain
How to use speechbrain/mtl-mimic-voicebank with speechbrain:
# interface in config.json invalid
- Notebooks
- Google Colab
- Kaggle
Commit ·
551bc8e
1
Parent(s): ca5ef0f
Update README.md
Browse files
README.md
CHANGED
|
@@ -73,6 +73,29 @@ torchaudio.save('enhanced.wav', enhanced.unsqueeze(0).cpu(), 16000)
|
|
| 73 |
### Inference on GPU
|
| 74 |
To perform inference on the GPU, add `run_opts={"device":"cuda"}` when calling the `from_hparams` method.
|
| 75 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 76 |
### Limitations
|
| 77 |
The SpeechBrain team does not provide any warranty on the performance achieved by this model when used on other datasets.
|
| 78 |
|
|
@@ -100,7 +123,7 @@ title = {SpeechBrain},
|
|
| 100 |
year = {2021},
|
| 101 |
publisher = {GitHub},
|
| 102 |
journal = {GitHub repository},
|
| 103 |
-
howpublished = {\url{https://github.com/speechbrain/speechbrain}},
|
| 104 |
}
|
| 105 |
```
|
| 106 |
|
|
|
|
| 73 |
### Inference on GPU
|
| 74 |
To perform inference on the GPU, add `run_opts={"device":"cuda"}` when calling the `from_hparams` method.
|
| 75 |
|
| 76 |
+
### Training
|
| 77 |
+
The model was trained with SpeechBrain (150e1890).
|
| 78 |
+
To train it from scratch follows these steps:
|
| 79 |
+
1. Clone SpeechBrain:
|
| 80 |
+
```bash
|
| 81 |
+
git clone https://github.com/speechbrain/speechbrain/
|
| 82 |
+
```
|
| 83 |
+
2. Install it:
|
| 84 |
+
```
|
| 85 |
+
cd speechbrain
|
| 86 |
+
pip install -r requirements.txt
|
| 87 |
+
pip install -e .
|
| 88 |
+
```
|
| 89 |
+
|
| 90 |
+
3. Run Training:
|
| 91 |
+
```
|
| 92 |
+
cd recipes/Voicebank/MTL/ASR_enhance
|
| 93 |
+
python train.py hparams/enhance_mimic.yaml --data_folder=your_data_folder
|
| 94 |
+
|
| 95 |
+
https://drive.google.com/drive/folders/1fcVP52gHgoMX9diNN1JxX_My5KaRNZWs?usp=sharing
|
| 96 |
+
|
| 97 |
+
You can find our training results (models, logs, etc) [here](https://drive.google.com/drive/folders/1HaR0Bq679pgd1_4jD74_wDRUq-c3Wl4L?usp=sharing)
|
| 98 |
+
|
| 99 |
### Limitations
|
| 100 |
The SpeechBrain team does not provide any warranty on the performance achieved by this model when used on other datasets.
|
| 101 |
|
|
|
|
| 123 |
year = {2021},
|
| 124 |
publisher = {GitHub},
|
| 125 |
journal = {GitHub repository},
|
| 126 |
+
howpublished = {\\url{https://github.com/speechbrain/speechbrain}},
|
| 127 |
}
|
| 128 |
```
|
| 129 |
|