witness-diarize-plda-redimnet2

Big-corpus (921-speaker, LibriSpeech-refit) PLDA fit for the ReDimNet2-B6 x-vector space, in the raw little-endian *.f32 blob layout the witness VBx backend loads (community-1 layout). Pairs with littlebearlabs/witness-redimnet2-b6-mlx for VBx clustering.

Attribution

  • Fit: ours — .research/diarization/fit_resnet293_plda.py + gen_plda_vbx_fixtures.py, refit on a 921-speaker corpus in the ReDimNet2-B6 192-d x-vector space.
  • Method / layout: BUT VBx (variational Bayes HMM x-vector clustering) + the pyannote community-1 PLDA file layout. The ReDimNet2-B6 embedder these x-vectors come from is MIT (PalabraAI); the VBx method and the community-1 layout are the upstream references.
  • License: CC-BY-4.0 (our fit; attribute witness + the BUT VBx / community-1 lineage).

What's in this repo

Six raw little-endian f32 blobs (the full set the VBx backend loads):

  • transform_mean1.f32 (xvec_dim=192)
  • transform_lda.f32 (192·128)
  • transform_mean2.f32 (128)
  • plda_mu.f32 (128)
  • plda_tr.f32 (128·128)
  • plda_psi.f32 (128, the across-class covariance diagonal)

Converted to MLX for witness, an open-source Rust toolkit for on-device system capture on macOS. Generated by .research/diarization/publish_weights.sh.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support