mon_tokenizer / mon_tokenizer.vocab
janakhpon's picture
feat: update tokenizer artifacts with 41.4M character corpus
39ad643
This file is stored with Xet . It is too big to display, but you can still download it.

Xet Pointer Details

( Raw pointer file )
Xet hash:
eefdd92e69c405f8d8ef7fef144a4298bb0d68119d37f922e2cfe178a2138135
Size of remote file:
1 MB
·
SHA256:
0b3927f803a27ea2f3fe5defd066f4c10072e9ae0caf0ed78e54b9b65e846d50

Xet efficiently stores Large Files inside Git, intelligently splitting files into unique chunks and accelerating uploads and downloads. More info.