KokosDev commited on
Commit
0338b62
·
1 Parent(s): d0851b3

Add Qwen2.5-VL-7B CLT transcoders with LFS

Browse files
README.md ADDED
@@ -0,0 +1,63 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Qwen2.5-VL-7B Circuit-Level Transcoders (CLT)
2
+
3
+ This repository contains Circuit-Level Transcoders (CLTs) for Qwen2.5-VL-7B, trained with TopK sparsity (12% L0 sparsity).
4
+
5
+ ## Model Details
6
+
7
+ - **Base Model**: Qwen2.5-VL-7B
8
+ - **Type**: Circuit-Level Transcoders (CLT)
9
+ - **Layers**: 27 transcoders (L0-L26)
10
+ - **Sparsity**: 12% L0 (TopK)
11
+ - **File Size**: ~113MB per layer (~3GB total)
12
+ - **Training**: Each layer trained for 5000 steps
13
+
14
+ ## Files
15
+
16
+ This repository contains 27 transcoder checkpoint files:
17
+ - `transcoder_L0.pt` through `transcoder_L26.pt`
18
+
19
+ Each file contains the trained transcoder weights for the corresponding layer.
20
+
21
+ ## Usage
22
+
23
+ ```python
24
+ import torch
25
+
26
+ # Load a specific layer transcoder
27
+ layer_num = 12
28
+ transcoder = torch.load(f'transcoder_L{layer_num}.pt')
29
+
30
+ # The transcoder can be used for circuit discovery and suppression
31
+ # in conjunction with the Qwen2.5-VL-7B model
32
+ ```
33
+
34
+ ## Training Details
35
+
36
+ - **Optimizer**: AdamW
37
+ - **Training Steps**: 5000 per layer
38
+ - **Sparsity Target**: 12% L0 (TopK)
39
+ - **Validation Frequency**: Every 200 steps
40
+ - **Training Time**: ~5 minutes per layer
41
+
42
+ ## Related Models
43
+
44
+ - [KokosDev/qwen2p5vl-7b-plt](https://huggingface.co/KokosDev/qwen2p5vl-7b-plt) - Paired Linear Transcoders (PLT) for the same base model
45
+
46
+ ## Citation
47
+
48
+ If you use these transcoders in your research, please cite:
49
+
50
+ ```bibtex
51
+ @misc{qwen2p5vl-7b-clt,
52
+ title={Circuit-Level Transcoders for Qwen2.5-VL-7B},
53
+ author={KokosDev},
54
+ year={2025},
55
+ publisher={HuggingFace},
56
+ howpublished={\url{https://huggingface.co/KokosDev/qwen2p5vl-7b-clt}}
57
+ }
58
+ ```
59
+
60
+ ## License
61
+
62
+ Please refer to the base model's license: [Qwen2.5-VL](https://huggingface.co/Qwen/Qwen2.5-VL-7B)
63
+
transcoder_L0.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9a523426643bfb7b1ba5fd4e315035947a3124975ab81b17b078cf21e3623e91
3
+ size 117481949
transcoder_L1.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:011da45eec676ccc6ec499a7646e4e9dd33f63d0b30f093a1c2620734a8820a9
3
+ size 117481949
transcoder_L10.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:674f155aa2998527432c8fefa80a7d4d1d46441e6faf79e6d9b41088bcd8e116
3
+ size 117481961
transcoder_L11.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a9dda5894955dbdf4b4ae15ceb15221466024da7d5bda29c176c6a2c896e45e0
3
+ size 117481961
transcoder_L12.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:14fd40ba52d752cc0a39ef8dea39899851dba8bc6642a73caa8d35b32f71c442
3
+ size 117481961
transcoder_L13.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f4f1e16d42611571560f84c1b40bc9c0ff63f187cfc05732ba08bbf14b71c9a7
3
+ size 117481961
transcoder_L14.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5d5e16211dbfc408cdd1633d7299f8a3261a85d9206df9e730a2a2d2bed2fab9
3
+ size 117481961
transcoder_L15.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:90228d9eabf2c68444ca84bfb2e278352400ccaf735ba7fc04f7c1d1b07c1618
3
+ size 117481961
transcoder_L16.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a2d54b51b86723f2d12bf084e0c3609e35285a87f574324900be07de576176f2
3
+ size 117481961
transcoder_L17.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:af541fa055045c78885ce46bb646838330b1f1e88ced882f9a45032abf5ffdd0
3
+ size 117481961
transcoder_L18.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:deca497b56fb9117f24921229cbc2d13ce6cc1218e8d457ce117fc335cad8e50
3
+ size 117481961
transcoder_L19.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4c92fed7707738b43f9c2ce8d430e5e7fb003a4352b380a2e6fb9488ee9ebeb1
3
+ size 117481961
transcoder_L2.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:046eb0ad6499828116ca24ddac56d75b2eafe33231549f3f2536c623bdc6b604
3
+ size 117481949
transcoder_L20.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b4da499648fa770a0890d9161368687179914dcbc9e5c54fb6d5b8e927cd84a3
3
+ size 117481961
transcoder_L21.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d1c4a5e6cbcbb544fb37235c462061ac7a927f6613ede5636f92989f4032b00d
3
+ size 117481961
transcoder_L22.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:187e176450f34386b7239f28b85d8f66f27cd52cb20343ff7ea31edb4189a6ed
3
+ size 117481961
transcoder_L23.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4a5049a899afe59374f6ae01056156f2cf25568c5ce31073ec52094fdc1a0fcc
3
+ size 117481961
transcoder_L24.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9d28a88349a9709b25322a47f0a7422fca08c024c1e1d05be69ca928ba129c00
3
+ size 117481961
transcoder_L25.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:33874f1bb13cf164d57203c121843b856e4c428bc74f3fbf95f3e263f61b757d
3
+ size 117481961
transcoder_L26.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:bea6d92a8d3c5f18ad95c08c10d759a8e00709097e2337797c49d81069622acc
3
+ size 117481961
transcoder_L3.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:efa4d029e87e4a07ebb1ed9317e6e6661365f9627c5baac6e4f3980dec7630ce
3
+ size 117481949
transcoder_L4.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:95956064bcab73f1adb052d2115261d9c5a2ad4b4046931366290841777653bb
3
+ size 117481949
transcoder_L5.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4c28c60040be909dd0af62b5685595065b7c296b840b649b0a521975e1ba225e
3
+ size 117481949
transcoder_L6.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2d09a2f749fd8a208dc5809c10844ba5c1e278819c9cea162835654cd4853dca
3
+ size 117481949
transcoder_L7.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3b6422712ae2b8c45d7d08b6ddc9851140da93899d238b73564194a591232ab8
3
+ size 117481949
transcoder_L8.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7cc127463736aa367af6aa8c6f87497204988e0d6a1fb4ba9fe3a8bdf762482e
3
+ size 117481949
transcoder_L9.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a40edc62d6f928e852411a7f802f73c1fa8d34c9b6d2dd57f81be206001f5717
3
+ size 117481949