timm/vit_tiny_patch16_dinov3_qkvb.eupe_lvd1689m Image Feature Extraction • 5.49M • Updated 14 days ago • 61 • 2
NITP: Next Implicit Token Prediction for LLM Pre-training Paper • 2605.24956 • Published 18 days ago • 35
Draft-OPD: On-Policy Distillation for Speculative Draft Models Paper • 2605.29343 • Published 14 days ago • 33