S.F.'s picture

S.F.

search-facility

·

ipv6

AI & ML interests

None yet

Recent Activity

upvoted a paper about 4 hours ago

Web Retrieval-Aware Chunking (W-RAC) for Efficient and Cost-Effective Retrieval-Augmented Generation Systems

upvoted a paper about 4 hours ago

(1D) Ordered Tokens Enable Efficient Test-Time Search

upvoted a paper about 4 hours ago

Qwen3.5-Omni Technical Report

View all activity

Organizations

None yet

upvoted 3 papers about 4 hours ago

Web Retrieval-Aware Chunking (W-RAC) for Efficient and Cost-Effective Retrieval-Augmented Generation Systems

Paper • 2604.04936 • Published Jan 8 • 24

(1D) Ordered Tokens Enable Efficient Test-Time Search

Paper • 2604.15453 • Published 6 days ago • 15

Qwen3.5-Omni Technical Report

Paper • 2604.15804 • Published 5 days ago • 39

upvoted a paper 5 days ago

Lyra 2.0: Explorable Generative 3D Worlds

Paper • 2604.13036 • Published 8 days ago • 37

upvoted 4 papers 6 days ago

Uni-ViGU: Towards Unified Video Generation and Understanding via A Diffusion-Based Video Generator

Paper • 2604.08121 • Published 13 days ago • 42

CodeTracer: Towards Traceable Agent States

Paper • 2604.11641 • Published 9 days ago • 39

Audio Flamingo Next: Next-Generation Open Audio-Language Models for Speech, Sound, and Music

Paper • 2604.10905 • Published 9 days ago • 28

The Past Is Not Past: Memory-Enhanced Dynamic Reward Shaping

Paper • 2604.11297 • Published 9 days ago • 136

upvoted 2 papers 11 days ago

RAGEN-2: Reasoning Collapse in Agentic RL

Paper • 2604.06268 • Published 15 days ago • 64

Think in Strokes, Not Pixels: Process-Driven Image Generation via Interleaved Reasoning

Paper • 2604.04746 • Published 14 days ago • 70

upvoted 2 papers 13 days ago

LightThinker++: From Reasoning Compression to Memory Management

Paper • 2604.03679 • Published 18 days ago • 36

MinerU2.5-Pro: Pushing the Limits of Data-Centric Document Parsing at Scale

Paper • 2604.04771 • Published 16 days ago • 120

upvoted a paper 17 days ago

LatentUM: Unleashing the Potential of Interleaved Cross-Modal Reasoning via a Latent-Space Unified Model

Paper • 2604.02097 • Published 20 days ago • 32

upvoted a paper 18 days ago

GaussianGPT: Towards Autoregressive 3D Gaussian Scene Generation

Paper • 2603.26661 • Published 25 days ago • 26

upvoted 5 papers 24 days ago

Representation Alignment for Just Image Transformers is not Easier than You Think

Paper • 2603.14366 • Published Mar 15 • 13

Less Gaussians, Texture More: 4K Feed-Forward Textured Splatting

Paper • 2603.25745 • Published 26 days ago • 16

AVControl: Efficient Framework for Training Audio-Visual Controls

Paper • 2603.24793 • Published 27 days ago • 26

Voxtral TTS

Paper • 2603.25551 • Published 26 days ago • 59

PixelSmile: Toward Fine-Grained Facial Expression Editing

Paper • 2603.25728 • Published 26 days ago • 117

upvoted a paper 27 days ago

Repurposing Geometric Foundation Models for Multi-view Diffusion

Paper • 2603.22275 • Published 29 days ago • 47