Hugging Face

Team

company

Verified

https://huggingface.co

huggingface

Activity Feed

AI & ML interests

The AI community building the future.

Recent Activity

nielsr updated a Space about 6 hours ago

huggingface/ai-deadlines

adamm-hf new activity about 13 hours ago

huggingface/InferenceSupport:lightx2v/Wan2.1-I2V-14B-720P-StepDistill-CfgDistill-Lightx2v

alvarobartt updated a dataset about 21 hours ago

huggingface/DEH-image-scan-data

View all activity

Papers

Qualixar OS: A Universal Operating System for AI Agent Orchestration

FineVision: Open Data Is All You Need

View all Papers

Articles

Architectural Choices in China's Open-Source AI Ecosystem: Building Beyond DeepSeek

Jan 27

•

One Year Since the “DeepSeek Moment”

Jan 20

•

On the Shifting Global Compute Landscape

Oct 29, 2025

•

Announcing Hugging Face Fundamentals: A New Learning Track on DataCamp

Oct 16, 2025

•

Yay! Organizations can now publish blog Articles

Jan 20, 2025

•

View all articles

nielsr

updated a Space about 6 hours ago

AI Deadlines

⚡

716

Find upcoming AI conference deadlines in one place

adamm-hf

in huggingface/InferenceSupport about 13 hours ago

lightx2v/Wan2.1-I2V-14B-720P-StepDistill-CfgDistill-Lightx2v

#5192 opened 6 months ago by

Grtfr134567

alvarobartt

updated a dataset about 21 hours ago

huggingface/DEH-image-scan-data

Viewer • Updated about 21 hours ago • 4 • 8.45k • 10

sayakpaul

updated a dataset 2 days ago

huggingface/diffusers-metadata

Viewer • Updated 2 days ago • 96 • 1.38k • 26

enzostvs

in huggingface/documentation-images 2 days ago

Upload feetech-reset-offset.png

#604 opened 2 days ago by

maximellerbach

AdinaY

in huggingface/HuggingDiscussions 2 days ago

[FEEDBACK] Daily Papers

🔥❤️ 21

193

#32 opened almost 2 years ago by

kramp

dlouapre

in huggingface/eleusis-benchmark 3 days ago

Unordered questions and remarks

#1 opened 2 months ago by

qgallouedec

sergiopaniego

posted an update 3 days ago

Post

183

Great experience yesterday at PyTorch Conf Europe in Paris 🇫🇷

We (w/ @kashif ) talked about training LLMs through interaction, using trajectories across games, browsers, or simulators

Room was packed, a clear sign of interest in where RL post-training is heading.

sharing the slides! 🤓
https://drive.google.com/file/d/16k7YRnf5EJEo0XjXGlRJ_hVeLoFWKyNP/view?usp=sharing

tomaarsen

posted an update 3 days ago

Post

252

🌐 I've just published Sentence Transformers v5.4 to make the project fully multimodal for embeddings and reranking. The release also includes a modular CrossEncoder, and automatic Flash Attention 2 input flattening. Details:

You can now use SentenceTransformer and CrossEncoder with text, images, audio, and video, with the same familiar API. That means you can compute embeddings for an image and a text query using model.encode(), compare them with model.similarity(), and it just works. Models like Qwen3-VL-Embedding-2B and jinaai/jina-reranker-m0 are supported out of the box.

Beyond multimodal, I also fully modularized the CrossEncoder class. It's now a torch.nn.Sequential of composable modules, just like SentenceTransformer has been. This unlocked support for generative rerankers (CausalLM-based models like mxbai-rerank-v2 and the Qwen3 rerankers) via a new LogitScore module, which wasn't possible before without custom code.

Also, Flash Attention 2 now automatically skips padding for text-only inputs. If your batch has a mix of short and long texts, this gives you a nice speedup and lower VRAM usage for free.

I wrote a blog post walking through the multimodal features with practical examples. Check it out if you want to get started, or just point your Agent to the URL: https://huggingface.co/blog/multimodal-sentence-transformers

This release has set up the groundwork for more easily introducing late-interaction models (both text-only and multimodal) into Sentence Transformers in the next major release. I'm looking forward to it!

akhaliq

submitted a paper to Daily Papers 9 days ago

MultiGen: Level-Design for Editable Multiplayer Worlds in Diffusion Game Engines

Paper • 2603.06679 • Published 14 days ago • 5

sergiopaniego

posted an update 10 days ago

Post

2699

Gemma 4 💎 is here and it’s strong!

to celebrate, we’re rolling out in TRL:

> support for multimodal tool responses for environments (OpenEnv)
> an example to train it in CARLA for autonomous driving with image-based tool calls

go check it out 🏎️🏎️

blog: https://huggingface.co/blog/gemma4
script: https://github.com/huggingface/trl/blob/main/examples/scripts/openenv/carla_vlm_gemma.py

sergiopaniego

posted an update 12 days ago

Post

1954

TRL is officially an adult 🥳

excited to announce TRL v1.0❗️

head to the blog to see how we got here and what’s next for this post-training library, designed to keep pace with the field

https://huggingface.co/blog/trl-v1

2 replies

AdinaY

submitted a paper to Daily Papers 12 days ago

KAT-Coder-V2 Technical Report

Paper • 2603.27703 • Published 14 days ago • 9

akhaliq

submitted a paper to Daily Papers 16 days ago

AVO: Agentic Variation Operators for Autonomous Evolutionary Search

Paper • 2603.24517 • Published 18 days ago • 10

mishig

posted an update 17 days ago

Post

461

I like these models nvidia/NVIDIA-Nemotron-3-Nano-4B-BF16 and nvidia/NVIDIA-Nemotron-3-Nano-4B-FP8 and TradingAgents: Multi-Agents LLM Financial Trading Framework (2412.20138) and https://arxiv.org/abs/2412.20138

mlabonne/FineTome-100k

akhaliq

submitted a paper to Daily Papers 25 days ago

V-Co: A Closer Look at Visual Representation Alignment via Co-Denoising

Paper • 2603.16792 • Published 26 days ago • 3

akhaliq

submitted a paper to Daily Papers 28 days ago

Multimodal OCR: Parse Anything from Documents

Paper • 2603.13032 • Published about 1 month ago • 43

sergiopaniego

posted an update about 1 month ago

Post

742

ICYMI, great blog by @kashif and @stas on Ulysses Sequence Parallelism: train with million-token contexts

on 4×H100s: 12x longer sequences, 3.7x throughput

learn how to integrate it with Accelerate, Transformers, and TRL ⤵️
https://huggingface.co/blog/ulysses-sp

AdinaY

submitted a paper to Daily Papers about 1 month ago

Training Language Models via Neural Cellular Automata

Paper • 2603.10055 • Published Mar 9 • 8

sergiopaniego

posted an update about 1 month ago

Post

426

We just released a big blog surveying 16 OSS frameworks for async RL training of LLMs!

We're building a new async GRPO trainer for TRL and as first step, we needed to understand how the ecosystem solves this problem today.

The problem: in synchronous RL training, generation dominates wall-clock time. 32K-token rollouts on a 32B model take hours while training GPUs sit completely idle. With reasoning models and agentic RL making rollouts longer and more variable, this only gets worse.

The ecosystem converged on the same fix: separate inference + training onto different GPU pools, rollout buffer, and async weight sync.

We compared 16 frameworks across 7 axes: orchestration, buffer design, weight sync, staleness management, partial rollouts, LoRA, and MoE support.

This survey is step one. The async GRPO trainer for TRL is next!

https://huggingface.co/blog/async-rl-training-landscape

AI & ML interests

Recent Activity

Papers

Articles

From doctest to runnable Markdown

State of Open Source on Hugging Face: Spring 2026

The Future of the Global Open-Source AI Ecosystem: From DeepSeek to AI+

Architectural Choices in China's Open-Source AI Ecosystem: Building Beyond DeepSeek

One Year Since the “DeepSeek Moment”

On the Shifting Global Compute Landscape

Announcing Hugging Face Fundamentals: A New Learning Track on DataCamp

Yay! Organizations can now publish blog Articles

Team members 183

huggingface's activity

AI Deadlines

lightx2v/Wan2.1-I2V-14B-720P-StepDistill-CfgDistill-Lightx2v

Upload feetech-reset-offset.png

[FEEDBACK] Daily Papers

Unordered questions and remarks