Weikai Huang's picture

In a Training Loop 🔄

Weikai Huang PRO

weikaih

·

https://weikaih04.github.io/

weikaih04

AI & ML interests

None yet

Recent Activity

authored a paper 6 days ago

Imaginative Perception Tokens Enhance Spatial Reasoning in Multimodal Language Models

updated a collection 7 days ago

Spatial Mental Modeling Benchmark

updated a collection 7 days ago

Imaginative Perception Token Data

View all activity

Organizations

upvoted a paper 7 days ago

Imaginative Perception Tokens Enhance Spatial Reasoning in Multimodal Language Models

Paper • 2606.03988 • Published 13 days ago • 118

upvoted a paper about 1 month ago

MolmoAct2: Action Reasoning Models for Real-world Deployment

Paper • 2605.02881 • Published May 4 • 348

upvoted a paper about 2 months ago

Vero: An Open RL Recipe for General Visual Reasoning

Paper • 2604.04917 • Published Apr 6 • 34

upvoted a paper 2 months ago

WildDet3D: Scaling Promptable 3D Detection in the Wild

Paper • 2604.08626 • Published Apr 9 • 247

upvoted 2 collections 2 months ago

WildDet3D

This is the collection of WildDet3D artifacts, including demos, model checkpoints and data. https://github.com/allenai/WildDet3D • 8 items • Updated Apr 13 • 20

Molmo2

Artifacts for the Molmo2 release • 5 items • Updated Mar 2 • 37

upvoted 2 papers 3 months ago

VFIG: Vectorizing Complex Figures in SVG with Vision-Language Models

Paper • 2603.24575 • Published Mar 25 • 19

Synthetic Visual Genome 2: Extracting Large-scale Spatio-Temporal Scene Graphs from Videos

Paper • 2602.23543 • Published Feb 26 • 9

upvoted 2 papers 4 months ago

TOPReward: Token Probabilities as Hidden Zero-Shot Rewards for Robotics

Paper • 2602.19313 • Published Feb 22 • 26

Experiential Reinforcement Learning

Paper • 2602.13949 • Published Feb 15 • 75

upvoted a collection 4 months ago

XGen-MM-1 models and datasets

A collection of all XGen-MM (Foundation LMM) models! • 15 items • Updated Mar 2 • 40

upvoted 2 collections 5 months ago

Open Coding Agents

13 items • Updated Mar 5 • 53

Molmo2 Data

Artifacts for the Molmo2 data release • 13 items • Updated Mar 2 • 42

upvoted a collection 7 months ago

Olmo 3

Artifacts for the Olmo 3 release. • 7 items • Updated Mar 2 • 171

upvoted 2 papers 7 months ago

BlenderFusion: 3D-Grounded Visual Editing and Generative Compositing

Paper • 2506.17450 • Published Jun 20, 2025 • 64

Infini-gram mini: Exact n-gram Search at the Internet Scale with FM-Index

Paper • 2506.12229 • Published Jun 13, 2025 • 3

upvoted a collection 10 months ago

DocRAG Datasets

Processed ("Unified") datasets used in DocRAG for training or inference purposes. • 12 items • Updated Jun 14, 2025 • 1

upvoted a paper 12 months ago

Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs

Paper • 2504.15280 • Published Apr 21, 2025 • 25

upvoted 2 collections about 1 year ago

Molmo

Artifacts for open multimodal language models. • 5 items • Updated Dec 23, 2025 • 310

Synthetic Object Compositions for Det / Seg / Grounding

Dataset Collections for paper: https://github.com/weikaih04/Synthetic-Detection-Segmentation-Grounding-Data • 8 items • Updated Mar 2 • 2