DINO in the Room: Leveraging 2D Foundation Models for 3D Segmentation Paper • 2503.18944 • Published Mar 24, 2025
CaMeLs Can Use Computers Too: System-level Security for Computer Use Agents Paper • 2601.09923 • Published Jan 14 • 5
LLM-Guided Task- and Affordance-Level Exploration in Reinforcement Learning Paper • 2509.16615 • Published Sep 20, 2025 • 1
Benchmarks and Challenges in Pose Estimation for Egocentric Hand Interactions with Objects Paper • 2403.16428 • Published Mar 25, 2024
Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think Paper • 2409.11355 • Published Sep 17, 2024 • 30
Point2Vec for Self-Supervised Representation Learning on Point Clouds Paper • 2303.16570 • Published Mar 29, 2023
InstrumentGen: Generating Sample-Based Musical Instruments From Text Paper • 2311.04339 • Published Nov 7, 2023
Distortion Audio Effects: Learning How to Recover the Clean Signal Paper • 2202.01664 • Published Feb 3, 2022
Generating Sample-Based Musical Instruments Using Neural Audio Codec Language Models Paper • 2407.15641 • Published Jul 22, 2024
DSP-informed bandwidth extension using locally-conditioned excitation and linear time-varying filter subnetworks Paper • 2407.15624 • Published Jul 22, 2024
The SIFo Benchmark: Investigating the Sequential Instruction Following Ability of Large Language Models Paper • 2406.19999 • Published Jun 28, 2024 • 5
Cascaded Span Extraction and Response Generation for Document-Grounded Dialog Paper • 2106.07275 • Published Jun 14, 2021
ClimateGPT: Towards AI Synthesizing Interdisciplinary Research on Climate Change Paper • 2401.09646 • Published Jan 17, 2024 • 17
On Sampling-Based Training Criteria for Neural Language Modeling Paper • 2104.10507 • Published Apr 21, 2021
Investigation on Data Adaptation Techniques for Neural Named Entity Recognition Paper • 2110.05892 • Published Oct 12, 2021
Adapting Document-Grounded Dialog Systems to Spoken Conversations using Data Augmentation and a Noisy Channel Model Paper • 2112.08844 • Published Dec 16, 2021
Does Joint Training Really Help Cascaded Speech Translation? Paper • 2210.13700 • Published Oct 24, 2022
Controllable Factuality in Document-Grounded Dialog Systems Using a Noisy Channel Model Paper • 2210.17418 • Published Oct 31, 2022