Picture for James Glass

James Glass

MIT Computer Science and Artificial Intelligence Laboratory, MA, USA

DDPO-VC: Speaker De-Identification via Diffusion Denoising Policy Optimization

Add code
Jun 13, 2026
Viaarxiv icon

Overcoming State Inertia in Full-Duplex Spoken Language Models via Activation Steering

Add code
Jun 09, 2026
Viaarxiv icon

USAD 2.0: Scaling Representation Distillation for Universal Audio Understanding

Add code
Jun 04, 2026
Viaarxiv icon

TiCo: Time-Controllable Training for Spoken Dialogue Models

Add code
Mar 23, 2026
Viaarxiv icon

TaigiSpeech: A Low-Resource Real-World Speech Intent Dataset and Preliminary Results with Scalable Data Mining In-the-Wild

Add code
Mar 23, 2026
Viaarxiv icon

Beyond Single-Shot: Multi-step Tool Retrieval via Query Planning

Add code
Jan 12, 2026
Viaarxiv icon

TreePS-RAG: Tree-based Process Supervision for Reinforcement Learning in Agentic RAG

Add code
Jan 11, 2026
Viaarxiv icon

LibriVAD: A Scalable Open Dataset with Deep Learning Benchmarks for Voice Activity Detection

Add code
Dec 19, 2025
Viaarxiv icon

MetaCLIP 2: A Worldwide Scaling Recipe

Add code
Jul 29, 2025
Figure 1 for MetaCLIP 2: A Worldwide Scaling Recipe
Figure 2 for MetaCLIP 2: A Worldwide Scaling Recipe
Figure 3 for MetaCLIP 2: A Worldwide Scaling Recipe
Figure 4 for MetaCLIP 2: A Worldwide Scaling Recipe
Viaarxiv icon

Beyond Context Limits: Subconscious Threads for Long-Horizon Reasoning

Add code
Jul 22, 2025
Viaarxiv icon