Scaled


Weight Tying Biases Token Embeddings Towards the Output Space

Add code
Mar 27, 2026
Viaarxiv icon

Beyond Language: Grounding Referring Expressions with Hand Pointing in Egocentric Vision

Add code
Mar 27, 2026
Viaarxiv icon

Drive-Through 3D Vehicle Exterior Reconstruction via Dynamic-Scene SfM and Distortion-Aware Gaussian Splatting

Add code
Mar 27, 2026
Viaarxiv icon

VGGRPO: Towards World-Consistent Video Generation with 4D Latent Reward

Add code
Mar 27, 2026
Viaarxiv icon

The Limits of Learning from Pictures and Text: Vision-Language Models and Embodied Scene Understanding

Add code
Mar 27, 2026
Viaarxiv icon

The Multi-AMR Buffer Storage, Retrieval, and Reshuffling Problem: Exact and Heuristic Approaches

Add code
Mar 27, 2026
Viaarxiv icon

JAL-Turn: Joint Acoustic-Linguistic Modeling for Real-Time and Robust Turn-Taking Detection in Full-Duplex Spoken Dialogue Systems

Add code
Mar 27, 2026
Viaarxiv icon

UNIFERENCE: A Discrete Event Simulation Framework for Developing Distributed AI Models

Add code
Mar 27, 2026
Viaarxiv icon

Near-Field MMSE Channel Estimation for THz RIS-aided Communications with Electromagnetic Interference

Add code
Mar 27, 2026
Viaarxiv icon

120 Minutes and a Laptop: Minimalist Image-goal Navigation via Unsupervised Exploration and Offline RL

Add code
Mar 27, 2026
Viaarxiv icon