Picture for Siyu Chen

Siyu Chen

Massachusetts Institute of Technology USA

Entropy-Guided GRVQ for Ultra-Low Bitrate Neural Speech Codec

Add code
Mar 02, 2026
Viaarxiv icon

Geometry OR Tracker: Universal Geometric Operating Room Tracking

Add code
Feb 28, 2026
Viaarxiv icon

On the Mechanism and Dynamics of Modular Addition: Fourier Features, Lottery Ticket, and Grokking

Add code
Feb 18, 2026
Viaarxiv icon

Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters

Add code
Feb 11, 2026
Viaarxiv icon

Time2General: Learning Spatiotemporal Invariant Representations for Domain-Generalization Video Semantic Segmentation

Add code
Feb 10, 2026
Viaarxiv icon

Unlocking Out-of-Distribution Generalization in Transformers via Recursive Latent Space Reasoning

Add code
Oct 15, 2025
Viaarxiv icon

SGS-3D: High-Fidelity 3D Instance Segmentation via Reliable Semantic Mask Splitting and Growing

Add code
Sep 05, 2025
Viaarxiv icon

EGS-SLAM: RGB-D Gaussian Splatting SLAM with Events

Add code
Aug 09, 2025
Figure 1 for EGS-SLAM: RGB-D Gaussian Splatting SLAM with Events
Figure 2 for EGS-SLAM: RGB-D Gaussian Splatting SLAM with Events
Figure 3 for EGS-SLAM: RGB-D Gaussian Splatting SLAM with Events
Figure 4 for EGS-SLAM: RGB-D Gaussian Splatting SLAM with Events
Viaarxiv icon

ICM-Fusion: In-Context Meta-Optimized LoRA Fusion for Multi-Task Adaptation

Add code
Aug 06, 2025
Figure 1 for ICM-Fusion: In-Context Meta-Optimized LoRA Fusion for Multi-Task Adaptation
Figure 2 for ICM-Fusion: In-Context Meta-Optimized LoRA Fusion for Multi-Task Adaptation
Figure 3 for ICM-Fusion: In-Context Meta-Optimized LoRA Fusion for Multi-Task Adaptation
Figure 4 for ICM-Fusion: In-Context Meta-Optimized LoRA Fusion for Multi-Task Adaptation
Viaarxiv icon

Step-Audio 2 Technical Report

Add code
Jul 24, 2025
Figure 1 for Step-Audio 2 Technical Report
Figure 2 for Step-Audio 2 Technical Report
Figure 3 for Step-Audio 2 Technical Report
Figure 4 for Step-Audio 2 Technical Report
Viaarxiv icon