Picture for Yi Wang

Yi Wang

NUS

3DGeoDet: General-purpose Geometry-aware Image-based 3D Object Detection

Add code
Jun 11, 2025
Viaarxiv icon

Looking Beyond Visible Cues: Implicit Video Question Answering via Dual-Clue Reasoning

Add code
Jun 09, 2025
Figure 1 for Looking Beyond Visible Cues: Implicit Video Question Answering via Dual-Clue Reasoning
Figure 2 for Looking Beyond Visible Cues: Implicit Video Question Answering via Dual-Clue Reasoning
Figure 3 for Looking Beyond Visible Cues: Implicit Video Question Answering via Dual-Clue Reasoning
Figure 4 for Looking Beyond Visible Cues: Implicit Video Question Answering via Dual-Clue Reasoning
Viaarxiv icon

VideoChat-A1: Thinking with Long Videos by Chain-of-Shot Reasoning

Add code
Jun 06, 2025
Viaarxiv icon

TC-GS: A Faster Gaussian Splatting Module Utilizing Tensor Cores

Add code
May 30, 2025
Viaarxiv icon

Beyond the LUMIR challenge: The pathway to foundational registration models

Add code
May 30, 2025
Figure 1 for Beyond the LUMIR challenge: The pathway to foundational registration models
Figure 2 for Beyond the LUMIR challenge: The pathway to foundational registration models
Figure 3 for Beyond the LUMIR challenge: The pathway to foundational registration models
Figure 4 for Beyond the LUMIR challenge: The pathway to foundational registration models
Viaarxiv icon

Synthetic Generation and Latent Projection Denoising of Rim Lesions in Multiple Sclerosis

Add code
May 29, 2025
Viaarxiv icon

PATS: Process-Level Adaptive Thinking Mode Switching

Add code
May 25, 2025
Viaarxiv icon

Genie Centurion: Accelerating Scalable Real-World Robot Training with Human Rewind-and-Refine Guidance

Add code
May 24, 2025
Figure 1 for Genie Centurion: Accelerating Scalable Real-World Robot Training with Human Rewind-and-Refine Guidance
Figure 2 for Genie Centurion: Accelerating Scalable Real-World Robot Training with Human Rewind-and-Refine Guidance
Figure 3 for Genie Centurion: Accelerating Scalable Real-World Robot Training with Human Rewind-and-Refine Guidance
Figure 4 for Genie Centurion: Accelerating Scalable Real-World Robot Training with Human Rewind-and-Refine Guidance
Viaarxiv icon

V2V: Scaling Event-Based Vision through Efficient Video-to-Voxel Simulation

Add code
May 22, 2025
Viaarxiv icon

Large Language Model-Empowered Interactive Load Forecasting

Add code
May 22, 2025
Viaarxiv icon