Picture for Shuicheng Yan

Shuicheng Yan

NUS

Patch-as-Decodable-Token: Towards Unified Multi-Modal Vision Tasks in MLLMs

Add code
Oct 02, 2025
Figure 1 for Patch-as-Decodable-Token: Towards Unified Multi-Modal Vision Tasks in MLLMs
Figure 2 for Patch-as-Decodable-Token: Towards Unified Multi-Modal Vision Tasks in MLLMs
Figure 3 for Patch-as-Decodable-Token: Towards Unified Multi-Modal Vision Tasks in MLLMs
Figure 4 for Patch-as-Decodable-Token: Towards Unified Multi-Modal Vision Tasks in MLLMs
Viaarxiv icon

SAIL-VL2 Technical Report

Add code
Sep 18, 2025
Viaarxiv icon

AgenTracer: Who Is Inducing Failure in the LLM Agentic Systems?

Add code
Sep 04, 2025
Viaarxiv icon

PointDGRWKV: Generalizing RWKV-like Architecture to Unseen Domains for Point Cloud Classification

Add code
Aug 29, 2025
Figure 1 for PointDGRWKV: Generalizing RWKV-like Architecture to Unseen Domains for Point Cloud Classification
Figure 2 for PointDGRWKV: Generalizing RWKV-like Architecture to Unseen Domains for Point Cloud Classification
Figure 3 for PointDGRWKV: Generalizing RWKV-like Architecture to Unseen Domains for Point Cloud Classification
Figure 4 for PointDGRWKV: Generalizing RWKV-like Architecture to Unseen Domains for Point Cloud Classification
Viaarxiv icon

Genie Envisioner: A Unified World Foundation Platform for Robotic Manipulation

Add code
Aug 07, 2025
Figure 1 for Genie Envisioner: A Unified World Foundation Platform for Robotic Manipulation
Figure 2 for Genie Envisioner: A Unified World Foundation Platform for Robotic Manipulation
Figure 3 for Genie Envisioner: A Unified World Foundation Platform for Robotic Manipulation
Figure 4 for Genie Envisioner: A Unified World Foundation Platform for Robotic Manipulation
Viaarxiv icon

G-Memory: Tracing Hierarchical Memory for Multi-Agent Systems

Add code
Jun 09, 2025
Viaarxiv icon

RoboCerebra: A Large-scale Benchmark for Long-horizon Robotic Manipulation Evaluation

Add code
Jun 07, 2025
Viaarxiv icon

Muddit: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusion Model

Add code
May 29, 2025
Viaarxiv icon

On Path to Multimodal Generalist: General-Level and General-Bench

Add code
May 07, 2025
Viaarxiv icon

Guiding VLM Agents with Process Rewards at Inference Time for GUI Navigation

Add code
Apr 22, 2025
Viaarxiv icon