Picture for Siliang Tang

Siliang Tang

CORE: Code-based Inverse Self-Training Framework with Graph Expansion for Virtual Agents

Add code
Jan 05, 2026
Viaarxiv icon

OmniMoGen: Unifying Human Motion Generation via Learning from Interleaved Text-Motion Instructions

Add code
Dec 22, 2025
Viaarxiv icon

WEAVE: Unleashing and Benchmarking the In-context Interleaved Comprehension and Generation

Add code
Nov 14, 2025
Figure 1 for WEAVE: Unleashing and Benchmarking the In-context Interleaved Comprehension and Generation
Figure 2 for WEAVE: Unleashing and Benchmarking the In-context Interleaved Comprehension and Generation
Figure 3 for WEAVE: Unleashing and Benchmarking the In-context Interleaved Comprehension and Generation
Figure 4 for WEAVE: Unleashing and Benchmarking the In-context Interleaved Comprehension and Generation
Viaarxiv icon

What Limits Virtual Agent Application? OmniBench: A Scalable Multi-Dimensional Benchmark for Essential Virtual Agent Capabilities

Add code
Jun 10, 2025
Figure 1 for What Limits Virtual Agent Application? OmniBench: A Scalable Multi-Dimensional Benchmark for Essential Virtual Agent Capabilities
Figure 2 for What Limits Virtual Agent Application? OmniBench: A Scalable Multi-Dimensional Benchmark for Essential Virtual Agent Capabilities
Figure 3 for What Limits Virtual Agent Application? OmniBench: A Scalable Multi-Dimensional Benchmark for Essential Virtual Agent Capabilities
Figure 4 for What Limits Virtual Agent Application? OmniBench: A Scalable Multi-Dimensional Benchmark for Essential Virtual Agent Capabilities
Viaarxiv icon

MoA: Heterogeneous Mixture of Adapters for Parameter-Efficient Fine-Tuning of Large Language Models

Add code
Jun 06, 2025
Viaarxiv icon

FocusDiff: Advancing Fine-Grained Text-Image Alignment for Autoregressive Visual Generation through RL

Add code
Jun 05, 2025
Viaarxiv icon

On Path to Multimodal Generalist: General-Level and General-Bench

Add code
May 07, 2025
Viaarxiv icon

Generative Multimodal Pretraining with Discrete Diffusion Timestep Tokens

Add code
Apr 20, 2025
Figure 1 for Generative Multimodal Pretraining with Discrete Diffusion Timestep Tokens
Figure 2 for Generative Multimodal Pretraining with Discrete Diffusion Timestep Tokens
Figure 3 for Generative Multimodal Pretraining with Discrete Diffusion Timestep Tokens
Figure 4 for Generative Multimodal Pretraining with Discrete Diffusion Timestep Tokens
Viaarxiv icon

Benchmarking Multimodal CoT Reward Model Stepwise by Visual Program

Add code
Apr 09, 2025
Viaarxiv icon

Boosting Virtual Agent Learning and Reasoning: A Step-wise, Multi-dimensional, and Generalist Reward Model with Benchmark

Add code
Mar 24, 2025
Viaarxiv icon