Picture for Wei Liu

Wei Liu

Peter

UniHM: Universal Human Motion Generation with Object Interactions in Indoor Scenes

Add code
May 19, 2025
Viaarxiv icon

Enhance Mobile Agents Thinking Process Via Iterative Preference Learning

Add code
May 18, 2025
Figure 1 for Enhance Mobile Agents Thinking Process Via Iterative Preference Learning
Figure 2 for Enhance Mobile Agents Thinking Process Via Iterative Preference Learning
Figure 3 for Enhance Mobile Agents Thinking Process Via Iterative Preference Learning
Figure 4 for Enhance Mobile Agents Thinking Process Via Iterative Preference Learning
Viaarxiv icon

How Reliable is Multilingual LLM-as-a-Judge?

Add code
May 18, 2025
Viaarxiv icon

Mobile-Bench-v2: A More Realistic and Comprehensive Benchmark for VLM-based Mobile Agents

Add code
May 17, 2025
Figure 1 for Mobile-Bench-v2: A More Realistic and Comprehensive Benchmark for VLM-based Mobile Agents
Figure 2 for Mobile-Bench-v2: A More Realistic and Comprehensive Benchmark for VLM-based Mobile Agents
Figure 3 for Mobile-Bench-v2: A More Realistic and Comprehensive Benchmark for VLM-based Mobile Agents
Figure 4 for Mobile-Bench-v2: A More Realistic and Comprehensive Benchmark for VLM-based Mobile Agents
Viaarxiv icon

XRAG: Cross-lingual Retrieval-Augmented Generation

Add code
May 15, 2025
Viaarxiv icon

DanceGRPO: Unleashing GRPO on Visual Generation

Add code
May 12, 2025
Figure 1 for DanceGRPO: Unleashing GRPO on Visual Generation
Figure 2 for DanceGRPO: Unleashing GRPO on Visual Generation
Figure 3 for DanceGRPO: Unleashing GRPO on Visual Generation
Figure 4 for DanceGRPO: Unleashing GRPO on Visual Generation
Viaarxiv icon

EcoLANG: Efficient and Effective Agent Communication Language Induction for Social Simulation

Add code
May 11, 2025
Viaarxiv icon

Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging

Add code
May 08, 2025
Viaarxiv icon

X-Driver: Explainable Autonomous Driving with Vision-Language Models

Add code
May 08, 2025
Figure 1 for X-Driver: Explainable Autonomous Driving with Vision-Language Models
Figure 2 for X-Driver: Explainable Autonomous Driving with Vision-Language Models
Figure 3 for X-Driver: Explainable Autonomous Driving with Vision-Language Models
Figure 4 for X-Driver: Explainable Autonomous Driving with Vision-Language Models
Viaarxiv icon

DocSpiral: A Platform for Integrated Assistive Document Annotation through Human-in-the-Spiral

Add code
May 06, 2025
Viaarxiv icon