Picture for Shengju Qian

Shengju Qian

PhysEditWorld: A Large-Scale Dataset Toward Physics-Editable World Models

Add code
Jun 25, 2026
Viaarxiv icon

Orchestra-o1: Omnimodal Agent Orchestration

Add code
Jun 10, 2026
Viaarxiv icon

OmniGameArena: A Unified UE5 Benchmark for VLM Game Agents with Improvement Dynamics

Add code
Jun 08, 2026
Viaarxiv icon

Struct-Searcher: Agentic Structural Thinking Advances Multimodal Deep Information Seeking

Add code
Jun 05, 2026
Viaarxiv icon

Policy and World Modeling Co-Training for Language Agents

Add code
Jun 01, 2026
Viaarxiv icon

On-Policy Adversarial Flow Distillation for Autoregressive Video Generation

Add code
May 25, 2026
Viaarxiv icon

SAP: Segment Any 4K Panorama

Add code
Mar 13, 2026
Viaarxiv icon

CoSMo3D: Open-World Promptable 3D Semantic Part Segmentation through LLM-Guided Canonical Spatial Modeling

Add code
Mar 01, 2026
Viaarxiv icon

AssetFormer: Modular 3D Assets Generation with Autoregressive Transformer

Add code
Feb 12, 2026
Viaarxiv icon

StyleAR: Customizing Multimodal Autoregressive Model for Style-Aligned Text-to-Image Generation

Add code
May 26, 2025
Figure 1 for StyleAR: Customizing Multimodal Autoregressive Model for Style-Aligned Text-to-Image Generation
Figure 2 for StyleAR: Customizing Multimodal Autoregressive Model for Style-Aligned Text-to-Image Generation
Figure 3 for StyleAR: Customizing Multimodal Autoregressive Model for Style-Aligned Text-to-Image Generation
Figure 4 for StyleAR: Customizing Multimodal Autoregressive Model for Style-Aligned Text-to-Image Generation
Viaarxiv icon