Picture for Shengqiong Wu

Shengqiong Wu

Orthogonal Spatial-temporal Distributional Transfer for 4D Generation

Add code
Mar 05, 2026
Viaarxiv icon

UniM: A Unified Any-to-Any Interleaved Multimodal Benchmark

Add code
Mar 05, 2026
Viaarxiv icon

Spatial Causal Prediction in Video

Add code
Mar 04, 2026
Viaarxiv icon

Modeling Cross-vision Synergy for Unified Large Vision Model

Add code
Mar 03, 2026
Viaarxiv icon

Synergizing Understanding and Generation with Interleaved Analyzing-Drafting Thinking

Add code
Feb 24, 2026
Viaarxiv icon

JavisDiT++: Unified Modeling and Optimization for Joint Audio-Video Generation

Add code
Feb 22, 2026
Viaarxiv icon

Global Commander and Local Operative: A Dual-Agent Framework for Scene Navigation

Add code
Feb 21, 2026
Viaarxiv icon

JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation

Add code
Dec 28, 2025
Viaarxiv icon

Training LLMs with LogicReward for Faithful and Rigorous Reasoning

Add code
Dec 20, 2025
Figure 1 for Training LLMs with LogicReward for Faithful and Rigorous Reasoning
Figure 2 for Training LLMs with LogicReward for Faithful and Rigorous Reasoning
Figure 3 for Training LLMs with LogicReward for Faithful and Rigorous Reasoning
Figure 4 for Training LLMs with LogicReward for Faithful and Rigorous Reasoning
Viaarxiv icon

UniVA: Universal Video Agent towards Open-Source Next-Generation Video Generalist

Add code
Nov 11, 2025
Viaarxiv icon