Picture for Shengqiong Wu

Shengqiong Wu

JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation

Add code
Dec 28, 2025
Viaarxiv icon

Training LLMs with LogicReward for Faithful and Rigorous Reasoning

Add code
Dec 20, 2025
Figure 1 for Training LLMs with LogicReward for Faithful and Rigorous Reasoning
Figure 2 for Training LLMs with LogicReward for Faithful and Rigorous Reasoning
Figure 3 for Training LLMs with LogicReward for Faithful and Rigorous Reasoning
Figure 4 for Training LLMs with LogicReward for Faithful and Rigorous Reasoning
Viaarxiv icon

UniVA: Universal Video Agent towards Open-Source Next-Generation Video Generalist

Add code
Nov 11, 2025
Viaarxiv icon

On Path to Multimodal Generalist: General-Level and General-Bench

Add code
May 07, 2025
Viaarxiv icon

VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models

Add code
Apr 17, 2025
Figure 1 for VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models
Figure 2 for VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models
Figure 3 for VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models
Figure 4 for VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models
Viaarxiv icon

Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation

Add code
Mar 31, 2025
Figure 1 for Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation
Figure 2 for Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation
Figure 3 for Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation
Figure 4 for Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation
Viaarxiv icon

JavisDiT: Joint Audio-Video Diffusion Transformer with Hierarchical Spatio-Temporal Prior Synchronization

Add code
Mar 30, 2025
Viaarxiv icon

Universal Scene Graph Generation

Add code
Mar 19, 2025
Figure 1 for Universal Scene Graph Generation
Figure 2 for Universal Scene Graph Generation
Figure 3 for Universal Scene Graph Generation
Figure 4 for Universal Scene Graph Generation
Viaarxiv icon

Learning 4D Panoptic Scene Graph Generation from Rich 2D Visual Scene

Add code
Mar 19, 2025
Figure 1 for Learning 4D Panoptic Scene Graph Generation from Rich 2D Visual Scene
Figure 2 for Learning 4D Panoptic Scene Graph Generation from Rich 2D Visual Scene
Figure 3 for Learning 4D Panoptic Scene Graph Generation from Rich 2D Visual Scene
Figure 4 for Learning 4D Panoptic Scene Graph Generation from Rich 2D Visual Scene
Viaarxiv icon

Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey

Add code
Mar 16, 2025
Viaarxiv icon