Picture for Jie Shao

Jie Shao

ViCO: A Training Strategy towards Semantic Aware Dynamic High-Resolution

Add code
Oct 14, 2025
Viaarxiv icon

Factuality Matters: When Image Generation and Editing Meet Structured Visuals

Add code
Oct 06, 2025
Viaarxiv icon

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Add code
Aug 25, 2025
Viaarxiv icon

Enhanced Influence-aware Group Recommendation for Online Media Propagation

Add code
Jul 02, 2025
Viaarxiv icon

CreatiPoster: Towards Editable and Controllable Multi-Layer Graphic Design Generation

Add code
Jun 12, 2025
Viaarxiv icon

Who Reasons in the Large Language Models?

Add code
May 27, 2025
Viaarxiv icon

CreatiDesign: A Unified Multi-Conditional Diffusion Transformer for Creative Graphic Design

Add code
May 25, 2025
Viaarxiv icon

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Add code
Apr 15, 2025
Viaarxiv icon

BlockDance: Reuse Structurally Similar Spatio-Temporal Features to Accelerate Diffusion Transformers

Add code
Mar 20, 2025
Viaarxiv icon

Enhancing Online Reinforcement Learning with Meta-Learned Objective from Offline Data

Add code
Jan 13, 2025
Viaarxiv icon