Picture for Hongsheng Li

Hongsheng Li

EnerVerse-AC: Envisioning Embodied Environments with Action Condition

Add code
May 14, 2025
Viaarxiv icon

Adaptive Markup Language Generation for Contextually-Grounded Visual Document Understanding

Add code
May 08, 2025
Figure 1 for Adaptive Markup Language Generation for Contextually-Grounded Visual Document Understanding
Figure 2 for Adaptive Markup Language Generation for Contextually-Grounded Visual Document Understanding
Figure 3 for Adaptive Markup Language Generation for Contextually-Grounded Visual Document Understanding
Figure 4 for Adaptive Markup Language Generation for Contextually-Grounded Visual Document Understanding
Viaarxiv icon

WebGen-Bench: Evaluating LLMs on Generating Interactive and Functional Websites from Scratch

Add code
May 06, 2025
Figure 1 for WebGen-Bench: Evaluating LLMs on Generating Interactive and Functional Websites from Scratch
Figure 2 for WebGen-Bench: Evaluating LLMs on Generating Interactive and Functional Websites from Scratch
Figure 3 for WebGen-Bench: Evaluating LLMs on Generating Interactive and Functional Websites from Scratch
Figure 4 for WebGen-Bench: Evaluating LLMs on Generating Interactive and Functional Websites from Scratch
Viaarxiv icon

T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT

Add code
May 01, 2025
Viaarxiv icon

From Reflection to Perfection: Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning

Add code
Apr 22, 2025
Viaarxiv icon

Exposing the Copycat Problem of Imitation-based Planner: A Novel Closed-Loop Simulator, Causal Benchmark and Joint IL-RL Baseline

Add code
Apr 20, 2025
Figure 1 for Exposing the Copycat Problem of Imitation-based Planner: A Novel Closed-Loop Simulator, Causal Benchmark and Joint IL-RL Baseline
Figure 2 for Exposing the Copycat Problem of Imitation-based Planner: A Novel Closed-Loop Simulator, Causal Benchmark and Joint IL-RL Baseline
Figure 3 for Exposing the Copycat Problem of Imitation-based Planner: A Novel Closed-Loop Simulator, Causal Benchmark and Joint IL-RL Baseline
Figure 4 for Exposing the Copycat Problem of Imitation-based Planner: A Novel Closed-Loop Simulator, Causal Benchmark and Joint IL-RL Baseline
Viaarxiv icon

High-Fidelity Diffusion Face Swapping with ID-Constrained Facial Conditioning

Add code
Mar 28, 2025
Viaarxiv icon

LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis

Add code
Mar 27, 2025
Viaarxiv icon

Lumina-Image 2.0: A Unified and Efficient Image Generative Framework

Add code
Mar 27, 2025
Figure 1 for Lumina-Image 2.0: A Unified and Efficient Image Generative Framework
Figure 2 for Lumina-Image 2.0: A Unified and Efficient Image Generative Framework
Figure 3 for Lumina-Image 2.0: A Unified and Efficient Image Generative Framework
Figure 4 for Lumina-Image 2.0: A Unified and Efficient Image Generative Framework
Viaarxiv icon

UI-R1: Enhancing Action Prediction of GUI Agents by Reinforcement Learning

Add code
Mar 27, 2025
Figure 1 for UI-R1: Enhancing Action Prediction of GUI Agents by Reinforcement Learning
Figure 2 for UI-R1: Enhancing Action Prediction of GUI Agents by Reinforcement Learning
Figure 3 for UI-R1: Enhancing Action Prediction of GUI Agents by Reinforcement Learning
Figure 4 for UI-R1: Enhancing Action Prediction of GUI Agents by Reinforcement Learning
Viaarxiv icon