Picture for Liang Zheng

Liang Zheng

Effective Training Data Synthesis for Improving MLLM Chart Understanding

Add code
Aug 08, 2025
Viaarxiv icon

Vec2Face+ for Face Dataset Generation

Add code
Jul 23, 2025
Viaarxiv icon

DiSA: Diffusion Step Annealing in Autoregressive Image Generation

Add code
May 26, 2025
Viaarxiv icon

REPA-E: Unlocking VAE for End-to-End Tuning with Latent Diffusion Transformers

Add code
Apr 14, 2025
Viaarxiv icon

R2E-Gym: Procedural Environments and Hybrid Verifiers for Scaling Open-Weights SWE Agents

Add code
Apr 09, 2025
Viaarxiv icon

ARINAR: Bi-Level Autoregressive Feature-by-Feature Generative Models

Add code
Mar 04, 2025
Figure 1 for ARINAR: Bi-Level Autoregressive Feature-by-Feature Generative Models
Figure 2 for ARINAR: Bi-Level Autoregressive Feature-by-Feature Generative Models
Figure 3 for ARINAR: Bi-Level Autoregressive Feature-by-Feature Generative Models
Figure 4 for ARINAR: Bi-Level Autoregressive Feature-by-Feature Generative Models
Viaarxiv icon

Think on your feet: Seamless Transition between Human-like Locomotion in Response to Changing Commands

Add code
Feb 26, 2025
Figure 1 for Think on your feet: Seamless Transition between Human-like Locomotion in Response to Changing Commands
Figure 2 for Think on your feet: Seamless Transition between Human-like Locomotion in Response to Changing Commands
Figure 3 for Think on your feet: Seamless Transition between Human-like Locomotion in Response to Changing Commands
Figure 4 for Think on your feet: Seamless Transition between Human-like Locomotion in Response to Changing Commands
Viaarxiv icon

Learning Camera Movement Control from Real-World Drone Videos

Add code
Dec 12, 2024
Figure 1 for Learning Camera Movement Control from Real-World Drone Videos
Figure 2 for Learning Camera Movement Control from Real-World Drone Videos
Figure 3 for Learning Camera Movement Control from Real-World Drone Videos
Figure 4 for Learning Camera Movement Control from Real-World Drone Videos
Viaarxiv icon

Negative Token Merging: Image-based Adversarial Feature Guidance

Add code
Dec 02, 2024
Figure 1 for Negative Token Merging: Image-based Adversarial Feature Guidance
Figure 2 for Negative Token Merging: Image-based Adversarial Feature Guidance
Figure 3 for Negative Token Merging: Image-based Adversarial Feature Guidance
Figure 4 for Negative Token Merging: Image-based Adversarial Feature Guidance
Viaarxiv icon

Can We Predict Performance of Large Models across Vision-Language Tasks?

Add code
Oct 14, 2024
Figure 1 for Can We Predict Performance of Large Models across Vision-Language Tasks?
Figure 2 for Can We Predict Performance of Large Models across Vision-Language Tasks?
Figure 3 for Can We Predict Performance of Large Models across Vision-Language Tasks?
Figure 4 for Can We Predict Performance of Large Models across Vision-Language Tasks?
Viaarxiv icon