Picture for Zhiyuan Ma

Zhiyuan Ma

FaceEditTalker: Interactive Talking Head Generation with Facial Attribute Editing

Add code
May 28, 2025
Viaarxiv icon

Self-Reflective Reinforcement Learning for Diffusion-based Image Reasoning Generation

Add code
May 28, 2025
Viaarxiv icon

Success is in the Details: Evaluate and Enhance Details Sensitivity of Code LLMs through Counterfactuals

Add code
May 20, 2025
Viaarxiv icon

Context-Aware Autoregressive Models for Multi-Conditional Image Generation

Add code
May 18, 2025
Viaarxiv icon

Progressive Rendering Distillation: Adapting Stable Diffusion for Instant Text-to-Mesh Generation without 3D Data

Add code
Mar 27, 2025
Viaarxiv icon

Pixel-level and Semantic-level Adjustable Super-resolution: A Dual-LoRA Approach

Add code
Dec 04, 2024
Figure 1 for Pixel-level and Semantic-level Adjustable Super-resolution: A Dual-LoRA Approach
Figure 2 for Pixel-level and Semantic-level Adjustable Super-resolution: A Dual-LoRA Approach
Figure 3 for Pixel-level and Semantic-level Adjustable Super-resolution: A Dual-LoRA Approach
Figure 4 for Pixel-level and Semantic-level Adjustable Super-resolution: A Dual-LoRA Approach
Viaarxiv icon

MVBoost: Boost 3D Reconstruction with Multi-View Refinement

Add code
Nov 26, 2024
Viaarxiv icon

VideoDirector: Precise Video Editing via Text-to-Video Models

Add code
Nov 26, 2024
Figure 1 for VideoDirector: Precise Video Editing via Text-to-Video Models
Figure 2 for VideoDirector: Precise Video Editing via Text-to-Video Models
Figure 3 for VideoDirector: Precise Video Editing via Text-to-Video Models
Figure 4 for VideoDirector: Precise Video Editing via Text-to-Video Models
Viaarxiv icon

DH-VTON: Deep Text-Driven Virtual Try-On via Hybrid Attention Learning

Add code
Oct 16, 2024
Viaarxiv icon

CMAL: A Novel Cross-Modal Associative Learning Framework for Vision-Language Pre-Training

Add code
Oct 16, 2024
Viaarxiv icon