Picture for Zhiyuan Ma

Zhiyuan Ma

Safe-SD: Safe and Traceable Stable Diffusion with Text Prompt Trigger for Invisible Generative Watermarking

Add code
Jul 19, 2024
Viaarxiv icon

Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding

Add code
Jul 13, 2024
Viaarxiv icon

ScaleDreamer: Scalable Text-to-3D Synthesis with Asynchronous Score Distillation

Add code
Jul 02, 2024
Viaarxiv icon

Neural Residual Diffusion Models for Deep Scalable Vision Generation

Add code
Jun 19, 2024
Viaarxiv icon

One-Step Effective Diffusion Network for Real-World Image Super-Resolution

Add code
Jun 12, 2024
Viaarxiv icon

UltraMedical: Building Specialized Generalists in Biomedicine

Add code
Jun 06, 2024
Viaarxiv icon

Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models

Add code
Mar 26, 2024
Viaarxiv icon

DiffSpeaker: Speech-Driven 3D Facial Animation with Diffusion Transformer

Add code
Feb 08, 2024
Viaarxiv icon

EmoSpeaker: One-shot Fine-grained Emotion-Controlled Talking Face Generation

Add code
Feb 02, 2024
Viaarxiv icon

Generative Multi-Modal Knowledge Retrieval with Large Language Models

Add code
Jan 16, 2024
Viaarxiv icon