Text


FairHuman: Boosting Hand and Face Quality in Human Image Generation with Minimum Potential Delay Fairness in Diffusion Models

Add code
Jul 03, 2025
Viaarxiv icon

UniMC: Taming Diffusion Transformer for Unified Keypoint-Guided Multi-Class Image Generation

Add code
Jul 03, 2025
Viaarxiv icon

APT: Adaptive Personalized Training for Diffusion Models with Limited Data

Add code
Jul 03, 2025
Viaarxiv icon

OmniDraft: A Cross-vocabulary, Online Adaptive Drafter for On-device Speculative Decoding

Add code
Jul 03, 2025
Viaarxiv icon

Open-Source System for Multilingual Translation and Cloned Speech Synthesis

Add code
Jul 03, 2025
Viaarxiv icon

JoyTTS: LLM-based Spoken Chatbot With Voice Cloning

Add code
Jul 03, 2025
Viaarxiv icon

A robust and versatile deep learning model for prediction of the arterial input function in dynamic small animal $\left[^{18}\text{F}\right]$FDG PET imaging

Add code
Jul 03, 2025
Viaarxiv icon

Heeding the Inner Voice: Aligning ControlNet Training via Intermediate Features Feedback

Add code
Jul 03, 2025
Viaarxiv icon

Are Synthetic Videos Useful? A Benchmark for Retrieval-Centric Evaluation of Synthetic Videos

Add code
Jul 03, 2025
Viaarxiv icon

Improving Constrained Generation in Language Models via Self-Distilled Twisted Sequential Monte Carlo

Add code
Jul 03, 2025
Viaarxiv icon