Picture for Di Huang

Di Huang

CoVFT: Context-aware Visual Fine-tuning for Multimodal Large Language Models

Add code
Mar 22, 2026
Viaarxiv icon

CTCal: Rethinking Text-to-Image Diffusion Models via Cross-Timestep Self-Calibration

Add code
Mar 21, 2026
Viaarxiv icon

EruDiff: Refactoring Knowledge in Diffusion Models for Advanced Text-to-Image Synthesis

Add code
Mar 21, 2026
Viaarxiv icon

QiMeng-CodeV-SVA: Training Specialized LLMs for Hardware Assertion Generation via RTL-Grounded Bidirectional Data Synthesis

Add code
Mar 15, 2026
Viaarxiv icon

Catalyst4D: High-Fidelity 3D-to-4D Scene Editing via Dynamic Propagation

Add code
Mar 13, 2026
Viaarxiv icon

HumDex:Humanoid Dexterous Manipulation Made Easy

Add code
Mar 12, 2026
Viaarxiv icon

$Ψ_0$: An Open Foundation Model Towards Universal Humanoid Loco-Manipulation

Add code
Mar 12, 2026
Viaarxiv icon

TokenSplat: Token-aligned 3D Gaussian Splatting for Feed-forward Pose-free Reconstruction

Add code
Feb 28, 2026
Viaarxiv icon

GraspLDP: Towards Generalizable Grasping Policy via Latent Diffusion

Add code
Feb 26, 2026
Viaarxiv icon

EgoHumanoid: Unlocking In-the-Wild Loco-Manipulation with Robot-Free Egocentric Demonstration

Add code
Feb 10, 2026
Viaarxiv icon