Picture for Hongxiang Li

Hongxiang Li

DanceTogether! Identity-Preserving Multi-Person Interactive Video Generation

Add code
May 23, 2025
Viaarxiv icon

VideoGen-Eval: Agent-based System for Video Generation Evaluation

Add code
Mar 30, 2025
Viaarxiv icon

BlobCtrl: A Unified and Flexible Framework for Element-level Image Generation and Editing

Add code
Mar 17, 2025
Viaarxiv icon

DisPose: Disentangling Pose Guidance for Controllable Human Image Animation

Add code
Dec 13, 2024
Viaarxiv icon

Towards a Multimodal Large Language Model with Pixel-Level Insight for Biomedicine

Add code
Dec 12, 2024
Viaarxiv icon

PAPR Reduction based on Deep Learning Autoencoder in Coherent Optical OFDM Systems

Add code
Aug 26, 2024
Figure 1 for PAPR Reduction based on Deep Learning Autoencoder in Coherent Optical OFDM Systems
Figure 2 for PAPR Reduction based on Deep Learning Autoencoder in Coherent Optical OFDM Systems
Figure 3 for PAPR Reduction based on Deep Learning Autoencoder in Coherent Optical OFDM Systems
Figure 4 for PAPR Reduction based on Deep Learning Autoencoder in Coherent Optical OFDM Systems
Viaarxiv icon

Textual Inversion and Self-supervised Refinement for Radiology Report Generation

Add code
May 31, 2024
Viaarxiv icon

Towards Spoken Language Understanding via Multi-level Multi-grained Contrastive Learning

Add code
May 31, 2024
Figure 1 for Towards Spoken Language Understanding via Multi-level Multi-grained Contrastive Learning
Figure 2 for Towards Spoken Language Understanding via Multi-level Multi-grained Contrastive Learning
Figure 3 for Towards Spoken Language Understanding via Multi-level Multi-grained Contrastive Learning
Figure 4 for Towards Spoken Language Understanding via Multi-level Multi-grained Contrastive Learning
Viaarxiv icon

Uncertainty-aware sign language video retrieval with probability distribution modeling

Add code
May 30, 2024
Viaarxiv icon

Cross-Modal Conditioned Reconstruction for Language-guided Medical Image Segmentation

Add code
Apr 03, 2024
Viaarxiv icon