Picture for Xiang Wang

Xiang Wang

Victor

AdaViP: Aligning Multi-modal LLMs via Adaptive Vision-enhanced Preference Optimization

Add code
Apr 22, 2025
Viaarxiv icon

A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment

Add code
Apr 22, 2025
Viaarxiv icon

DMPT: Decoupled Modality-aware Prompt Tuning for Multi-modal Object Re-identification

Add code
Apr 15, 2025
Viaarxiv icon

Taming Consistency Distillation for Accelerated Human Image Animation

Add code
Apr 15, 2025
Viaarxiv icon

UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer

Add code
Apr 15, 2025
Viaarxiv icon

SafeMLRM: Demystifying Safety in Multi-modal Large Reasoning Models

Add code
Apr 09, 2025
Viaarxiv icon

Exploring the Evolution of Physics Cognition in Video Generation: A Survey

Add code
Mar 27, 2025
Viaarxiv icon

Towards Unified Latent Space for 3D Molecular Latent Diffusion Modeling

Add code
Mar 19, 2025
Viaarxiv icon

Multimodal Language Modeling for High-Accuracy Single Cell Transcriptomics Analysis and Generation

Add code
Mar 12, 2025
Viaarxiv icon

Route Sparse Autoencoder to Interpret Large Language Models

Add code
Mar 11, 2025
Viaarxiv icon