Picture for Liang Wang

Liang Wang

Institute of Automation, CAS

A Call for New Recipes to Enhance Spatial Reasoning in MLLMs

Add code
Apr 21, 2025
Viaarxiv icon

Fast Adversarial Training with Weak-to-Strong Spatial-Temporal Consistency in the Frequency Domain on Videos

Add code
Apr 21, 2025
Viaarxiv icon

WT-BCP: Wavelet Transform based Bidirectional Copy-Paste for Semi-Supervised Medical Image Segmentation

Add code
Apr 20, 2025
Viaarxiv icon

MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models

Add code
Apr 07, 2025
Viaarxiv icon

Aligning Multimodal LLM with Human Preference: A Survey

Add code
Mar 18, 2025
Viaarxiv icon

Agents Play Thousands of 3D Video Games

Add code
Mar 17, 2025
Viaarxiv icon

DPC: Dual-Prompt Collaboration for Tuning Vision-Language Models

Add code
Mar 17, 2025
Viaarxiv icon

Personalized Text Generation with Contrastive Activation Steering

Add code
Mar 07, 2025
Figure 1 for Personalized Text Generation with Contrastive Activation Steering
Figure 2 for Personalized Text Generation with Contrastive Activation Steering
Figure 3 for Personalized Text Generation with Contrastive Activation Steering
Figure 4 for Personalized Text Generation with Contrastive Activation Steering
Viaarxiv icon

Integrating Protein Dynamics into Structure-Based Drug Design via Full-Atom Stochastic Flows

Add code
Mar 06, 2025
Viaarxiv icon

A Compact Model for Large-Scale Time Series Forecasting

Add code
Feb 28, 2025
Viaarxiv icon