Picture for Wenbo Li

Wenbo Li

T-Rex: Task-Adaptive Spatial Representation Extraction for Robotic Manipulation with Vision-Language Models

Add code
Jun 24, 2025
Viaarxiv icon

AntiGrounding: Lifting Robotic Actions into VLM Representation Space for Decision Making

Add code
Jun 14, 2025
Viaarxiv icon

FastFLUX: Pruning FLUX with Block-wise Replacement and Sandwich Training

Add code
Jun 10, 2025
Viaarxiv icon

OPeRA: A Dataset of Observation, Persona, Rationale, and Action for Evaluating LLMs on Human Online Shopping Behavior Simulation

Add code
Jun 05, 2025
Viaarxiv icon

OSCAR: One-Step Diffusion Codec Across Multiple Bit-rates

Add code
May 22, 2025
Viaarxiv icon

PMQ-VE: Progressive Multi-Frame Quantization for Video Enhancement

Add code
May 18, 2025
Viaarxiv icon

PSDiffusion: Harmonized Multi-Layer Image Generation via Layout and Appearance Alignment

Add code
May 16, 2025
Viaarxiv icon

Low-bit Model Quantization for Deep Neural Networks: A Survey

Add code
May 08, 2025
Viaarxiv icon

Dual Prompting Image Restoration with Diffusion Transformers

Add code
Apr 24, 2025
Viaarxiv icon

Turbo2K: Towards Ultra-Efficient and High-Quality 2K Video Synthesis

Add code
Apr 20, 2025
Viaarxiv icon