Picture for Bin Liu

Bin Liu

Fanny

ALAM: Algebraically Consistent Latent Transitions for Vision-Language-Action Models

Add code
May 11, 2026
Viaarxiv icon

D2ACE: Multi-Label Batch Selection Guided by Dual Dynamics and Adaptive Correlation Enhancement

Add code
May 10, 2026
Viaarxiv icon

Advancing Aesthetic Image Generation via Composition Transfer

Add code
May 06, 2026
Viaarxiv icon

SketchFaceGS: Real-Time Sketch-Driven Face Editing and Generation with Gaussian Splatting

Add code
Apr 21, 2026
Viaarxiv icon

MSDS: Deep Structural Similarity with Multiscale Representation

Add code
Apr 21, 2026
Viaarxiv icon

MMTalker: Multiresolution 3D Talking Head Synthesis with Multimodal Feature Fusion

Add code
Apr 03, 2026
Viaarxiv icon

EchoGen: Cycle-Consistent Learning for Unified Layout-Image Generation and Understanding

Add code
Mar 18, 2026
Viaarxiv icon

Multimodal Emotion Regression with Multi-Objective Optimization and VAD-Aware Audio Modeling for the 10th ABAW EMI Track

Add code
Mar 14, 2026
Viaarxiv icon

HiAR: Efficient Autoregressive Long Video Generation via Hierarchical Denoising

Add code
Mar 09, 2026
Viaarxiv icon

Breaking Coordinate Overfitting: Geometry-Aware WiFi Sensing for Cross-Layout 3D Pose Estimation

Add code
Jan 18, 2026
Viaarxiv icon