Picture for Hong Li

Hong Li

Cybo-Waiter: A Physical Agentic Framework for Humanoid Whole-Body Locomotion-Manipulation

Add code
Mar 11, 2026
Viaarxiv icon

ReMix: Reinforcement routing for mixtures of LoRAs in LLM finetuning

Add code
Mar 10, 2026
Viaarxiv icon

Dual Diffusion Models for Multi-modal Guided 3D Avatar Generation

Add code
Mar 04, 2026
Viaarxiv icon

Motion Manipulation via Unsupervised Keypoint Positioning in Face Animation

Add code
Mar 04, 2026
Viaarxiv icon

Micro-expression Recognition Based on Dual-branch Feature Extraction and Fusion

Add code
Feb 27, 2026
Viaarxiv icon

Silent Inconsistency in Data-Parallel Full Fine-Tuning: Diagnosing Worker-Level Optimization Misalignment

Add code
Feb 16, 2026
Viaarxiv icon

MeDocVL: A Visual Language Model for Medical Document Understanding and Parsing

Add code
Feb 06, 2026
Viaarxiv icon

Visual Merit or Linguistic Crutch? A Close Look at DeepSeek-OCR

Add code
Jan 08, 2026
Viaarxiv icon

Diffusion Knows Transparency: Repurposing Video Diffusion for Transparent Object Depth and Normal Estimation

Add code
Dec 29, 2025
Viaarxiv icon

Vision Transformer for Robust Occluded Person Reidentification in Complex Surveillance Scenes

Add code
Oct 31, 2025
Viaarxiv icon