Picture for Hongsheng Li

Hongsheng Li

MMEmb-R1: Reasoning-Enhanced Multimodal Embedding with Pair-Aware Selection and Adaptive Control

Add code
Apr 07, 2026
Viaarxiv icon

Weather-Conditioned Branch Routing for Robust LiDAR-Radar 3D Object Detection

Add code
Apr 07, 2026
Viaarxiv icon

AURA: Always-On Understanding and Real-Time Assistance via Video Streams

Add code
Apr 05, 2026
Viaarxiv icon

ReinDriveGen: Reinforcement Post-Training for Out-of-Distribution Driving Scene Generation

Add code
Apr 01, 2026
Viaarxiv icon

ScaleEdit-12M: Scaling Open-Source Image Editing Data Generation via Multi-Agent Framework

Add code
Mar 21, 2026
Viaarxiv icon

AR-CoPO: Align Autoregressive Video Generation with Contrastive Policy Optimization

Add code
Mar 18, 2026
Viaarxiv icon

InternVL-U: Democratizing Unified Multimodal Models for Understanding, Reasoning, Generation and Editing

Add code
Mar 10, 2026
Viaarxiv icon

PIRA-Bench: A Transition from Reactive GUI Agents to GUI-based Proactive Intent Recommendation Agents

Add code
Mar 09, 2026
Viaarxiv icon

MADCrowner: Margin Aware Dental Crown Design with Template Deformation and Refinement

Add code
Mar 05, 2026
Viaarxiv icon

From Solver to Tutor: Evaluating the Pedagogical Intelligence of LLMs with KMP-Bench

Add code
Mar 03, 2026
Viaarxiv icon