Picture for Ke Li

Ke Li

Jack

ProVG: Progressive Visual Grounding via Language Decoupling for Remote Sensing Imagery

Add code
Apr 02, 2026
Viaarxiv icon

HMPDM: A Diffusion Model for Driving Video Prediction with Historical Motion Priors

Add code
Mar 28, 2026
Viaarxiv icon

LLM-Driven Reasoning for Constraint-Aware Feature Selection in Industrial Systems

Add code
Mar 26, 2026
Viaarxiv icon

The Interspeech 2026 Audio Encoder Capability Challenge for Large Audio Language Models

Add code
Mar 24, 2026
Viaarxiv icon

Can a Robot Walk the Robotic Dog: Triple-Zero Collaborative Navigation for Heterogeneous Multi-Agent Systems

Add code
Mar 23, 2026
Viaarxiv icon

Implicit Maximum Likelihood Estimation for Real-time Generative Model Predictive Control

Add code
Mar 14, 2026
Viaarxiv icon

AI Agents, Language, Deep Learning and the Next Revolution in Science

Add code
Mar 09, 2026
Viaarxiv icon

Efficient Decoder Scaling Strategy for Neural Routing Solvers

Add code
Feb 28, 2026
Viaarxiv icon

SemVideo: Reconstructs What You Watch from Brain Activity via Hierarchical Semantic Guidance

Add code
Feb 25, 2026
Viaarxiv icon

No Need For Real Anomaly: MLLM Empowered Zero-Shot Video Anomaly Detection

Add code
Feb 22, 2026
Viaarxiv icon