Picture for Ke Li

Ke Li

Jack

Training-Free Open-Vocabulary Visual Grounding for Remote Sensing Images and Videos

Add code
Jun 15, 2026
Viaarxiv icon

Y-BotFrame: An Extensible Embodied Agent Framework for Quadruped Robot Assistants

Add code
Jun 11, 2026
Viaarxiv icon

AerialClaw: An Open-Source Framework for LLM-Driven Autonomous Aerial Agents

Add code
Jun 10, 2026
Viaarxiv icon

Enhancing Multilingual LLM-based ASR with Mixture of Experts and Dynamic Downsampling

Add code
Jun 09, 2026
Viaarxiv icon

RigPAPR: Rig-Based Animation of Static Neural Point Clouds from a Fixed-Viewpoint Video

Add code
Jun 04, 2026
Viaarxiv icon

The Sword, Shield, and Achilles' Heel: Characterizing the Linguistic Inductive Bias of Large Language Models for Spatial Reasoning in Navigation Planning

Add code
May 29, 2026
Viaarxiv icon

InfoQuant: Shaping Activation Distributions for Low-Bit LLM Quantization

Add code
May 25, 2026
Viaarxiv icon

Latent Dynamics for Full Body Avatar Animation

Add code
May 20, 2026
Viaarxiv icon

On the Generation and Mitigation of Harmful Geometry in Image-to-3D Models

Add code
May 10, 2026
Viaarxiv icon

Stego Battlefield: Evaluating Image Steganography Attacks and Steganalysis Defenses

Add code
May 07, 2026
Viaarxiv icon