Picture for Ke Li

Ke Li

Jack

On the Generation and Mitigation of Harmful Geometry in Image-to-3D Models

Add code
May 10, 2026
Viaarxiv icon

X-Voice: Enabling Everyone to Speak 30 Languages via Zero-Shot Cross-Lingual Voice Cloning

Add code
May 07, 2026
Viaarxiv icon

Stego Battlefield: Evaluating Image Steganography Attacks and Steganalysis Defenses

Add code
May 07, 2026
Viaarxiv icon

GeoEdit: Local Frames for Fast, Training-Free On-Manifold Editing in Diffusion Models

Add code
Apr 27, 2026
Viaarxiv icon

The Second Challenge on Cross-Domain Few-Shot Object Detection at NTIRE 2026: Methods and Results

Add code
Apr 13, 2026
Viaarxiv icon

IceCache: Memory-efficient KV-cache Management for Long-Sequence LLMs

Add code
Apr 12, 2026
Viaarxiv icon

DIRECT: Video Mashup Creation via Hierarchical Multi-Agent Planning and Intent-Guided Editing

Add code
Apr 06, 2026
Viaarxiv icon

ProVG: Progressive Visual Grounding via Language Decoupling for Remote Sensing Imagery

Add code
Apr 02, 2026
Viaarxiv icon

HMPDM: A Diffusion Model for Driving Video Prediction with Historical Motion Priors

Add code
Mar 28, 2026
Viaarxiv icon

LLM-Driven Reasoning for Constraint-Aware Feature Selection in Industrial Systems

Add code
Mar 26, 2026
Viaarxiv icon