Picture for Kai Ye

Kai Ye

The Less You Depend, The More You Learn: Synthesizing Novel Views from Sparse, Unposed Images without Any 3D Knowledge

Add code
Jun 11, 2025
Viaarxiv icon

Doc-CoB: Enhancing Multi-Modal Document Understanding with Visual Chain-of-Boxes Reasoning

Add code
May 24, 2025
Viaarxiv icon

More Clear, More Flexible, More Precise: A Comprehensive Oriented Object Detection benchmark for UAV

Add code
Apr 28, 2025
Viaarxiv icon

Cross-Frequency Implicit Neural Representation with Self-Evolving Parameters

Add code
Apr 15, 2025
Viaarxiv icon

S$^2$Teacher: Step-by-step Teacher for Sparsely Annotated Oriented Object Detection

Add code
Apr 15, 2025
Viaarxiv icon

SafeSpeech: Robust and Universal Voice Protection Against Malicious Speech Synthesis

Add code
Apr 14, 2025
Viaarxiv icon

Zeus: Zero-shot LLM Instruction for Union Segmentation in Multimodal Medical Imaging

Add code
Apr 09, 2025
Viaarxiv icon

Enhancing Large-scale UAV Route Planing with Global and Local Features via Reinforcement Graph Fusion

Add code
Dec 20, 2024
Viaarxiv icon

Grasp What You Want: Embodied Dexterous Grasping System Driven by Your Voice

Add code
Dec 14, 2024
Viaarxiv icon

An Efficient Dynamic Resource Allocation Framework for Evolutionary Bilevel Optimization

Add code
Oct 31, 2024
Viaarxiv icon