Picture for Zheng Li

Zheng Li

Department of Computer Science, Cornell Tech

VLCache: Computing 2% Vision Tokens and Reusing 98% for Vision-Language Inference

Add code
Dec 18, 2025
Viaarxiv icon

UltraDP: Generalizable Carotid Ultrasound Scanning with Force-Aware Diffusion Policy

Add code
Nov 19, 2025
Viaarxiv icon

CATS-V2V: A Real-World Vehicle-to-Vehicle Cooperative Perception Dataset with Complex Adverse Traffic Scenarios

Add code
Nov 14, 2025
Viaarxiv icon

Compensating Distribution Drifts in Class-incremental Learning of Pre-trained Vision Transformers

Add code
Nov 13, 2025
Viaarxiv icon

MEJO: MLLM-Engaged Surgical Triplet Recognition via Inter- and Intra-Task Joint Optimization

Add code
Sep 16, 2025
Viaarxiv icon

Hunyuan-MT Technical Report

Add code
Sep 05, 2025
Viaarxiv icon

PAUL: Uncertainty-Guided Partition and Augmentation for Robust Cross-View Geo-Localization under Noisy Correspondence

Add code
Aug 27, 2025
Viaarxiv icon

Few-shot Unknown Class Discovery of Hyperspectral Images with Prototype Learning and Clustering

Add code
Aug 25, 2025
Viaarxiv icon

Quantization Meets Spikes: Lossless Conversion in the First Timestep via Polarity Multi-Spike Mapping

Add code
Aug 20, 2025
Viaarxiv icon

SessionIntentBench: A Multi-task Inter-session Intention-shift Modeling Benchmark for E-commerce Customer Behavior Understanding

Add code
Jul 27, 2025
Viaarxiv icon