Picture for Kai Zhao

Kai Zhao

MetaSynth: Multi-Agent Metadata Generation from Implicit Feedback in Black-Box Systems

Add code
Oct 01, 2025
Viaarxiv icon

Spatial Reasoning in Foundation Models: Benchmarking Object-Centric Spatial Understanding

Add code
Sep 26, 2025
Viaarxiv icon

Millisecond-Response Tracking and Gazing System for UAVs: A Domestic Solution Based on "Phytium + Cambricon"

Add code
Sep 04, 2025
Viaarxiv icon

PoRe: Position-Reweighted Visual Token Pruning for Vision Language Models

Add code
Aug 25, 2025
Viaarxiv icon

CC-Time: Cross-Model and Cross-Modality Time Series Forecasting

Add code
Aug 17, 2025
Figure 1 for CC-Time: Cross-Model and Cross-Modality Time Series Forecasting
Figure 2 for CC-Time: Cross-Model and Cross-Modality Time Series Forecasting
Figure 3 for CC-Time: Cross-Model and Cross-Modality Time Series Forecasting
Figure 4 for CC-Time: Cross-Model and Cross-Modality Time Series Forecasting
Viaarxiv icon

CodeBoost: Boosting Code LLMs by Squeezing Knowledge from Code Snippets with RL

Add code
Aug 07, 2025
Viaarxiv icon

VL-CLIP: Enhancing Multimodal Recommendations via Visual Grounding and LLM-Augmented CLIP Embeddings

Add code
Jul 22, 2025
Viaarxiv icon

Simple Graph Contrastive Learning via Fractional-order Neural Diffusion Networks

Add code
Apr 24, 2025
Viaarxiv icon

Efficient Federated Split Learning for Large Language Models over Communication Networks

Add code
Apr 20, 2025
Figure 1 for Efficient Federated Split Learning for Large Language Models over Communication Networks
Figure 2 for Efficient Federated Split Learning for Large Language Models over Communication Networks
Figure 3 for Efficient Federated Split Learning for Large Language Models over Communication Networks
Figure 4 for Efficient Federated Split Learning for Large Language Models over Communication Networks
Viaarxiv icon

Zero-Shot Image-Based Large Language Model Approach to Road Pavement Monitoring

Add code
Apr 09, 2025
Viaarxiv icon