Picture for Kai Zhang

Kai Zhang

Victor

ComplexBench-Edit: Benchmarking Complex Instruction-Driven Image Editing via Compositional Dependencies

Add code
Jun 15, 2025
Viaarxiv icon

Post-Training Quantization for Video Matting

Add code
Jun 12, 2025
Viaarxiv icon

Test-Time Training Done Right

Add code
May 29, 2025
Figure 1 for Test-Time Training Done Right
Figure 2 for Test-Time Training Done Right
Figure 3 for Test-Time Training Done Right
Figure 4 for Test-Time Training Done Right
Viaarxiv icon

ARM: Adaptive Reasoning Model

Add code
May 26, 2025
Figure 1 for ARM: Adaptive Reasoning Model
Figure 2 for ARM: Adaptive Reasoning Model
Figure 3 for ARM: Adaptive Reasoning Model
Figure 4 for ARM: Adaptive Reasoning Model
Viaarxiv icon

CPathAgent: An Agent-based Foundation Model for Interpretable High-Resolution Pathology Image Analysis Mimicking Pathologists' Diagnostic Logic

Add code
May 26, 2025
Viaarxiv icon

Self-Reflective Planning with Knowledge Graphs: Enhancing LLM Reasoning Reliability for Question Answering

Add code
May 26, 2025
Viaarxiv icon

Route to Reason: Adaptive Routing for LLM and Reasoning Strategy Selection

Add code
May 26, 2025
Figure 1 for Route to Reason: Adaptive Routing for LLM and Reasoning Strategy Selection
Figure 2 for Route to Reason: Adaptive Routing for LLM and Reasoning Strategy Selection
Figure 3 for Route to Reason: Adaptive Routing for LLM and Reasoning Strategy Selection
Figure 4 for Route to Reason: Adaptive Routing for LLM and Reasoning Strategy Selection
Viaarxiv icon

A Unified Solution to Video Fusion: From Multi-Frame Learning to Benchmarking

Add code
May 26, 2025
Figure 1 for A Unified Solution to Video Fusion: From Multi-Frame Learning to Benchmarking
Figure 2 for A Unified Solution to Video Fusion: From Multi-Frame Learning to Benchmarking
Figure 3 for A Unified Solution to Video Fusion: From Multi-Frame Learning to Benchmarking
Figure 4 for A Unified Solution to Video Fusion: From Multi-Frame Learning to Benchmarking
Viaarxiv icon

Scaling Up Biomedical Vision-Language Models: Fine-Tuning, Instruction Tuning, and Multi-Modal Learning

Add code
May 23, 2025
Viaarxiv icon

R1-Compress: Long Chain-of-Thought Compression via Chunk Compression and Search

Add code
May 22, 2025
Viaarxiv icon