Picture for Yao Fu

Yao Fu

Sample-Mean Anchored Thompson Sampling for Offline-to-Online Learning with Distribution Shift

Add code
May 11, 2026
Viaarxiv icon

RAM-H1200: A Unified Evaluation and Dataset on Hand Radiographs for Rheumatoid Arthritis

Add code
May 07, 2026
Viaarxiv icon

Ego2Web: A Web Agent Benchmark Grounded in Egocentric Videos

Add code
Mar 23, 2026
Viaarxiv icon

Quantized but Deceptive? A Multi-Dimensional Truthfulness Evaluation of Quantized LLMs

Add code
Aug 26, 2025
Viaarxiv icon

When Truthful Representations Flip Under Deceptive Instructions?

Add code
Jul 29, 2025
Viaarxiv icon

FAEDKV: Infinite-Window Fourier Transform for Unbiased KV Cache Compression

Add code
Jul 26, 2025
Figure 1 for FAEDKV: Infinite-Window Fourier Transform for Unbiased KV Cache Compression
Figure 2 for FAEDKV: Infinite-Window Fourier Transform for Unbiased KV Cache Compression
Figure 3 for FAEDKV: Infinite-Window Fourier Transform for Unbiased KV Cache Compression
Figure 4 for FAEDKV: Infinite-Window Fourier Transform for Unbiased KV Cache Compression
Viaarxiv icon

HybridServe: Efficient Serving of Large AI Models with Confidence-Based Cascade Routing

Add code
May 18, 2025
Figure 1 for HybridServe: Efficient Serving of Large AI Models with Confidence-Based Cascade Routing
Figure 2 for HybridServe: Efficient Serving of Large AI Models with Confidence-Based Cascade Routing
Figure 3 for HybridServe: Efficient Serving of Large AI Models with Confidence-Based Cascade Routing
Figure 4 for HybridServe: Efficient Serving of Large AI Models with Confidence-Based Cascade Routing
Viaarxiv icon

MoE-CAP: Benchmarking Cost, Accuracy and Performance of Sparse Mixture-of-Experts Systems

Add code
May 16, 2025
Viaarxiv icon

MoE-CAP: Cost-Accuracy-Performance Benchmarking for Mixture-of-Experts Systems

Add code
Dec 10, 2024
Viaarxiv icon

Dynamic Self-Distillation via Previous Mini-batches for Fine-tuning Small Language Models

Add code
Nov 25, 2024
Figure 1 for Dynamic Self-Distillation via Previous Mini-batches for Fine-tuning Small Language Models
Figure 2 for Dynamic Self-Distillation via Previous Mini-batches for Fine-tuning Small Language Models
Figure 3 for Dynamic Self-Distillation via Previous Mini-batches for Fine-tuning Small Language Models
Figure 4 for Dynamic Self-Distillation via Previous Mini-batches for Fine-tuning Small Language Models
Viaarxiv icon