Picture for Xiaoran Fan

Xiaoran Fan

LifeAgentBench: A Multi-dimensional Benchmark and Agent for Personal Health Assistants in Digital Health

Add code
Jan 20, 2026
Viaarxiv icon

Which Reasoning Trajectories Teach Students to Reason Better? A Simple Metric of Informative Alignment

Add code
Jan 20, 2026
Viaarxiv icon

MHA2MLA-VLM: Enabling DeepSeek's Economical Multi-Head Latent Attention across Vision-Language Models

Add code
Jan 16, 2026
Viaarxiv icon

MDAR: A Multi-scene Dynamic Audio Reasoning Benchmark

Add code
Sep 26, 2025
Viaarxiv icon

Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments

Add code
Aug 12, 2025
Figure 1 for Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments
Figure 2 for Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments
Figure 3 for Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments
Figure 4 for Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments
Viaarxiv icon

Speech-Language Models with Decoupled Tokenizers and Multi-Token Prediction

Add code
Jun 14, 2025
Figure 1 for Speech-Language Models with Decoupled Tokenizers and Multi-Token Prediction
Figure 2 for Speech-Language Models with Decoupled Tokenizers and Multi-Token Prediction
Figure 3 for Speech-Language Models with Decoupled Tokenizers and Multi-Token Prediction
Figure 4 for Speech-Language Models with Decoupled Tokenizers and Multi-Token Prediction
Viaarxiv icon

Mitigating Object Hallucinations in MLLMs via Multi-Frequency Perturbations

Add code
Mar 19, 2025
Viaarxiv icon

EliteKV: Scalable KV Cache Compression via RoPE Frequency Selection and Joint Low-Rank Projection

Add code
Mar 03, 2025
Figure 1 for EliteKV: Scalable KV Cache Compression via RoPE Frequency Selection and Joint Low-Rank Projection
Figure 2 for EliteKV: Scalable KV Cache Compression via RoPE Frequency Selection and Joint Low-Rank Projection
Figure 3 for EliteKV: Scalable KV Cache Compression via RoPE Frequency Selection and Joint Low-Rank Projection
Figure 4 for EliteKV: Scalable KV Cache Compression via RoPE Frequency Selection and Joint Low-Rank Projection
Viaarxiv icon

Predicting Large Language Model Capabilities on Closed-Book QA Tasks Using Only Information Available Prior to Training

Add code
Feb 06, 2025
Viaarxiv icon

Distill Visual Chart Reasoning Ability from LLMs to MLLMs

Add code
Oct 24, 2024
Figure 1 for Distill Visual Chart Reasoning Ability from LLMs to MLLMs
Figure 2 for Distill Visual Chart Reasoning Ability from LLMs to MLLMs
Figure 3 for Distill Visual Chart Reasoning Ability from LLMs to MLLMs
Figure 4 for Distill Visual Chart Reasoning Ability from LLMs to MLLMs
Viaarxiv icon