Picture for Xia Hu

Xia Hu

Give Me FP32 or Give Me Death? Challenges and Solutions for Reproducible Reasoning

Add code
Jun 11, 2025
Viaarxiv icon

AutoL2S: Auto Long-Short Reasoning for Efficient Large Language Models

Add code
May 28, 2025
Viaarxiv icon

70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length Float

Add code
Apr 15, 2025
Viaarxiv icon

Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models

Add code
Mar 20, 2025
Viaarxiv icon

You Only Debias Once: Towards Flexible Accuracy-Fairness Trade-offs at Inference Time

Add code
Mar 10, 2025
Viaarxiv icon

More for Keys, Less for Values: Adaptive KV Cache Quantization

Add code
Feb 20, 2025
Viaarxiv icon

Confident or Seek Stronger: Exploring Uncertainty-Based On-device LLM Routing From Benchmarking to Generalization

Add code
Feb 06, 2025
Viaarxiv icon

Survey and Improvement Strategies for Gene Prioritization with Large Language Models

Add code
Jan 30, 2025
Figure 1 for Survey and Improvement Strategies for Gene Prioritization with Large Language Models
Figure 2 for Survey and Improvement Strategies for Gene Prioritization with Large Language Models
Figure 3 for Survey and Improvement Strategies for Gene Prioritization with Large Language Models
Viaarxiv icon

Dialogue is Better Than Monologue: Instructing Medical LLMs via Strategical Conversations

Add code
Jan 29, 2025
Viaarxiv icon

AD-LLM: Benchmarking Large Language Models for Anomaly Detection

Add code
Dec 15, 2024
Figure 1 for AD-LLM: Benchmarking Large Language Models for Anomaly Detection
Figure 2 for AD-LLM: Benchmarking Large Language Models for Anomaly Detection
Figure 3 for AD-LLM: Benchmarking Large Language Models for Anomaly Detection
Figure 4 for AD-LLM: Benchmarking Large Language Models for Anomaly Detection
Viaarxiv icon