Picture for Huaxiu Yao

Huaxiu Yao

Improving Alignment in LVLMs with Debiased Self-Judgment

Add code
Aug 28, 2025
Figure 1 for Improving Alignment in LVLMs with Debiased Self-Judgment
Figure 2 for Improving Alignment in LVLMs with Debiased Self-Judgment
Figure 3 for Improving Alignment in LVLMs with Debiased Self-Judgment
Figure 4 for Improving Alignment in LVLMs with Debiased Self-Judgment
Viaarxiv icon

UQ: Assessing Language Models on Unsolved Questions

Add code
Aug 25, 2025
Viaarxiv icon

Mimicking the Physicist's Eye:A VLM-centric Approach for Physics Formula Discovery

Add code
Aug 24, 2025
Figure 1 for Mimicking the Physicist's Eye:A VLM-centric Approach for Physics Formula Discovery
Figure 2 for Mimicking the Physicist's Eye:A VLM-centric Approach for Physics Formula Discovery
Figure 3 for Mimicking the Physicist's Eye:A VLM-centric Approach for Physics Formula Discovery
Figure 4 for Mimicking the Physicist's Eye:A VLM-centric Approach for Physics Formula Discovery
Viaarxiv icon

Efficient Long CoT Reasoning in Small Language Models

Add code
May 24, 2025
Viaarxiv icon

From EduVisBench to EduVisAgent: A Benchmark and Multi-Agent Framework for Pedagogical Visualization

Add code
May 22, 2025
Figure 1 for From EduVisBench to EduVisAgent: A Benchmark and Multi-Agent Framework for Pedagogical Visualization
Figure 2 for From EduVisBench to EduVisAgent: A Benchmark and Multi-Agent Framework for Pedagogical Visualization
Figure 3 for From EduVisBench to EduVisAgent: A Benchmark and Multi-Agent Framework for Pedagogical Visualization
Figure 4 for From EduVisBench to EduVisAgent: A Benchmark and Multi-Agent Framework for Pedagogical Visualization
Viaarxiv icon

CellTypeAgent: Trustworthy cell type annotation with Large Language Models

Add code
May 13, 2025
Viaarxiv icon

Anyprefer: An Agentic Framework for Preference Data Synthesis

Add code
Apr 27, 2025
Figure 1 for Anyprefer: An Agentic Framework for Preference Data Synthesis
Figure 2 for Anyprefer: An Agentic Framework for Preference Data Synthesis
Figure 3 for Anyprefer: An Agentic Framework for Preference Data Synthesis
Figure 4 for Anyprefer: An Agentic Framework for Preference Data Synthesis
Viaarxiv icon

Synergistic Weak-Strong Collaboration by Aligning Preferences

Add code
Apr 22, 2025
Viaarxiv icon

Token Level Routing Inference System for Edge Devices

Add code
Apr 10, 2025
Figure 1 for Token Level Routing Inference System for Edge Devices
Figure 2 for Token Level Routing Inference System for Edge Devices
Figure 3 for Token Level Routing Inference System for Edge Devices
Figure 4 for Token Level Routing Inference System for Edge Devices
Viaarxiv icon

Weak-for-Strong: Training Weak Meta-Agent to Harness Strong Executors

Add code
Apr 07, 2025
Viaarxiv icon