Picture for Miao Xiong

Miao Xiong

ConfTuner: Training Large Language Models to Express Their Confidence Verbally

Add code
Aug 26, 2025
Viaarxiv icon

MLR-Bench: Evaluating AI Agents on Open-Ended Machine Learning Research

Add code
May 26, 2025
Viaarxiv icon

Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging

Add code
May 08, 2025
Viaarxiv icon

Revisiting Uncertainty Quantification Evaluation in Language Models: Spurious Interactions with Response Length Bias Results

Add code
Apr 18, 2025
Figure 1 for Revisiting Uncertainty Quantification Evaluation in Language Models: Spurious Interactions with Response Length Bias Results
Figure 2 for Revisiting Uncertainty Quantification Evaluation in Language Models: Spurious Interactions with Response Length Bias Results
Figure 3 for Revisiting Uncertainty Quantification Evaluation in Language Models: Spurious Interactions with Response Length Bias Results
Figure 4 for Revisiting Uncertainty Quantification Evaluation in Language Models: Spurious Interactions with Response Length Bias Results
Viaarxiv icon

Do LLMs estimate uncertainty well in instruction-following?

Add code
Oct 18, 2024
Figure 1 for Do LLMs estimate uncertainty well in instruction-following?
Figure 2 for Do LLMs estimate uncertainty well in instruction-following?
Figure 3 for Do LLMs estimate uncertainty well in instruction-following?
Figure 4 for Do LLMs estimate uncertainty well in instruction-following?
Viaarxiv icon

FlipAttack: Jailbreak LLMs via Flipping

Add code
Oct 02, 2024
Figure 1 for FlipAttack: Jailbreak LLMs via Flipping
Figure 2 for FlipAttack: Jailbreak LLMs via Flipping
Figure 3 for FlipAttack: Jailbreak LLMs via Flipping
Figure 4 for FlipAttack: Jailbreak LLMs via Flipping
Viaarxiv icon

ID$^3$: Identity-Preserving-yet-Diversified Diffusion Models for Synthetic Face Recognition

Add code
Sep 26, 2024
Figure 1 for ID$^3$: Identity-Preserving-yet-Diversified Diffusion Models for Synthetic Face Recognition
Figure 2 for ID$^3$: Identity-Preserving-yet-Diversified Diffusion Models for Synthetic Face Recognition
Figure 3 for ID$^3$: Identity-Preserving-yet-Diversified Diffusion Models for Synthetic Face Recognition
Figure 4 for ID$^3$: Identity-Preserving-yet-Diversified Diffusion Models for Synthetic Face Recognition
Viaarxiv icon

In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation

Add code
Mar 12, 2024
Figure 1 for In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation
Figure 2 for In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation
Figure 3 for In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation
Figure 4 for In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation
Viaarxiv icon

Prompt-and-Align: Prompt-Based Social Alignment for Few-Shot Fake News Detection

Add code
Sep 28, 2023
Viaarxiv icon

Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs

Add code
Jun 22, 2023
Figure 1 for Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs
Figure 2 for Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs
Figure 3 for Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs
Figure 4 for Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs
Viaarxiv icon