Picture for Xiao Yu

Xiao Yu

REINA: Regularized Entropy Information-Based Loss for Efficient Simultaneous Speech Translation

Add code
Aug 07, 2025
Viaarxiv icon

Frontier AI Risk Management Framework in Practice: A Risk Analysis Technical Report

Add code
Jul 22, 2025
Viaarxiv icon

PreFM: Online Audio-Visual Event Parsing via Predictive Future Modeling

Add code
May 29, 2025
Viaarxiv icon

MegaScale-Infer: Serving Mixture-of-Experts at Scale with Disaggregated Expert Parallelism

Add code
Apr 03, 2025
Viaarxiv icon

Strategize Globally, Adapt Locally: A Multi-Turn Red Teaming Agent with Dual-Level Learning

Add code
Apr 02, 2025
Viaarxiv icon

ConFit v2: Improving Resume-Job Matching using Hypothetical Resume Embedding and Runner-Up Hard-Negative Mining

Add code
Feb 19, 2025
Viaarxiv icon

IPSeg: Image Posterior Mitigates Semantic Drift in Class-Incremental Segmentation

Add code
Feb 07, 2025
Viaarxiv icon

FocalPO: Enhancing Preference Optimizing by Focusing on Correct Preference Rankings

Add code
Jan 11, 2025
Figure 1 for FocalPO: Enhancing Preference Optimizing by Focusing on Correct Preference Rankings
Figure 2 for FocalPO: Enhancing Preference Optimizing by Focusing on Correct Preference Rankings
Figure 3 for FocalPO: Enhancing Preference Optimizing by Focusing on Correct Preference Rankings
Figure 4 for FocalPO: Enhancing Preference Optimizing by Focusing on Correct Preference Rankings
Viaarxiv icon

Memory-Reduced Meta-Learning with Guaranteed Convergence

Add code
Dec 16, 2024
Figure 1 for Memory-Reduced Meta-Learning with Guaranteed Convergence
Figure 2 for Memory-Reduced Meta-Learning with Guaranteed Convergence
Figure 3 for Memory-Reduced Meta-Learning with Guaranteed Convergence
Figure 4 for Memory-Reduced Meta-Learning with Guaranteed Convergence
Viaarxiv icon

HumanEval-V: Evaluating Visual Understanding and Reasoning Abilities of Large Multimodal Models Through Coding Tasks

Add code
Oct 16, 2024
Figure 1 for HumanEval-V: Evaluating Visual Understanding and Reasoning Abilities of Large Multimodal Models Through Coding Tasks
Figure 2 for HumanEval-V: Evaluating Visual Understanding and Reasoning Abilities of Large Multimodal Models Through Coding Tasks
Figure 3 for HumanEval-V: Evaluating Visual Understanding and Reasoning Abilities of Large Multimodal Models Through Coding Tasks
Figure 4 for HumanEval-V: Evaluating Visual Understanding and Reasoning Abilities of Large Multimodal Models Through Coding Tasks
Viaarxiv icon