Picture for Yongjin Yang

Yongjin Yang

UniSAFE: A Comprehensive Benchmark for Safety Evaluation of Unified Multimodal Models

Add code
Mar 18, 2026
Viaarxiv icon

Automated Skill Discovery for Language Agents through Exploration and Iterative Feedback

Add code
Jun 04, 2025
Viaarxiv icon

Revisiting Multi-Agent Debate as Test-Time Scaling: A Systematic Study of Conditional Effectiveness

Add code
May 29, 2025
Figure 1 for Revisiting Multi-Agent Debate as Test-Time Scaling: A Systematic Study of Conditional Effectiveness
Figure 2 for Revisiting Multi-Agent Debate as Test-Time Scaling: A Systematic Study of Conditional Effectiveness
Figure 3 for Revisiting Multi-Agent Debate as Test-Time Scaling: A Systematic Study of Conditional Effectiveness
Figure 4 for Revisiting Multi-Agent Debate as Test-Time Scaling: A Systematic Study of Conditional Effectiveness
Viaarxiv icon

Self-Training Elicits Concise Reasoning in Large Language Models

Add code
Feb 28, 2025
Viaarxiv icon

Automated Filtering of Human Feedback Data for Aligning Text-to-Image Diffusion Models

Add code
Oct 14, 2024
Figure 1 for Automated Filtering of Human Feedback Data for Aligning Text-to-Image Diffusion Models
Figure 2 for Automated Filtering of Human Feedback Data for Aligning Text-to-Image Diffusion Models
Figure 3 for Automated Filtering of Human Feedback Data for Aligning Text-to-Image Diffusion Models
Figure 4 for Automated Filtering of Human Feedback Data for Aligning Text-to-Image Diffusion Models
Viaarxiv icon

MAQA: Evaluating Uncertainty Quantification in LLMs Regarding Data Uncertainty

Add code
Aug 13, 2024
Viaarxiv icon

CSRT: Evaluation and Analysis of LLMs using Code-Switching Red-Teaming Dataset

Add code
Jun 17, 2024
Figure 1 for CSRT: Evaluation and Analysis of LLMs using Code-Switching Red-Teaming Dataset
Figure 2 for CSRT: Evaluation and Analysis of LLMs using Code-Switching Red-Teaming Dataset
Figure 3 for CSRT: Evaluation and Analysis of LLMs using Code-Switching Red-Teaming Dataset
Figure 4 for CSRT: Evaluation and Analysis of LLMs using Code-Switching Red-Teaming Dataset
Viaarxiv icon

Towards Unbiased Evaluation of Detecting Unanswerable Questions in EHRSQL

Add code
Apr 29, 2024
Figure 1 for Towards Unbiased Evaluation of Detecting Unanswerable Questions in EHRSQL
Figure 2 for Towards Unbiased Evaluation of Detecting Unanswerable Questions in EHRSQL
Figure 3 for Towards Unbiased Evaluation of Detecting Unanswerable Questions in EHRSQL
Figure 4 for Towards Unbiased Evaluation of Detecting Unanswerable Questions in EHRSQL
Viaarxiv icon

Leveraging Normalization Layer in Adapters With Progressive Learning and Adaptive Distillation for Cross-Domain Few-Shot Learning

Add code
Dec 18, 2023
Figure 1 for Leveraging Normalization Layer in Adapters With Progressive Learning and Adaptive Distillation for Cross-Domain Few-Shot Learning
Figure 2 for Leveraging Normalization Layer in Adapters With Progressive Learning and Adaptive Distillation for Cross-Domain Few-Shot Learning
Figure 3 for Leveraging Normalization Layer in Adapters With Progressive Learning and Adaptive Distillation for Cross-Domain Few-Shot Learning
Figure 4 for Leveraging Normalization Layer in Adapters With Progressive Learning and Adaptive Distillation for Cross-Domain Few-Shot Learning
Viaarxiv icon

Improving Adaptability and Generalizability of Efficient Transfer Learning for Vision-Language Models

Add code
Nov 27, 2023
Figure 1 for Improving Adaptability and Generalizability of Efficient Transfer Learning for Vision-Language Models
Figure 2 for Improving Adaptability and Generalizability of Efficient Transfer Learning for Vision-Language Models
Figure 3 for Improving Adaptability and Generalizability of Efficient Transfer Learning for Vision-Language Models
Figure 4 for Improving Adaptability and Generalizability of Efficient Transfer Learning for Vision-Language Models
Viaarxiv icon