Picture for Yuxi Huang

Yuxi Huang

ArenaBencher: Automatic Benchmark Evolution via Multi-Model Competitive Evaluation

Add code
Oct 09, 2025
Figure 1 for ArenaBencher: Automatic Benchmark Evolution via Multi-Model Competitive Evaluation
Figure 2 for ArenaBencher: Automatic Benchmark Evolution via Multi-Model Competitive Evaluation
Figure 3 for ArenaBencher: Automatic Benchmark Evolution via Multi-Model Competitive Evaluation
Figure 4 for ArenaBencher: Automatic Benchmark Evolution via Multi-Model Competitive Evaluation
Viaarxiv icon

A Survey on Responsible LLMs: Inherent Risk, Malicious Use, and Mitigation Strategy

Add code
Jan 16, 2025
Figure 1 for A Survey on Responsible LLMs: Inherent Risk, Malicious Use, and Mitigation Strategy
Figure 2 for A Survey on Responsible LLMs: Inherent Risk, Malicious Use, and Mitigation Strategy
Figure 3 for A Survey on Responsible LLMs: Inherent Risk, Malicious Use, and Mitigation Strategy
Figure 4 for A Survey on Responsible LLMs: Inherent Risk, Malicious Use, and Mitigation Strategy
Viaarxiv icon