Alert button
Picture for Jiahao Ying

Jiahao Ying

Alert button

Have Seen Me Before? Automating Dataset Updates Towards Reliable and Timely Evaluation

Add code
Bookmark button
Alert button
Feb 28, 2024
Jiahao Ying, Yixin Cao, Bo Wang, Wei Tang, Yizhe Yang, Shuicheng Yan

Viaarxiv icon

Intuitive or Dependent? Investigating LLMs' Robustness to Conflicting Prompts

Add code
Bookmark button
Alert button
Oct 03, 2023
Jiahao Ying, Yixin Cao, Kai Xiong, Yidong He, Long Cui, Yongbin Liu

Viaarxiv icon

Benchmarking Foundation Models with Language-Model-as-an-Examiner

Add code
Bookmark button
Alert button
Jun 07, 2023
Yushi Bai, Jiahao Ying, Yixin Cao, Xin Lv, Yuze He, Xiaozhi Wang, Jifan Yu, Kaisheng Zeng, Yijia Xiao, Haozhe Lyu, Jiayin Zhang, Juanzi Li, Lei Hou

Figure 1 for Benchmarking Foundation Models with Language-Model-as-an-Examiner
Figure 2 for Benchmarking Foundation Models with Language-Model-as-an-Examiner
Figure 3 for Benchmarking Foundation Models with Language-Model-as-an-Examiner
Figure 4 for Benchmarking Foundation Models with Language-Model-as-an-Examiner
Viaarxiv icon