Alert button
Picture for Huan Sun

Huan Sun

Alert button

AttributionBench: How Hard is Automatic Attribution Evaluation?

Feb 23, 2024
Yifei Li, Xiang Yue, Zeyi Liao, Huan Sun

Viaarxiv icon

A Multi-Aspect Framework for Counter Narrative Evaluation using Large Language Models

Feb 18, 2024
Jaylen Jones, Lingbo Mo, Eric Fosler-Lussier, Huan Sun

Viaarxiv icon

LlaSMol: Advancing Large Language Models for Chemistry with a Large-Scale, Comprehensive, High-Quality Instruction Tuning Dataset

Feb 17, 2024
Botao Yu, Frazier N. Baker, Ziqi Chen, Xia Ning, Huan Sun

Viaarxiv icon

When is Tree Search Useful for LLM Planning? It Depends on the Discriminator

Feb 16, 2024
Ziru Chen, Michael White, Raymond Mooney, Ali Payani, Yu Su, Huan Sun

Viaarxiv icon

A Trembling House of Cards? Mapping Adversarial Attacks against Language Agents

Feb 15, 2024
Lingbo Mo, Zeyi Liao, Boyuan Zheng, Yu Su, Chaowei Xiao, Huan Sun

Viaarxiv icon

eCeLLM: Generalizing Large Language Models for E-commerce from Large-scale, High-quality Instruction Data

Feb 13, 2024
Bo Peng, Xinyi Ling, Ziru Chen, Huan Sun, Xia Ning

Viaarxiv icon

GPT-4V(ision) is a Generalist Web Agent, if Grounded

Jan 03, 2024
Boyuan Zheng, Boyu Gou, Jihyung Kil, Huan Sun, Yu Su

Viaarxiv icon

MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI

Nov 27, 2023
Xiang Yue, Yuansheng Ni, Kai Zhang, Tianyu Zheng, Ruoqi Liu, Ge Zhang, Samuel Stevens, Dongfu Jiang, Weiming Ren, Yuxuan Sun, Cong Wei, Botao Yu, Ruibin Yuan, Renliang Sun, Ming Yin, Boyuan Zheng, Zhenzhu Yang, Yibo Liu, Wenhao Huang, Huan Sun, Yu Su, Wenhu Chen

Viaarxiv icon

How Trustworthy are Open-Source LLMs? An Assessment under Malicious Demonstrations Shows their Vulnerabilities

Nov 15, 2023
Lingbo Mo, Boshi Wang, Muhao Chen, Huan Sun

Viaarxiv icon