Alert button
Picture for Xiangru Tang

Xiangru Tang

Alert button

Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents

Add code
Bookmark button
Alert button
Nov 20, 2023
Zhuosheng Zhang, Yao Yao, Aston Zhang, Xiangru Tang, Xinbei Ma, Zhiwei He, Yiming Wang, Mark Gerstein, Rui Wang, Gongshen Liu, Hai Zhao

Figure 1 for Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents
Figure 2 for Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents
Figure 3 for Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents
Figure 4 for Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents
Viaarxiv icon

ML-Bench: Large Language Models Leverage Open-source Libraries for Machine Learning Tasks

Add code
Bookmark button
Alert button
Nov 16, 2023
Yuliang Liu, Xiangru Tang, Zefan Cai, Junjie Lu, Yichi Zhang, Yanjun Shao, Zexuan Deng, Helan Hu, Zengxian Yang, Kaikai An, Ruijun Huang, Shuzheng Si, Sheng Chen, Haozhe Zhao, Zhengliang Li, Liang Chen, Yiming Zong, Yan Wang, Tianyu Liu, Zhiwei Jiang, Baobao Chang, Yujia Qin, Wangchunshu Zhou, Yilun Zhao, Arman Cohan, Mark Gerstein

Viaarxiv icon

MedAgents: Large Language Models as Collaborators for Zero-shot Medical Reasoning

Add code
Bookmark button
Alert button
Nov 16, 2023
Xiangru Tang, Anni Zou, Zhuosheng Zhang, Yilun Zhao, Xingyao Zhang, Arman Cohan, Mark Gerstein

Figure 1 for MedAgents: Large Language Models as Collaborators for Zero-shot Medical Reasoning
Figure 2 for MedAgents: Large Language Models as Collaborators for Zero-shot Medical Reasoning
Figure 3 for MedAgents: Large Language Models as Collaborators for Zero-shot Medical Reasoning
Figure 4 for MedAgents: Large Language Models as Collaborators for Zero-shot Medical Reasoning
Viaarxiv icon

DocMath-Eval: Evaluating Numerical Reasoning Capabilities of LLMs in Understanding Long Documents with Tabular Data

Add code
Bookmark button
Alert button
Nov 16, 2023
Yilun Zhao, Yitao Long, Hongjun Liu, Linyong Nan, Lyuhao Chen, Ryo Kamoi, Yixin Liu, Xiangru Tang, Rui Zhang, Arman Cohan

Viaarxiv icon

Investigating Data Contamination in Modern Benchmarks for Large Language Models

Add code
Bookmark button
Alert button
Nov 16, 2023
Chunyuan Deng, Yilun Zhao, Xiangru Tang, Mark Gerstein, Arman Cohan

Viaarxiv icon

Survey on Factuality in Large Language Models: Knowledge, Retrieval and Domain-Specificity

Add code
Bookmark button
Alert button
Oct 18, 2023
Cunxiang Wang, Xiaoze Liu, Yuanhao Yue, Xiangru Tang, Tianhang Zhang, Cheng Jiayang, Yunzhi Yao, Wenyang Gao, Xuming Hu, Zehan Qi, Yidong Wang, Linyi Yang, Jindong Wang, Xing Xie, Zheng Zhang, Yue Zhang

Figure 1 for Survey on Factuality in Large Language Models: Knowledge, Retrieval and Domain-Specificity
Figure 2 for Survey on Factuality in Large Language Models: Knowledge, Retrieval and Domain-Specificity
Figure 3 for Survey on Factuality in Large Language Models: Knowledge, Retrieval and Domain-Specificity
Figure 4 for Survey on Factuality in Large Language Models: Knowledge, Retrieval and Domain-Specificity
Viaarxiv icon

Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models

Add code
Bookmark button
Alert button
Oct 11, 2023
Anni Zou, Zhuosheng Zhang, Hai Zhao, Xiangru Tang

Figure 1 for Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models
Figure 2 for Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models
Figure 3 for Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models
Figure 4 for Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models
Viaarxiv icon

Struc-Bench: Are Large Language Models Really Good at Generating Complex Structured Data?

Add code
Bookmark button
Alert button
Sep 19, 2023
Xiangru Tang, Yiming Zong, Jason Phang, Yilun Zhao, Wangchunshu Zhou, Arman Cohan, Mark Gerstein

Figure 1 for Struc-Bench: Are Large Language Models Really Good at Generating Complex Structured Data?
Figure 2 for Struc-Bench: Are Large Language Models Really Good at Generating Complex Structured Data?
Figure 3 for Struc-Bench: Are Large Language Models Really Good at Generating Complex Structured Data?
Figure 4 for Struc-Bench: Are Large Language Models Really Good at Generating Complex Structured Data?
Viaarxiv icon

BioCoder: A Benchmark for Bioinformatics Code Generation with Contextual Pragmatic Knowledge

Add code
Bookmark button
Alert button
Sep 05, 2023
Xiangru Tang, Bill Qian, Rick Gao, Jiakang Chen, Xinyun Chen, Mark Gerstein

Figure 1 for BioCoder: A Benchmark for Bioinformatics Code Generation with Contextual Pragmatic Knowledge
Figure 2 for BioCoder: A Benchmark for Bioinformatics Code Generation with Contextual Pragmatic Knowledge
Figure 3 for BioCoder: A Benchmark for Bioinformatics Code Generation with Contextual Pragmatic Knowledge
Figure 4 for BioCoder: A Benchmark for Bioinformatics Code Generation with Contextual Pragmatic Knowledge
Viaarxiv icon