Picture for Zhuosheng Zhang

Zhuosheng Zhang

Department of Computer Science and Engineering, Shanghai Jiao Tong University, Key Laboratory of Shanghai Education Commission for Intelligent Interaction and Cognitive Engineering, Shanghai Jiao Tong University, MoE Key Lab of Artificial Intelligence, AI Institute, Shanghai Jiao Tong University

DOCBENCH: A Benchmark for Evaluating LLM-based Document Reading Systems

Add code
Jul 15, 2024
Viaarxiv icon

Flooding Spread of Manipulated Knowledge in LLM-Based Multi-Agent Communities

Add code
Jul 10, 2024
Viaarxiv icon

TrojanRAG: Retrieval-Augmented Generation Can Be Backdoor Driver in Large Language Models

Add code
May 22, 2024
Viaarxiv icon

Trajectory Volatility for Out-of-Distribution Detection in Mathematical Reasoning

Add code
May 22, 2024
Figure 1 for Trajectory Volatility for Out-of-Distribution Detection in Mathematical Reasoning
Figure 2 for Trajectory Volatility for Out-of-Distribution Detection in Mathematical Reasoning
Figure 3 for Trajectory Volatility for Out-of-Distribution Detection in Mathematical Reasoning
Figure 4 for Trajectory Volatility for Out-of-Distribution Detection in Mathematical Reasoning
Viaarxiv icon

Mitigating Misleading Chain-of-Thought Reasoning with Selective Filtering

Add code
Mar 28, 2024
Figure 1 for Mitigating Misleading Chain-of-Thought Reasoning with Selective Filtering
Figure 2 for Mitigating Misleading Chain-of-Thought Reasoning with Selective Filtering
Figure 3 for Mitigating Misleading Chain-of-Thought Reasoning with Selective Filtering
Figure 4 for Mitigating Misleading Chain-of-Thought Reasoning with Selective Filtering
Viaarxiv icon

Measuring Bargaining Abilities of LLMs: A Benchmark and A Buyer-Enhancement Method

Add code
Feb 29, 2024
Figure 1 for Measuring Bargaining Abilities of LLMs: A Benchmark and A Buyer-Enhancement Method
Figure 2 for Measuring Bargaining Abilities of LLMs: A Benchmark and A Buyer-Enhancement Method
Figure 3 for Measuring Bargaining Abilities of LLMs: A Benchmark and A Buyer-Enhancement Method
Figure 4 for Measuring Bargaining Abilities of LLMs: A Benchmark and A Buyer-Enhancement Method
Viaarxiv icon

Acquiring Clean Language Models from Backdoor Poisoned Datasets by Downscaling Frequency Space

Add code
Feb 27, 2024
Viaarxiv icon

Can Watermarks Survive Translation? On the Cross-lingual Consistency of Text Watermark for Large Language Models

Add code
Feb 21, 2024
Viaarxiv icon

Comprehensive Cognitive LLM Agent for Smartphone GUI Automation

Add code
Feb 19, 2024
Viaarxiv icon

Investigating Multi-Hop Factual Shortcuts in Knowledge Editing of Large Language Models

Add code
Feb 19, 2024
Figure 1 for Investigating Multi-Hop Factual Shortcuts in Knowledge Editing of Large Language Models
Figure 2 for Investigating Multi-Hop Factual Shortcuts in Knowledge Editing of Large Language Models
Figure 3 for Investigating Multi-Hop Factual Shortcuts in Knowledge Editing of Large Language Models
Figure 4 for Investigating Multi-Hop Factual Shortcuts in Knowledge Editing of Large Language Models
Viaarxiv icon