Picture for Songyang Gao

Songyang Gao

AgentGym: Evolving Large Language Model-based Agents across Diverse Environments

Add code
Jun 06, 2024
Figure 1 for AgentGym: Evolving Large Language Model-based Agents across Diverse Environments
Figure 2 for AgentGym: Evolving Large Language Model-based Agents across Diverse Environments
Figure 3 for AgentGym: Evolving Large Language Model-based Agents across Diverse Environments
Figure 4 for AgentGym: Evolving Large Language Model-based Agents across Diverse Environments
Viaarxiv icon

Chinese Tiny LLM: Pretraining a Chinese-Centric Large Language Model

Add code
Apr 09, 2024
Figure 1 for Chinese Tiny LLM: Pretraining a Chinese-Centric Large Language Model
Figure 2 for Chinese Tiny LLM: Pretraining a Chinese-Centric Large Language Model
Figure 3 for Chinese Tiny LLM: Pretraining a Chinese-Centric Large Language Model
Figure 4 for Chinese Tiny LLM: Pretraining a Chinese-Centric Large Language Model
Viaarxiv icon

The Fine Line: Navigating Large Language Model Pretraining with Down-streaming Capability Analysis

Add code
Apr 01, 2024
Viaarxiv icon

EasyJailbreak: A Unified Framework for Jailbreaking Large Language Models

Add code
Mar 18, 2024
Figure 1 for EasyJailbreak: A Unified Framework for Jailbreaking Large Language Models
Figure 2 for EasyJailbreak: A Unified Framework for Jailbreaking Large Language Models
Figure 3 for EasyJailbreak: A Unified Framework for Jailbreaking Large Language Models
Figure 4 for EasyJailbreak: A Unified Framework for Jailbreaking Large Language Models
Viaarxiv icon

ToolSword: Unveiling Safety Issues of Large Language Models in Tool Learning Across Three Stages

Add code
Feb 16, 2024
Viaarxiv icon

Navigating the OverKill in Large Language Models

Add code
Jan 31, 2024
Viaarxiv icon

Linear Alignment: A Closed-form Solution for Aligning Human Preferences without Tuning and Feedback

Add code
Jan 21, 2024
Viaarxiv icon

RoTBench: A Multi-Level Benchmark for Evaluating the Robustness of Large Language Models in Tool Learning

Add code
Jan 19, 2024
Figure 1 for RoTBench: A Multi-Level Benchmark for Evaluating the Robustness of Large Language Models in Tool Learning
Figure 2 for RoTBench: A Multi-Level Benchmark for Evaluating the Robustness of Large Language Models in Tool Learning
Figure 3 for RoTBench: A Multi-Level Benchmark for Evaluating the Robustness of Large Language Models in Tool Learning
Figure 4 for RoTBench: A Multi-Level Benchmark for Evaluating the Robustness of Large Language Models in Tool Learning
Viaarxiv icon

ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios

Add code
Jan 14, 2024
Viaarxiv icon

Secrets of RLHF in Large Language Models Part II: Reward Modeling

Add code
Jan 12, 2024
Figure 1 for Secrets of RLHF in Large Language Models Part II: Reward Modeling
Figure 2 for Secrets of RLHF in Large Language Models Part II: Reward Modeling
Figure 3 for Secrets of RLHF in Large Language Models Part II: Reward Modeling
Figure 4 for Secrets of RLHF in Large Language Models Part II: Reward Modeling
Viaarxiv icon