Picture for Aohan Zeng

Aohan Zeng

ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools

Add code
Jun 18, 2024
Figure 1 for ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools
Figure 2 for ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools
Figure 3 for ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools
Figure 4 for ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools
Viaarxiv icon

ChatGLM-RLHF: Practices of Aligning Large Language Models with Human Feedback

Add code
Apr 03, 2024
Viaarxiv icon

ChatGLM-Math: Improving Math Problem-Solving in Large Language Models with a Self-Critique Pipeline

Add code
Apr 03, 2024
Viaarxiv icon

Understanding Emergent Abilities of Language Models from the Loss Perspective

Add code
Mar 30, 2024
Figure 1 for Understanding Emergent Abilities of Language Models from the Loss Perspective
Figure 2 for Understanding Emergent Abilities of Language Models from the Loss Perspective
Figure 3 for Understanding Emergent Abilities of Language Models from the Loss Perspective
Figure 4 for Understanding Emergent Abilities of Language Models from the Loss Perspective
Viaarxiv icon

APAR: LLMs Can Do Auto-Parallel Auto-Regressive Decoding

Add code
Jan 12, 2024
Viaarxiv icon

xTrimoPGLM: Unified 100B-Scale Pre-trained Transformer for Deciphering the Language of Protein

Add code
Jan 11, 2024
Viaarxiv icon

CritiqueLLM: Scaling LLM-as-Critic for Effective and Explainable Evaluation of Large Language Model Generation

Add code
Nov 30, 2023
Figure 1 for CritiqueLLM: Scaling LLM-as-Critic for Effective and Explainable Evaluation of Large Language Model Generation
Figure 2 for CritiqueLLM: Scaling LLM-as-Critic for Effective and Explainable Evaluation of Large Language Model Generation
Figure 3 for CritiqueLLM: Scaling LLM-as-Critic for Effective and Explainable Evaluation of Large Language Model Generation
Figure 4 for CritiqueLLM: Scaling LLM-as-Critic for Effective and Explainable Evaluation of Large Language Model Generation
Viaarxiv icon

AgentTuning: Enabling Generalized Agent Abilities for LLMs

Add code
Oct 22, 2023
Figure 1 for AgentTuning: Enabling Generalized Agent Abilities for LLMs
Figure 2 for AgentTuning: Enabling Generalized Agent Abilities for LLMs
Figure 3 for AgentTuning: Enabling Generalized Agent Abilities for LLMs
Figure 4 for AgentTuning: Enabling Generalized Agent Abilities for LLMs
Viaarxiv icon

LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding

Add code
Aug 28, 2023
Figure 1 for LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding
Figure 2 for LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding
Figure 3 for LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding
Figure 4 for LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding
Viaarxiv icon

AgentBench: Evaluating LLMs as Agents

Add code
Aug 07, 2023
Figure 1 for AgentBench: Evaluating LLMs as Agents
Figure 2 for AgentBench: Evaluating LLMs as Agents
Figure 3 for AgentBench: Evaluating LLMs as Agents
Figure 4 for AgentBench: Evaluating LLMs as Agents
Viaarxiv icon