Picture for Chenyang Zhao

Chenyang Zhao

SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning

Add code
Jul 16, 2024
Viaarxiv icon

Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence

Add code
Jul 10, 2024
Viaarxiv icon

MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies

Add code
Apr 09, 2024
Figure 1 for MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies
Figure 2 for MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies
Figure 3 for MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies
Figure 4 for MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies
Viaarxiv icon

YAYI 2: Multilingual Open-Source Large Language Models

Add code
Dec 22, 2023
Viaarxiv icon

Seam-guided local alignment and stitching for large parallax images

Add code
Nov 30, 2023
Viaarxiv icon

Large Language Model is a Good Policy Teacher for Training Reinforcement Learning Agents

Add code
Nov 29, 2023
Figure 1 for Large Language Model is a Good Policy Teacher for Training Reinforcement Learning Agents
Figure 2 for Large Language Model is a Good Policy Teacher for Training Reinforcement Learning Agents
Figure 3 for Large Language Model is a Good Policy Teacher for Training Reinforcement Learning Agents
Figure 4 for Large Language Model is a Good Policy Teacher for Training Reinforcement Learning Agents
Viaarxiv icon

TeacherLM: Teaching to Fish Rather Than Giving the Fish, Language Modeling Likewise

Add code
Oct 31, 2023
Figure 1 for TeacherLM: Teaching to Fish Rather Than Giving the Fish, Language Modeling Likewise
Figure 2 for TeacherLM: Teaching to Fish Rather Than Giving the Fish, Language Modeling Likewise
Figure 3 for TeacherLM: Teaching to Fish Rather Than Giving the Fish, Language Modeling Likewise
Figure 4 for TeacherLM: Teaching to Fish Rather Than Giving the Fish, Language Modeling Likewise
Viaarxiv icon

Prompt2Model: Generating Deployable Models from Natural Language Instructions

Add code
Aug 23, 2023
Figure 1 for Prompt2Model: Generating Deployable Models from Natural Language Instructions
Figure 2 for Prompt2Model: Generating Deployable Models from Natural Language Instructions
Figure 3 for Prompt2Model: Generating Deployable Models from Natural Language Instructions
Figure 4 for Prompt2Model: Generating Deployable Models from Natural Language Instructions
Viaarxiv icon

Enabling Intelligent Interactions between an Agent and an LLM: A Reinforcement Learning Approach

Add code
Jun 11, 2023
Figure 1 for Enabling Intelligent Interactions between an Agent and an LLM: A Reinforcement Learning Approach
Figure 2 for Enabling Intelligent Interactions between an Agent and an LLM: A Reinforcement Learning Approach
Figure 3 for Enabling Intelligent Interactions between an Agent and an LLM: A Reinforcement Learning Approach
Figure 4 for Enabling Intelligent Interactions between an Agent and an LLM: A Reinforcement Learning Approach
Viaarxiv icon

Replicating Complex Dialogue Policy of Humans via Offline Imitation Learning with Supervised Regularization

Add code
May 06, 2023
Figure 1 for Replicating Complex Dialogue Policy of Humans via Offline Imitation Learning with Supervised Regularization
Figure 2 for Replicating Complex Dialogue Policy of Humans via Offline Imitation Learning with Supervised Regularization
Figure 3 for Replicating Complex Dialogue Policy of Humans via Offline Imitation Learning with Supervised Regularization
Figure 4 for Replicating Complex Dialogue Policy of Humans via Offline Imitation Learning with Supervised Regularization
Viaarxiv icon