Picture for Chang Zhou

Chang Zhou

Qwen2 Technical Report

Add code
Jul 16, 2024
Figure 1 for Qwen2 Technical Report
Figure 2 for Qwen2 Technical Report
Figure 3 for Qwen2 Technical Report
Figure 4 for Qwen2 Technical Report
Viaarxiv icon

Qwen2-Audio Technical Report

Add code
Jul 15, 2024
Viaarxiv icon

Multi-Scenario Combination Based on Multi-Agent Reinforcement Learning to Optimize the Advertising Recommendation System

Add code
Jul 03, 2024
Figure 1 for Multi-Scenario Combination Based on Multi-Agent Reinforcement Learning to Optimize the Advertising Recommendation System
Figure 2 for Multi-Scenario Combination Based on Multi-Agent Reinforcement Learning to Optimize the Advertising Recommendation System
Figure 3 for Multi-Scenario Combination Based on Multi-Agent Reinforcement Learning to Optimize the Advertising Recommendation System
Figure 4 for Multi-Scenario Combination Based on Multi-Agent Reinforcement Learning to Optimize the Advertising Recommendation System
Viaarxiv icon

LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Feedback

Add code
Jun 30, 2024
Viaarxiv icon

The Reason behind Good or Bad: Towards a Better Mathematical Verifier with Natural Language Feedback

Add code
Jun 20, 2024
Viaarxiv icon

Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language Models

Add code
Jun 19, 2024
Viaarxiv icon

Can Large Language Models Always Solve Easy Problems if They Can Solve Harder Ones?

Add code
Jun 18, 2024
Viaarxiv icon

Deep Learning Powered Estimate of The Extrinsic Parameters on Unmanned Surface Vehicles

Add code
Jun 07, 2024
Viaarxiv icon

Online Merging Optimizers for Boosting Rewards and Mitigating Tax in Alignment

Add code
May 28, 2024
Viaarxiv icon

TD3 Based Collision Free Motion Planning for Robot Navigation

Add code
May 24, 2024
Figure 1 for TD3 Based Collision Free Motion Planning for Robot Navigation
Viaarxiv icon