Picture for Fuzheng Zhang

Fuzheng Zhang

Kuaishou Natural Language Processing Center and Audio Center

TSO: Self-Training with Scaled Preference Optimization

Add code
Aug 31, 2024
Viaarxiv icon

Towards Comprehensive Preference Data Collection for Reward Modeling

Add code
Jun 24, 2024
Figure 1 for Towards Comprehensive Preference Data Collection for Reward Modeling
Figure 2 for Towards Comprehensive Preference Data Collection for Reward Modeling
Figure 3 for Towards Comprehensive Preference Data Collection for Reward Modeling
Figure 4 for Towards Comprehensive Preference Data Collection for Reward Modeling
Viaarxiv icon

Small Agent Can Also Rock! Empowering Small Language Models as Hallucination Detector

Add code
Jun 17, 2024
Viaarxiv icon

Research on Foundation Model for Spatial Data Intelligence: China's 2024 White Paper on Strategic Development of Spatial Data Intelligence

Add code
May 30, 2024
Viaarxiv icon

Decoding at the Speed of Thought: Harnessing Parallel Decoding of Lexical Units for LLMs

Add code
May 24, 2024
Viaarxiv icon

Inductive-Deductive Strategy Reuse for Multi-Turn Instructional Dialogues

Add code
Apr 17, 2024
Figure 1 for Inductive-Deductive Strategy Reuse for Multi-Turn Instructional Dialogues
Figure 2 for Inductive-Deductive Strategy Reuse for Multi-Turn Instructional Dialogues
Figure 3 for Inductive-Deductive Strategy Reuse for Multi-Turn Instructional Dialogues
Figure 4 for Inductive-Deductive Strategy Reuse for Multi-Turn Instructional Dialogues
Viaarxiv icon

Chain-of-Specificity: An Iteratively Refining Method for Eliciting Knowledge from Large Language Models

Add code
Feb 20, 2024
Figure 1 for Chain-of-Specificity: An Iteratively Refining Method for Eliciting Knowledge from Large Language Models
Figure 2 for Chain-of-Specificity: An Iteratively Refining Method for Eliciting Knowledge from Large Language Models
Figure 3 for Chain-of-Specificity: An Iteratively Refining Method for Eliciting Knowledge from Large Language Models
Figure 4 for Chain-of-Specificity: An Iteratively Refining Method for Eliciting Knowledge from Large Language Models
Viaarxiv icon

Enhancing Role-playing Systems through Aggressive Queries: Evaluation and Improvement

Add code
Feb 16, 2024
Viaarxiv icon

Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint

Add code
Jan 11, 2024
Viaarxiv icon

Ask One More Time: Self-Agreement Improves Reasoning of Language Models in All Scenarios

Add code
Nov 14, 2023
Figure 1 for Ask One More Time: Self-Agreement Improves Reasoning of Language Models in  All Scenarios
Figure 2 for Ask One More Time: Self-Agreement Improves Reasoning of Language Models in  All Scenarios
Figure 3 for Ask One More Time: Self-Agreement Improves Reasoning of Language Models in  All Scenarios
Figure 4 for Ask One More Time: Self-Agreement Improves Reasoning of Language Models in  All Scenarios
Viaarxiv icon