Picture for Xun Zhou

Xun Zhou

Unlock the Correlation between Supervised Fine-Tuning and Reinforcement Learning in Training Code Large Language Models

Add code
Jun 14, 2024
Viaarxiv icon

Mistral-C2F: Coarse to Fine Actor for Analytical and Reasoning Enhancement in RLHF and Effective-Merged LLMs

Add code
Jun 12, 2024
Viaarxiv icon

Seeing the Image: Prioritizing Visual Correlation by Contrastive Alignment

Add code
May 28, 2024
Figure 1 for Seeing the Image: Prioritizing Visual Correlation by Contrastive Alignment
Figure 2 for Seeing the Image: Prioritizing Visual Correlation by Contrastive Alignment
Figure 3 for Seeing the Image: Prioritizing Visual Correlation by Contrastive Alignment
Figure 4 for Seeing the Image: Prioritizing Visual Correlation by Contrastive Alignment
Viaarxiv icon

Efficient Text-driven Motion Generation via Latent Consistency Training

Add code
May 05, 2024
Viaarxiv icon

NeRF2Points: Large-Scale Point Cloud Generation From Street Views' Radiance Field Optimization

Add code
Apr 07, 2024
Figure 1 for NeRF2Points: Large-Scale Point Cloud Generation From Street Views' Radiance Field Optimization
Figure 2 for NeRF2Points: Large-Scale Point Cloud Generation From Street Views' Radiance Field Optimization
Figure 3 for NeRF2Points: Large-Scale Point Cloud Generation From Street Views' Radiance Field Optimization
Figure 4 for NeRF2Points: Large-Scale Point Cloud Generation From Street Views' Radiance Field Optimization
Viaarxiv icon

Balancing Enhancement, Harmlessness, and General Capabilities: Enhancing Conversational LLMs with Direct RLHF

Add code
Mar 04, 2024
Figure 1 for Balancing Enhancement, Harmlessness, and General Capabilities: Enhancing Conversational LLMs with Direct RLHF
Figure 2 for Balancing Enhancement, Harmlessness, and General Capabilities: Enhancing Conversational LLMs with Direct RLHF
Figure 3 for Balancing Enhancement, Harmlessness, and General Capabilities: Enhancing Conversational LLMs with Direct RLHF
Figure 4 for Balancing Enhancement, Harmlessness, and General Capabilities: Enhancing Conversational LLMs with Direct RLHF
Viaarxiv icon

Referee-Meta-Learning for Fast Adaptation of Locational Fairness

Add code
Feb 20, 2024
Figure 1 for Referee-Meta-Learning for Fast Adaptation of Locational Fairness
Figure 2 for Referee-Meta-Learning for Fast Adaptation of Locational Fairness
Figure 3 for Referee-Meta-Learning for Fast Adaptation of Locational Fairness
Figure 4 for Referee-Meta-Learning for Fast Adaptation of Locational Fairness
Viaarxiv icon

ICE-GRT: Instruction Context Enhancement by Generative Reinforcement based Transformers

Add code
Jan 04, 2024
Viaarxiv icon

SpatialRank: Urban Event Ranking with NDCG Optimization on Spatiotemporal Data

Add code
Oct 14, 2023
Figure 1 for SpatialRank: Urban Event Ranking with NDCG Optimization on Spatiotemporal Data
Figure 2 for SpatialRank: Urban Event Ranking with NDCG Optimization on Spatiotemporal Data
Figure 3 for SpatialRank: Urban Event Ranking with NDCG Optimization on Spatiotemporal Data
Figure 4 for SpatialRank: Urban Event Ranking with NDCG Optimization on Spatiotemporal Data
Viaarxiv icon

GPT-Fathom: Benchmarking Large Language Models to Decipher the Evolutionary Path towards GPT-4 and Beyond

Add code
Oct 10, 2023
Figure 1 for GPT-Fathom: Benchmarking Large Language Models to Decipher the Evolutionary Path towards GPT-4 and Beyond
Figure 2 for GPT-Fathom: Benchmarking Large Language Models to Decipher the Evolutionary Path towards GPT-4 and Beyond
Figure 3 for GPT-Fathom: Benchmarking Large Language Models to Decipher the Evolutionary Path towards GPT-4 and Beyond
Figure 4 for GPT-Fathom: Benchmarking Large Language Models to Decipher the Evolutionary Path towards GPT-4 and Beyond
Viaarxiv icon