Picture for Qiang Fu

Qiang Fu

Tencent Inc

Not All Tasks Are Equally Difficult: Multi-Task Reinforcement Learning with Dynamic Depth Routing

Add code
Dec 22, 2023
Figure 1 for Not All Tasks Are Equally Difficult: Multi-Task Reinforcement Learning with Dynamic Depth Routing
Figure 2 for Not All Tasks Are Equally Difficult: Multi-Task Reinforcement Learning with Dynamic Depth Routing
Figure 3 for Not All Tasks Are Equally Difficult: Multi-Task Reinforcement Learning with Dynamic Depth Routing
Figure 4 for Not All Tasks Are Equally Difficult: Multi-Task Reinforcement Learning with Dynamic Depth Routing
Viaarxiv icon

Professional Network Matters: Connections Empower Person-Job Fit

Add code
Dec 19, 2023
Viaarxiv icon

Diversity from Human Feedback

Add code
Oct 10, 2023
Figure 1 for Diversity from Human Feedback
Figure 2 for Diversity from Human Feedback
Figure 3 for Diversity from Human Feedback
Figure 4 for Diversity from Human Feedback
Viaarxiv icon

Text-to-Image Generation for Abstract Concepts

Add code
Sep 27, 2023
Figure 1 for Text-to-Image Generation for Abstract Concepts
Figure 2 for Text-to-Image Generation for Abstract Concepts
Figure 3 for Text-to-Image Generation for Abstract Concepts
Figure 4 for Text-to-Image Generation for Abstract Concepts
Viaarxiv icon

PSGformer: Enhancing 3D Point Cloud Instance Segmentation via Precise Semantic Guidance

Add code
Jul 15, 2023
Viaarxiv icon

RLTF: Reinforcement Learning from Unit Test Feedback

Add code
Jul 10, 2023
Figure 1 for RLTF: Reinforcement Learning from Unit Test Feedback
Figure 2 for RLTF: Reinforcement Learning from Unit Test Feedback
Figure 3 for RLTF: Reinforcement Learning from Unit Test Feedback
Figure 4 for RLTF: Reinforcement Learning from Unit Test Feedback
Viaarxiv icon

Policy Space Diversity for Non-Transitive Games

Add code
Jun 29, 2023
Figure 1 for Policy Space Diversity for Non-Transitive Games
Figure 2 for Policy Space Diversity for Non-Transitive Games
Figure 3 for Policy Space Diversity for Non-Transitive Games
Figure 4 for Policy Space Diversity for Non-Transitive Games
Viaarxiv icon

Maximum Entropy Heterogeneous-Agent Mirror Learning

Add code
Jun 19, 2023
Viaarxiv icon

On Manipulating Signals of User-Item Graph: A Jacobi Polynomial-based Graph Collaborative Filtering

Add code
Jun 06, 2023
Figure 1 for On Manipulating Signals of User-Item Graph: A Jacobi Polynomial-based Graph Collaborative Filtering
Figure 2 for On Manipulating Signals of User-Item Graph: A Jacobi Polynomial-based Graph Collaborative Filtering
Figure 3 for On Manipulating Signals of User-Item Graph: A Jacobi Polynomial-based Graph Collaborative Filtering
Figure 4 for On Manipulating Signals of User-Item Graph: A Jacobi Polynomial-based Graph Collaborative Filtering
Viaarxiv icon

Future-conditioned Unsupervised Pretraining for Decision Transformer

Add code
May 26, 2023
Figure 1 for Future-conditioned Unsupervised Pretraining for Decision Transformer
Figure 2 for Future-conditioned Unsupervised Pretraining for Decision Transformer
Figure 3 for Future-conditioned Unsupervised Pretraining for Decision Transformer
Figure 4 for Future-conditioned Unsupervised Pretraining for Decision Transformer
Viaarxiv icon