Picture for Weinan Zhang

Weinan Zhang

Large-Scale Actionless Video Pre-Training via Discrete Diffusion for Efficient Policy Learning

Add code
Feb 22, 2024
Viaarxiv icon

Entropy-Regularized Token-Level Policy Optimization for Large Language Models

Add code
Feb 09, 2024
Figure 1 for Entropy-Regularized Token-Level Policy Optimization for Large Language Models
Figure 2 for Entropy-Regularized Token-Level Policy Optimization for Large Language Models
Figure 3 for Entropy-Regularized Token-Level Policy Optimization for Large Language Models
Figure 4 for Entropy-Regularized Token-Level Policy Optimization for Large Language Models
Viaarxiv icon

CityFlowER: An Efficient and Realistic Traffic Simulator with Embedded Machine Learning Models

Add code
Feb 09, 2024
Figure 1 for CityFlowER: An Efficient and Realistic Traffic Simulator with Embedded Machine Learning Models
Figure 2 for CityFlowER: An Efficient and Realistic Traffic Simulator with Embedded Machine Learning Models
Figure 3 for CityFlowER: An Efficient and Realistic Traffic Simulator with Embedded Machine Learning Models
Figure 4 for CityFlowER: An Efficient and Realistic Traffic Simulator with Embedded Machine Learning Models
Viaarxiv icon

Contrastive Diffuser: Planning Towards High Return States via Contrastive Learning

Add code
Feb 06, 2024
Figure 1 for Contrastive Diffuser: Planning Towards High Return States via Contrastive Learning
Figure 2 for Contrastive Diffuser: Planning Towards High Return States via Contrastive Learning
Figure 3 for Contrastive Diffuser: Planning Towards High Return States via Contrastive Learning
Figure 4 for Contrastive Diffuser: Planning Towards High Return States via Contrastive Learning
Viaarxiv icon

DiffStitch: Boosting Offline Reinforcement Learning with Diffusion-based Trajectory Stitching

Add code
Feb 04, 2024
Figure 1 for DiffStitch: Boosting Offline Reinforcement Learning with Diffusion-based Trajectory Stitching
Figure 2 for DiffStitch: Boosting Offline Reinforcement Learning with Diffusion-based Trajectory Stitching
Figure 3 for DiffStitch: Boosting Offline Reinforcement Learning with Diffusion-based Trajectory Stitching
Figure 4 for DiffStitch: Boosting Offline Reinforcement Learning with Diffusion-based Trajectory Stitching
Viaarxiv icon

ODICE: Revealing the Mystery of Distribution Correction Estimation via Orthogonal-gradient Update

Add code
Feb 01, 2024
Viaarxiv icon

D2K: Turning Historical Data into Retrievable Knowledge for Recommender Systems

Add code
Jan 23, 2024
Figure 1 for D2K: Turning Historical Data into Retrievable Knowledge for Recommender Systems
Figure 2 for D2K: Turning Historical Data into Retrievable Knowledge for Recommender Systems
Figure 3 for D2K: Turning Historical Data into Retrievable Knowledge for Recommender Systems
Figure 4 for D2K: Turning Historical Data into Retrievable Knowledge for Recommender Systems
Viaarxiv icon

InfoRank: Unbiased Learning-to-Rank via Conditional Mutual Information Minimization

Add code
Jan 23, 2024
Figure 1 for InfoRank: Unbiased Learning-to-Rank via Conditional Mutual Information Minimization
Figure 2 for InfoRank: Unbiased Learning-to-Rank via Conditional Mutual Information Minimization
Figure 3 for InfoRank: Unbiased Learning-to-Rank via Conditional Mutual Information Minimization
Figure 4 for InfoRank: Unbiased Learning-to-Rank via Conditional Mutual Information Minimization
Viaarxiv icon

GeoGalactica: A Scientific Large Language Model in Geoscience

Add code
Dec 31, 2023
Figure 1 for GeoGalactica: A Scientific Large Language Model in Geoscience
Figure 2 for GeoGalactica: A Scientific Large Language Model in Geoscience
Figure 3 for GeoGalactica: A Scientific Large Language Model in Geoscience
Figure 4 for GeoGalactica: A Scientific Large Language Model in Geoscience
Viaarxiv icon

GFS: Graph-based Feature Synthesis for Prediction over Relational Databases

Add code
Dec 04, 2023
Figure 1 for GFS: Graph-based Feature Synthesis for Prediction over Relational Databases
Figure 2 for GFS: Graph-based Feature Synthesis for Prediction over Relational Databases
Figure 3 for GFS: Graph-based Feature Synthesis for Prediction over Relational Databases
Figure 4 for GFS: Graph-based Feature Synthesis for Prediction over Relational Databases
Viaarxiv icon