Picture for Weinan Zhang

Weinan Zhang

Hammer: Robust Function-Calling for On-Device Language Models via Function Masking

Add code
Oct 06, 2024
Figure 1 for Hammer: Robust Function-Calling for On-Device Language Models via Function Masking
Figure 2 for Hammer: Robust Function-Calling for On-Device Language Models via Function Masking
Figure 3 for Hammer: Robust Function-Calling for On-Device Language Models via Function Masking
Figure 4 for Hammer: Robust Function-Calling for On-Device Language Models via Function Masking
Viaarxiv icon

GenSim2: Scaling Robot Data Generation with Multi-modal and Reasoning LLMs

Add code
Oct 04, 2024
Figure 1 for GenSim2: Scaling Robot Data Generation with Multi-modal and Reasoning LLMs
Figure 2 for GenSim2: Scaling Robot Data Generation with Multi-modal and Reasoning LLMs
Figure 3 for GenSim2: Scaling Robot Data Generation with Multi-modal and Reasoning LLMs
Figure 4 for GenSim2: Scaling Robot Data Generation with Multi-modal and Reasoning LLMs
Viaarxiv icon

Computing Ex Ante Equilibrium in Heterogeneous Zero-Sum Team Games

Add code
Oct 02, 2024
Figure 1 for Computing Ex Ante Equilibrium in Heterogeneous Zero-Sum Team Games
Figure 2 for Computing Ex Ante Equilibrium in Heterogeneous Zero-Sum Team Games
Figure 3 for Computing Ex Ante Equilibrium in Heterogeneous Zero-Sum Team Games
Figure 4 for Computing Ex Ante Equilibrium in Heterogeneous Zero-Sum Team Games
Viaarxiv icon

LoopSR: Looping Sim-and-Real for Lifelong Policy Adaptation of Legged Robots

Add code
Sep 26, 2024
Viaarxiv icon

World Model-based Perception for Visual Legged Locomotion

Add code
Sep 25, 2024
Figure 1 for World Model-based Perception for Visual Legged Locomotion
Figure 2 for World Model-based Perception for Visual Legged Locomotion
Figure 3 for World Model-based Perception for Visual Legged Locomotion
Figure 4 for World Model-based Perception for Visual Legged Locomotion
Viaarxiv icon

RethinkMCTS: Refining Erroneous Thoughts in Monte Carlo Tree Search for Code Generation

Add code
Sep 15, 2024
Figure 1 for RethinkMCTS: Refining Erroneous Thoughts in Monte Carlo Tree Search for Code Generation
Figure 2 for RethinkMCTS: Refining Erroneous Thoughts in Monte Carlo Tree Search for Code Generation
Figure 3 for RethinkMCTS: Refining Erroneous Thoughts in Monte Carlo Tree Search for Code Generation
Figure 4 for RethinkMCTS: Refining Erroneous Thoughts in Monte Carlo Tree Search for Code Generation
Viaarxiv icon

Autonomous Goal Detection and Cessation in Reinforcement Learning: A Case Study on Source Term Estimation

Add code
Sep 14, 2024
Figure 1 for Autonomous Goal Detection and Cessation in Reinforcement Learning: A Case Study on Source Term Estimation
Figure 2 for Autonomous Goal Detection and Cessation in Reinforcement Learning: A Case Study on Source Term Estimation
Figure 3 for Autonomous Goal Detection and Cessation in Reinforcement Learning: A Case Study on Source Term Estimation
Figure 4 for Autonomous Goal Detection and Cessation in Reinforcement Learning: A Case Study on Source Term Estimation
Viaarxiv icon

Mutual Theory of Mind in Human-AI Collaboration: An Empirical Study with LLM-driven AI Agents in a Real-time Shared Workspace Task

Add code
Sep 13, 2024
Viaarxiv icon

A Survey on Diffusion Models for Recommender Systems

Add code
Sep 08, 2024
Figure 1 for A Survey on Diffusion Models for Recommender Systems
Figure 2 for A Survey on Diffusion Models for Recommender Systems
Figure 3 for A Survey on Diffusion Models for Recommender Systems
Figure 4 for A Survey on Diffusion Models for Recommender Systems
Viaarxiv icon

Efficient and Deployable Knowledge Infusion for Open-World Recommendations via Large Language Models

Add code
Aug 20, 2024
Figure 1 for Efficient and Deployable Knowledge Infusion for Open-World Recommendations via Large Language Models
Figure 2 for Efficient and Deployable Knowledge Infusion for Open-World Recommendations via Large Language Models
Figure 3 for Efficient and Deployable Knowledge Infusion for Open-World Recommendations via Large Language Models
Figure 4 for Efficient and Deployable Knowledge Infusion for Open-World Recommendations via Large Language Models
Viaarxiv icon