Picture for Bei Peng

Bei Peng

Synthetic Data Generation for Training Diversified Commonsense Reasoning Models

Add code
Mar 18, 2026
Viaarxiv icon

How Does the Lagrangian Guide Safe Reinforcement Learning through Diffusion Models?

Add code
Feb 02, 2026
Viaarxiv icon

Heuristic Transformer: Belief Augmented In-Context Reinforcement Learning

Add code
Nov 13, 2025
Figure 1 for Heuristic Transformer: Belief Augmented In-Context Reinforcement Learning
Figure 2 for Heuristic Transformer: Belief Augmented In-Context Reinforcement Learning
Figure 3 for Heuristic Transformer: Belief Augmented In-Context Reinforcement Learning
Figure 4 for Heuristic Transformer: Belief Augmented In-Context Reinforcement Learning
Viaarxiv icon

FlowCritic: Bridging Value Estimation with Flow Matching in Reinforcement Learning

Add code
Oct 26, 2025
Figure 1 for FlowCritic: Bridging Value Estimation with Flow Matching in Reinforcement Learning
Figure 2 for FlowCritic: Bridging Value Estimation with Flow Matching in Reinforcement Learning
Figure 3 for FlowCritic: Bridging Value Estimation with Flow Matching in Reinforcement Learning
Figure 4 for FlowCritic: Bridging Value Estimation with Flow Matching in Reinforcement Learning
Viaarxiv icon

MACS: Multi-Agent Reinforcement Learning for Optimization of Crystal Structures

Add code
Jun 04, 2025
Figure 1 for MACS: Multi-Agent Reinforcement Learning for Optimization of Crystal Structures
Figure 2 for MACS: Multi-Agent Reinforcement Learning for Optimization of Crystal Structures
Figure 3 for MACS: Multi-Agent Reinforcement Learning for Optimization of Crystal Structures
Figure 4 for MACS: Multi-Agent Reinforcement Learning for Optimization of Crystal Structures
Viaarxiv icon

So-Fake: Benchmarking and Explaining Social Media Image Forgery Detection

Add code
May 24, 2025
Figure 1 for So-Fake: Benchmarking and Explaining Social Media Image Forgery Detection
Figure 2 for So-Fake: Benchmarking and Explaining Social Media Image Forgery Detection
Figure 3 for So-Fake: Benchmarking and Explaining Social Media Image Forgery Detection
Figure 4 for So-Fake: Benchmarking and Explaining Social Media Image Forgery Detection
Viaarxiv icon

A Knowledge-Informed Deep Learning Paradigm for Generalizable and Stability-Optimized Car-Following Models

Add code
Apr 19, 2025
Figure 1 for A Knowledge-Informed Deep Learning Paradigm for Generalizable and Stability-Optimized Car-Following Models
Figure 2 for A Knowledge-Informed Deep Learning Paradigm for Generalizable and Stability-Optimized Car-Following Models
Figure 3 for A Knowledge-Informed Deep Learning Paradigm for Generalizable and Stability-Optimized Car-Following Models
Figure 4 for A Knowledge-Informed Deep Learning Paradigm for Generalizable and Stability-Optimized Car-Following Models
Viaarxiv icon

SIDA: Social Media Image Deepfake Detection, Localization and Explanation with Large Multimodal Model

Add code
Dec 05, 2024
Figure 1 for SIDA: Social Media Image Deepfake Detection, Localization and Explanation with Large Multimodal Model
Figure 2 for SIDA: Social Media Image Deepfake Detection, Localization and Explanation with Large Multimodal Model
Figure 3 for SIDA: Social Media Image Deepfake Detection, Localization and Explanation with Large Multimodal Model
Figure 4 for SIDA: Social Media Image Deepfake Detection, Localization and Explanation with Large Multimodal Model
Viaarxiv icon

Improving Diversity of Commonsense Generation by Large Language Models via In-Context Learning

Add code
Apr 25, 2024
Figure 1 for Improving Diversity of Commonsense Generation by Large Language Models via In-Context Learning
Figure 2 for Improving Diversity of Commonsense Generation by Large Language Models via In-Context Learning
Figure 3 for Improving Diversity of Commonsense Generation by Large Language Models via In-Context Learning
Figure 4 for Improving Diversity of Commonsense Generation by Large Language Models via In-Context Learning
Viaarxiv icon

Gradable ChatGPT Translation Evaluation

Add code
Jan 18, 2024
Viaarxiv icon