Picture for Nan Jiang

Nan Jiang

Faculty of Information Technology, Beijing University of Technology, Beijing, China, Beijing Key Laboratory of Trusted Computing, Beijing, China, National Engineering Laboratory for Critical Technologies of Information Security Classified Protection, Beijing, China

A Snapshot of Influence: A Local Data Attribution Framework for Online Reinforcement Learning

Add code
May 25, 2025
Viaarxiv icon

From Questions to Clinical Recommendations: Large Language Models Driving Evidence-Based Clinical Decision Making

Add code
May 15, 2025
Figure 1 for From Questions to Clinical Recommendations: Large Language Models Driving Evidence-Based Clinical Decision Making
Figure 2 for From Questions to Clinical Recommendations: Large Language Models Driving Evidence-Based Clinical Decision Making
Figure 3 for From Questions to Clinical Recommendations: Large Language Models Driving Evidence-Based Clinical Decision Making
Figure 4 for From Questions to Clinical Recommendations: Large Language Models Driving Evidence-Based Clinical Decision Making
Viaarxiv icon

Joint Graph Convolution and Sequential Modeling for Scalable Network Traffic Estimation

Add code
May 12, 2025
Viaarxiv icon

Optimizing Chain-of-Thought Reasoners via Gradient Variance Minimization in Rejection Sampling and RL

Add code
May 05, 2025
Figure 1 for Optimizing Chain-of-Thought Reasoners via Gradient Variance Minimization in Rejection Sampling and RL
Figure 2 for Optimizing Chain-of-Thought Reasoners via Gradient Variance Minimization in Rejection Sampling and RL
Figure 3 for Optimizing Chain-of-Thought Reasoners via Gradient Variance Minimization in Rejection Sampling and RL
Figure 4 for Optimizing Chain-of-Thought Reasoners via Gradient Variance Minimization in Rejection Sampling and RL
Viaarxiv icon

Generative Auto-Bidding with Value-Guided Explorations

Add code
Apr 20, 2025
Viaarxiv icon

A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce

Add code
Apr 15, 2025
Figure 1 for A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce
Figure 2 for A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce
Figure 3 for A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce
Figure 4 for A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce
Viaarxiv icon

Context-Aware Adaptive Sampling for Intelligent Data Acquisition Systems Using DQN

Add code
Apr 12, 2025
Figure 1 for Context-Aware Adaptive Sampling for Intelligent Data Acquisition Systems Using DQN
Figure 2 for Context-Aware Adaptive Sampling for Intelligent Data Acquisition Systems Using DQN
Figure 3 for Context-Aware Adaptive Sampling for Intelligent Data Acquisition Systems Using DQN
Figure 4 for Context-Aware Adaptive Sampling for Intelligent Data Acquisition Systems Using DQN
Viaarxiv icon

Improving Harmful Text Detection with Joint Retrieval and External Knowledge

Add code
Apr 03, 2025
Viaarxiv icon

Is Best-of-N the Best of Them? Coverage, Scaling, and Optimality in Inference-Time Alignment

Add code
Mar 27, 2025
Figure 1 for Is Best-of-N the Best of Them? Coverage, Scaling, and Optimality in Inference-Time Alignment
Figure 2 for Is Best-of-N the Best of Them? Coverage, Scaling, and Optimality in Inference-Time Alignment
Figure 3 for Is Best-of-N the Best of Them? Coverage, Scaling, and Optimality in Inference-Time Alignment
Figure 4 for Is Best-of-N the Best of Them? Coverage, Scaling, and Optimality in Inference-Time Alignment
Viaarxiv icon

Dynamic Motion Blending for Versatile Motion Editing

Add code
Mar 26, 2025
Viaarxiv icon