Picture for Jingqing Ruan

Jingqing Ruan

AMoPO: Adaptive Multi-objective Preference Optimization without Reward Models and Reference Models

Add code
Jun 08, 2025
Viaarxiv icon

When to Continue Thinking: Adaptive Thinking Mode Switching for Efficient Reasoning

Add code
May 21, 2025
Viaarxiv icon

QPO: Query-dependent Prompt Optimization via Multi-Loop Offline Reinforcement Learning

Add code
Aug 20, 2024
Viaarxiv icon

GuideLight: "Industrial Solution" Guidance for More Practical Traffic Signal Control Agents

Add code
Jul 15, 2024
Figure 1 for GuideLight: "Industrial Solution" Guidance for More Practical Traffic Signal Control Agents
Figure 2 for GuideLight: "Industrial Solution" Guidance for More Practical Traffic Signal Control Agents
Figure 3 for GuideLight: "Industrial Solution" Guidance for More Practical Traffic Signal Control Agents
Figure 4 for GuideLight: "Industrial Solution" Guidance for More Practical Traffic Signal Control Agents
Viaarxiv icon

CoSLight: Co-optimizing Collaborator Selection and Decision-making to Enhance Traffic Signal Control

Add code
May 27, 2024
Viaarxiv icon

Hummer: Towards Limited Competitive Preference Dataset

Add code
May 21, 2024
Viaarxiv icon

Learning Causal Dynamics Models in Object-Oriented Environments

Add code
May 21, 2024
Figure 1 for Learning Causal Dynamics Models in Object-Oriented Environments
Figure 2 for Learning Causal Dynamics Models in Object-Oriented Environments
Figure 3 for Learning Causal Dynamics Models in Object-Oriented Environments
Figure 4 for Learning Causal Dynamics Models in Object-Oriented Environments
Viaarxiv icon

X-Light: Cross-City Traffic Signal Control Using Transformer on Transformer as Meta Multi-Agent Reinforcement Learner

Add code
Apr 18, 2024
Figure 1 for X-Light: Cross-City Traffic Signal Control Using Transformer on Transformer as Meta Multi-Agent Reinforcement Learner
Figure 2 for X-Light: Cross-City Traffic Signal Control Using Transformer on Transformer as Meta Multi-Agent Reinforcement Learner
Figure 3 for X-Light: Cross-City Traffic Signal Control Using Transformer on Transformer as Meta Multi-Agent Reinforcement Learner
Figure 4 for X-Light: Cross-City Traffic Signal Control Using Transformer on Transformer as Meta Multi-Agent Reinforcement Learner
Viaarxiv icon

DuaLight: Enhancing Traffic Signal Control by Leveraging Scenario-Specific and Scenario-Shared Knowledge

Add code
Dec 22, 2023
Viaarxiv icon

Learning Top-k Subtask Planning Tree based on Discriminative Representation Pre-training for Decision Making

Add code
Dec 18, 2023
Viaarxiv icon