Picture for Jian Yuan

Jian Yuan

LLM-Explorer: A Plug-in Reinforcement Learning Policy Exploration Enhancement Driven by Large Language Models

Add code
May 21, 2025
Viaarxiv icon

Multiple Weaks Win Single Strong: Large Language Models Ensemble Weak Reinforcement Learning Agents into a Supreme One

Add code
May 21, 2025
Viaarxiv icon

RL of Thoughts: Navigating LLM Reasoning with Inference-time Reinforcement Learning

Add code
May 20, 2025
Viaarxiv icon

DeepSTA: A Spatial-Temporal Attention Network for Logistics Delivery Timely Rate Prediction in Anomaly Conditions

Add code
May 01, 2025
Viaarxiv icon

Learning to Estimate Package Delivery Time in Mixed Imbalanced Delivery and Pickup Logistics Services

Add code
May 01, 2025
Viaarxiv icon

The Art of Tool Interface Design

Add code
Mar 26, 2025
Viaarxiv icon

WeightedKV: Attention Scores Weighted Key-Value Cache Merging for Large Language Models

Add code
Mar 03, 2025
Viaarxiv icon

TreeKV: Smooth Key-Value Cache Compression with Tree Structures

Add code
Jan 09, 2025
Viaarxiv icon

Noise Matters: Diffusion Model-based Urban Mobility Generation with Collaborative Noise Priors

Add code
Dec 06, 2024
Viaarxiv icon

Global Estimation of Building-Integrated Facade and Rooftop Photovoltaic Potential by Integrating 3D Building Footprint and Spatio-Temporal Datasets

Add code
Dec 02, 2024
Viaarxiv icon