Picture for Weiming Lu

Weiming Lu

OmniEAR: Benchmarking Agent Reasoning in Embodied Tasks

Add code
Aug 07, 2025
Viaarxiv icon

Test-Time Reinforcement Learning for GUI Grounding via Region Consistency

Add code
Aug 07, 2025
Viaarxiv icon

Cooper: Co-Optimizing Policy and Reward Models in Reinforcement Learning for Large Language Models

Add code
Aug 07, 2025
Viaarxiv icon

TimeHC-RL: Temporal-aware Hierarchical Cognitive Reinforcement Learning for Enhancing LLMs' Social Intelligence

Add code
May 30, 2025
Viaarxiv icon

ViewSpatial-Bench: Evaluating Multi-perspective Spatial Localization in Vision-Language Models

Add code
May 27, 2025
Viaarxiv icon

Let LLMs Break Free from Overthinking via Self-Braking Tuning

Add code
May 21, 2025
Viaarxiv icon

Mind the Gap: Bridging Thought Leap for Improved Chain-of-Thought Tuning

Add code
May 21, 2025
Viaarxiv icon

Structural-Temporal Coupling Anomaly Detection with Dynamic Graph Transformer

Add code
May 13, 2025
Viaarxiv icon

Enhancing LLM Language Adaption through Cross-lingual In-Context Pre-training

Add code
Apr 29, 2025
Viaarxiv icon

Bias Beyond English: Evaluating Social Bias and Debiasing Methods in a Low-Resource Setting

Add code
Apr 15, 2025
Viaarxiv icon