Picture for Jingbo Shang

Jingbo Shang

CocoaBench: Evaluating Unified Digital Agents in the Wild

Add code
Apr 14, 2026
Viaarxiv icon

Simulating Organized Group Behavior: New Framework, Benchmark, and Analysis

Add code
Apr 10, 2026
Viaarxiv icon

EvoLen: Evolution-Guided Tokenization for DNA Language Model

Add code
Apr 09, 2026
Viaarxiv icon

How Well Does Generative Recommendation Generalize?

Add code
Mar 20, 2026
Viaarxiv icon

SynPlanResearch-R1: Encouraging Tool Exploration for Deep Research with Synthetic Plans

Add code
Mar 09, 2026
Viaarxiv icon

Stepwise Penalization for Length-Efficient Chain-of-Thought Reasoning

Add code
Feb 27, 2026
Viaarxiv icon

WS-GRPO: Weakly-Supervised Group-Relative Policy Optimization for Rollout-Efficient Reasoning

Add code
Feb 19, 2026
Viaarxiv icon

AMPS: Adaptive Modality Preference Steering via Functional Entropy

Add code
Feb 13, 2026
Viaarxiv icon

Codified Finite-state Machines for Role-playing

Add code
Feb 05, 2026
Viaarxiv icon

Test-time Recursive Thinking: Self-Improvement without External Feedback

Add code
Feb 03, 2026
Viaarxiv icon