Picture for Bing Yin

Bing Yin

Shopping Reasoning Bench: An Expert-Authored Benchmark for Multi-Turn Conversational Shopping Assistants

Add code
Jun 10, 2026
Viaarxiv icon

Customer-Agent: Overcoming Context Limitations in Ultra-Long Shopping Trajectories via Tool-Augmented Agents and RLVR

Add code
Jun 06, 2026
Viaarxiv icon

Translate-R1: Cost-Aware Translation Tool Use via Reinforcement Learning

Add code
Jun 05, 2026
Viaarxiv icon

Unlocking Latent Value: Taxonomy-Guided Recovery of High-Performing Data from Low-Tier Web Corpora

Add code
Jun 05, 2026
Viaarxiv icon

QUBRIC: Co-Designing Queries and Rubrics for RL Beyond Verifiable Rewards

Add code
Jun 02, 2026
Viaarxiv icon

CoMem: Context Management with A Decoupled Long-Context Model

Add code
May 29, 2026
Viaarxiv icon

Learning with Rare Success but Rich Feedback via Reflection-Enhanced Self-Distillation

Add code
May 12, 2026
Viaarxiv icon

Expert Upcycling: Shifting the Compute-Efficient Frontier of Mixture-of-Experts

Add code
Apr 21, 2026
Viaarxiv icon

Dictionary-Aligned Concept Control for Safeguarding Multimodal LLMs

Add code
Apr 10, 2026
Viaarxiv icon

Controllable and Verifiable Tool-Use Data Synthesis for Agentic Reinforcement Learning

Add code
Apr 10, 2026
Viaarxiv icon