Picture for Bing Yin

Bing Yin

Controllable and Verifiable Tool-Use Data Synthesis for Agentic Reinforcement Learning

Add code
Apr 10, 2026
Viaarxiv icon

Dictionary-Aligned Concept Control for Safeguarding Multimodal LLMs

Add code
Apr 10, 2026
Viaarxiv icon

MCLMR: A Model-Agnostic Causal Learning Framework for Multi-Behavior Recommendation

Add code
Mar 26, 2026
Viaarxiv icon

Training LLMs for Multi-Step Tool Orchestration with Constrained Data Synthesis and Graduated Rewards

Add code
Mar 25, 2026
Viaarxiv icon

TDATR: Improving End-to-End Table Recognition via Table Detail-Aware Learning and Cell-Level Visual Alignment

Add code
Mar 24, 2026
Viaarxiv icon

Stepwise Penalization for Length-Efficient Chain-of-Thought Reasoning

Add code
Feb 27, 2026
Viaarxiv icon

HeaPA: Difficulty-Aware Heap Sampling and On-Policy Query Augmentation for LLM Reinforcement Learning

Add code
Jan 30, 2026
Viaarxiv icon

Teach Diffusion Language Models to Learn from Their Own Mistakes

Add code
Jan 10, 2026
Viaarxiv icon

Rethinking Popularity Bias in Collaborative Filtering via Analytical Vector Decomposition

Add code
Dec 24, 2025
Figure 1 for Rethinking Popularity Bias in Collaborative Filtering via Analytical Vector Decomposition
Figure 2 for Rethinking Popularity Bias in Collaborative Filtering via Analytical Vector Decomposition
Figure 3 for Rethinking Popularity Bias in Collaborative Filtering via Analytical Vector Decomposition
Figure 4 for Rethinking Popularity Bias in Collaborative Filtering via Analytical Vector Decomposition
Viaarxiv icon

REST: Diffusion-based Real-time End-to-end Streaming Talking Head Generation via ID-Context Caching and Asynchronous Streaming Distillation

Add code
Dec 12, 2025
Viaarxiv icon