Picture for Bing Yin

Bing Yin

HeaPA: Difficulty-Aware Heap Sampling and On-Policy Query Augmentation for LLM Reinforcement Learning

Add code
Jan 30, 2026
Viaarxiv icon

Teach Diffusion Language Models to Learn from Their Own Mistakes

Add code
Jan 10, 2026
Viaarxiv icon

Rethinking Popularity Bias in Collaborative Filtering via Analytical Vector Decomposition

Add code
Dec 24, 2025
Figure 1 for Rethinking Popularity Bias in Collaborative Filtering via Analytical Vector Decomposition
Figure 2 for Rethinking Popularity Bias in Collaborative Filtering via Analytical Vector Decomposition
Figure 3 for Rethinking Popularity Bias in Collaborative Filtering via Analytical Vector Decomposition
Figure 4 for Rethinking Popularity Bias in Collaborative Filtering via Analytical Vector Decomposition
Viaarxiv icon

REST: Diffusion-based Real-time End-to-end Streaming Talking Head Generation via ID-Context Caching and Asynchronous Streaming Distillation

Add code
Dec 12, 2025
Viaarxiv icon

TransLLM: A Unified Multi-Task Foundation Framework for Urban Transportation via Learnable Prompting

Add code
Aug 20, 2025
Viaarxiv icon

DocR1: Evidence Page-Guided GRPO for Multi-Page Document Understanding

Add code
Aug 10, 2025
Viaarxiv icon

SessionIntentBench: A Multi-task Inter-session Intention-shift Modeling Benchmark for E-commerce Customer Behavior Understanding

Add code
Jul 27, 2025
Viaarxiv icon

UniConv: Unifying Retrieval and Response Generation for Large Language Models in Conversations

Add code
Jul 09, 2025
Figure 1 for UniConv: Unifying Retrieval and Response Generation for Large Language Models in Conversations
Figure 2 for UniConv: Unifying Retrieval and Response Generation for Large Language Models in Conversations
Figure 3 for UniConv: Unifying Retrieval and Response Generation for Large Language Models in Conversations
Figure 4 for UniConv: Unifying Retrieval and Response Generation for Large Language Models in Conversations
Viaarxiv icon

Aligning Large Language Models with Implicit Preferences from User-Generated Content

Add code
Jun 04, 2025
Figure 1 for Aligning Large Language Models with Implicit Preferences from User-Generated Content
Figure 2 for Aligning Large Language Models with Implicit Preferences from User-Generated Content
Figure 3 for Aligning Large Language Models with Implicit Preferences from User-Generated Content
Figure 4 for Aligning Large Language Models with Implicit Preferences from User-Generated Content
Viaarxiv icon

WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning

Add code
May 22, 2025
Viaarxiv icon