Picture for Wei Chu

Wei Chu

INF Technology

CTkvr: KV Cache Retrieval for Long-Context LLMs via Centroid then Token Indexing

Add code
Dec 17, 2025
Viaarxiv icon

The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward

Add code
Sep 09, 2025
Figure 1 for The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward
Figure 2 for The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward
Figure 3 for The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward
Figure 4 for The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward
Viaarxiv icon

OleSpeech-IV: A Large-Scale Multispeaker and Multilingual Conversational Speech Dataset with Diverse Topics

Add code
Sep 04, 2025
Figure 1 for OleSpeech-IV: A Large-Scale Multispeaker and Multilingual Conversational Speech Dataset with Diverse Topics
Figure 2 for OleSpeech-IV: A Large-Scale Multispeaker and Multilingual Conversational Speech Dataset with Diverse Topics
Figure 3 for OleSpeech-IV: A Large-Scale Multispeaker and Multilingual Conversational Speech Dataset with Diverse Topics
Figure 4 for OleSpeech-IV: A Large-Scale Multispeaker and Multilingual Conversational Speech Dataset with Diverse Topics
Viaarxiv icon

Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning

Add code
May 30, 2025
Viaarxiv icon

VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning

Add code
Apr 10, 2025
Viaarxiv icon

Guess What I am Thinking: A Benchmark for Inner Thought Reasoning of Role-Playing Language Agents

Add code
Mar 11, 2025
Figure 1 for Guess What I am Thinking: A Benchmark for Inner Thought Reasoning of Role-Playing Language Agents
Figure 2 for Guess What I am Thinking: A Benchmark for Inner Thought Reasoning of Role-Playing Language Agents
Figure 3 for Guess What I am Thinking: A Benchmark for Inner Thought Reasoning of Role-Playing Language Agents
Figure 4 for Guess What I am Thinking: A Benchmark for Inner Thought Reasoning of Role-Playing Language Agents
Viaarxiv icon

AURORA:Automated Training Framework of Universal Process Reward Models via Ensemble Prompting and Reverse Verification

Add code
Feb 17, 2025
Figure 1 for AURORA:Automated Training Framework of Universal Process Reward Models via Ensemble Prompting and Reverse Verification
Figure 2 for AURORA:Automated Training Framework of Universal Process Reward Models via Ensemble Prompting and Reverse Verification
Figure 3 for AURORA:Automated Training Framework of Universal Process Reward Models via Ensemble Prompting and Reverse Verification
Figure 4 for AURORA:Automated Training Framework of Universal Process Reward Models via Ensemble Prompting and Reverse Verification
Viaarxiv icon

SCP-116K: A High-Quality Problem-Solution Dataset and a Generalized Pipeline for Automated Extraction in the Higher Education Science Domain

Add code
Jan 26, 2025
Figure 1 for SCP-116K: A High-Quality Problem-Solution Dataset and a Generalized Pipeline for Automated Extraction in the Higher Education Science Domain
Figure 2 for SCP-116K: A High-Quality Problem-Solution Dataset and a Generalized Pipeline for Automated Extraction in the Higher Education Science Domain
Viaarxiv icon

An Attentive Dual-Encoder Framework Leveraging Multimodal Visual and Semantic Information for Automatic OSAHS Diagnosis

Add code
Dec 25, 2024
Figure 1 for An Attentive Dual-Encoder Framework Leveraging Multimodal Visual and Semantic Information for Automatic OSAHS Diagnosis
Figure 2 for An Attentive Dual-Encoder Framework Leveraging Multimodal Visual and Semantic Information for Automatic OSAHS Diagnosis
Figure 3 for An Attentive Dual-Encoder Framework Leveraging Multimodal Visual and Semantic Information for Automatic OSAHS Diagnosis
Figure 4 for An Attentive Dual-Encoder Framework Leveraging Multimodal Visual and Semantic Information for Automatic OSAHS Diagnosis
Viaarxiv icon

OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models

Add code
Nov 07, 2024
Figure 1 for OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models
Figure 2 for OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models
Figure 3 for OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models
Figure 4 for OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models
Viaarxiv icon