Picture for Yibo Wang

Yibo Wang

VTC-R1: Vision-Text Compression for Efficient Long-Context Reasoning

Add code
Jan 29, 2026
Viaarxiv icon

DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation

Add code
Jan 14, 2026
Viaarxiv icon

Triplets Better Than Pairs: Towards Stable and Effective Self-Play Fine-Tuning for LLMs

Add code
Jan 13, 2026
Viaarxiv icon

Detecting Hallucinations in Graph Retrieval-Augmented Generation via Attention Patterns and Semantic Alignment

Add code
Dec 09, 2025
Viaarxiv icon

Humanoid Whole-Body Badminton via Multi-Stage Reinforcement Learning

Add code
Nov 14, 2025
Viaarxiv icon

QUITE: A Query Rewrite System Beyond Rules with LLM Agents

Add code
Jun 09, 2025
Viaarxiv icon

Response Uncertainty and Probe Modeling: Two Sides of the Same Coin in LLM Interpretability?

Add code
May 24, 2025
Viaarxiv icon

R1-ShareVL: Incentivizing Reasoning Capability of Multimodal Large Language Models via Share-GRPO

Add code
May 22, 2025
Viaarxiv icon

R1-Compress: Long Chain-of-Thought Compression via Chunk Compression and Search

Add code
May 22, 2025
Viaarxiv icon

Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models

Add code
May 15, 2025
Viaarxiv icon