Picture for Yibin Wang

Yibin Wang

Towards Confidential and Efficient LLM Inference with Dual Privacy Protection

Add code
Sep 11, 2025
Viaarxiv icon

An U-Net-Based Deep Neural Network for Cloud Shadow and Sun-Glint Correction of Unmanned Aerial System (UAS) Imagery

Add code
Sep 10, 2025
Viaarxiv icon

Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning

Add code
Aug 28, 2025
Viaarxiv icon

DiCache: Let Diffusion Model Determine Its Own Cache

Add code
Aug 24, 2025
Viaarxiv icon

GeometryZero: Improving Geometry Solving for LLM with Group Contrastive Policy Optimization

Add code
Jun 08, 2025
Viaarxiv icon

Improving Data Efficiency for LLM Reinforcement Fine-tuning Through Difficulty-targeted Online Data Selection and Rollout Replay

Add code
Jun 05, 2025
Viaarxiv icon

Token-Level Uncertainty Estimation for Large Language Model Reasoning

Add code
May 16, 2025
Viaarxiv icon

Efficient Uncertainty Estimation via Distillation of Bayesian Large Language Models

Add code
May 16, 2025
Viaarxiv icon

Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning

Add code
May 06, 2025
Viaarxiv icon

EduBot -- Can LLMs Solve Personalized Learning and Programming Assignments?

Add code
Apr 23, 2025
Viaarxiv icon