Picture for Zihao Wang

Zihao Wang

May

Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model

Add code
Oct 21, 2025
Viaarxiv icon

Online Rubrics Elicitation from Pairwise Comparisons

Add code
Oct 08, 2025
Viaarxiv icon

FPI-Det: a face--phone Interaction Dataset for phone-use detection and understanding

Add code
Sep 11, 2025
Viaarxiv icon

UQ: Assessing Language Models on Unsolved Questions

Add code
Aug 25, 2025
Viaarxiv icon

Echo-4o: Harnessing the Power of GPT-4o Synthetic Images for Improved Image Generation

Add code
Aug 13, 2025
Viaarxiv icon

Ring-lite: Scalable Reasoning via C3PO-Stabilized Reinforcement Learning for LLMs

Add code
Jun 18, 2025
Viaarxiv icon

Hey, That's My Data! Label-Only Dataset Inference in Large Language Models

Add code
Jun 06, 2025
Viaarxiv icon

Losing is for Cherishing: Data Valuation Based on Machine Unlearning and Shapley Value

Add code
May 22, 2025
Viaarxiv icon

RePPL: Recalibrating Perplexity by Uncertainty in Semantic Propagation and Language Generation for Explainable QA Hallucination Detection

Add code
May 21, 2025
Viaarxiv icon

From Automation to Autonomy: A Survey on Large Language Models in Scientific Discovery

Add code
May 19, 2025
Viaarxiv icon