Picture for Jingwei Ni

Jingwei Ni

ETH Zürich

GD$^2$PO: Mitigating Multi-Reward Conflicts via Group-Dynamic reward-Decoupled Policy Optimization

Add code
Jun 15, 2026
Viaarxiv icon

ThinkBooster: A Unified Framework for Seamless Test-Time Scaling of LLM Reasoning

Add code
Jun 05, 2026
Viaarxiv icon

Skill-RM: Unifying Heterogeneous Evaluation Criteria via Agent Skill

Add code
Jun 02, 2026
Viaarxiv icon

Tackling the Root of Misinformation by Teaching Laypeople about Logical Fallacies via Socratic Questioning and Critical Argumentation

Add code
May 31, 2026
Viaarxiv icon

Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills

Add code
Mar 26, 2026
Viaarxiv icon

When AI Benchmarks Plateau: A Systematic Study of Benchmark Saturation

Add code
Feb 18, 2026
Viaarxiv icon

pdfQA: Diverse, Challenging, and Realistic Question Answering over PDFs

Add code
Jan 06, 2026
Viaarxiv icon

Reasoning with Confidence: Efficient Verification of LLM Reasoning Steps via Uncertainty Heads

Add code
Nov 11, 2025
Viaarxiv icon

Apertus: Democratizing Open and Compliant LLMs for Global Language Environments

Add code
Sep 17, 2025
Figure 1 for Apertus: Democratizing Open and Compliant LLMs for Global Language Environments
Figure 2 for Apertus: Democratizing Open and Compliant LLMs for Global Language Environments
Figure 3 for Apertus: Democratizing Open and Compliant LLMs for Global Language Environments
Figure 4 for Apertus: Democratizing Open and Compliant LLMs for Global Language Environments
Viaarxiv icon

Can Large Language Models Capture Human Annotator Disagreements?

Add code
Jun 24, 2025
Viaarxiv icon