Picture for Haonan Li

Haonan Li

Visual Preference Optimization with Rubric Rewards

Add code
Apr 14, 2026
Viaarxiv icon

Controllable Reasoning Models Are Private Thinkers

Add code
Feb 27, 2026
Viaarxiv icon

SimuScene: Training and Benchmarking Code Generation to Simulate Physical Scenarios

Add code
Feb 11, 2026
Viaarxiv icon

Neural Theorem Proving for Verification Conditions: A Real-World Benchmark

Add code
Jan 26, 2026
Viaarxiv icon

TorchTraceAP: A New Benchmark Dataset for Detecting Performance Anti-Patterns in Computer Vision Models

Add code
Dec 16, 2025
Figure 1 for TorchTraceAP: A New Benchmark Dataset for Detecting Performance Anti-Patterns in Computer Vision Models
Figure 2 for TorchTraceAP: A New Benchmark Dataset for Detecting Performance Anti-Patterns in Computer Vision Models
Figure 3 for TorchTraceAP: A New Benchmark Dataset for Detecting Performance Anti-Patterns in Computer Vision Models
Figure 4 for TorchTraceAP: A New Benchmark Dataset for Detecting Performance Anti-Patterns in Computer Vision Models
Viaarxiv icon

LLMBisect: Breaking Barriers in Bug Bisection with A Comparative Analysis Pipeline

Add code
Oct 30, 2025
Figure 1 for LLMBisect: Breaking Barriers in Bug Bisection with A Comparative Analysis Pipeline
Figure 2 for LLMBisect: Breaking Barriers in Bug Bisection with A Comparative Analysis Pipeline
Figure 3 for LLMBisect: Breaking Barriers in Bug Bisection with A Comparative Analysis Pipeline
Figure 4 for LLMBisect: Breaking Barriers in Bug Bisection with A Comparative Analysis Pipeline
Viaarxiv icon

K2-Think: A Parameter-Efficient Reasoning System

Add code
Sep 09, 2025
Viaarxiv icon

BALSAM: A Platform for Benchmarking Arabic Large Language Models

Add code
Jul 30, 2025
Figure 1 for BALSAM: A Platform for Benchmarking Arabic Large Language Models
Figure 2 for BALSAM: A Platform for Benchmarking Arabic Large Language Models
Figure 3 for BALSAM: A Platform for Benchmarking Arabic Large Language Models
Figure 4 for BALSAM: A Platform for Benchmarking Arabic Large Language Models
Viaarxiv icon

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective

Add code
Jun 17, 2025
Figure 1 for Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective
Figure 2 for Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective
Figure 3 for Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective
Figure 4 for Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective
Viaarxiv icon

The Hitchhiker's Guide to Program Analysis, Part II: Deep Thoughts by LLMs

Add code
Apr 16, 2025
Viaarxiv icon