Picture for Yinya Huang

Yinya Huang

TreeRPO: Tree Relative Policy Optimization

Add code
Jun 05, 2025
Viaarxiv icon

SeePhys: Does Seeing Help Thinking? -- Benchmarking Vision-Based Physics Reasoning

Add code
May 25, 2025
Viaarxiv icon

LEXam: Benchmarking Legal Reasoning on 340 Law Exams

Add code
May 19, 2025
Viaarxiv icon

FormalAlign: Automated Alignment Evaluation for Autoformalization

Add code
Oct 14, 2024
Viaarxiv icon

Benchmarking LLMs for Optimization Modeling and Enhancing Reasoning via Reverse Socratic Synthesis

Add code
Jul 13, 2024
Figure 1 for Benchmarking LLMs for Optimization Modeling and Enhancing Reasoning via Reverse Socratic Synthesis
Figure 2 for Benchmarking LLMs for Optimization Modeling and Enhancing Reasoning via Reverse Socratic Synthesis
Figure 3 for Benchmarking LLMs for Optimization Modeling and Enhancing Reasoning via Reverse Socratic Synthesis
Figure 4 for Benchmarking LLMs for Optimization Modeling and Enhancing Reasoning via Reverse Socratic Synthesis
Viaarxiv icon

FVEL: Interactive Formal Verification Environment with Large Language Models via Theorem Proving

Add code
Jun 20, 2024
Figure 1 for FVEL: Interactive Formal Verification Environment with Large Language Models via Theorem Proving
Figure 2 for FVEL: Interactive Formal Verification Environment with Large Language Models via Theorem Proving
Figure 3 for FVEL: Interactive Formal Verification Environment with Large Language Models via Theorem Proving
Figure 4 for FVEL: Interactive Formal Verification Environment with Large Language Models via Theorem Proving
Viaarxiv icon

Process-Driven Autoformalization in Lean 4

Add code
Jun 04, 2024
Figure 1 for Process-Driven Autoformalization in Lean 4
Figure 2 for Process-Driven Autoformalization in Lean 4
Figure 3 for Process-Driven Autoformalization in Lean 4
Figure 4 for Process-Driven Autoformalization in Lean 4
Viaarxiv icon

AutoCV: Empowering Reasoning with Automated Process Labeling via Confidence Variation

Add code
May 29, 2024
Viaarxiv icon

Proving Theorems Recursively

Add code
May 23, 2024
Figure 1 for Proving Theorems Recursively
Figure 2 for Proving Theorems Recursively
Figure 3 for Proving Theorems Recursively
Figure 4 for Proving Theorems Recursively
Viaarxiv icon

ATG: Benchmarking Automated Theorem Generation for Generative Language Models

Add code
May 05, 2024
Viaarxiv icon