Picture for Tong Che

Tong Che

Your Reward Function for RL is Your Best PRM for Search: Unifying RL and Search-Based TTS

Add code
Aug 19, 2025
Viaarxiv icon

Two Heads are Better Than One: Test-time Scaling of Multi-agent Collaborative Reasoning

Add code
Apr 14, 2025
Viaarxiv icon

LoR-VP: Low-Rank Visual Prompting for Efficient Vision Model Adaptation

Add code
Feb 04, 2025
Viaarxiv icon

Learning Multiple Initial Solutions to Optimization Problems

Add code
Nov 04, 2024
Figure 1 for Learning Multiple Initial Solutions to Optimization Problems
Figure 2 for Learning Multiple Initial Solutions to Optimization Problems
Figure 3 for Learning Multiple Initial Solutions to Optimization Problems
Figure 4 for Learning Multiple Initial Solutions to Optimization Problems
Viaarxiv icon

LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning

Add code
Oct 03, 2024
Figure 1 for LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning
Figure 2 for LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning
Figure 3 for LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning
Figure 4 for LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning
Viaarxiv icon

Parallelized Spatiotemporal Binding

Add code
Feb 26, 2024
Viaarxiv icon

Learning from Teaching Regularization: Generalizable Correlations Should be Easy to Imitate

Add code
Feb 05, 2024
Figure 1 for Learning from Teaching Regularization: Generalizable Correlations Should be Easy to Imitate
Figure 2 for Learning from Teaching Regularization: Generalizable Correlations Should be Easy to Imitate
Figure 3 for Learning from Teaching Regularization: Generalizable Correlations Should be Easy to Imitate
Figure 4 for Learning from Teaching Regularization: Generalizable Correlations Should be Easy to Imitate
Viaarxiv icon

EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-Supervision

Add code
Nov 03, 2023
Viaarxiv icon

Bayesian Reparameterization of Reward-Conditioned Reinforcement Learning with Energy-based Models

Add code
May 18, 2023
Figure 1 for Bayesian Reparameterization of Reward-Conditioned Reinforcement Learning with Energy-based Models
Figure 2 for Bayesian Reparameterization of Reward-Conditioned Reinforcement Learning with Energy-based Models
Figure 3 for Bayesian Reparameterization of Reward-Conditioned Reinforcement Learning with Energy-based Models
Figure 4 for Bayesian Reparameterization of Reward-Conditioned Reinforcement Learning with Energy-based Models
Viaarxiv icon

SPE: Symmetrical Prompt Enhancement for Fact Probing

Add code
Nov 14, 2022
Figure 1 for SPE: Symmetrical Prompt Enhancement for Fact Probing
Figure 2 for SPE: Symmetrical Prompt Enhancement for Fact Probing
Figure 3 for SPE: Symmetrical Prompt Enhancement for Fact Probing
Figure 4 for SPE: Symmetrical Prompt Enhancement for Fact Probing
Viaarxiv icon