Picture for Teng Xiao

Teng Xiao

Incentivizing Strong Reasoning from Weak Supervision

Add code
May 28, 2025
Viaarxiv icon

Inference-time Alignment in Continuous Space

Add code
May 26, 2025
Viaarxiv icon

Incentivizing Reasoning from Weak Supervision

Add code
May 26, 2025
Viaarxiv icon

InfoPO: On Mutual Information Maximization for Large Language Model Alignment

Add code
May 13, 2025
Viaarxiv icon

A Deep Single Image Rectification Approach for Pan-Tilt-Zoom Cameras

Add code
Apr 09, 2025
Viaarxiv icon

On a Connection Between Imitation Learning and RLHF

Add code
Mar 07, 2025
Viaarxiv icon

SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters

Add code
Feb 04, 2025
Figure 1 for SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters
Figure 2 for SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters
Figure 3 for SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters
Figure 4 for SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters
Viaarxiv icon

Cal-DPO: Calibrated Direct Preference Optimization for Language Model Alignment

Add code
Dec 19, 2024
Viaarxiv icon

Fact-Level Confidence Calibration and Self-Correction

Add code
Nov 20, 2024
Figure 1 for Fact-Level Confidence Calibration and Self-Correction
Figure 2 for Fact-Level Confidence Calibration and Self-Correction
Figure 3 for Fact-Level Confidence Calibration and Self-Correction
Figure 4 for Fact-Level Confidence Calibration and Self-Correction
Viaarxiv icon

GeomCLIP: Contrastive Geometry-Text Pre-training for Molecules

Add code
Nov 16, 2024
Viaarxiv icon