Picture for Wee Sun Lee

Wee Sun Lee

NUS

Optimizing Anytime Reasoning via Budget Relative Policy Optimization

Add code
May 19, 2025
Viaarxiv icon

Reasoning-CV: Fine-tuning Powerful Reasoning LLMs for Knowledge-Assisted Claim Verification

Add code
May 18, 2025
Viaarxiv icon

Approximation and Generalization Abilities of Score-based Neural Network Generative Models for Sub-Gaussian Distributions

Add code
May 16, 2025
Viaarxiv icon

Understanding R1-Zero-Like Training: A Critical Perspective

Add code
Mar 26, 2025
Viaarxiv icon

EVaDE : Event-Based Variational Thompson Sampling for Model-Based Reinforcement Learning

Add code
Jan 16, 2025
Figure 1 for EVaDE : Event-Based Variational Thompson Sampling for Model-Based Reinforcement Learning
Figure 2 for EVaDE : Event-Based Variational Thompson Sampling for Model-Based Reinforcement Learning
Figure 3 for EVaDE : Event-Based Variational Thompson Sampling for Model-Based Reinforcement Learning
Figure 4 for EVaDE : Event-Based Variational Thompson Sampling for Model-Based Reinforcement Learning
Viaarxiv icon

Sample-Efficient Alignment for LLMs

Add code
Nov 03, 2024
Figure 1 for Sample-Efficient Alignment for LLMs
Figure 2 for Sample-Efficient Alignment for LLMs
Figure 3 for Sample-Efficient Alignment for LLMs
Figure 4 for Sample-Efficient Alignment for LLMs
Viaarxiv icon

Hierarchical Neural Constructive Solver for Real-world TSP Scenarios

Add code
Aug 07, 2024
Figure 1 for Hierarchical Neural Constructive Solver for Real-world TSP Scenarios
Figure 2 for Hierarchical Neural Constructive Solver for Real-world TSP Scenarios
Figure 3 for Hierarchical Neural Constructive Solver for Real-world TSP Scenarios
Figure 4 for Hierarchical Neural Constructive Solver for Real-world TSP Scenarios
Viaarxiv icon

Differentiable Cluster Graph Neural Network

Add code
May 25, 2024
Figure 1 for Differentiable Cluster Graph Neural Network
Figure 2 for Differentiable Cluster Graph Neural Network
Figure 3 for Differentiable Cluster Graph Neural Network
Figure 4 for Differentiable Cluster Graph Neural Network
Viaarxiv icon

Lightweight Spatial Modeling for Combinatorial Information Extraction From Documents

Add code
May 08, 2024
Viaarxiv icon

On the Empirical Complexity of Reasoning and Planning in LLMs

Add code
Apr 17, 2024
Viaarxiv icon