Picture for Xinyu Lu

Xinyu Lu

Equivariant Spherical Transformer for Efficient Molecular Modeling

Add code
May 29, 2025
Viaarxiv icon

Scalable Oversight for Superhuman AI via Recursive Self-Critiquing

Add code
Feb 07, 2025
Viaarxiv icon

Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering

Add code
Nov 18, 2024
Figure 1 for Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering
Figure 2 for Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering
Figure 3 for Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering
Figure 4 for Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering
Viaarxiv icon

Transferable Post-training via Inverse Value Learning

Add code
Oct 28, 2024
Figure 1 for Transferable Post-training via Inverse Value Learning
Figure 2 for Transferable Post-training via Inverse Value Learning
Figure 3 for Transferable Post-training via Inverse Value Learning
Figure 4 for Transferable Post-training via Inverse Value Learning
Viaarxiv icon

Rethinking Reward Model Evaluation: Are We Barking up the Wrong Tree?

Add code
Oct 08, 2024
Figure 1 for Rethinking Reward Model Evaluation: Are We Barking up the Wrong Tree?
Figure 2 for Rethinking Reward Model Evaluation: Are We Barking up the Wrong Tree?
Figure 3 for Rethinking Reward Model Evaluation: Are We Barking up the Wrong Tree?
Figure 4 for Rethinking Reward Model Evaluation: Are We Barking up the Wrong Tree?
Viaarxiv icon

On-Policy Fine-grained Knowledge Feedback for Hallucination Mitigation

Add code
Jun 18, 2024
Figure 1 for On-Policy Fine-grained Knowledge Feedback for Hallucination Mitigation
Figure 2 for On-Policy Fine-grained Knowledge Feedback for Hallucination Mitigation
Figure 3 for On-Policy Fine-grained Knowledge Feedback for Hallucination Mitigation
Figure 4 for On-Policy Fine-grained Knowledge Feedback for Hallucination Mitigation
Viaarxiv icon

Towards Scalable Automated Alignment of LLMs: A Survey

Add code
Jun 03, 2024
Viaarxiv icon

SoFA: Shielded On-the-fly Alignment via Priority Rule Following

Add code
Feb 27, 2024
Figure 1 for SoFA: Shielded On-the-fly Alignment via Priority Rule Following
Figure 2 for SoFA: Shielded On-the-fly Alignment via Priority Rule Following
Figure 3 for SoFA: Shielded On-the-fly Alignment via Priority Rule Following
Figure 4 for SoFA: Shielded On-the-fly Alignment via Priority Rule Following
Viaarxiv icon

Principled and Efficient Transfer Learning of Deep Models via Neural Collapse

Add code
Jan 04, 2023
Viaarxiv icon

GraphCoCo: Graph Complementary Contrastive Learning

Add code
Mar 24, 2022
Figure 1 for GraphCoCo: Graph Complementary Contrastive Learning
Figure 2 for GraphCoCo: Graph Complementary Contrastive Learning
Figure 3 for GraphCoCo: Graph Complementary Contrastive Learning
Figure 4 for GraphCoCo: Graph Complementary Contrastive Learning
Viaarxiv icon