Picture for Austin Xu

Austin Xu

MAS-ZERO: Designing Multi-Agent Systems with Zero Supervision

Add code
May 26, 2025
Viaarxiv icon

Meta-Design Matters: A Self-Design Multi-Agent System

Add code
May 21, 2025
Viaarxiv icon

J4R: Learning to Judge with Equivalent Initial State Group Relative Preference Optimization

Add code
May 19, 2025
Viaarxiv icon

Evaluating Judges as Evaluators: The JETTS Benchmark of LLM-as-Judges as Test-Time Scaling Evaluators

Add code
Apr 21, 2025
Viaarxiv icon

A Survey of Frontiers in LLM Reasoning: Inference Scaling, Learning to Reason, and Agentic Systems

Add code
Apr 12, 2025
Viaarxiv icon

Does Context Matter? ContextualJudgeBench for Evaluating LLM-based Judges in Contextual Settings

Add code
Mar 19, 2025
Viaarxiv icon

Direct Judgement Preference Optimization

Add code
Sep 23, 2024
Figure 1 for Direct Judgement Preference Optimization
Figure 2 for Direct Judgement Preference Optimization
Figure 3 for Direct Judgement Preference Optimization
Figure 4 for Direct Judgement Preference Optimization
Viaarxiv icon

SFR-RAG: Towards Contextually Faithful LLMs

Add code
Sep 16, 2024
Viaarxiv icon

Perceptual adjustment queries and an inverted measurement paradigm for low-rank metric learning

Add code
Sep 08, 2023
Figure 1 for Perceptual adjustment queries and an inverted measurement paradigm for low-rank metric learning
Figure 2 for Perceptual adjustment queries and an inverted measurement paradigm for low-rank metric learning
Figure 3 for Perceptual adjustment queries and an inverted measurement paradigm for low-rank metric learning
Figure 4 for Perceptual adjustment queries and an inverted measurement paradigm for low-rank metric learning
Viaarxiv icon

HandsOff: Labeled Dataset Generation With No Additional Human Annotations

Add code
Dec 24, 2022
Viaarxiv icon