Picture for Xinrun Wang

Xinrun Wang

The Avengers: A Simple Recipe for Uniting Smaller Language Models to Challenge Proprietary Giants

Add code
May 26, 2025
Viaarxiv icon

Resolving Latency and Inventory Risk in Market Making with Reinforcement Learning

Add code
May 18, 2025
Viaarxiv icon

Nondeterministic Polynomial-time Problem Challenge: An Ever-Scaling Reasoning Benchmark for LLMs

Add code
Apr 15, 2025
Viaarxiv icon

If Multi-Agent Debate is the Answer, What is the Question?

Add code
Feb 12, 2025
Viaarxiv icon

Evaluating World Models with LLM for Decision Making

Add code
Nov 13, 2024
Figure 1 for Evaluating World Models with LLM for Decision Making
Figure 2 for Evaluating World Models with LLM for Decision Making
Figure 3 for Evaluating World Models with LLM for Decision Making
Figure 4 for Evaluating World Models with LLM for Decision Making
Viaarxiv icon

CLR-Bench: Evaluating Large Language Models in College-level Reasoning

Add code
Oct 23, 2024
Figure 1 for CLR-Bench: Evaluating Large Language Models in College-level Reasoning
Figure 2 for CLR-Bench: Evaluating Large Language Models in College-level Reasoning
Figure 3 for CLR-Bench: Evaluating Large Language Models in College-level Reasoning
Figure 4 for CLR-Bench: Evaluating Large Language Models in College-level Reasoning
Viaarxiv icon

Double Oracle Neural Architecture Search for Game Theoretic Deep Learning Models

Add code
Oct 07, 2024
Figure 1 for Double Oracle Neural Architecture Search for Game Theoretic Deep Learning Models
Figure 2 for Double Oracle Neural Architecture Search for Game Theoretic Deep Learning Models
Figure 3 for Double Oracle Neural Architecture Search for Game Theoretic Deep Learning Models
Figure 4 for Double Oracle Neural Architecture Search for Game Theoretic Deep Learning Models
Viaarxiv icon

Resultant: Incremental Effectiveness on Likelihood for Unsupervised Out-of-Distribution Detection

Add code
Sep 05, 2024
Viaarxiv icon

In-Context Exploiter for Extensive-Form Games

Add code
Aug 10, 2024
Figure 1 for In-Context Exploiter for Extensive-Form Games
Figure 2 for In-Context Exploiter for Extensive-Form Games
Figure 3 for In-Context Exploiter for Extensive-Form Games
Figure 4 for In-Context Exploiter for Extensive-Form Games
Viaarxiv icon

MacroHFT: Memory Augmented Context-aware Reinforcement Learning On High Frequency Trading

Add code
Jun 20, 2024
Figure 1 for MacroHFT: Memory Augmented Context-aware Reinforcement Learning On High Frequency Trading
Figure 2 for MacroHFT: Memory Augmented Context-aware Reinforcement Learning On High Frequency Trading
Figure 3 for MacroHFT: Memory Augmented Context-aware Reinforcement Learning On High Frequency Trading
Figure 4 for MacroHFT: Memory Augmented Context-aware Reinforcement Learning On High Frequency Trading
Viaarxiv icon