Picture for Xueqing Peng

Xueqing Peng

Can LLM Agents Be CFOs? A Benchmark for Resource Allocation in Dynamic Enterprise Environments

Add code
Mar 24, 2026
Viaarxiv icon

Conv-FinRe: A Conversational and Longitudinal Benchmark for Utility-Grounded Financial Recommendation

Add code
Feb 19, 2026
Viaarxiv icon

The CLEF-2026 FinMMEval Lab: Multilingual and Multimodal Evaluation of Financial AI Systems

Add code
Feb 11, 2026
Viaarxiv icon

Ebisu: Benchmarking Large Language Models in Japanese Finance

Add code
Feb 01, 2026
Viaarxiv icon

MedViz: An Agent-based, Visual-guided Research Assistant for Navigating Biomedical Literature

Add code
Jan 28, 2026
Viaarxiv icon

FinCriticalED: A Visual Benchmark for Financial Fact-Level OCR Evaluation

Add code
Nov 19, 2025
Viaarxiv icon

FinTagging: An LLM-ready Benchmark for Extracting and Structuring Financial Information

Add code
May 27, 2025
Viaarxiv icon

Plutus: Benchmarking Large Language Models in Low-Resource Greek Finance

Add code
Feb 26, 2025
Viaarxiv icon

FLAG-Trader: Fusion LLM-Agent with Gradient-based Reinforcement Learning for Financial Trading

Add code
Feb 19, 2025
Figure 1 for FLAG-Trader: Fusion LLM-Agent with Gradient-based Reinforcement Learning for Financial Trading
Figure 2 for FLAG-Trader: Fusion LLM-Agent with Gradient-based Reinforcement Learning for Financial Trading
Figure 3 for FLAG-Trader: Fusion LLM-Agent with Gradient-based Reinforcement Learning for Financial Trading
Figure 4 for FLAG-Trader: Fusion LLM-Agent with Gradient-based Reinforcement Learning for Financial Trading
Viaarxiv icon

Fino1: On the Transferability of Reasoning Enhanced LLMs to Finance

Add code
Feb 12, 2025
Viaarxiv icon