Picture for Jimin Huang

Jimin Huang

Steve

Concordia: Self-Improving Synthetic Tables for Federated LLMs

Add code
May 11, 2026
Viaarxiv icon

Moira: Language-driven Hierarchical Reinforcement Learning for Pair Trading

Add code
May 03, 2026
Viaarxiv icon

SAHM: A Benchmark for Arabic Financial and Shari'ah-Compliant Reasoning

Add code
Apr 21, 2026
Viaarxiv icon

FinTrace: Holistic Trajectory-Level Evaluation of LLM Tool Calling for Long-Horizon Financial Tasks

Add code
Apr 11, 2026
Viaarxiv icon

FinReporting: An Agentic Workflow for Localized Reporting of Cross-Jurisdiction Financial Disclosures

Add code
Apr 07, 2026
Viaarxiv icon

Can LLM Agents Be CFOs? A Benchmark for Resource Allocation in Dynamic Enterprise Environments

Add code
Mar 24, 2026
Viaarxiv icon

Conv-FinRe: A Conversational and Longitudinal Benchmark for Utility-Grounded Financial Recommendation

Add code
Feb 19, 2026
Viaarxiv icon

The CLEF-2026 FinMMEval Lab: Multilingual and Multimodal Evaluation of Financial AI Systems

Add code
Feb 11, 2026
Viaarxiv icon

Ebisu: Benchmarking Large Language Models in Japanese Finance

Add code
Feb 01, 2026
Viaarxiv icon

All That Glisters Is Not Gold: A Benchmark for Reference-Free Counterfactual Financial Misinformation Detection

Add code
Jan 08, 2026
Viaarxiv icon