Picture for Jiacheng Liu

Jiacheng Liu

FinanceReasoning: Benchmarking Financial Numerical Reasoning More Credible, Comprehensive and Challenging

Add code
Jun 06, 2025
Viaarxiv icon

Open CaptchaWorld: A Comprehensive Web-based Platform for Testing and Benchmarking Multimodal LLM Agents

Add code
May 30, 2025
Viaarxiv icon

DenoiseRotator: Enhance Pruning Robustness for LLMs via Importance Concentration

Add code
May 29, 2025
Viaarxiv icon

Efficient Online RL Fine Tuning with Offline Pre-trained Policy Only

Add code
May 22, 2025
Viaarxiv icon

Large Language Models Are More Persuasive Than Incentivized Human Persuaders

Add code
May 14, 2025
Viaarxiv icon

Time's Up! An Empirical Study of LLM Reasoning Ability Under Output Length Constraint

Add code
Apr 22, 2025
Viaarxiv icon

ParaPO: Aligning Language Models to Reduce Verbatim Reproduction of Pre-training Data

Add code
Apr 20, 2025
Viaarxiv icon

DataDecide: How to Predict Best Pretraining Data with Small Experiments

Add code
Apr 15, 2025
Viaarxiv icon

OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens

Add code
Apr 09, 2025
Viaarxiv icon

Exploring Reliable PPG Authentication on Smartwatches in Daily Scenarios

Add code
Mar 31, 2025
Viaarxiv icon