Picture for Zijun Yao

Zijun Yao

On Predictability of Reinforcement Learning Dynamics for Large Language Models

Add code
Oct 02, 2025
Viaarxiv icon

StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?

Add code
Oct 02, 2025
Viaarxiv icon

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Add code
Aug 08, 2025
Viaarxiv icon

ECG Latent Feature Extraction with Autoencoders for Downstream Prediction Tasks

Add code
Jul 31, 2025
Viaarxiv icon

We Should Identify and Mitigate Third-Party Safety Risks in MCP-Powered Agent Systems

Add code
Jun 16, 2025
Viaarxiv icon

Are Reasoning Models More Prone to Hallucination?

Add code
May 29, 2025
Viaarxiv icon

How does Transformer Learn Implicit Reasoning?

Add code
May 29, 2025
Viaarxiv icon

Steering LVLMs via Sparse Autoencoder for Hallucination Mitigation

Add code
May 22, 2025
Viaarxiv icon

Continuous Optimization for Feature Selection with Permutation-Invariant Embedding and Policy-Guided Search

Add code
May 16, 2025
Viaarxiv icon

Toward Generalizable Evaluation in the LLM Era: A Survey Beyond Benchmarks

Add code
Apr 26, 2025
Viaarxiv icon