Picture for Zijun Yao

Zijun Yao

Are Reasoning Models More Prone to Hallucination?

Add code
May 29, 2025
Viaarxiv icon

How does Transformer Learn Implicit Reasoning?

Add code
May 29, 2025
Viaarxiv icon

Steering LVLMs via Sparse Autoencoder for Hallucination Mitigation

Add code
May 22, 2025
Viaarxiv icon

Continuous Optimization for Feature Selection with Permutation-Invariant Embedding and Policy-Guided Search

Add code
May 16, 2025
Viaarxiv icon

Toward Generalizable Evaluation in the LLM Era: A Survey Beyond Benchmarks

Add code
Apr 26, 2025
Viaarxiv icon

SafeMLRM: Demystifying Safety in Multi-modal Large Reasoning Models

Add code
Apr 09, 2025
Viaarxiv icon

Sparse Auto-Encoder Interprets Linguistic Features in Large Language Models

Add code
Feb 27, 2025
Viaarxiv icon

Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems

Add code
Feb 26, 2025
Viaarxiv icon

Iterative Feature Space Optimization through Incremental Adaptive Evaluation

Add code
Jan 24, 2025
Viaarxiv icon

Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling

Add code
Jan 20, 2025
Figure 1 for Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling
Figure 2 for Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling
Figure 3 for Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling
Figure 4 for Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling
Viaarxiv icon