Picture for Zhiyuan Zeng

Zhiyuan Zeng

Dynamic Large Concept Models: Latent Reasoning in an Adaptive Semantic Space

Add code
Dec 31, 2025
Viaarxiv icon

Mitigating LLM Hallucination via Behaviorally Calibrated Reinforcement Learning

Add code
Dec 22, 2025
Viaarxiv icon

Olmo 3

Add code
Dec 15, 2025
Viaarxiv icon

From Scaling to Structured Expressivity: Rethinking Transformers for CTR Prediction

Add code
Nov 15, 2025
Figure 1 for From Scaling to Structured Expressivity: Rethinking Transformers for CTR Prediction
Figure 2 for From Scaling to Structured Expressivity: Rethinking Transformers for CTR Prediction
Figure 3 for From Scaling to Structured Expressivity: Rethinking Transformers for CTR Prediction
Figure 4 for From Scaling to Structured Expressivity: Rethinking Transformers for CTR Prediction
Viaarxiv icon

RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments

Add code
Nov 10, 2025
Figure 1 for RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments
Figure 2 for RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments
Figure 3 for RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments
Figure 4 for RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments
Viaarxiv icon

FinSearchComp: Towards a Realistic, Expert-Level Evaluation of Financial Search and Reasoning

Add code
Sep 16, 2025
Figure 1 for FinSearchComp: Towards a Realistic, Expert-Level Evaluation of Financial Search and Reasoning
Figure 2 for FinSearchComp: Towards a Realistic, Expert-Level Evaluation of Financial Search and Reasoning
Figure 3 for FinSearchComp: Towards a Realistic, Expert-Level Evaluation of Financial Search and Reasoning
Figure 4 for FinSearchComp: Towards a Realistic, Expert-Level Evaluation of Financial Search and Reasoning
Viaarxiv icon

Dynamic and Generalizable Process Reward Modeling

Add code
Jul 23, 2025
Viaarxiv icon

Precise Information Control in Long-Form Text Generation

Add code
Jun 06, 2025
Viaarxiv icon

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Add code
Apr 29, 2025
Figure 1 for Reinforcement Learning for Reasoning in Large Language Models with One Training Example
Figure 2 for Reinforcement Learning for Reasoning in Large Language Models with One Training Example
Figure 3 for Reinforcement Learning for Reasoning in Large Language Models with One Training Example
Figure 4 for Reinforcement Learning for Reasoning in Large Language Models with One Training Example
Viaarxiv icon

EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees

Add code
Mar 11, 2025
Viaarxiv icon