Picture for Zilong Zheng

Zilong Zheng

TongSearch-QR: Reinforced Query Reasoning for Retrieval

Add code
Jun 16, 2025
Viaarxiv icon

RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling

Add code
Jun 10, 2025
Viaarxiv icon

When Large Multimodal Models Confront Evolving Knowledge:Challenges and Pathways

Add code
May 30, 2025
Viaarxiv icon

Discrete Markov Bridge

Add code
May 26, 2025
Viaarxiv icon

EuroCon: Benchmarking Parliament Deliberation for Political Consensus Finding

Add code
May 26, 2025
Viaarxiv icon

ReflectEvo: Improving Meta Introspection of Small LLMs by Learning Self-Reflection

Add code
May 22, 2025
Viaarxiv icon

Seek in the Dark: Reasoning via Test-Time Instance-Level Policy Gradient in Latent Space

Add code
May 19, 2025
Viaarxiv icon

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Add code
May 07, 2025
Viaarxiv icon

Probing and Inducing Combinational Creativity in Vision-Language Models

Add code
Apr 17, 2025
Viaarxiv icon

OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts

Add code
Mar 29, 2025
Viaarxiv icon