Picture for Mengdi Wang

Mengdi Wang

Detecting Post-generation Edits to Watermarked LLM Outputs via Combinatorial Watermarking

Add code
Oct 02, 2025
Viaarxiv icon

Monte Carlo Tree Diffusion with Multiple Experts for Protein Design

Add code
Sep 19, 2025
Viaarxiv icon

Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models

Add code
Sep 08, 2025
Viaarxiv icon

SafeProtein: Red-Teaming Framework and Benchmark for Protein Foundation Models

Add code
Sep 03, 2025
Viaarxiv icon

A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence

Add code
Jul 28, 2025
Viaarxiv icon

ReasonFlux-PRM: Trajectory-Aware PRMs for Long Chain-of-Thought Reasoning in LLMs

Add code
Jun 23, 2025
Figure 1 for ReasonFlux-PRM: Trajectory-Aware PRMs for Long Chain-of-Thought Reasoning in LLMs
Figure 2 for ReasonFlux-PRM: Trajectory-Aware PRMs for Long Chain-of-Thought Reasoning in LLMs
Figure 3 for ReasonFlux-PRM: Trajectory-Aware PRMs for Long Chain-of-Thought Reasoning in LLMs
Figure 4 for ReasonFlux-PRM: Trajectory-Aware PRMs for Long Chain-of-Thought Reasoning in LLMs
Viaarxiv icon

AgentDistill: Training-Free Agent Distillation with Generalizable MCP Boxes

Add code
Jun 17, 2025
Viaarxiv icon

Does Thinking More always Help? Understanding Test-Time Scaling in Reasoning Models

Add code
Jun 04, 2025
Viaarxiv icon

Genome-Bench: A Scientific Reasoning Benchmark from Real-World Expert Discussions

Add code
May 26, 2025
Viaarxiv icon

Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evolution

Add code
May 26, 2025
Viaarxiv icon