Picture for Mengdi Wang

Mengdi Wang

GenEnv: Difficulty-Aligned Co-Evolution Between LLM Agents and Environment Simulators

Add code
Dec 23, 2025
Viaarxiv icon

From Word to World: Can Large Language Models be Implicit Text-based World Models?

Add code
Dec 21, 2025
Viaarxiv icon

AutoTool: Dynamic Tool Selection and Integration for Agentic Reasoning

Add code
Dec 15, 2025
Viaarxiv icon

MMaDA-Parallel: Multimodal Large Diffusion Language Models for Thinking-Aware Editing and Generation

Add code
Nov 18, 2025
Viaarxiv icon

Detecting Post-generation Edits to Watermarked LLM Outputs via Combinatorial Watermarking

Add code
Oct 02, 2025
Viaarxiv icon

Monte Carlo Tree Diffusion with Multiple Experts for Protein Design

Add code
Sep 19, 2025
Viaarxiv icon

Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models

Add code
Sep 08, 2025
Viaarxiv icon

SafeProtein: Red-Teaming Framework and Benchmark for Protein Foundation Models

Add code
Sep 03, 2025
Figure 1 for SafeProtein: Red-Teaming Framework and Benchmark for Protein Foundation Models
Figure 2 for SafeProtein: Red-Teaming Framework and Benchmark for Protein Foundation Models
Figure 3 for SafeProtein: Red-Teaming Framework and Benchmark for Protein Foundation Models
Figure 4 for SafeProtein: Red-Teaming Framework and Benchmark for Protein Foundation Models
Viaarxiv icon

A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence

Add code
Jul 28, 2025
Figure 1 for A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence
Figure 2 for A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence
Figure 3 for A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence
Figure 4 for A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence
Viaarxiv icon

ReasonFlux-PRM: Trajectory-Aware PRMs for Long Chain-of-Thought Reasoning in LLMs

Add code
Jun 23, 2025
Figure 1 for ReasonFlux-PRM: Trajectory-Aware PRMs for Long Chain-of-Thought Reasoning in LLMs
Figure 2 for ReasonFlux-PRM: Trajectory-Aware PRMs for Long Chain-of-Thought Reasoning in LLMs
Figure 3 for ReasonFlux-PRM: Trajectory-Aware PRMs for Long Chain-of-Thought Reasoning in LLMs
Figure 4 for ReasonFlux-PRM: Trajectory-Aware PRMs for Long Chain-of-Thought Reasoning in LLMs
Viaarxiv icon