Picture for Mengdi Wang

Mengdi Wang

Emergent Symbolic Mechanisms Support Abstract Reasoning in Large Language Models

Add code
Feb 27, 2025
Viaarxiv icon

DISC: Dynamic Decomposition Improves LLM Inference Scaling

Add code
Feb 23, 2025
Viaarxiv icon

Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening

Add code
Feb 17, 2025
Viaarxiv icon

Training-Free Guidance Beyond Differentiability: Scalable Path Steering with Tree Search in Diffusion and Flow Models

Add code
Feb 17, 2025
Viaarxiv icon

A First-order Generative Bilevel Optimization Framework for Diffusion Models

Add code
Feb 12, 2025
Viaarxiv icon

MATH-Perturb: Benchmarking LLMs' Math Reasoning Abilities against Hard Perturbations

Add code
Feb 10, 2025
Viaarxiv icon

ReasonFlux: Hierarchical LLM Reasoning via Scaling Thought Templates

Add code
Feb 10, 2025
Viaarxiv icon

ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization

Add code
Feb 06, 2025
Viaarxiv icon

RTLMarker: Protecting LLM-Generated RTL Copyright via a Hardware Watermarking Framework

Add code
Jan 05, 2025
Figure 1 for RTLMarker: Protecting LLM-Generated RTL Copyright via a Hardware Watermarking Framework
Figure 2 for RTLMarker: Protecting LLM-Generated RTL Copyright via a Hardware Watermarking Framework
Figure 3 for RTLMarker: Protecting LLM-Generated RTL Copyright via a Hardware Watermarking Framework
Figure 4 for RTLMarker: Protecting LLM-Generated RTL Copyright via a Hardware Watermarking Framework
Viaarxiv icon

On the Statistical Complexity for Offline and Low-Adaptive Reinforcement Learning with Structures

Add code
Jan 03, 2025
Viaarxiv icon