Picture for Dongfu Jiang

Dongfu Jiang

ReviewGrounder: Improving Review Substantiveness with Rubric-Guided, Tool-Integrated Agents

Add code
Apr 15, 2026
Viaarxiv icon

Nemotron 3 Super: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Add code
Apr 14, 2026
Viaarxiv icon

ClawBench: Can AI Agents Complete Everyday Online Tasks?

Add code
Apr 09, 2026
Viaarxiv icon

Watch Before You Answer: Learning from Visually Grounded Post-Training

Add code
Apr 06, 2026
Viaarxiv icon

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Add code
Mar 19, 2026
Viaarxiv icon

EvolveCoder: Evolving Test Cases via Adversarial Verification for Code Reinforcement Learning

Add code
Mar 13, 2026
Viaarxiv icon

StructEval: Benchmarking LLMs' Capabilities to Generate Structural Outputs

Add code
May 26, 2025
Figure 1 for StructEval: Benchmarking LLMs' Capabilities to Generate Structural Outputs
Figure 2 for StructEval: Benchmarking LLMs' Capabilities to Generate Structural Outputs
Figure 3 for StructEval: Benchmarking LLMs' Capabilities to Generate Structural Outputs
Figure 4 for StructEval: Benchmarking LLMs' Capabilities to Generate Structural Outputs
Viaarxiv icon

QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-Design

Add code
May 22, 2025
Figure 1 for QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-Design
Figure 2 for QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-Design
Figure 3 for QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-Design
Figure 4 for QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-Design
Viaarxiv icon

General-Reasoner: Advancing LLM Reasoning Across All Domains

Add code
May 21, 2025
Figure 1 for General-Reasoner: Advancing LLM Reasoning Across All Domains
Figure 2 for General-Reasoner: Advancing LLM Reasoning Across All Domains
Figure 3 for General-Reasoner: Advancing LLM Reasoning Across All Domains
Figure 4 for General-Reasoner: Advancing LLM Reasoning Across All Domains
Viaarxiv icon

ACECODER: Acing Coder RL via Automated Test-Case Synthesis

Add code
Feb 03, 2025
Viaarxiv icon