Picture for Yi Dong

Yi Dong

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Add code
May 30, 2025
Viaarxiv icon

HelpSteer3-Preference: Open Human-Annotated Preference Data across Diverse Tasks and Languages

Add code
May 16, 2025
Viaarxiv icon

Nemotron-Research-Tool-N1: Tool-Using Language Models with Reinforced Reasoning

Add code
Apr 25, 2025
Viaarxiv icon

Simultaneous Pre-compensation for Bandwidth Limitation and Fiber Dispersion in Cost-Sensitive IM/DD Transmission Systems

Add code
Apr 02, 2025
Viaarxiv icon

TAIJI: Textual Anchoring for Immunizing Jailbreak Images in Vision Language Models

Add code
Mar 13, 2025
Viaarxiv icon

PFDial: A Structured Dialogue Instruction Fine-tuning Method Based on UML Flowcharts

Add code
Mar 09, 2025
Viaarxiv icon

Dedicated Feedback and Edit Models Empower Inference-Time Scaling for Open-Ended General-Domain Tasks

Add code
Mar 06, 2025
Viaarxiv icon

Predicting Large Language Model Capabilities on Closed-Book QA Tasks Using Only Information Available Prior to Training

Add code
Feb 06, 2025
Viaarxiv icon

Position: Towards a Responsible LLM-empowered Multi-Agent Systems

Add code
Feb 03, 2025
Figure 1 for Position: Towards a Responsible LLM-empowered Multi-Agent Systems
Figure 2 for Position: Towards a Responsible LLM-empowered Multi-Agent Systems
Figure 3 for Position: Towards a Responsible LLM-empowered Multi-Agent Systems
Figure 4 for Position: Towards a Responsible LLM-empowered Multi-Agent Systems
Viaarxiv icon

FALCON: Fine-grained Activation Manipulation by Contrastive Orthogonal Unalignment for Large Language Model

Add code
Feb 03, 2025
Viaarxiv icon