Picture for Hexuan Deng

Hexuan Deng

NoveltyAgent: Autonomous Novelty Reporting Agent with Point-wise Novelty Analysis and Self-Validation

Add code
Mar 21, 2026
Viaarxiv icon

RouterKGQA: Specialized--General Model Routing for Constraint-Aware Knowledge Graph Question Answering

Add code
Mar 20, 2026
Viaarxiv icon

Stop Rewarding Hallucinated Steps: Faithfulness-Aware Step-Level Reinforcement Learning for Small Reasoning Models

Add code
Feb 05, 2026
Viaarxiv icon

AQuilt: Weaving Logic and Self-Inspection into Low-Cost, High-Relevance Data Synthesis for Specialist LLMs

Add code
Jul 24, 2025
Viaarxiv icon

REA-RL: Reflection-Aware Online Reinforcement Learning for Efficient Large Reasoning Models

Add code
May 26, 2025
Figure 1 for REA-RL: Reflection-Aware Online Reinforcement Learning for Efficient Large Reasoning Models
Figure 2 for REA-RL: Reflection-Aware Online Reinforcement Learning for Efficient Large Reasoning Models
Figure 3 for REA-RL: Reflection-Aware Online Reinforcement Learning for Efficient Large Reasoning Models
Figure 4 for REA-RL: Reflection-Aware Online Reinforcement Learning for Efficient Large Reasoning Models
Viaarxiv icon

Dynamic Sampling that Adapts: Iterative DPO for Self-Aware Mathematical Reasoning

Add code
May 22, 2025
Viaarxiv icon

DRPruning: Efficient Large Language Model Pruning through Distributionally Robust Optimization

Add code
Nov 21, 2024
Figure 1 for DRPruning: Efficient Large Language Model Pruning through Distributionally Robust Optimization
Figure 2 for DRPruning: Efficient Large Language Model Pruning through Distributionally Robust Optimization
Figure 3 for DRPruning: Efficient Large Language Model Pruning through Distributionally Robust Optimization
Figure 4 for DRPruning: Efficient Large Language Model Pruning through Distributionally Robust Optimization
Viaarxiv icon

NewTerm: Benchmarking Real-Time New Terms for Large Language Models with Annual Updates

Add code
Oct 28, 2024
Figure 1 for NewTerm: Benchmarking Real-Time New Terms for Large Language Models with Annual Updates
Figure 2 for NewTerm: Benchmarking Real-Time New Terms for Large Language Models with Annual Updates
Figure 3 for NewTerm: Benchmarking Real-Time New Terms for Large Language Models with Annual Updates
Figure 4 for NewTerm: Benchmarking Real-Time New Terms for Large Language Models with Annual Updates
Viaarxiv icon

Holistic Exploration on Universal Decompositional Semantic Parsing: Architecture, Data Augmentation, and LLM Paradigm

Add code
Jul 25, 2023
Viaarxiv icon

Improving Simultaneous Machine Translation with Monolingual Data

Add code
Dec 02, 2022
Figure 1 for Improving Simultaneous Machine Translation with Monolingual Data
Figure 2 for Improving Simultaneous Machine Translation with Monolingual Data
Figure 3 for Improving Simultaneous Machine Translation with Monolingual Data
Figure 4 for Improving Simultaneous Machine Translation with Monolingual Data
Viaarxiv icon