Picture for Jinjie Gu

Jinjie Gu

Perplexity-Aware Data Scaling Law: Perplexity Landscapes Predict Performance for Continual Pre-training

Add code
Dec 25, 2025
Viaarxiv icon

Multi-Agent Deep Research: Training Multi-Agent Systems with M-GRPO

Add code
Nov 18, 2025
Figure 1 for Multi-Agent Deep Research: Training Multi-Agent Systems with M-GRPO
Figure 2 for Multi-Agent Deep Research: Training Multi-Agent Systems with M-GRPO
Figure 3 for Multi-Agent Deep Research: Training Multi-Agent Systems with M-GRPO
Figure 4 for Multi-Agent Deep Research: Training Multi-Agent Systems with M-GRPO
Viaarxiv icon

GroupRank: A Groupwise Reranking Paradigm Driven by Reinforcement Learning

Add code
Nov 10, 2025
Viaarxiv icon

HAD: HAllucination Detection Language Models Based on a Comprehensive Hallucination Taxonomy

Add code
Oct 22, 2025
Viaarxiv icon

MedReseacher-R1: Expert-Level Medical Deep Researcher via A Knowledge-Informed Trajectory Synthesis Framework

Add code
Aug 20, 2025
Viaarxiv icon

AWorld: Dynamic Multi-Agent System with Stable Maneuvering for Robust GAIA Problem Solving

Add code
Aug 13, 2025
Viaarxiv icon

DIVER: A Multi-Stage Approach for Reasoning-intensive Information Retrieval

Add code
Aug 12, 2025
Viaarxiv icon

Learning to Align, Aligning to Learn: A Unified Approach for Self-Optimized Alignment

Add code
Aug 11, 2025
Figure 1 for Learning to Align, Aligning to Learn: A Unified Approach for Self-Optimized Alignment
Figure 2 for Learning to Align, Aligning to Learn: A Unified Approach for Self-Optimized Alignment
Figure 3 for Learning to Align, Aligning to Learn: A Unified Approach for Self-Optimized Alignment
Figure 4 for Learning to Align, Aligning to Learn: A Unified Approach for Self-Optimized Alignment
Viaarxiv icon

FunReason: Enhancing Large Language Models' Function Calling via Self-Refinement Multiscale Loss and Automated Data Refinement

Add code
May 26, 2025
Figure 1 for FunReason: Enhancing Large Language Models' Function Calling via Self-Refinement Multiscale Loss and Automated Data Refinement
Figure 2 for FunReason: Enhancing Large Language Models' Function Calling via Self-Refinement Multiscale Loss and Automated Data Refinement
Figure 3 for FunReason: Enhancing Large Language Models' Function Calling via Self-Refinement Multiscale Loss and Automated Data Refinement
Figure 4 for FunReason: Enhancing Large Language Models' Function Calling via Self-Refinement Multiscale Loss and Automated Data Refinement
Viaarxiv icon

Bridge the Gap between Past and Future: Siamese Model Optimization for Context-Aware Document Ranking

Add code
May 20, 2025
Viaarxiv icon