Picture for Jinjie Gu

Jinjie Gu

AWorld: Dynamic Multi-Agent System with Stable Maneuvering for Robust GAIA Problem Solving

Add code
Aug 13, 2025
Viaarxiv icon

DIVER: A Multi-Stage Approach for Reasoning-intensive Information Retrieval

Add code
Aug 12, 2025
Viaarxiv icon

Learning to Align, Aligning to Learn: A Unified Approach for Self-Optimized Alignment

Add code
Aug 11, 2025
Viaarxiv icon

FunReason: Enhancing Large Language Models' Function Calling via Self-Refinement Multiscale Loss and Automated Data Refinement

Add code
May 26, 2025
Viaarxiv icon

Bridge the Gap between Past and Future: Siamese Model Optimization for Context-Aware Document Ranking

Add code
May 20, 2025
Viaarxiv icon

A Comprehensive Survey of Reward Models: Taxonomy, Applications, Challenges, and Future

Add code
Apr 12, 2025
Viaarxiv icon

Boosting LLM-based Relevance Modeling with Distribution-Aware Robust Learning

Add code
Dec 17, 2024
Viaarxiv icon

CSR:Achieving 1 Bit Key-Value Cache via Sparse Representation

Add code
Dec 16, 2024
Figure 1 for CSR:Achieving 1 Bit Key-Value Cache via Sparse Representation
Figure 2 for CSR:Achieving 1 Bit Key-Value Cache via Sparse Representation
Figure 3 for CSR:Achieving 1 Bit Key-Value Cache via Sparse Representation
Figure 4 for CSR:Achieving 1 Bit Key-Value Cache via Sparse Representation
Viaarxiv icon

CPRM: A LLM-based Continual Pre-training Framework for Relevance Modeling in Commercial Search

Add code
Dec 03, 2024
Figure 1 for CPRM: A LLM-based Continual Pre-training Framework for Relevance Modeling in Commercial Search
Figure 2 for CPRM: A LLM-based Continual Pre-training Framework for Relevance Modeling in Commercial Search
Figure 3 for CPRM: A LLM-based Continual Pre-training Framework for Relevance Modeling in Commercial Search
Figure 4 for CPRM: A LLM-based Continual Pre-training Framework for Relevance Modeling in Commercial Search
Viaarxiv icon

Explainable Behavior Cloning: Teaching Large Language Model Agents through Learning by Demonstration

Add code
Oct 30, 2024
Figure 1 for Explainable Behavior Cloning: Teaching Large Language Model Agents through Learning by Demonstration
Figure 2 for Explainable Behavior Cloning: Teaching Large Language Model Agents through Learning by Demonstration
Figure 3 for Explainable Behavior Cloning: Teaching Large Language Model Agents through Learning by Demonstration
Figure 4 for Explainable Behavior Cloning: Teaching Large Language Model Agents through Learning by Demonstration
Viaarxiv icon