Picture for Jiajun Shen

Jiajun Shen

Context Training with Active Information Seeking

Add code
May 14, 2026
Viaarxiv icon

MemReranker: Reasoning-Aware Reranking for Agent Memory Retrieval

Add code
May 07, 2026
Viaarxiv icon

Transparentize the Internal and External Knowledge Utilization in LLMs with Trustworthy Citation

Add code
Apr 21, 2025
Viaarxiv icon

Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch

Add code
Jan 30, 2025
Viaarxiv icon

Citekit: A Modular Toolkit for Large Language Model Citation Generation

Add code
Aug 06, 2024
Figure 1 for Citekit: A Modular Toolkit for Large Language Model Citation Generation
Figure 2 for Citekit: A Modular Toolkit for Large Language Model Citation Generation
Figure 3 for Citekit: A Modular Toolkit for Large Language Model Citation Generation
Figure 4 for Citekit: A Modular Toolkit for Large Language Model Citation Generation
Viaarxiv icon

DiPaCo: Distributed Path Composition

Add code
Mar 15, 2024
Viaarxiv icon

Exploring Federated Self-Supervised Learning for General Purpose Audio Understanding

Add code
Feb 05, 2024
Viaarxiv icon

Asynchronous Local-SGD Training for Language Modeling

Add code
Jan 17, 2024
Figure 1 for Asynchronous Local-SGD Training for Language Modeling
Figure 2 for Asynchronous Local-SGD Training for Language Modeling
Figure 3 for Asynchronous Local-SGD Training for Language Modeling
Figure 4 for Asynchronous Local-SGD Training for Language Modeling
Viaarxiv icon

A Simple Recipe for Contrastively Pre-training Video-First Encoders Beyond 16 Frames

Add code
Dec 12, 2023
Figure 1 for A Simple Recipe for Contrastively Pre-training Video-First Encoders Beyond 16 Frames
Figure 2 for A Simple Recipe for Contrastively Pre-training Video-First Encoders Beyond 16 Frames
Figure 3 for A Simple Recipe for Contrastively Pre-training Video-First Encoders Beyond 16 Frames
Figure 4 for A Simple Recipe for Contrastively Pre-training Video-First Encoders Beyond 16 Frames
Viaarxiv icon

DiLoCo: Distributed Low-Communication Training of Language Models

Add code
Nov 14, 2023
Figure 1 for DiLoCo: Distributed Low-Communication Training of Language Models
Figure 2 for DiLoCo: Distributed Low-Communication Training of Language Models
Figure 3 for DiLoCo: Distributed Low-Communication Training of Language Models
Figure 4 for DiLoCo: Distributed Low-Communication Training of Language Models
Viaarxiv icon