Picture for Xinyuan Zhang

Xinyuan Zhang

Unsupervised Denoising of Diffusion-Weighted Images with Bias and Variance Corrected Noise Modeling

Add code
Feb 22, 2026
Viaarxiv icon

GLM-5: from Vibe Coding to Agentic Engineering

Add code
Feb 17, 2026
Viaarxiv icon

MiniCPM-SALA: Hybridizing Sparse and Linear Attention for Efficient Long-Context Modeling

Add code
Feb 12, 2026
Viaarxiv icon

ShotFinder: Imagination-Driven Open-Domain Video Shot Retrieval via Web Search

Add code
Jan 30, 2026
Viaarxiv icon

Unified Multimodal and Multilingual Retrieval via Multi-Task Learning with NLU Integration

Add code
Jan 21, 2026
Viaarxiv icon

Stream RAG: Instant and Accurate Spoken Dialogue Systems with Streaming Tool Usage

Add code
Oct 02, 2025
Figure 1 for Stream RAG: Instant and Accurate Spoken Dialogue Systems with Streaming Tool Usage
Figure 2 for Stream RAG: Instant and Accurate Spoken Dialogue Systems with Streaming Tool Usage
Figure 3 for Stream RAG: Instant and Accurate Spoken Dialogue Systems with Streaming Tool Usage
Figure 4 for Stream RAG: Instant and Accurate Spoken Dialogue Systems with Streaming Tool Usage
Viaarxiv icon

SCRIBES: Web-Scale Script-Based Semi-Structured Data Extraction with Reinforcement Learning

Add code
Oct 02, 2025
Figure 1 for SCRIBES: Web-Scale Script-Based Semi-Structured Data Extraction with Reinforcement Learning
Figure 2 for SCRIBES: Web-Scale Script-Based Semi-Structured Data Extraction with Reinforcement Learning
Figure 3 for SCRIBES: Web-Scale Script-Based Semi-Structured Data Extraction with Reinforcement Learning
Figure 4 for SCRIBES: Web-Scale Script-Based Semi-Structured Data Extraction with Reinforcement Learning
Viaarxiv icon

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Add code
Aug 08, 2025
Viaarxiv icon

LogisticsVLN: Vision-Language Navigation For Low-Altitude Terminal Delivery Based on Agentic UAVs

Add code
May 06, 2025
Figure 1 for LogisticsVLN: Vision-Language Navigation For Low-Altitude Terminal Delivery Based on Agentic UAVs
Figure 2 for LogisticsVLN: Vision-Language Navigation For Low-Altitude Terminal Delivery Based on Agentic UAVs
Figure 3 for LogisticsVLN: Vision-Language Navigation For Low-Altitude Terminal Delivery Based on Agentic UAVs
Figure 4 for LogisticsVLN: Vision-Language Navigation For Low-Altitude Terminal Delivery Based on Agentic UAVs
Viaarxiv icon

AttFC: Attention Fully-Connected Layer for Large-Scale Face Recognition with One GPU

Add code
Mar 10, 2025
Viaarxiv icon