Picture for Jun Zhang

Jun Zhang

National Innovation Institute of Defense Technology, Chinese Academy of Military Science

SIT-Graph: State Integrated Tool Graph for Multi-Turn Agents

Add code
Dec 08, 2025
Viaarxiv icon

MoDES: Accelerating Mixture-of-Experts Multimodal Large Language Models via Dynamic Expert Skipping

Add code
Nov 19, 2025
Figure 1 for MoDES: Accelerating Mixture-of-Experts Multimodal Large Language Models via Dynamic Expert Skipping
Figure 2 for MoDES: Accelerating Mixture-of-Experts Multimodal Large Language Models via Dynamic Expert Skipping
Figure 3 for MoDES: Accelerating Mixture-of-Experts Multimodal Large Language Models via Dynamic Expert Skipping
Figure 4 for MoDES: Accelerating Mixture-of-Experts Multimodal Large Language Models via Dynamic Expert Skipping
Viaarxiv icon

GPR: Towards a Generative Pre-trained One-Model Paradigm for Large-Scale Advertising Recommendation

Add code
Nov 13, 2025
Figure 1 for GPR: Towards a Generative Pre-trained One-Model Paradigm for Large-Scale Advertising Recommendation
Figure 2 for GPR: Towards a Generative Pre-trained One-Model Paradigm for Large-Scale Advertising Recommendation
Figure 3 for GPR: Towards a Generative Pre-trained One-Model Paradigm for Large-Scale Advertising Recommendation
Figure 4 for GPR: Towards a Generative Pre-trained One-Model Paradigm for Large-Scale Advertising Recommendation
Viaarxiv icon

PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model

Add code
Oct 16, 2025
Viaarxiv icon

WebWeaver: Structuring Web-Scale Evidence with Dynamic Outlines for Open-Ended Deep Research

Add code
Sep 16, 2025
Figure 1 for WebWeaver: Structuring Web-Scale Evidence with Dynamic Outlines for Open-Ended Deep Research
Figure 2 for WebWeaver: Structuring Web-Scale Evidence with Dynamic Outlines for Open-Ended Deep Research
Figure 3 for WebWeaver: Structuring Web-Scale Evidence with Dynamic Outlines for Open-Ended Deep Research
Figure 4 for WebWeaver: Structuring Web-Scale Evidence with Dynamic Outlines for Open-Ended Deep Research
Viaarxiv icon

${C}^{3}$-GS: Learning Context-aware, Cross-dimension, Cross-scale Feature for Generalizable Gaussian Splatting

Add code
Aug 28, 2025
Viaarxiv icon

Attention2Probability: Attention-Driven Terminology Probability Estimation for Robust Speech-to-Text System

Add code
Aug 26, 2025
Viaarxiv icon

DreamSwapV: Mask-guided Subject Swapping for Any Customized Video Editing

Add code
Aug 20, 2025
Viaarxiv icon

Yan: Foundational Interactive Video Generation

Add code
Aug 13, 2025
Viaarxiv icon

Towards Comprehensible Recommendation with Large Language Model Fine-tuning

Add code
Aug 11, 2025
Viaarxiv icon