Picture for Ying Zhang

Ying Zhang

Sid

kTULA: A Langevin sampling algorithm with improved KL bounds under super-linear log-gradients

Add code
Jun 05, 2025
Viaarxiv icon

Diversity of Transformer Layers: One Aspect of Parameter Scaling Laws

Add code
May 29, 2025
Viaarxiv icon

PreMoe: Lightening MoEs on Constrained Memory by Expert Pruning and Retrieval

Add code
May 23, 2025
Viaarxiv icon

Understanding Fact Recall in Language Models: Why Two-Stage Training Encourages Memorization but Mixed Training Teaches Knowledge

Add code
May 22, 2025
Viaarxiv icon

Code Graph Model (CGM): A Graph-Integrated Large Language Model for Repository-Level Software Engineering Tasks

Add code
May 22, 2025
Viaarxiv icon

ProDS: Preference-oriented Data Selection for Instruction Tuning

Add code
May 19, 2025
Viaarxiv icon

Video-GPT via Next Clip Diffusion

Add code
May 18, 2025
Viaarxiv icon

JointDistill: Adaptive Multi-Task Distillation for Joint Depth Estimation and Scene Segmentation

Add code
May 15, 2025
Viaarxiv icon

FF-PNet: A Pyramid Network Based on Feature and Field for Brain Image Registration

Add code
May 08, 2025
Viaarxiv icon

CSASN: A Multitask Attention-Based Framework for Heterogeneous Thyroid Carcinoma Classification in Ultrasound Images

Add code
May 04, 2025
Viaarxiv icon