Picture for Tong Zhang

Tong Zhang

Nanjing University of Science and Technology, Nanjing, China

AIM: Adaptive Intra-Network Modulation for Balanced Multimodal Learning

Add code
Aug 27, 2025
Viaarxiv icon

StepWiser: Stepwise Generative Judges for Wiser Reasoning

Add code
Aug 27, 2025
Figure 1 for StepWiser: Stepwise Generative Judges for Wiser Reasoning
Figure 2 for StepWiser: Stepwise Generative Judges for Wiser Reasoning
Figure 3 for StepWiser: Stepwise Generative Judges for Wiser Reasoning
Figure 4 for StepWiser: Stepwise Generative Judges for Wiser Reasoning
Viaarxiv icon

P/D-Device: Disaggregated Large Language Model between Cloud and Devices

Add code
Aug 12, 2025
Viaarxiv icon

Q-CLIP: Unleashing the Power of Vision-Language Models for Video Quality Assessment through Unified Cross-Modal Adaptation

Add code
Aug 08, 2025
Viaarxiv icon

Listwise Preference Alignment Optimization for Tail Item Recommendation

Add code
Jul 03, 2025
Figure 1 for Listwise Preference Alignment Optimization for Tail Item Recommendation
Figure 2 for Listwise Preference Alignment Optimization for Tail Item Recommendation
Figure 3 for Listwise Preference Alignment Optimization for Tail Item Recommendation
Figure 4 for Listwise Preference Alignment Optimization for Tail Item Recommendation
Viaarxiv icon

LLM Agent for Hyper-Parameter Optimization

Add code
Jun 18, 2025
Viaarxiv icon

Canonical Latent Representations in Conditional Diffusion Models

Add code
Jun 11, 2025
Figure 1 for Canonical Latent Representations in Conditional Diffusion Models
Figure 2 for Canonical Latent Representations in Conditional Diffusion Models
Figure 3 for Canonical Latent Representations in Conditional Diffusion Models
Figure 4 for Canonical Latent Representations in Conditional Diffusion Models
Viaarxiv icon

Thinking vs. Doing: Agents that Reason by Scaling Test-Time Interaction

Add code
Jun 09, 2025
Figure 1 for Thinking vs. Doing: Agents that Reason by Scaling Test-Time Interaction
Figure 2 for Thinking vs. Doing: Agents that Reason by Scaling Test-Time Interaction
Figure 3 for Thinking vs. Doing: Agents that Reason by Scaling Test-Time Interaction
Figure 4 for Thinking vs. Doing: Agents that Reason by Scaling Test-Time Interaction
Viaarxiv icon

Towards Better Generalization via Distributional Input Projection Network

Add code
Jun 05, 2025
Viaarxiv icon

MiCRo: Mixture Modeling and Context-aware Routing for Personalized Preference Learning

Add code
May 30, 2025
Viaarxiv icon