Picture for Qi Zhang

Qi Zhang

NVIDIA

A cross-modal network for facial expression recognition

Add code
May 06, 2026
Viaarxiv icon

Faithfulness-QA: A Counterfactual Entity Substitution Dataset for Training Context-Faithful RAG Models

Add code
Apr 28, 2026
Viaarxiv icon

Kwai Summary Attention Technical Report

Add code
Apr 27, 2026
Viaarxiv icon

Agri-CPJ: A Training-Free Explainable Framework for Agricultural Pest Diagnosis Using Caption-Prompt-Judge and LLM-as-a-Judge

Add code
Apr 26, 2026
Viaarxiv icon

Multi-view Crowd Tracking Transformer with View-Ground Interactions Under Large Real-World Scenes

Add code
Apr 21, 2026
Viaarxiv icon

DUSG-Tomo-Net: A Deep Unfolded Neural Network for Super-Resolving Gridless Spaceborne SAR Tomography via Learned Toeplitz-Structured Covariance Representation

Add code
Apr 21, 2026
Viaarxiv icon

Physically-Induced Atmospheric Adversarial Perturbations: Enhancing Transferability and Robustness in Remote Sensing Image Classification

Add code
Apr 16, 2026
Viaarxiv icon

Beyond Voxel 3D Editing: Learning from 3D Masks and Self-Constructed Data

Add code
Apr 15, 2026
Viaarxiv icon

Enhancing LLM-based Search Agents via Contribution Weighted Group Relative Policy Optimization

Add code
Apr 15, 2026
Viaarxiv icon

MM-Doc-R1: Training Agents for Long Document Visual Question Answering through Multi-turn Reinforcement Learning

Add code
Apr 15, 2026
Viaarxiv icon