Picture for Kaihao Zhang

Kaihao Zhang

AstroRAG -- A Pagerank-Based Retrieval-Augmented Generation Pipeline for Question Answering in Astronomy

Add code
May 24, 2026
Viaarxiv icon

Beyond VLM-Based Rewards: Diffusion-Native Latent Reward Modeling

Add code
Feb 11, 2026
Viaarxiv icon

Fourier-RWKV: A Multi-State Perception Network for Efficient Image Dehazing

Add code
Dec 09, 2025
Viaarxiv icon

APD-Agents: A Large Language Model-Driven Multi-Agents Collaborative Framework for Automated Page Design

Add code
Nov 18, 2025
Figure 1 for APD-Agents: A Large Language Model-Driven Multi-Agents Collaborative Framework for Automated Page Design
Figure 2 for APD-Agents: A Large Language Model-Driven Multi-Agents Collaborative Framework for Automated Page Design
Figure 3 for APD-Agents: A Large Language Model-Driven Multi-Agents Collaborative Framework for Automated Page Design
Figure 4 for APD-Agents: A Large Language Model-Driven Multi-Agents Collaborative Framework for Automated Page Design
Viaarxiv icon

CorrMoE: Mixture of Experts with De-stylization Learning for Cross-Scene and Cross-Domain Correspondence Pruning

Add code
Jul 16, 2025
Viaarxiv icon

DAM-VSR: Disentanglement of Appearance and Motion for Video Super-Resolution

Add code
Jul 01, 2025
Figure 1 for DAM-VSR: Disentanglement of Appearance and Motion for Video Super-Resolution
Figure 2 for DAM-VSR: Disentanglement of Appearance and Motion for Video Super-Resolution
Figure 3 for DAM-VSR: Disentanglement of Appearance and Motion for Video Super-Resolution
Figure 4 for DAM-VSR: Disentanglement of Appearance and Motion for Video Super-Resolution
Viaarxiv icon

Visual and textual prompts for enhancing emotion recognition in video

Add code
Apr 24, 2025
Viaarxiv icon

MaterialMVP: Illumination-Invariant Material Generation via Multi-view PBR Diffusion

Add code
Mar 13, 2025
Viaarxiv icon

PTQ1.61: Push the Real Limit of Extremely Low-Bit Post-Training Quantization Methods for Large Language Models

Add code
Feb 18, 2025
Figure 1 for PTQ1.61: Push the Real Limit of Extremely Low-Bit Post-Training Quantization Methods for Large Language Models
Figure 2 for PTQ1.61: Push the Real Limit of Extremely Low-Bit Post-Training Quantization Methods for Large Language Models
Figure 3 for PTQ1.61: Push the Real Limit of Extremely Low-Bit Post-Training Quantization Methods for Large Language Models
Figure 4 for PTQ1.61: Push the Real Limit of Extremely Low-Bit Post-Training Quantization Methods for Large Language Models
Viaarxiv icon

MB-TaylorFormer V2: Improved Multi-branch Linear Transformer Expanded by Taylor Formula for Image Restoration

Add code
Jan 08, 2025
Viaarxiv icon