Picture for Zhiqiang Zhou

Zhiqiang Zhou

Gen-VCoT: Generative Visual Chain-of-Thought Reasoning via Diffusion-Based RGB Intermediate Representations

Add code
Jun 15, 2026
Viaarxiv icon

TriAdReview: Triangular Adversarial Review Architecture for Multi-Model Technical Document Generation

Add code
Jun 13, 2026
Viaarxiv icon

CLP: Collocation-Length Prediction for Zero-Loss Adaptive Multi-Token Inference

Add code
Jun 09, 2026
Viaarxiv icon

Feature Alignment Determines Fusion Strategy: A Comparative Study of Cross-Attention and Concatenation in Multimodal Learning

Add code
May 31, 2026
Viaarxiv icon

Each Prompt Matters: Scaling Reinforcement Learning Without Wasting Rollouts on Hundred-Billion-Scale MoE

Add code
Dec 08, 2025
Viaarxiv icon

Towards Reliable Evaluation of Large Language Models for Multilingual and Multimodal E-Commerce Applications

Add code
Oct 23, 2025
Viaarxiv icon

CoCAViT: Compact Vision Transformer with Robust Global Coordination

Add code
Aug 07, 2025
Viaarxiv icon

GraphSubDetector: Time Series Subsequence Anomaly Detection via Density-Aware Adaptive Graph Neural Network

Add code
Nov 26, 2024
Figure 1 for GraphSubDetector: Time Series Subsequence Anomaly Detection via Density-Aware Adaptive Graph Neural Network
Figure 2 for GraphSubDetector: Time Series Subsequence Anomaly Detection via Density-Aware Adaptive Graph Neural Network
Figure 3 for GraphSubDetector: Time Series Subsequence Anomaly Detection via Density-Aware Adaptive Graph Neural Network
Figure 4 for GraphSubDetector: Time Series Subsequence Anomaly Detection via Density-Aware Adaptive Graph Neural Network
Viaarxiv icon

Infrared and visible Image Fusion with Language-driven Loss in CLIP Embedding Space

Add code
Feb 26, 2024
Figure 1 for Infrared and visible Image Fusion with Language-driven Loss in CLIP Embedding Space
Figure 2 for Infrared and visible Image Fusion with Language-driven Loss in CLIP Embedding Space
Figure 3 for Infrared and visible Image Fusion with Language-driven Loss in CLIP Embedding Space
Figure 4 for Infrared and visible Image Fusion with Language-driven Loss in CLIP Embedding Space
Viaarxiv icon

Semantic Object-level Modeling for Robust Visual Camera Relocalization

Add code
Feb 10, 2024
Figure 1 for Semantic Object-level Modeling for Robust Visual Camera Relocalization
Figure 2 for Semantic Object-level Modeling for Robust Visual Camera Relocalization
Figure 3 for Semantic Object-level Modeling for Robust Visual Camera Relocalization
Figure 4 for Semantic Object-level Modeling for Robust Visual Camera Relocalization
Viaarxiv icon