Picture for Gaoyang Zhang

Gaoyang Zhang

Agentar-Fin-OCR

Add code
Mar 11, 2026
Viaarxiv icon

Efficient RGB-D Scene Understanding via Multi-task Adaptive Learning and Cross-dimensional Feature Guidance

Add code
Mar 08, 2026
Viaarxiv icon

Real-Time Glottis Detection Framework via Spatial-decoupled Feature Learning for Nasal Transnasal Intubation

Add code
Mar 08, 2026
Viaarxiv icon

ReusStdFlow: A Standardized Reusability Framework for Dynamic Workflow Construction in Agentic AI

Add code
Feb 16, 2026
Viaarxiv icon

Fault2Flow: An AlphaEvolve-Optimized Human-in-the-Loop Multi-Agent System for Fault-to-Workflow Automation

Add code
Nov 17, 2025
Viaarxiv icon

CoMPaSS: Enhancing Spatial Understanding in Text-to-Image Diffusion Models

Add code
Dec 17, 2024
Figure 1 for CoMPaSS: Enhancing Spatial Understanding in Text-to-Image Diffusion Models
Figure 2 for CoMPaSS: Enhancing Spatial Understanding in Text-to-Image Diffusion Models
Figure 3 for CoMPaSS: Enhancing Spatial Understanding in Text-to-Image Diffusion Models
Figure 4 for CoMPaSS: Enhancing Spatial Understanding in Text-to-Image Diffusion Models
Viaarxiv icon