Picture for Fei Yin

Fei Yin

Institute of Automation of Chinese Academy of Science, University of Chinese Academy of Sciences

BiNSGPS: Geometry Problem Solving via Bidirectional Neuro-Symbolic Interaction

Add code
Jun 03, 2026
Viaarxiv icon

IG-Diff: Complex Night Scene Restoration with Illumination-Guided Diffusion Model

Add code
May 14, 2026
Viaarxiv icon

MMCL-Bench: Multimodal Context Learning from Visual Rules, Procedures, and Evidence

Add code
May 12, 2026
Viaarxiv icon

Geoparsing: Diagram Parsing for Plane and Solid Geometry with a Unified Formal Language

Add code
Apr 13, 2026
Viaarxiv icon

PMPBench: A Paired Multi-Modal Pan-Cancer Benchmark for Medical Image Synthesis

Add code
Jan 22, 2026
Viaarxiv icon

Online Handwritten Signature Verification Based on Temporal-Spatial Graph Attention Transformer

Add code
Oct 22, 2025
Viaarxiv icon

FaceCraft4D: Animated 3D Facial Avatar Generation from a Single Image

Add code
Apr 21, 2025
Viaarxiv icon

DocSAM: Unified Document Image Segmentation via Query Decomposition and Heterogeneous Mixed Learning

Add code
Apr 05, 2025
Viaarxiv icon

MV-MATH: Evaluating Multimodal Math Reasoning in Multi-Visual Contexts

Add code
Feb 28, 2025
Figure 1 for MV-MATH: Evaluating Multimodal Math Reasoning in Multi-Visual Contexts
Figure 2 for MV-MATH: Evaluating Multimodal Math Reasoning in Multi-Visual Contexts
Figure 3 for MV-MATH: Evaluating Multimodal Math Reasoning in Multi-Visual Contexts
Figure 4 for MV-MATH: Evaluating Multimodal Math Reasoning in Multi-Visual Contexts
Viaarxiv icon

Do computer vision foundation models learn the low-level characteristics of the human visual system?

Add code
Feb 27, 2025
Figure 1 for Do computer vision foundation models learn the low-level characteristics of the human visual system?
Figure 2 for Do computer vision foundation models learn the low-level characteristics of the human visual system?
Figure 3 for Do computer vision foundation models learn the low-level characteristics of the human visual system?
Figure 4 for Do computer vision foundation models learn the low-level characteristics of the human visual system?
Viaarxiv icon