Picture for Baocai Yin

Baocai Yin

TDATR: Improving End-to-End Table Recognition via Table Detail-Aware Learning and Cell-Level Visual Alignment

Add code
Mar 24, 2026
Viaarxiv icon

TagaVLM: Topology-Aware Global Action Reasoning for Vision-Language Navigation

Add code
Mar 03, 2026
Viaarxiv icon

CompEvent: Complex-valued Event-RGB Fusion for Low-light Video Enhancement and Deblurring

Add code
Nov 18, 2025
Viaarxiv icon

Visual and Semantic Prompt Collaboration for Generalized Zero-Shot Learning

Add code
Mar 29, 2025
Viaarxiv icon

Col-OLHTR: A Novel Framework for Multimodal Online Handwritten Text Recognition

Add code
Feb 10, 2025
Figure 1 for Col-OLHTR: A Novel Framework for Multimodal Online Handwritten Text Recognition
Figure 2 for Col-OLHTR: A Novel Framework for Multimodal Online Handwritten Text Recognition
Figure 3 for Col-OLHTR: A Novel Framework for Multimodal Online Handwritten Text Recognition
Figure 4 for Col-OLHTR: A Novel Framework for Multimodal Online Handwritten Text Recognition
Viaarxiv icon

AdvAnchor: Enhancing Diffusion Model Unlearning with Adversarial Anchors

Add code
Dec 28, 2024
Viaarxiv icon

HC-LLM: Historical-Constrained Large Language Models for Radiology Report Generation

Add code
Dec 15, 2024
Figure 1 for HC-LLM: Historical-Constrained Large Language Models for Radiology Report Generation
Figure 2 for HC-LLM: Historical-Constrained Large Language Models for Radiology Report Generation
Figure 3 for HC-LLM: Historical-Constrained Large Language Models for Radiology Report Generation
Figure 4 for HC-LLM: Historical-Constrained Large Language Models for Radiology Report Generation
Viaarxiv icon

Adapter-Enhanced Semantic Prompting for Continual Learning

Add code
Dec 15, 2024
Figure 1 for Adapter-Enhanced Semantic Prompting for Continual Learning
Figure 2 for Adapter-Enhanced Semantic Prompting for Continual Learning
Figure 3 for Adapter-Enhanced Semantic Prompting for Continual Learning
Figure 4 for Adapter-Enhanced Semantic Prompting for Continual Learning
Viaarxiv icon

RFL: Simplifying Chemical Structure Recognition with Ring-Free Language

Add code
Dec 10, 2024
Figure 1 for RFL: Simplifying Chemical Structure Recognition with Ring-Free Language
Figure 2 for RFL: Simplifying Chemical Structure Recognition with Ring-Free Language
Figure 3 for RFL: Simplifying Chemical Structure Recognition with Ring-Free Language
Figure 4 for RFL: Simplifying Chemical Structure Recognition with Ring-Free Language
Viaarxiv icon

WTDUN: Wavelet Tree-Structured Sampling and Deep Unfolding Network for Image Compressed Sensing

Add code
Nov 25, 2024
Figure 1 for WTDUN: Wavelet Tree-Structured Sampling and Deep Unfolding Network for Image Compressed Sensing
Figure 2 for WTDUN: Wavelet Tree-Structured Sampling and Deep Unfolding Network for Image Compressed Sensing
Figure 3 for WTDUN: Wavelet Tree-Structured Sampling and Deep Unfolding Network for Image Compressed Sensing
Figure 4 for WTDUN: Wavelet Tree-Structured Sampling and Deep Unfolding Network for Image Compressed Sensing
Viaarxiv icon