Picture for Fengying Xie

Fengying Xie

Global Context or Local Detail? Adaptive Visual Grounding for Hallucination Mitigation

Add code
Apr 27, 2026
Viaarxiv icon

V-tableR1: Process-Supervised Multimodal Table Reasoning with Critic-Guided Policy Optimization

Add code
Apr 22, 2026
Viaarxiv icon

Rectification Reimagined: A Unified Mamba Model for Image Correction and Rectangling with Prompts

Add code
Dec 21, 2025
Figure 1 for Rectification Reimagined: A Unified Mamba Model for Image Correction and Rectangling with Prompts
Figure 2 for Rectification Reimagined: A Unified Mamba Model for Image Correction and Rectangling with Prompts
Figure 3 for Rectification Reimagined: A Unified Mamba Model for Image Correction and Rectangling with Prompts
Figure 4 for Rectification Reimagined: A Unified Mamba Model for Image Correction and Rectangling with Prompts
Viaarxiv icon

Adapt CLIP as Aggregation Instructor for Image Dehazing

Add code
Aug 22, 2024
Figure 1 for Adapt CLIP as Aggregation Instructor for Image Dehazing
Figure 2 for Adapt CLIP as Aggregation Instructor for Image Dehazing
Figure 3 for Adapt CLIP as Aggregation Instructor for Image Dehazing
Figure 4 for Adapt CLIP as Aggregation Instructor for Image Dehazing
Viaarxiv icon

Pan-cancer Histopathology WSI Pre-training with Position-aware Masked Autoencoder

Add code
Jul 10, 2024
Figure 1 for Pan-cancer Histopathology WSI Pre-training with Position-aware Masked Autoencoder
Figure 2 for Pan-cancer Histopathology WSI Pre-training with Position-aware Masked Autoencoder
Figure 3 for Pan-cancer Histopathology WSI Pre-training with Position-aware Masked Autoencoder
Figure 4 for Pan-cancer Histopathology WSI Pre-training with Position-aware Masked Autoencoder
Viaarxiv icon

Prototypical Information Bottlenecking and Disentangling for Multimodal Cancer Survival Prediction

Add code
Jan 03, 2024
Figure 1 for Prototypical Information Bottlenecking and Disentangling for Multimodal Cancer Survival Prediction
Figure 2 for Prototypical Information Bottlenecking and Disentangling for Multimodal Cancer Survival Prediction
Figure 3 for Prototypical Information Bottlenecking and Disentangling for Multimodal Cancer Survival Prediction
Figure 4 for Prototypical Information Bottlenecking and Disentangling for Multimodal Cancer Survival Prediction
Viaarxiv icon

ECL: Class-Enhancement Contrastive Learning for Long-tailed Skin Lesion Classification

Add code
Jul 09, 2023
Figure 1 for ECL: Class-Enhancement Contrastive Learning for Long-tailed Skin Lesion Classification
Figure 2 for ECL: Class-Enhancement Contrastive Learning for Long-tailed Skin Lesion Classification
Figure 3 for ECL: Class-Enhancement Contrastive Learning for Long-tailed Skin Lesion Classification
Figure 4 for ECL: Class-Enhancement Contrastive Learning for Long-tailed Skin Lesion Classification
Viaarxiv icon

TFormer: A throughout fusion transformer for multi-modal skin lesion diagnosis

Add code
Nov 21, 2022
Figure 1 for TFormer: A throughout fusion transformer for multi-modal skin lesion diagnosis
Figure 2 for TFormer: A throughout fusion transformer for multi-modal skin lesion diagnosis
Figure 3 for TFormer: A throughout fusion transformer for multi-modal skin lesion diagnosis
Figure 4 for TFormer: A throughout fusion transformer for multi-modal skin lesion diagnosis
Viaarxiv icon

A Rotation Meanout Network with Invariance for Dermoscopy Image Classification and Retrieval

Add code
Aug 01, 2022
Figure 1 for A Rotation Meanout Network with Invariance for Dermoscopy Image Classification and Retrieval
Figure 2 for A Rotation Meanout Network with Invariance for Dermoscopy Image Classification and Retrieval
Figure 3 for A Rotation Meanout Network with Invariance for Dermoscopy Image Classification and Retrieval
Figure 4 for A Rotation Meanout Network with Invariance for Dermoscopy Image Classification and Retrieval
Viaarxiv icon

Kernel Attention Transformer (KAT) for Histopathology Whole Slide Image Classification

Add code
Jun 27, 2022
Figure 1 for Kernel Attention Transformer (KAT) for Histopathology Whole Slide Image Classification
Figure 2 for Kernel Attention Transformer (KAT) for Histopathology Whole Slide Image Classification
Figure 3 for Kernel Attention Transformer (KAT) for Histopathology Whole Slide Image Classification
Figure 4 for Kernel Attention Transformer (KAT) for Histopathology Whole Slide Image Classification
Viaarxiv icon