Picture for Rainer Stiefelhagen

Rainer Stiefelhagen

ChartFormer: A Large Vision Language Model for Converting Chart Images into Tactile Accessible SVGs

Add code
May 29, 2024
Viaarxiv icon

ACCSAMS: Automatic Conversion of Exam Documents to Accessible Learning Material for Blind and Visually Impaired

Add code
May 29, 2024
Figure 1 for ACCSAMS: Automatic Conversion of Exam Documents to Accessible Learning Material for Blind and Visually Impaired
Figure 2 for ACCSAMS: Automatic Conversion of Exam Documents to Accessible Learning Material for Blind and Visually Impaired
Viaarxiv icon

Alt4Blind: A User Interface to Simplify Charts Alt-Text Creation

Add code
May 29, 2024
Figure 1 for Alt4Blind: A User Interface to Simplify Charts Alt-Text Creation
Figure 2 for Alt4Blind: A User Interface to Simplify Charts Alt-Text Creation
Viaarxiv icon

AltChart: Enhancing VLM-based Chart Summarization Through Multi-Pretext Tasks

Add code
May 22, 2024
Viaarxiv icon

Rethinking Annotator Simulation: Realistic Evaluation of Whole-Body PET Lesion Interactive Segmentation Methods

Add code
Apr 02, 2024
Figure 1 for Rethinking Annotator Simulation: Realistic Evaluation of Whole-Body PET Lesion Interactive Segmentation Methods
Figure 2 for Rethinking Annotator Simulation: Realistic Evaluation of Whole-Body PET Lesion Interactive Segmentation Methods
Figure 3 for Rethinking Annotator Simulation: Realistic Evaluation of Whole-Body PET Lesion Interactive Segmentation Methods
Figure 4 for Rethinking Annotator Simulation: Realistic Evaluation of Whole-Body PET Lesion Interactive Segmentation Methods
Viaarxiv icon

RoDLA: Benchmarking the Robustness of Document Layout Analysis Models

Add code
Mar 21, 2024
Figure 1 for RoDLA: Benchmarking the Robustness of Document Layout Analysis Models
Figure 2 for RoDLA: Benchmarking the Robustness of Document Layout Analysis Models
Figure 3 for RoDLA: Benchmarking the Robustness of Document Layout Analysis Models
Figure 4 for RoDLA: Benchmarking the Robustness of Document Layout Analysis Models
Viaarxiv icon

Skeleton-Based Human Action Recognition with Noisy Labels

Add code
Mar 15, 2024
Figure 1 for Skeleton-Based Human Action Recognition with Noisy Labels
Figure 2 for Skeleton-Based Human Action Recognition with Noisy Labels
Figure 3 for Skeleton-Based Human Action Recognition with Noisy Labels
Figure 4 for Skeleton-Based Human Action Recognition with Noisy Labels
Viaarxiv icon

EchoTrack: Auditory Referring Multi-Object Tracking for Autonomous Driving

Add code
Feb 28, 2024
Figure 1 for EchoTrack: Auditory Referring Multi-Object Tracking for Autonomous Driving
Figure 2 for EchoTrack: Auditory Referring Multi-Object Tracking for Autonomous Driving
Figure 3 for EchoTrack: Auditory Referring Multi-Object Tracking for Autonomous Driving
Figure 4 for EchoTrack: Auditory Referring Multi-Object Tracking for Autonomous Driving
Viaarxiv icon

Fourier Prompt Tuning for Modality-Incomplete Scene Segmentation

Add code
Jan 30, 2024
Figure 1 for Fourier Prompt Tuning for Modality-Incomplete Scene Segmentation
Figure 2 for Fourier Prompt Tuning for Modality-Incomplete Scene Segmentation
Figure 3 for Fourier Prompt Tuning for Modality-Incomplete Scene Segmentation
Figure 4 for Fourier Prompt Tuning for Modality-Incomplete Scene Segmentation
Viaarxiv icon

C-BEV: Contrastive Bird's Eye View Training for Cross-View Image Retrieval and 3-DoF Pose Estimation

Add code
Dec 13, 2023
Viaarxiv icon