Picture for Chu-Song Chen

Chu-Song Chen

Document-Level Numerical Reasoning across Single and Multiple Tables in Financial Reports

Add code
Apr 04, 2026
Viaarxiv icon

Customized Visual Storytelling with Unified Multimodal LLMs

Add code
Mar 29, 2026
Viaarxiv icon

Relation-Rich Visual Document Generator for Visual Information Extraction

Add code
Apr 14, 2025
Viaarxiv icon

Open-Vocabulary Panoptic Segmentation Using BERT Pre-Training of Vision-Language Multiway Transformer Model

Add code
Dec 25, 2024
Viaarxiv icon

ACCEPT: Adaptive Codebook for Composite and Efficient Prompt Tuning

Add code
Oct 10, 2024
Figure 1 for ACCEPT: Adaptive Codebook for Composite and Efficient Prompt Tuning
Figure 2 for ACCEPT: Adaptive Codebook for Composite and Efficient Prompt Tuning
Figure 3 for ACCEPT: Adaptive Codebook for Composite and Efficient Prompt Tuning
Figure 4 for ACCEPT: Adaptive Codebook for Composite and Efficient Prompt Tuning
Viaarxiv icon

Information-Theoretical Principled Trade-off between Jailbreakability and Stealthiness on Vision Language Models

Add code
Oct 02, 2024
Figure 1 for Information-Theoretical Principled Trade-off between Jailbreakability and Stealthiness on Vision Language Models
Figure 2 for Information-Theoretical Principled Trade-off between Jailbreakability and Stealthiness on Vision Language Models
Figure 3 for Information-Theoretical Principled Trade-off between Jailbreakability and Stealthiness on Vision Language Models
Figure 4 for Information-Theoretical Principled Trade-off between Jailbreakability and Stealthiness on Vision Language Models
Viaarxiv icon

Defending Against Repetitive-based Backdoor Attacks on Semi-supervised Learning through Lens of Rate-Distortion-Perception Trade-off

Add code
Jul 14, 2024
Viaarxiv icon

RDPN6D: Residual-based Dense Point-wise Network for 6Dof Object Pose Estimation Based on RGB-D Images

Add code
May 14, 2024
Figure 1 for RDPN6D: Residual-based Dense Point-wise Network for 6Dof Object Pose Estimation Based on RGB-D Images
Figure 2 for RDPN6D: Residual-based Dense Point-wise Network for 6Dof Object Pose Estimation Based on RGB-D Images
Viaarxiv icon

LC4SV: A Denoising Framework Learning to Compensate for Unseen Speaker Verification Models

Add code
Nov 28, 2023
Viaarxiv icon

D4AM: A General Denoising Framework for Downstream Acoustic Models

Add code
Nov 28, 2023
Viaarxiv icon