Picture for Zhen Zeng

Zhen Zeng

CrossCult-KIBench: A Benchmark for Cross-Cultural Knowledge Insertion in MLLMs

Add code
May 07, 2026
Viaarxiv icon

Scalable Secure Biometric Authentication without Auxiliary Identifiers

Add code
Apr 27, 2026
Viaarxiv icon

Continual Learning of Domain Knowledge from Human Feedback in Text-to-SQL

Add code
Nov 10, 2025
Viaarxiv icon

SlideAgent: Hierarchical Agentic Framework for Multi-Page Visual Document Understanding

Add code
Oct 30, 2025
Figure 1 for SlideAgent: Hierarchical Agentic Framework for Multi-Page Visual Document Understanding
Figure 2 for SlideAgent: Hierarchical Agentic Framework for Multi-Page Visual Document Understanding
Figure 3 for SlideAgent: Hierarchical Agentic Framework for Multi-Page Visual Document Understanding
Figure 4 for SlideAgent: Hierarchical Agentic Framework for Multi-Page Visual Document Understanding
Viaarxiv icon

ChartAgent: A Multimodal Agent for Visually Grounded Reasoning in Complex Chart Question Answering

Add code
Oct 06, 2025
Viaarxiv icon

TADACap: Time-series Adaptive Domain-Aware Captioning

Add code
Apr 15, 2025
Figure 1 for TADACap: Time-series Adaptive Domain-Aware Captioning
Figure 2 for TADACap: Time-series Adaptive Domain-Aware Captioning
Figure 3 for TADACap: Time-series Adaptive Domain-Aware Captioning
Figure 4 for TADACap: Time-series Adaptive Domain-Aware Captioning
Viaarxiv icon

On Creating a Causally Grounded Usable Rating Method for Assessing the Robustness of Foundation Models Supporting Time Series

Add code
Feb 17, 2025
Viaarxiv icon

LAW: Legal Agentic Workflows for Custody and Fund Services Contracts

Add code
Dec 15, 2024
Figure 1 for LAW: Legal Agentic Workflows for Custody and Fund Services Contracts
Figure 2 for LAW: Legal Agentic Workflows for Custody and Fund Services Contracts
Figure 3 for LAW: Legal Agentic Workflows for Custody and Fund Services Contracts
Figure 4 for LAW: Legal Agentic Workflows for Custody and Fund Services Contracts
Viaarxiv icon

AdaptAgent: Adapting Multimodal Web Agents with Few-Shot Learning from Human Demonstrations

Add code
Nov 20, 2024
Figure 1 for AdaptAgent: Adapting Multimodal Web Agents with Few-Shot Learning from Human Demonstrations
Figure 2 for AdaptAgent: Adapting Multimodal Web Agents with Few-Shot Learning from Human Demonstrations
Figure 3 for AdaptAgent: Adapting Multimodal Web Agents with Few-Shot Learning from Human Demonstrations
Figure 4 for AdaptAgent: Adapting Multimodal Web Agents with Few-Shot Learning from Human Demonstrations
Viaarxiv icon

Visual-Oriented Fine-Grained Knowledge Editing for MultiModal Large Language Models

Add code
Nov 19, 2024
Figure 1 for Visual-Oriented Fine-Grained Knowledge Editing for MultiModal Large Language Models
Figure 2 for Visual-Oriented Fine-Grained Knowledge Editing for MultiModal Large Language Models
Figure 3 for Visual-Oriented Fine-Grained Knowledge Editing for MultiModal Large Language Models
Figure 4 for Visual-Oriented Fine-Grained Knowledge Editing for MultiModal Large Language Models
Viaarxiv icon