Picture for Kai-Wei Chang

Kai-Wei Chang

Towards a Holistic Framework for Multimodal Large Language Models in Three-dimensional Brain CT Report Generation

Add code
Jul 02, 2024
Viaarxiv icon

The Factuality Tax of Diversity-Intervened Text-to-Image Generation: Benchmark and Fact-Augmented Intervention

Add code
Jun 29, 2024
Viaarxiv icon

MetaKP: On-Demand Keyphrase Generation

Add code
Jun 28, 2024
Viaarxiv icon

MACAROON: Training Vision-Language Models To Be Your Engaged Partners

Add code
Jun 20, 2024
Viaarxiv icon

LLM-A*: Large Language Model Enhanced Incremental Heuristic Search on Path Planning

Add code
Jun 20, 2024
Viaarxiv icon

Synchronous Faithfulness Monitoring for Trustworthy Retrieval-Augmented Generation

Add code
Jun 19, 2024
Viaarxiv icon

VDebugger: Harnessing Execution Feedback for Debugging Visual Programs

Add code
Jun 19, 2024
Viaarxiv icon

SparseCL: Sparse Contrastive Learning for Contradiction Retrieval

Add code
Jun 15, 2024
Viaarxiv icon

MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding

Add code
Jun 13, 2024
Figure 1 for MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding
Figure 2 for MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding
Figure 3 for MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding
Figure 4 for MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding
Viaarxiv icon

VideoPhy: Evaluating Physical Commonsense for Video Generation

Add code
Jun 05, 2024
Viaarxiv icon