Picture for Kai Wang

Kai Wang

Refer to the report for detailed contributions

MPBench: A Comprehensive Multimodal Reasoning Benchmark for Process Errors Identification

Add code
Mar 16, 2025
Figure 1 for MPBench: A Comprehensive Multimodal Reasoning Benchmark for Process Errors Identification
Figure 2 for MPBench: A Comprehensive Multimodal Reasoning Benchmark for Process Errors Identification
Figure 3 for MPBench: A Comprehensive Multimodal Reasoning Benchmark for Process Errors Identification
Figure 4 for MPBench: A Comprehensive Multimodal Reasoning Benchmark for Process Errors Identification
Viaarxiv icon

ProbDiffFlow: An Efficient Learning-Free Framework for Probabilistic Single-Image Optical Flow Estimation

Add code
Mar 16, 2025
Viaarxiv icon

PEBench: A Fictitious Dataset to Benchmark Machine Unlearning for Multimodal Large Language Models

Add code
Mar 16, 2025
Figure 1 for PEBench: A Fictitious Dataset to Benchmark Machine Unlearning for Multimodal Large Language Models
Figure 2 for PEBench: A Fictitious Dataset to Benchmark Machine Unlearning for Multimodal Large Language Models
Figure 3 for PEBench: A Fictitious Dataset to Benchmark Machine Unlearning for Multimodal Large Language Models
Figure 4 for PEBench: A Fictitious Dataset to Benchmark Machine Unlearning for Multimodal Large Language Models
Viaarxiv icon

Integrating Chain-of-Thought and Retrieval Augmented Generation Enhances Rare Disease Diagnosis from Clinical Notes

Add code
Mar 15, 2025
Viaarxiv icon

Make Optimization Once and for All with Fine-grained Guidance

Add code
Mar 14, 2025
Viaarxiv icon

Leveraging Semantic Attribute Binding for Free-Lunch Color Control in Diffusion Models

Add code
Mar 12, 2025
Figure 1 for Leveraging Semantic Attribute Binding for Free-Lunch Color Control in Diffusion Models
Figure 2 for Leveraging Semantic Attribute Binding for Free-Lunch Color Control in Diffusion Models
Figure 3 for Leveraging Semantic Attribute Binding for Free-Lunch Color Control in Diffusion Models
Figure 4 for Leveraging Semantic Attribute Binding for Free-Lunch Color Control in Diffusion Models
Viaarxiv icon

X-Field: A Physically Grounded Representation for 3D X-ray Reconstruction

Add code
Mar 11, 2025
Viaarxiv icon

A Multimodal Benchmark Dataset and Model for Crop Disease Diagnosis

Add code
Mar 10, 2025
Viaarxiv icon

Optimizing for the Shortest Path in Denoising Diffusion Model

Add code
Mar 06, 2025
Viaarxiv icon

Learning Object Placement Programs for Indoor Scene Synthesis with Iterative Self Training

Add code
Mar 06, 2025
Figure 1 for Learning Object Placement Programs for Indoor Scene Synthesis with Iterative Self Training
Figure 2 for Learning Object Placement Programs for Indoor Scene Synthesis with Iterative Self Training
Figure 3 for Learning Object Placement Programs for Indoor Scene Synthesis with Iterative Self Training
Figure 4 for Learning Object Placement Programs for Indoor Scene Synthesis with Iterative Self Training
Viaarxiv icon