Picture for Kai Wang

Kai Wang

Refer to the report for detailed contributions

Slow-Fast Architecture for Video Multi-Modal Large Language Models

Add code
Apr 02, 2025
Viaarxiv icon

ORAL: Prompting Your Large-Scale LoRAs via Conditional Recurrent Diffusion

Add code
Mar 31, 2025
Figure 1 for ORAL: Prompting Your Large-Scale LoRAs via Conditional Recurrent Diffusion
Figure 2 for ORAL: Prompting Your Large-Scale LoRAs via Conditional Recurrent Diffusion
Figure 3 for ORAL: Prompting Your Large-Scale LoRAs via Conditional Recurrent Diffusion
Figure 4 for ORAL: Prompting Your Large-Scale LoRAs via Conditional Recurrent Diffusion
Viaarxiv icon

Free-Lunch Color-Texture Disentanglement for Stylized Image Generation

Add code
Mar 21, 2025
Figure 1 for Free-Lunch Color-Texture Disentanglement for Stylized Image Generation
Figure 2 for Free-Lunch Color-Texture Disentanglement for Stylized Image Generation
Figure 3 for Free-Lunch Color-Texture Disentanglement for Stylized Image Generation
Figure 4 for Free-Lunch Color-Texture Disentanglement for Stylized Image Generation
Viaarxiv icon

Safety Evaluation and Enhancement of DeepSeek Models in Chinese Contexts

Add code
Mar 18, 2025
Figure 1 for Safety Evaluation and Enhancement of DeepSeek Models in Chinese Contexts
Figure 2 for Safety Evaluation and Enhancement of DeepSeek Models in Chinese Contexts
Figure 3 for Safety Evaluation and Enhancement of DeepSeek Models in Chinese Contexts
Figure 4 for Safety Evaluation and Enhancement of DeepSeek Models in Chinese Contexts
Viaarxiv icon

AR-1-to-3: Single Image to Consistent 3D Object Generation via Next-View Prediction

Add code
Mar 17, 2025
Viaarxiv icon

MPBench: A Comprehensive Multimodal Reasoning Benchmark for Process Errors Identification

Add code
Mar 16, 2025
Figure 1 for MPBench: A Comprehensive Multimodal Reasoning Benchmark for Process Errors Identification
Figure 2 for MPBench: A Comprehensive Multimodal Reasoning Benchmark for Process Errors Identification
Figure 3 for MPBench: A Comprehensive Multimodal Reasoning Benchmark for Process Errors Identification
Figure 4 for MPBench: A Comprehensive Multimodal Reasoning Benchmark for Process Errors Identification
Viaarxiv icon

PEBench: A Fictitious Dataset to Benchmark Machine Unlearning for Multimodal Large Language Models

Add code
Mar 16, 2025
Figure 1 for PEBench: A Fictitious Dataset to Benchmark Machine Unlearning for Multimodal Large Language Models
Figure 2 for PEBench: A Fictitious Dataset to Benchmark Machine Unlearning for Multimodal Large Language Models
Figure 3 for PEBench: A Fictitious Dataset to Benchmark Machine Unlearning for Multimodal Large Language Models
Figure 4 for PEBench: A Fictitious Dataset to Benchmark Machine Unlearning for Multimodal Large Language Models
Viaarxiv icon

ProbDiffFlow: An Efficient Learning-Free Framework for Probabilistic Single-Image Optical Flow Estimation

Add code
Mar 16, 2025
Viaarxiv icon

Integrating Chain-of-Thought and Retrieval Augmented Generation Enhances Rare Disease Diagnosis from Clinical Notes

Add code
Mar 15, 2025
Viaarxiv icon

Make Optimization Once and for All with Fine-grained Guidance

Add code
Mar 14, 2025
Viaarxiv icon