Picture for Kongming Liang

Kongming Liang

DriveRX: A Vision-Language Reasoning Model for Cross-Task Autonomous Driving

Add code
May 27, 2025
Viaarxiv icon

CineTechBench: A Benchmark for Cinematographic Technique Understanding and Generation

Add code
May 21, 2025
Viaarxiv icon

Harnessing Caption Detailness for Data-Efficient Text-to-Image Generation

Add code
May 21, 2025
Viaarxiv icon

ConMo: Controllable Motion Disentanglement and Recomposition for Zero-Shot Motion Transfer

Add code
Apr 03, 2025
Viaarxiv icon

FakeReasoning: Towards Generalizable Forgery Detection and Reasoning

Add code
Mar 27, 2025
Viaarxiv icon

OCORD: Open-Campus Object Removal Dataset

Add code
Jan 13, 2025
Viaarxiv icon

PGP-SAM: Prototype-Guided Prompt Learning for Efficient Few-Shot Medical Image Segmentation

Add code
Jan 12, 2025
Viaarxiv icon

From Simple to Professional: A Combinatorial Controllable Image Captioning Agent

Add code
Dec 15, 2024
Viaarxiv icon

Detailed Object Description with Controllable Dimensions

Add code
Nov 28, 2024
Viaarxiv icon

Polyp-E: Benchmarking the Robustness of Deep Segmentation Models via Polyp Editing

Add code
Oct 22, 2024
Figure 1 for Polyp-E: Benchmarking the Robustness of Deep Segmentation Models via Polyp Editing
Figure 2 for Polyp-E: Benchmarking the Robustness of Deep Segmentation Models via Polyp Editing
Figure 3 for Polyp-E: Benchmarking the Robustness of Deep Segmentation Models via Polyp Editing
Figure 4 for Polyp-E: Benchmarking the Robustness of Deep Segmentation Models via Polyp Editing
Viaarxiv icon