Picture for Qingjie Liu

Qingjie Liu

ContextQFormer: A New Context Modeling Method for Multi-Turn Multi-Modal Conversations

Add code
May 29, 2025
Viaarxiv icon

Towards Robust and Controllable Text-to-Motion via Masked Autoregressive Diffusion

Add code
May 16, 2025
Viaarxiv icon

GODBench: A Benchmark for Multimodal Large Language Models in Video Comment Art

Add code
May 16, 2025
Viaarxiv icon

SeriesBench: A Benchmark for Narrative-Driven Drama Series Understanding

Add code
Apr 30, 2025
Viaarxiv icon

SkeletonX: Data-Efficient Skeleton-based Action Recognition via Cross-sample Feature Aggregation

Add code
Apr 16, 2025
Viaarxiv icon

Vision-Language Model for Object Detection and Segmentation: A Review and Evaluation

Add code
Apr 13, 2025
Viaarxiv icon

A Survey on Remote Sensing Foundation Models: From Vision to Multimodality

Add code
Mar 28, 2025
Viaarxiv icon

KwaiChat: A Large-Scale Video-Driven Multilingual Mixed-Type Dialogue Corpus

Add code
Mar 10, 2025
Viaarxiv icon

OpenRSD: Towards Open-prompts for Object Detection in Remote Sensing Images

Add code
Mar 08, 2025
Viaarxiv icon

PACF: Prototype Augmented Compact Features for Improving Domain Adaptive Object Detection

Add code
Jan 15, 2025
Viaarxiv icon