Picture for Qing Liu

Qing Liu

Data61, CSIRO

Multi-domain Multi-modal Document Classification Benchmark with a Multi-level Taxonomy

Add code
May 11, 2026
Viaarxiv icon

FairEnc: A Fair Vision-Language Model with Fair Vision and Text Encoders for Glaucoma Detection

Add code
May 06, 2026
Viaarxiv icon

CC-OCR V2: Benchmarking Large Multimodal Models for Literacy in Real-world Document Processing

Add code
May 05, 2026
Viaarxiv icon

AlbumFill: Album-Guided Reasoning and Retrieval for Personalized Image Completion

Add code
May 04, 2026
Viaarxiv icon

Tri-Prompting: Video Diffusion with Unified Control over Scene, Subject, and Motion

Add code
Mar 16, 2026
Viaarxiv icon

Controllable Layered Image Generation for Real-World Editing

Add code
Jan 21, 2026
Viaarxiv icon

Taxon: Hierarchical Tax Code Prediction with Semantically Aligned LLM Expert Guidance

Add code
Jan 13, 2026
Viaarxiv icon

PrivGemo: Privacy-Preserving Dual-Tower Graph Retrieval for Empowering LLM Reasoning with Memory Augmentation

Add code
Jan 13, 2026
Viaarxiv icon

Both Semantics and Reconstruction Matter: Making Representation Encoders Ready for Text-to-Image Generation and Editing

Add code
Dec 19, 2025
Viaarxiv icon

Ensembling LLM-Induced Decision Trees for Explainable and Robust Error Detection

Add code
Dec 08, 2025
Viaarxiv icon