Picture for Weiming Hu

Weiming Hu

MMhops-R1: Multimodal Multi-hop Reasoning

Add code
Dec 16, 2025
Viaarxiv icon

Online Segment Any 3D Thing as Instance Tracking

Add code
Dec 08, 2025
Viaarxiv icon

Integer Binary-Range Alignment Neuron for Spiking Neural Networks

Add code
Jun 06, 2025
Viaarxiv icon

DetailFusion: A Dual-branch Framework with Detail Enhancement for Composed Image Retrieval

Add code
May 23, 2025
Viaarxiv icon

NTIRE 2025 challenge on Text to Image Generation Model Quality Assessment

Add code
May 22, 2025
Viaarxiv icon

PFSD: A Multi-Modal Pedestrian-Focus Scene Dataset for Rich Tasks in Semi-Structured Environments

Add code
Feb 24, 2025
Viaarxiv icon

PC-Agent: A Hierarchical Multi-Agent Collaboration Framework for Complex Task Automation on PC

Add code
Feb 21, 2025
Viaarxiv icon

mR$^2$AG: Multimodal Retrieval-Reflection-Augmented Generation for Knowledge-Based VQA

Add code
Nov 22, 2024
Figure 1 for mR$^2$AG: Multimodal Retrieval-Reflection-Augmented Generation for Knowledge-Based VQA
Figure 2 for mR$^2$AG: Multimodal Retrieval-Reflection-Augmented Generation for Knowledge-Based VQA
Figure 3 for mR$^2$AG: Multimodal Retrieval-Reflection-Augmented Generation for Knowledge-Based VQA
Figure 4 for mR$^2$AG: Multimodal Retrieval-Reflection-Augmented Generation for Knowledge-Based VQA
Viaarxiv icon

SEAGULL: No-reference Image Quality Assessment for Regions of Interest via Vision-Language Instruction Tuning

Add code
Nov 15, 2024
Figure 1 for SEAGULL: No-reference Image Quality Assessment for Regions of Interest via Vision-Language Instruction Tuning
Figure 2 for SEAGULL: No-reference Image Quality Assessment for Regions of Interest via Vision-Language Instruction Tuning
Figure 3 for SEAGULL: No-reference Image Quality Assessment for Regions of Interest via Vision-Language Instruction Tuning
Figure 4 for SEAGULL: No-reference Image Quality Assessment for Regions of Interest via Vision-Language Instruction Tuning
Viaarxiv icon

HSTrack: Bootstrap End-to-End Multi-Camera 3D Multi-object Tracking with Hybrid Supervision

Add code
Nov 11, 2024
Figure 1 for HSTrack: Bootstrap End-to-End Multi-Camera 3D Multi-object Tracking with Hybrid Supervision
Figure 2 for HSTrack: Bootstrap End-to-End Multi-Camera 3D Multi-object Tracking with Hybrid Supervision
Figure 3 for HSTrack: Bootstrap End-to-End Multi-Camera 3D Multi-object Tracking with Hybrid Supervision
Figure 4 for HSTrack: Bootstrap End-to-End Multi-Camera 3D Multi-object Tracking with Hybrid Supervision
Viaarxiv icon