Picture for Qibin Hou

Qibin Hou

Revisiting Efficient Semantic Segmentation: Learning Offsets for Better Spatial and Class Feature Alignment

Add code
Aug 12, 2025
Viaarxiv icon

Depth Anything at Any Condition

Add code
Jul 02, 2025
Viaarxiv icon

Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning

Add code
May 18, 2025
Viaarxiv icon

DFormerv2: Geometry Self-Attention for RGBD Semantic Segmentation

Add code
Apr 07, 2025
Viaarxiv icon

Re-Aligning Language to Visual Objects with an Agentic Workflow

Add code
Mar 30, 2025
Viaarxiv icon

KAC: Kolmogorov-Arnold Classifier for Continual Learning

Add code
Mar 27, 2025
Viaarxiv icon

AR-1-to-3: Single Image to Consistent 3D Object Generation via Next-View Prediction

Add code
Mar 17, 2025
Viaarxiv icon

K-LoRA: Unlocking Training-Free Fusion of Any Subject and Style LoRAs

Add code
Feb 25, 2025
Viaarxiv icon

Lumina-Video: Efficient and Flexible Video Generation with Multi-scale Next-DiT

Add code
Feb 10, 2025
Viaarxiv icon

LLaVA-Octopus: Unlocking Instruction-Driven Adaptive Projector Fusion for Video Understanding

Add code
Jan 09, 2025
Figure 1 for LLaVA-Octopus: Unlocking Instruction-Driven Adaptive Projector Fusion for Video Understanding
Figure 2 for LLaVA-Octopus: Unlocking Instruction-Driven Adaptive Projector Fusion for Video Understanding
Figure 3 for LLaVA-Octopus: Unlocking Instruction-Driven Adaptive Projector Fusion for Video Understanding
Figure 4 for LLaVA-Octopus: Unlocking Instruction-Driven Adaptive Projector Fusion for Video Understanding
Viaarxiv icon