Picture for Weili Guan

Weili Guan

Object-Shot Enhanced Grounding Network for Egocentric Video

Add code
May 07, 2025
Viaarxiv icon

Handling Imbalanced Pseudolabels for Vision-Language Models with Concept Alignment and Confusion-Aware Calibrated Margin

Add code
May 04, 2025
Viaarxiv icon

Few-Shot Vision-Language Action-Incremental Policy Learning

Add code
Apr 22, 2025
Viaarxiv icon

BMRL: Bi-Modal Guided Multi-Perspective Representation Learning for Zero-Shot Deepfake Attribution

Add code
Apr 19, 2025
Viaarxiv icon

Curriculum Coarse-to-Fine Selection for High-IPC Dataset Distillation

Add code
Mar 24, 2025
Viaarxiv icon

MegaSR: Mining Customized Semantics and Expressive Guidance for Image Super-Resolution

Add code
Mar 11, 2025
Viaarxiv icon

Embodied Crowd Counting

Add code
Mar 11, 2025
Viaarxiv icon

PTQ1.61: Push the Real Limit of Extremely Low-Bit Post-Training Quantization Methods for Large Language Models

Add code
Feb 18, 2025
Viaarxiv icon

FALCON: Resolving Visual Redundancy and Fragmentation in High-resolution Multimodal Large Language Models via Visual Registers

Add code
Jan 27, 2025
Figure 1 for FALCON: Resolving Visual Redundancy and Fragmentation in High-resolution Multimodal Large Language Models via Visual Registers
Figure 2 for FALCON: Resolving Visual Redundancy and Fragmentation in High-resolution Multimodal Large Language Models via Visual Registers
Figure 3 for FALCON: Resolving Visual Redundancy and Fragmentation in High-resolution Multimodal Large Language Models via Visual Registers
Figure 4 for FALCON: Resolving Visual Redundancy and Fragmentation in High-resolution Multimodal Large Language Models via Visual Registers
Viaarxiv icon

Content-aware Balanced Spectrum Encoding in Masked Modeling for Time Series Classification

Add code
Dec 17, 2024
Viaarxiv icon