Picture for Weifeng Ge

Weifeng Ge

Adaptive Attention Distillation for Robust Few-Shot Segmentation under Environmental Perturbations

Add code
Jan 07, 2026
Viaarxiv icon

Audio-VLA: Adding Contact Audio Perception to Vision-Language-Action Model for Robotic Manipulation

Add code
Nov 13, 2025
Viaarxiv icon

GaussianMorphing: Mesh-Guided 3D Gaussians for Semantic-Aware Object Morphing

Add code
Oct 02, 2025
Viaarxiv icon

GTAD: Global Temporal Aggregation Denoising Learning for 3D Semantic Occupancy Prediction

Add code
Jul 28, 2025
Viaarxiv icon

Code2Logic: Game-Code-Driven Data Synthesis for Enhancing VLMs General Reasoning

Add code
May 20, 2025
Viaarxiv icon

StreamBridge: Turning Your Offline Video Large Language Model into a Proactive Streaming Assistant

Add code
May 08, 2025
Viaarxiv icon

Enhancing Environmental Robustness in Few-shot Learning via Conditional Representation Learning

Add code
Feb 03, 2025
Figure 1 for Enhancing Environmental Robustness in Few-shot Learning via Conditional Representation Learning
Figure 2 for Enhancing Environmental Robustness in Few-shot Learning via Conditional Representation Learning
Figure 3 for Enhancing Environmental Robustness in Few-shot Learning via Conditional Representation Learning
Figure 4 for Enhancing Environmental Robustness in Few-shot Learning via Conditional Representation Learning
Viaarxiv icon

DeTrack: In-model Latent Denoising Learning for Visual Object Tracking

Add code
Jan 05, 2025
Figure 1 for DeTrack: In-model Latent Denoising Learning for Visual Object Tracking
Figure 2 for DeTrack: In-model Latent Denoising Learning for Visual Object Tracking
Figure 3 for DeTrack: In-model Latent Denoising Learning for Visual Object Tracking
Figure 4 for DeTrack: In-model Latent Denoising Learning for Visual Object Tracking
Viaarxiv icon

Compound-QA: A Benchmark for Evaluating LLMs on Compound Questions

Add code
Nov 15, 2024
Figure 1 for Compound-QA: A Benchmark for Evaluating LLMs on Compound Questions
Figure 2 for Compound-QA: A Benchmark for Evaluating LLMs on Compound Questions
Figure 3 for Compound-QA: A Benchmark for Evaluating LLMs on Compound Questions
Figure 4 for Compound-QA: A Benchmark for Evaluating LLMs on Compound Questions
Viaarxiv icon

Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Models

Add code
Oct 04, 2024
Figure 1 for Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Models
Figure 2 for Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Models
Figure 3 for Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Models
Figure 4 for Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Models
Viaarxiv icon