Picture for Weifeng Ge

Weifeng Ge

GTAD: Global Temporal Aggregation Denoising Learning for 3D Semantic Occupancy Prediction

Add code
Jul 28, 2025
Viaarxiv icon

Code2Logic: Game-Code-Driven Data Synthesis for Enhancing VLMs General Reasoning

Add code
May 20, 2025
Viaarxiv icon

StreamBridge: Turning Your Offline Video Large Language Model into a Proactive Streaming Assistant

Add code
May 08, 2025
Viaarxiv icon

Enhancing Environmental Robustness in Few-shot Learning via Conditional Representation Learning

Add code
Feb 03, 2025
Viaarxiv icon

DeTrack: In-model Latent Denoising Learning for Visual Object Tracking

Add code
Jan 05, 2025
Figure 1 for DeTrack: In-model Latent Denoising Learning for Visual Object Tracking
Figure 2 for DeTrack: In-model Latent Denoising Learning for Visual Object Tracking
Figure 3 for DeTrack: In-model Latent Denoising Learning for Visual Object Tracking
Figure 4 for DeTrack: In-model Latent Denoising Learning for Visual Object Tracking
Viaarxiv icon

Compound-QA: A Benchmark for Evaluating LLMs on Compound Questions

Add code
Nov 15, 2024
Figure 1 for Compound-QA: A Benchmark for Evaluating LLMs on Compound Questions
Figure 2 for Compound-QA: A Benchmark for Evaluating LLMs on Compound Questions
Figure 3 for Compound-QA: A Benchmark for Evaluating LLMs on Compound Questions
Figure 4 for Compound-QA: A Benchmark for Evaluating LLMs on Compound Questions
Viaarxiv icon

Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Models

Add code
Oct 04, 2024
Figure 1 for Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Models
Figure 2 for Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Models
Figure 3 for Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Models
Figure 4 for Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Models
Viaarxiv icon

Hierarchical Visual Categories Modeling: A Joint Representation Learning and Density Estimation Framework for Out-of-Distribution Detection

Add code
Aug 28, 2024
Figure 1 for Hierarchical Visual Categories Modeling: A Joint Representation Learning and Density Estimation Framework for Out-of-Distribution Detection
Figure 2 for Hierarchical Visual Categories Modeling: A Joint Representation Learning and Density Estimation Framework for Out-of-Distribution Detection
Figure 3 for Hierarchical Visual Categories Modeling: A Joint Representation Learning and Density Estimation Framework for Out-of-Distribution Detection
Figure 4 for Hierarchical Visual Categories Modeling: A Joint Representation Learning and Density Estimation Framework for Out-of-Distribution Detection
Viaarxiv icon

TagOOD: A Novel Approach to Out-of-Distribution Detection via Vision-Language Representations and Class Center Learning

Add code
Aug 28, 2024
Figure 1 for TagOOD: A Novel Approach to Out-of-Distribution Detection via Vision-Language Representations and Class Center Learning
Figure 2 for TagOOD: A Novel Approach to Out-of-Distribution Detection via Vision-Language Representations and Class Center Learning
Figure 3 for TagOOD: A Novel Approach to Out-of-Distribution Detection via Vision-Language Representations and Class Center Learning
Figure 4 for TagOOD: A Novel Approach to Out-of-Distribution Detection via Vision-Language Representations and Class Center Learning
Viaarxiv icon

Reading Relevant Feature from Global Representation Memory for Visual Object Tracking

Add code
Feb 26, 2024
Viaarxiv icon