Picture for Yi Bin

Yi Bin

GeoPurify: A Data-Efficient Geometric Distillation Framework for Open-Vocabulary 3D Segmentation

Add code
Oct 02, 2025
Figure 1 for GeoPurify: A Data-Efficient Geometric Distillation Framework for Open-Vocabulary 3D Segmentation
Figure 2 for GeoPurify: A Data-Efficient Geometric Distillation Framework for Open-Vocabulary 3D Segmentation
Figure 3 for GeoPurify: A Data-Efficient Geometric Distillation Framework for Open-Vocabulary 3D Segmentation
Figure 4 for GeoPurify: A Data-Efficient Geometric Distillation Framework for Open-Vocabulary 3D Segmentation
Viaarxiv icon

More Than One Teacher: Adaptive Multi-Guidance Policy Optimization for Diverse Exploration

Add code
Oct 02, 2025
Viaarxiv icon

Explore Briefly, Then Decide: Mitigating LLM Overthinking via Cumulative Entropy Regulation

Add code
Oct 02, 2025
Viaarxiv icon

Multimodal Mathematical Reasoning with Diverse Solving Perspective

Add code
Jul 03, 2025
Viaarxiv icon

SemCORE: A Semantic-Enhanced Generative Cross-Modal Retrieval Framework with MLLMs

Add code
Apr 17, 2025
Viaarxiv icon

Motion-aware Contrastive Learning for Temporal Panoptic Scene Graph Generation

Add code
Dec 10, 2024
Figure 1 for Motion-aware Contrastive Learning for Temporal Panoptic Scene Graph Generation
Figure 2 for Motion-aware Contrastive Learning for Temporal Panoptic Scene Graph Generation
Figure 3 for Motion-aware Contrastive Learning for Temporal Panoptic Scene Graph Generation
Figure 4 for Motion-aware Contrastive Learning for Temporal Panoptic Scene Graph Generation
Viaarxiv icon

Multi-Scale Contrastive Learning for Video Temporal Grounding

Add code
Dec 10, 2024
Figure 1 for Multi-Scale Contrastive Learning for Video Temporal Grounding
Figure 2 for Multi-Scale Contrastive Learning for Video Temporal Grounding
Figure 3 for Multi-Scale Contrastive Learning for Video Temporal Grounding
Figure 4 for Multi-Scale Contrastive Learning for Video Temporal Grounding
Viaarxiv icon

Dynamic Multimodal Evaluation with Flexible Complexity by Vision-Language Bootstrapping

Add code
Oct 11, 2024
Figure 1 for Dynamic Multimodal Evaluation with Flexible Complexity by Vision-Language Bootstrapping
Figure 2 for Dynamic Multimodal Evaluation with Flexible Complexity by Vision-Language Bootstrapping
Figure 3 for Dynamic Multimodal Evaluation with Flexible Complexity by Vision-Language Bootstrapping
Figure 4 for Dynamic Multimodal Evaluation with Flexible Complexity by Vision-Language Bootstrapping
Viaarxiv icon

PrefixQuant: Static Quantization Beats Dynamic through Prefixed Outliers in LLMs

Add code
Oct 07, 2024
Figure 1 for PrefixQuant: Static Quantization Beats Dynamic through Prefixed Outliers in LLMs
Figure 2 for PrefixQuant: Static Quantization Beats Dynamic through Prefixed Outliers in LLMs
Figure 3 for PrefixQuant: Static Quantization Beats Dynamic through Prefixed Outliers in LLMs
Figure 4 for PrefixQuant: Static Quantization Beats Dynamic through Prefixed Outliers in LLMs
Viaarxiv icon

MM-Forecast: A Multimodal Approach to Temporal Event Forecasting with Large Language Models

Add code
Aug 08, 2024
Viaarxiv icon