Picture for Limin Wang

Limin Wang

CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding

Add code
Dec 16, 2024
Figure 1 for CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding
Figure 2 for CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding
Figure 3 for CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding
Figure 4 for CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding
Viaarxiv icon

Bootstrapping Language-Guided Navigation Learning with Self-Refining Data Flywheel

Add code
Dec 11, 2024
Figure 1 for Bootstrapping Language-Guided Navigation Learning with Self-Refining Data Flywheel
Figure 2 for Bootstrapping Language-Guided Navigation Learning with Self-Refining Data Flywheel
Figure 3 for Bootstrapping Language-Guided Navigation Learning with Self-Refining Data Flywheel
Figure 4 for Bootstrapping Language-Guided Navigation Learning with Self-Refining Data Flywheel
Viaarxiv icon

p-MoD: Building Mixture-of-Depths MLLMs via Progressive Ratio Decay

Add code
Dec 05, 2024
Figure 1 for p-MoD: Building Mixture-of-Depths MLLMs via Progressive Ratio Decay
Figure 2 for p-MoD: Building Mixture-of-Depths MLLMs via Progressive Ratio Decay
Figure 3 for p-MoD: Building Mixture-of-Depths MLLMs via Progressive Ratio Decay
Figure 4 for p-MoD: Building Mixture-of-Depths MLLMs via Progressive Ratio Decay
Viaarxiv icon

SA-GNAS: Seed Architecture Expansion for Efficient Large-scale Graph Neural Architecture Search

Add code
Dec 03, 2024
Figure 1 for SA-GNAS: Seed Architecture Expansion for Efficient Large-scale Graph Neural Architecture Search
Figure 2 for SA-GNAS: Seed Architecture Expansion for Efficient Large-scale Graph Neural Architecture Search
Figure 3 for SA-GNAS: Seed Architecture Expansion for Efficient Large-scale Graph Neural Architecture Search
Figure 4 for SA-GNAS: Seed Architecture Expansion for Efficient Large-scale Graph Neural Architecture Search
Viaarxiv icon

Taming Scalable Visual Tokenizer for Autoregressive Image Generation

Add code
Dec 03, 2024
Figure 1 for Taming Scalable Visual Tokenizer for Autoregressive Image Generation
Figure 2 for Taming Scalable Visual Tokenizer for Autoregressive Image Generation
Figure 3 for Taming Scalable Visual Tokenizer for Autoregressive Image Generation
Figure 4 for Taming Scalable Visual Tokenizer for Autoregressive Image Generation
Viaarxiv icon

Graph-Enhanced EEG Foundation Model

Add code
Nov 29, 2024
Figure 1 for Graph-Enhanced EEG Foundation Model
Figure 2 for Graph-Enhanced EEG Foundation Model
Figure 3 for Graph-Enhanced EEG Foundation Model
Figure 4 for Graph-Enhanced EEG Foundation Model
Viaarxiv icon

Tra-MoE: Learning Trajectory Prediction Model from Multiple Domains for Adaptive Policy Conditioning

Add code
Nov 21, 2024
Figure 1 for Tra-MoE: Learning Trajectory Prediction Model from Multiple Domains for Adaptive Policy Conditioning
Figure 2 for Tra-MoE: Learning Trajectory Prediction Model from Multiple Domains for Adaptive Policy Conditioning
Figure 3 for Tra-MoE: Learning Trajectory Prediction Model from Multiple Domains for Adaptive Policy Conditioning
Figure 4 for Tra-MoE: Learning Trajectory Prediction Model from Multiple Domains for Adaptive Policy Conditioning
Viaarxiv icon

VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models

Add code
Nov 20, 2024
Figure 1 for VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models
Figure 2 for VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models
Figure 3 for VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models
Figure 4 for VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models
Viaarxiv icon

FlowDCN: Exploring DCN-like Architectures for Fast Image Generation with Arbitrary Resolution

Add code
Oct 30, 2024
Figure 1 for FlowDCN: Exploring DCN-like Architectures for Fast Image Generation with Arbitrary Resolution
Figure 2 for FlowDCN: Exploring DCN-like Architectures for Fast Image Generation with Arbitrary Resolution
Figure 3 for FlowDCN: Exploring DCN-like Architectures for Fast Image Generation with Arbitrary Resolution
Figure 4 for FlowDCN: Exploring DCN-like Architectures for Fast Image Generation with Arbitrary Resolution
Viaarxiv icon

TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning

Add code
Oct 25, 2024
Figure 1 for TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning
Figure 2 for TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning
Figure 3 for TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning
Figure 4 for TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning
Viaarxiv icon