Picture for Yu Qiao

Yu Qiao

ShenZhen Key Lab of Computer Vision and Pattern Recognition, SIAT-SenseTime Joint Lab, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, SIAT Branch, Shenzhen Institute of Artificial Intelligence and Robotics for Society

Federated Hybrid Training and Self-Adversarial Distillation: Towards Robust Edge Networks

Add code
Dec 26, 2024
Figure 1 for Federated Hybrid Training and Self-Adversarial Distillation: Towards Robust Edge Networks
Figure 2 for Federated Hybrid Training and Self-Adversarial Distillation: Towards Robust Edge Networks
Figure 3 for Federated Hybrid Training and Self-Adversarial Distillation: Towards Robust Edge Networks
Figure 4 for Federated Hybrid Training and Self-Adversarial Distillation: Towards Robust Edge Networks
Viaarxiv icon

Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment

Add code
Dec 26, 2024
Viaarxiv icon

HoVLE: Unleashing the Power of Monolithic Vision-Language Models with Holistic Vision-Language Embedding

Add code
Dec 20, 2024
Viaarxiv icon

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Add code
Dec 12, 2024
Figure 1 for InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
Figure 2 for InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
Figure 3 for InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
Figure 4 for InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
Viaarxiv icon

Bootstrapping Language-Guided Navigation Learning with Self-Refining Data Flywheel

Add code
Dec 11, 2024
Figure 1 for Bootstrapping Language-Guided Navigation Learning with Self-Refining Data Flywheel
Figure 2 for Bootstrapping Language-Guided Navigation Learning with Self-Refining Data Flywheel
Figure 3 for Bootstrapping Language-Guided Navigation Learning with Self-Refining Data Flywheel
Figure 4 for Bootstrapping Language-Guided Navigation Learning with Self-Refining Data Flywheel
Viaarxiv icon

Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative Models

Add code
Dec 10, 2024
Figure 1 for Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative Models
Figure 2 for Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative Models
Figure 3 for Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative Models
Figure 4 for Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative Models
Viaarxiv icon

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Add code
Dec 06, 2024
Figure 1 for Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling
Figure 2 for Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling
Figure 3 for Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling
Figure 4 for Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling
Viaarxiv icon

GATE OpenING: A Comprehensive Benchmark for Judging Open-ended Interleaved Image-Text Generation

Add code
Dec 01, 2024
Viaarxiv icon

SyncVIS: Synchronized Video Instance Segmentation

Add code
Dec 01, 2024
Figure 1 for SyncVIS: Synchronized Video Instance Segmentation
Figure 2 for SyncVIS: Synchronized Video Instance Segmentation
Figure 3 for SyncVIS: Synchronized Video Instance Segmentation
Figure 4 for SyncVIS: Synchronized Video Instance Segmentation
Viaarxiv icon

OASIS: Open Agent Social Interaction Simulations with One Million Agents

Add code
Nov 26, 2024
Figure 1 for OASIS: Open Agent Social Interaction Simulations with One Million Agents
Figure 2 for OASIS: Open Agent Social Interaction Simulations with One Million Agents
Figure 3 for OASIS: Open Agent Social Interaction Simulations with One Million Agents
Figure 4 for OASIS: Open Agent Social Interaction Simulations with One Million Agents
Viaarxiv icon