Picture for Chen Sun

Chen Sun

Potential Based Diffusion Motion Planning

Add code
Jul 08, 2024
Viaarxiv icon

Text-Aware Diffusion for Policy Learning

Add code
Jul 02, 2024
Viaarxiv icon

Multi-Beam Integrated Sensing and Communication: State-of-the-Art, Challenges and Opportunities

Add code
May 31, 2024
Figure 1 for Multi-Beam Integrated Sensing and Communication: State-of-the-Art, Challenges and Opportunities
Figure 2 for Multi-Beam Integrated Sensing and Communication: State-of-the-Art, Challenges and Opportunities
Figure 3 for Multi-Beam Integrated Sensing and Communication: State-of-the-Art, Challenges and Opportunities
Figure 4 for Multi-Beam Integrated Sensing and Communication: State-of-the-Art, Challenges and Opportunities
Viaarxiv icon

Pre-trained Vision-Language Models Learn Discoverable Visual Concepts

Add code
Apr 19, 2024
Figure 1 for Pre-trained Vision-Language Models Learn Discoverable Visual Concepts
Figure 2 for Pre-trained Vision-Language Models Learn Discoverable Visual Concepts
Figure 3 for Pre-trained Vision-Language Models Learn Discoverable Visual Concepts
Figure 4 for Pre-trained Vision-Language Models Learn Discoverable Visual Concepts
Viaarxiv icon

Precoder Design for User-Centric Network Massive MIMO with Matrix Manifold Optimization

Add code
Apr 11, 2024
Figure 1 for Precoder Design for User-Centric Network Massive MIMO with Matrix Manifold Optimization
Figure 2 for Precoder Design for User-Centric Network Massive MIMO with Matrix Manifold Optimization
Figure 3 for Precoder Design for User-Centric Network Massive MIMO with Matrix Manifold Optimization
Figure 4 for Precoder Design for User-Centric Network Massive MIMO with Matrix Manifold Optimization
Viaarxiv icon

Self-Correcting Self-Consuming Loops for Generative Model Training

Add code
Feb 11, 2024
Viaarxiv icon

Pixel Aligned Language Models

Add code
Dec 14, 2023
Figure 1 for Pixel Aligned Language Models
Figure 2 for Pixel Aligned Language Models
Figure 3 for Pixel Aligned Language Models
Figure 4 for Pixel Aligned Language Models
Viaarxiv icon

Spacewalk-18: A Benchmark for Multimodal and Long-form Procedural Video Understanding in Novel Domains

Add code
Nov 30, 2023
Viaarxiv icon

Vamos: Versatile Action Models for Video Understanding

Add code
Nov 22, 2023
Figure 1 for Vamos: Versatile Action Models for Video Understanding
Figure 2 for Vamos: Versatile Action Models for Video Understanding
Figure 3 for Vamos: Versatile Action Models for Video Understanding
Figure 4 for Vamos: Versatile Action Models for Video Understanding
Viaarxiv icon

Towards Generic Anomaly Detection and Understanding: Large-scale Visual-linguistic Model (GPT-4V) Takes the Lead

Add code
Nov 16, 2023
Figure 1 for Towards Generic Anomaly Detection and Understanding: Large-scale Visual-linguistic Model (GPT-4V) Takes the Lead
Figure 2 for Towards Generic Anomaly Detection and Understanding: Large-scale Visual-linguistic Model (GPT-4V) Takes the Lead
Figure 3 for Towards Generic Anomaly Detection and Understanding: Large-scale Visual-linguistic Model (GPT-4V) Takes the Lead
Figure 4 for Towards Generic Anomaly Detection and Understanding: Large-scale Visual-linguistic Model (GPT-4V) Takes the Lead
Viaarxiv icon