Picture for Kevin Zhang

Kevin Zhang

MR-MLLM: Mutual Reinforcement of Multimodal Comprehension and Vision Perception

Add code
Jun 22, 2024
Viaarxiv icon

Conformer-1: Robust ASR via Large-Scale Semisupervised Bootstrapping

Add code
Apr 12, 2024
Viaarxiv icon

Z-Splat: Z-Axis Gaussian Splatting for Camera-Sonar Fusion

Add code
Apr 06, 2024
Viaarxiv icon

Self-Healing Effects in OAM Beams Observed on a 28 GHz Experimental Link

Add code
Feb 07, 2024
Viaarxiv icon

AONeuS: A Neural Rendering Framework for Acoustic-Optical Sensor Fusion

Add code
Feb 05, 2024
Viaarxiv icon

Cloud-Device Collaborative Learning for Multimodal Large Language Models

Add code
Dec 26, 2023
Figure 1 for Cloud-Device Collaborative Learning for Multimodal Large Language Models
Figure 2 for Cloud-Device Collaborative Learning for Multimodal Large Language Models
Figure 3 for Cloud-Device Collaborative Learning for Multimodal Large Language Models
Figure 4 for Cloud-Device Collaborative Learning for Multimodal Large Language Models
Viaarxiv icon

ConVRT: Consistent Video Restoration Through Turbulence with Test-time Optimization of Neural Video Representations

Add code
Dec 07, 2023
Figure 1 for ConVRT: Consistent Video Restoration Through Turbulence with Test-time Optimization of Neural Video Representations
Figure 2 for ConVRT: Consistent Video Restoration Through Turbulence with Test-time Optimization of Neural Video Representations
Figure 3 for ConVRT: Consistent Video Restoration Through Turbulence with Test-time Optimization of Neural Video Representations
Figure 4 for ConVRT: Consistent Video Restoration Through Turbulence with Test-time Optimization of Neural Video Representations
Viaarxiv icon

Towards Autonomous Crop Monitoring: Inserting Sensors in Cluttered Environments

Add code
Nov 07, 2023
Viaarxiv icon

A Scalable Training Strategy for Blind Multi-Distribution Noise Removal

Add code
Oct 30, 2023
Viaarxiv icon

Open X-Embodiment: Robotic Learning Datasets and RT-X Models

Add code
Oct 17, 2023
Figure 1 for Open X-Embodiment: Robotic Learning Datasets and RT-X Models
Figure 2 for Open X-Embodiment: Robotic Learning Datasets and RT-X Models
Figure 3 for Open X-Embodiment: Robotic Learning Datasets and RT-X Models
Figure 4 for Open X-Embodiment: Robotic Learning Datasets and RT-X Models
Viaarxiv icon