Alert button
Picture for Wei Han

Wei Han

Alert button

Fuxi-DA: A Generalized Deep Learning Data Assimilation Framework for Assimilating Satellite Observations

Add code
Bookmark button
Alert button
Apr 12, 2024
Xiaoze Xu, Xiuyu Sun, Wei Han, Xiaohui Zhong, Lei Chen, Hao Li

Viaarxiv icon

INSTRAUG: Automatic Instruction Augmentation for Multimodal Instruction Fine-tuning

Add code
Bookmark button
Alert button
Feb 22, 2024
Wei Han, Hui Chen, Soujanya Poria

Viaarxiv icon

Retrieval Augmented End-to-End Spoken Dialog Models

Add code
Bookmark button
Alert button
Feb 02, 2024
Mingqiu Wang, Izhak Shafran, Hagen Soltau, Wei Han, Yuan Cao, Dian Yu, Laurent El Shafey

Viaarxiv icon

Localization and Discrete Beamforming with a Large Reconfigurable Intelligent Surface

Add code
Bookmark button
Alert button
Dec 19, 2023
Baojia Luo, Yili Deng, Miaomiao Dong, Zhongyi Huang, Xiang Chen, Wei Han, Bo Bai

Viaarxiv icon

Extending Context Window of Large Language Models via Semantic Compression

Add code
Bookmark button
Alert button
Dec 15, 2023
Weizhi Fei, Xueyan Niu, Pingyi Zhou, Lu Hou, Bo Bai, Lei Deng, Wei Han

Viaarxiv icon

RoboVQA: Multimodal Long-Horizon Reasoning for Robotics

Add code
Bookmark button
Alert button
Nov 01, 2023
Pierre Sermanet, Tianli Ding, Jeffrey Zhao, Fei Xia, Debidatta Dwibedi, Keerthana Gopalakrishnan, Christine Chan, Gabriel Dulac-Arnold, Sharath Maddineni, Nikhil J Joshi, Pete Florence, Wei Han, Robert Baruch, Yao Lu, Suvir Mirchandani, Peng Xu, Pannag Sanketi, Karol Hausman, Izhak Shafran, Brian Ichter, Yuan Cao

Figure 1 for RoboVQA: Multimodal Long-Horizon Reasoning for Robotics
Figure 2 for RoboVQA: Multimodal Long-Horizon Reasoning for Robotics
Figure 3 for RoboVQA: Multimodal Long-Horizon Reasoning for Robotics
Figure 4 for RoboVQA: Multimodal Long-Horizon Reasoning for Robotics
Viaarxiv icon

SLM: Bridge the thin gap between speech and text foundation models

Add code
Bookmark button
Alert button
Sep 30, 2023
Mingqiu Wang, Wei Han, Izhak Shafran, Zelin Wu, Chung-Cheng Chiu, Yuan Cao, Yongqiang Wang, Nanxin Chen, Yu Zhang, Hagen Soltau, Paul Rubenstein, Lukas Zilka, Dian Yu, Zhong Meng, Golan Pundak, Nikhil Siddhartha, Johan Schalkwyk, Yonghui Wu

Figure 1 for SLM: Bridge the thin gap between speech and text foundation models
Figure 2 for SLM: Bridge the thin gap between speech and text foundation models
Figure 3 for SLM: Bridge the thin gap between speech and text foundation models
Figure 4 for SLM: Bridge the thin gap between speech and text foundation models
Viaarxiv icon

High Perceptual Quality Wireless Image Delivery with Denoising Diffusion Models

Add code
Bookmark button
Alert button
Sep 27, 2023
Selim F. Yilmaz, Xueyan Niu, Bo Bai, Wei Han, Lei Deng, Deniz Gunduz

Figure 1 for High Perceptual Quality Wireless Image Delivery with Denoising Diffusion Models
Figure 2 for High Perceptual Quality Wireless Image Delivery with Denoising Diffusion Models
Figure 3 for High Perceptual Quality Wireless Image Delivery with Denoising Diffusion Models
Figure 4 for High Perceptual Quality Wireless Image Delivery with Denoising Diffusion Models
Viaarxiv icon

Multimodal Modeling For Spoken Language Identification

Add code
Bookmark button
Alert button
Sep 19, 2023
Shikhar Bharadwaj, Min Ma, Shikhar Vashishth, Ankur Bapna, Sriram Ganapathy, Vera Axelrod, Siddharth Dalmia, Wei Han, Yu Zhang, Daan van Esch, Sandy Ritchie, Partha Talukdar, Jason Riesa

Figure 1 for Multimodal Modeling For Spoken Language Identification
Figure 2 for Multimodal Modeling For Spoken Language Identification
Figure 3 for Multimodal Modeling For Spoken Language Identification
Figure 4 for Multimodal Modeling For Spoken Language Identification
Viaarxiv icon

SAS Video-QA: Self-Adaptive Sampling for Efficient Video Question-Answering

Add code
Bookmark button
Alert button
Aug 01, 2023
Wei Han, Hui Chen, Min-Yen Kan, Soujanya Poria

Figure 1 for SAS Video-QA: Self-Adaptive Sampling for Efficient Video Question-Answering
Figure 2 for SAS Video-QA: Self-Adaptive Sampling for Efficient Video Question-Answering
Figure 3 for SAS Video-QA: Self-Adaptive Sampling for Efficient Video Question-Answering
Figure 4 for SAS Video-QA: Self-Adaptive Sampling for Efficient Video Question-Answering
Viaarxiv icon