Alert button
Picture for Jialu Li

Jialu Li

Alert button

SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data

Add code
Bookmark button
Alert button
Mar 11, 2024
Jialu Li, Jaemin Cho, Yi-Lin Sung, Jaehong Yoon, Mohit Bansal

Figure 1 for SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data
Figure 2 for SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data
Figure 3 for SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data
Figure 4 for SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data
Viaarxiv icon

Analysis of Self-Supervised Speech Models on Children's Speech and Infant Vocalizations

Add code
Bookmark button
Alert button
Feb 10, 2024
Jialu Li, Mark Hasegawa-Johnson, Nancy L. McElwain

Viaarxiv icon

VLN-Video: Utilizing Driving Videos for Outdoor Vision-and-Language Navigation

Add code
Bookmark button
Alert button
Feb 07, 2024
Jialu Li, Aishwarya Padmakumar, Gaurav Sukhatme, Mohit Bansal

Viaarxiv icon

Every Node is Different: Dynamically Fusing Self-Supervised Tasks for Attributed Graph Clustering

Add code
Bookmark button
Alert button
Jan 12, 2024
Pengfei Zhu, Qian Wang, Yu Wang, Jialu Li, Qinghua Hu

Viaarxiv icon

DCHT: Deep Complex Hybrid Transformer for Speech Enhancement

Add code
Bookmark button
Alert button
Oct 30, 2023
Jialu Li, Junhui Li, Pu Wang, Youshan Zhang

Figure 1 for DCHT: Deep Complex Hybrid Transformer for Speech Enhancement
Figure 2 for DCHT: Deep Complex Hybrid Transformer for Speech Enhancement
Figure 3 for DCHT: Deep Complex Hybrid Transformer for Speech Enhancement
Figure 4 for DCHT: Deep Complex Hybrid Transformer for Speech Enhancement
Viaarxiv icon

DPATD: Dual-Phase Audio Transformer for Denoising

Add code
Bookmark button
Alert button
Oct 30, 2023
Junhui Li, Pu Wang, Jialu Li, Xinzhe Wang, Youshan Zhang

Viaarxiv icon

Complex Image Generation SwinTransformer Network for Audio Denoising

Add code
Bookmark button
Alert button
Oct 24, 2023
Youshan Zhang, Jialu Li

Viaarxiv icon

Multimodal Large Language Model for Visual Navigation

Add code
Bookmark button
Alert button
Oct 12, 2023
Yao-Hung Hubert Tsai, Vansh Dhar, Jialu Li, Bowen Zhang, Jian Zhang

Figure 1 for Multimodal Large Language Model for Visual Navigation
Figure 2 for Multimodal Large Language Model for Visual Navigation
Figure 3 for Multimodal Large Language Model for Visual Navigation
Figure 4 for Multimodal Large Language Model for Visual Navigation
Viaarxiv icon

Enhancing Child Vocalization Classification in Multi-Channel Child-Adult Conversations Through Wav2vec2 Children ASR Features

Add code
Bookmark button
Alert button
Sep 13, 2023
Jialu Li, Mark Hasegawa-Johnson, Karrie Karahalios

Viaarxiv icon

Scaling Data Generation in Vision-and-Language Navigation

Add code
Bookmark button
Alert button
Aug 09, 2023
Zun Wang, Jialu Li, Yicong Hong, Yi Wang, Qi Wu, Mohit Bansal, Stephen Gould, Hao Tan, Yu Qiao

Figure 1 for Scaling Data Generation in Vision-and-Language Navigation
Figure 2 for Scaling Data Generation in Vision-and-Language Navigation
Figure 3 for Scaling Data Generation in Vision-and-Language Navigation
Figure 4 for Scaling Data Generation in Vision-and-Language Navigation
Viaarxiv icon