Alert button
Picture for Serena Yeung-Levy

Serena Yeung-Levy

Alert button

Training Small Multimodal Models to Bridge Biomedical Competency Gap: A Case Study in Radiology Imaging

Add code
Bookmark button
Alert button
Mar 20, 2024
Juan Manuel Zambrano Chaves, Shih-Cheng Huang, Yanbo Xu, Hanwen Xu, Naoto Usuyama, Sheng Zhang, Fei Wang, Yujia Xie, Mahmoud Khademi, Ziyi Yang, Hany Awadalla, Julia Gong, Houdong Hu, Jianwei Yang, Chunyuan Li, Jianfeng Gao, Yu Gu, Cliff Wong, Mu Wei, Tristan Naumann, Muhao Chen, Matthew P. Lungren, Serena Yeung-Levy, Curtis P. Langlotz, Sheng Wang, Hoifung Poon

Figure 1 for Training Small Multimodal Models to Bridge Biomedical Competency Gap: A Case Study in Radiology Imaging
Figure 2 for Training Small Multimodal Models to Bridge Biomedical Competency Gap: A Case Study in Radiology Imaging
Figure 3 for Training Small Multimodal Models to Bridge Biomedical Competency Gap: A Case Study in Radiology Imaging
Figure 4 for Training Small Multimodal Models to Bridge Biomedical Competency Gap: A Case Study in Radiology Imaging
Viaarxiv icon

Depth-guided NeRF Training via Earth Mover's Distance

Add code
Bookmark button
Alert button
Mar 19, 2024
Anita Rau, Josiah Aklilu, F. Christopher Holsinger, Serena Yeung-Levy

Figure 1 for Depth-guided NeRF Training via Earth Mover's Distance
Figure 2 for Depth-guided NeRF Training via Earth Mover's Distance
Figure 3 for Depth-guided NeRF Training via Earth Mover's Distance
Figure 4 for Depth-guided NeRF Training via Earth Mover's Distance
Viaarxiv icon

Just Shift It: Test-Time Prototype Shifting for Zero-Shot Generalization with Vision-Language Models

Add code
Bookmark button
Alert button
Mar 19, 2024
Elaine Sui, Xiaohan Wang, Serena Yeung-Levy

Figure 1 for Just Shift It: Test-Time Prototype Shifting for Zero-Shot Generalization with Vision-Language Models
Figure 2 for Just Shift It: Test-Time Prototype Shifting for Zero-Shot Generalization with Vision-Language Models
Figure 3 for Just Shift It: Test-Time Prototype Shifting for Zero-Shot Generalization with Vision-Language Models
Figure 4 for Just Shift It: Test-Time Prototype Shifting for Zero-Shot Generalization with Vision-Language Models
Viaarxiv icon

VideoAgent: Long-form Video Understanding with Large Language Model as Agent

Add code
Bookmark button
Alert button
Mar 15, 2024
Xiaohan Wang, Yuhui Zhang, Orr Zohar, Serena Yeung-Levy

Figure 1 for VideoAgent: Long-form Video Understanding with Large Language Model as Agent
Figure 2 for VideoAgent: Long-form Video Understanding with Large Language Model as Agent
Figure 3 for VideoAgent: Long-form Video Understanding with Large Language Model as Agent
Figure 4 for VideoAgent: Long-form Video Understanding with Large Language Model as Agent
Viaarxiv icon

Multi-Human Mesh Recovery with Transformers

Add code
Bookmark button
Alert button
Feb 26, 2024
Zeyu Wang, Zhenzhen Weng, Serena Yeung-Levy

Viaarxiv icon

Revisiting Active Learning in the Era of Vision Foundation Models

Add code
Bookmark button
Alert button
Jan 25, 2024
Sanket Rajan Gupte, Josiah Aklilu, Jeffrey J. Nirschl, Serena Yeung-Levy

Viaarxiv icon

Single-View 3D Human Digitalization with Large Reconstruction Models

Add code
Bookmark button
Alert button
Jan 22, 2024
Zhenzhen Weng, Jingyuan Liu, Hao Tan, Zhan Xu, Yang Zhou, Serena Yeung-Levy, Jimei Yang

Viaarxiv icon

Connect, Collapse, Corrupt: Learning Cross-Modal Tasks with Uni-Modal Data

Add code
Bookmark button
Alert button
Jan 16, 2024
Yuhui Zhang, Elaine Sui, Serena Yeung-Levy

Viaarxiv icon

Describing Differences in Image Sets with Natural Language

Add code
Bookmark button
Alert button
Dec 05, 2023
Lisa Dunlap, Yuhui Zhang, Xiaohan Wang, Ruiqi Zhong, Trevor Darrell, Jacob Steinhardt, Joseph E. Gonzalez, Serena Yeung-Levy

Viaarxiv icon