Alert button
Picture for Shuo Zhang

Shuo Zhang

Alert button

Latent CLAP Loss for Better Foley Sound Synthesis

Mar 18, 2024
Tornike Karchkhadze, Hassan Salami Kavaki, Mohammad Rasool Izadi, Bryce Irvin, Mikolaj Kegler, Ari Hertz, Shuo Zhang, Marko Stamenovic

Viaarxiv icon

TextMonkey: An OCR-Free Large Multimodal Model for Understanding Document

Mar 15, 2024
Yuliang Liu, Biao Yang, Qiang Liu, Zhang Li, Zhiyin Ma, Shuo Zhang, Xiang Bai

Figure 1 for TextMonkey: An OCR-Free Large Multimodal Model for Understanding Document
Figure 2 for TextMonkey: An OCR-Free Large Multimodal Model for Understanding Document
Figure 3 for TextMonkey: An OCR-Free Large Multimodal Model for Understanding Document
Figure 4 for TextMonkey: An OCR-Free Large Multimodal Model for Understanding Document
Viaarxiv icon

InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning

Feb 09, 2024
Huaiyuan Ying, Shuo Zhang, Linyang Li, Zhejian Zhou, Yunfan Shao, Zhaoye Fei, Yichuan Ma, Jiawei Hong, Kuikun Liu, Ziyi Wang, Yudong Wang, Zijian Wu, Shuaibin Li, Fengzhe Zhou, Hongwei Liu, Songyang Zhang, Wenwei Zhang, Hang Yan, Xipeng Qiu, Jiayu Wang, Kai Chen, Dahua Lin

Viaarxiv icon

Protein Language Model-Powered 3D Ligand Binding Site Prediction from Protein Sequence

Dec 05, 2023
Shuo Zhang, Lei Xie

Viaarxiv icon

CoLLiE: Collaborative Training of Large Language Models in an Efficient Way

Dec 01, 2023
Kai Lv, Shuo Zhang, Tianle Gu, Shuhao Xing, Jiawei Hong, Keyu Chen, Xiaoran Liu, Yuqing Yang, Honglin Guo, Tengxiao Liu, Yu Sun, Qipeng Guo, Hang Yan, Xipeng Qiu

Viaarxiv icon

Monkey: Image Resolution and Text Label Are Important Things for Large Multi-modal Models

Nov 24, 2023
Zhang Li, Biao Yang, Qiang Liu, Zhiyin Ma, Shuo Zhang, Jingxu Yang, Yabo Sun, Yuliang Liu, Xiang Bai

Figure 1 for Monkey: Image Resolution and Text Label Are Important Things for Large Multi-modal Models
Figure 2 for Monkey: Image Resolution and Text Label Are Important Things for Large Multi-modal Models
Figure 3 for Monkey: Image Resolution and Text Label Are Important Things for Large Multi-modal Models
Figure 4 for Monkey: Image Resolution and Text Label Are Important Things for Large Multi-modal Models
Viaarxiv icon

A Universal Framework for Accurate and Efficient Geometric Deep Learning of Molecular Systems

Nov 19, 2023
Shuo Zhang, Yang Liu, Lei Xie

Viaarxiv icon

Time-efficient combined morphologic and quantitative joint MRI based on clinical image contrasts -- An exploratory in-situ study of standardized cartilage defects

Nov 14, 2023
Teresa Lemainque, Nicola Pridöhl, Shuo Zhang, Marc Huppertz, Manuel Post, Can Yüksel, Masami Yoneyama, Andreas Prescher, Christiane Kuhl, Daniel Truhn, Sven Nebelung

Viaarxiv icon

TempTabQA: Temporal Question Answering for Semi-Structured Tables

Nov 14, 2023
Vivek Gupta, Pranshu Kandoi, Mahek Bhavesh Vora, Shuo Zhang, Yujie He, Ridho Reinanda, Vivek Srikumar

Viaarxiv icon