Alert button
Picture for Shikun Zhang

Shikun Zhang

Alert button

MusicAgent: An AI Agent for Music Understanding and Generation with Large Language Models

Add code
Bookmark button
Alert button
Oct 18, 2023
Dingyao Yu, Kaitao Song, Peiling Lu, Tianyu He, Xu Tan, Wei Ye, Shikun Zhang, Jiang Bian

Figure 1 for MusicAgent: An AI Agent for Music Understanding and Generation with Large Language Models
Figure 2 for MusicAgent: An AI Agent for Music Understanding and Generation with Large Language Models
Figure 3 for MusicAgent: An AI Agent for Music Understanding and Generation with Large Language Models
Figure 4 for MusicAgent: An AI Agent for Music Understanding and Generation with Large Language Models
Viaarxiv icon

BUS:Efficient and Effective Vision-language Pre-training with Bottom-Up Patch Summarization

Add code
Bookmark button
Alert button
Jul 17, 2023
Chaoya Jiang, Haiyang Xu, Wei Ye, Qinghao Ye, Chenliang Li, Ming Yan, Bin Bi, Shikun Zhang, Fei Huang, Songfang Huang

Figure 1 for BUS:Efficient and Effective Vision-language Pre-training with Bottom-Up Patch Summarization
Figure 2 for BUS:Efficient and Effective Vision-language Pre-training with Bottom-Up Patch Summarization
Figure 3 for BUS:Efficient and Effective Vision-language Pre-training with Bottom-Up Patch Summarization
Figure 4 for BUS:Efficient and Effective Vision-language Pre-training with Bottom-Up Patch Summarization
Viaarxiv icon

EmoGen: Eliminating Subjective Bias in Emotional Music Generation

Add code
Bookmark button
Alert button
Jul 03, 2023
Chenfei Kang, Peiling Lu, Botao Yu, Xu Tan, Wei Ye, Shikun Zhang, Jiang Bian

Figure 1 for EmoGen: Eliminating Subjective Bias in Emotional Music Generation
Figure 2 for EmoGen: Eliminating Subjective Bias in Emotional Music Generation
Figure 3 for EmoGen: Eliminating Subjective Bias in Emotional Music Generation
Figure 4 for EmoGen: Eliminating Subjective Bias in Emotional Music Generation
Viaarxiv icon

PandaLM: An Automatic Evaluation Benchmark for LLM Instruction Tuning Optimization

Add code
Bookmark button
Alert button
Jun 08, 2023
Yidong Wang, Zhuohao Yu, Zhengran Zeng, Linyi Yang, Cunxiang Wang, Hao Chen, Chaoya Jiang, Rui Xie, Jindong Wang, Xing Xie, Wei Ye, Shikun Zhang, Yue Zhang

Figure 1 for PandaLM: An Automatic Evaluation Benchmark for LLM Instruction Tuning Optimization
Figure 2 for PandaLM: An Automatic Evaluation Benchmark for LLM Instruction Tuning Optimization
Figure 3 for PandaLM: An Automatic Evaluation Benchmark for LLM Instruction Tuning Optimization
Figure 4 for PandaLM: An Automatic Evaluation Benchmark for LLM Instruction Tuning Optimization
Viaarxiv icon

Mode Approximation Makes Good Multimodal Prompts

Add code
Bookmark button
Alert button
May 23, 2023
Haixin Wang, Xinlong Yang, Jianlong Chang, Dian Jin, Jinan Sun, Shikun Zhang, Xiao Luo, Qi Tian

Figure 1 for Mode Approximation Makes Good Multimodal Prompts
Figure 2 for Mode Approximation Makes Good Multimodal Prompts
Figure 3 for Mode Approximation Makes Good Multimodal Prompts
Figure 4 for Mode Approximation Makes Good Multimodal Prompts
Viaarxiv icon

3D Registration with Maximal Cliques

Add code
Bookmark button
Alert button
May 18, 2023
Xiyu Zhang, Jiaqi Yang, Shikun Zhang, Yanning Zhang

Figure 1 for 3D Registration with Maximal Cliques
Figure 2 for 3D Registration with Maximal Cliques
Figure 3 for 3D Registration with Maximal Cliques
Figure 4 for 3D Registration with Maximal Cliques
Viaarxiv icon

GETMusic: Generating Any Music Tracks with a Unified Representation and Diffusion Framework

Add code
Bookmark button
Alert button
May 18, 2023
Ang Lv, Xu Tan, Peiling Lu, Wei Ye, Shikun Zhang, Jiang Bian, Rui Yan

Figure 1 for GETMusic: Generating Any Music Tracks with a Unified Representation and Diffusion Framework
Figure 2 for GETMusic: Generating Any Music Tracks with a Unified Representation and Diffusion Framework
Figure 3 for GETMusic: Generating Any Music Tracks with a Unified Representation and Diffusion Framework
Figure 4 for GETMusic: Generating Any Music Tracks with a Unified Representation and Diffusion Framework
Viaarxiv icon

Mode Approximation Makes Good Vision-Language Prompts

Add code
Bookmark button
Alert button
May 15, 2023
Haixin Wang, Xinlong Yang, Jianlong Chang, Dian Jin, Jinan Sun, Shikun Zhang, Xiao Luo, Qi Tian

Figure 1 for Mode Approximation Makes Good Vision-Language Prompts
Figure 2 for Mode Approximation Makes Good Vision-Language Prompts
Figure 3 for Mode Approximation Makes Good Vision-Language Prompts
Figure 4 for Mode Approximation Makes Good Vision-Language Prompts
Viaarxiv icon

Exploiting Pseudo Image Captions for Multimodal Summarization

Add code
Bookmark button
Alert button
May 09, 2023
Chaoya Jiang, Rui Xie, Wei Ye, Jinan Sun, Shikun Zhang

Figure 1 for Exploiting Pseudo Image Captions for Multimodal Summarization
Figure 2 for Exploiting Pseudo Image Captions for Multimodal Summarization
Figure 3 for Exploiting Pseudo Image Captions for Multimodal Summarization
Figure 4 for Exploiting Pseudo Image Captions for Multimodal Summarization
Viaarxiv icon

Vision Langauge Pre-training by Contrastive Learning with Cross-Modal Similarity Regulation

Add code
Bookmark button
Alert button
May 09, 2023
Chaoya Jiang, Wei Ye, Haiyang Xu, Miang yan, Shikun Zhang, Jie Zhang, Fei Huang

Figure 1 for Vision Langauge Pre-training by Contrastive Learning with Cross-Modal Similarity Regulation
Figure 2 for Vision Langauge Pre-training by Contrastive Learning with Cross-Modal Similarity Regulation
Figure 3 for Vision Langauge Pre-training by Contrastive Learning with Cross-Modal Similarity Regulation
Figure 4 for Vision Langauge Pre-training by Contrastive Learning with Cross-Modal Similarity Regulation
Viaarxiv icon