Alert button
Picture for Qin Jin

Qin Jin

Alert button

Learning Descriptive Image Captioning via Semipermeable Maximum Likelihood Estimation

Add code
Bookmark button
Alert button
Jun 27, 2023
Zihao Yue, Anwen Hu, Liang Zhang, Qin Jin

Viaarxiv icon

Movie101: A New Movie Understanding Benchmark

Add code
Bookmark button
Alert button
May 20, 2023
Zihao Yue, Qi Zhang, Anwen Hu, Liang Zhang, Ziheng Wang, Qin Jin

Figure 1 for Movie101: A New Movie Understanding Benchmark
Figure 2 for Movie101: A New Movie Understanding Benchmark
Figure 3 for Movie101: A New Movie Understanding Benchmark
Figure 4 for Movie101: A New Movie Understanding Benchmark
Viaarxiv icon

Edit As You Wish: Video Description Editing with Multi-grained Commands

Add code
Bookmark button
Alert button
May 15, 2023
Linli Yao, Yuanmeng Zhang, Ziheng Wang, Xinglin Hou, Tiezheng Ge, Yuning Jiang, Qin Jin

Figure 1 for Edit As You Wish: Video Description Editing with Multi-grained Commands
Figure 2 for Edit As You Wish: Video Description Editing with Multi-grained Commands
Figure 3 for Edit As You Wish: Video Description Editing with Multi-grained Commands
Figure 4 for Edit As You Wish: Video Description Editing with Multi-grained Commands
Viaarxiv icon

InfoMetIC: An Informative Metric for Reference-free Image Caption Evaluation

Add code
Bookmark button
Alert button
May 10, 2023
Anwen Hu, Shizhe Chen, Liang Zhang, Qin Jin

Figure 1 for InfoMetIC: An Informative Metric for Reference-free Image Caption Evaluation
Figure 2 for InfoMetIC: An Informative Metric for Reference-free Image Caption Evaluation
Figure 3 for InfoMetIC: An Informative Metric for Reference-free Image Caption Evaluation
Figure 4 for InfoMetIC: An Informative Metric for Reference-free Image Caption Evaluation
Viaarxiv icon

Knowledge Enhanced Model for Live Video Comment Generation

Add code
Bookmark button
Alert button
Apr 28, 2023
Jieting Chen, Junkai Ding, Wenping Chen, Qin Jin

Figure 1 for Knowledge Enhanced Model for Live Video Comment Generation
Figure 2 for Knowledge Enhanced Model for Live Video Comment Generation
Figure 3 for Knowledge Enhanced Model for Live Video Comment Generation
Figure 4 for Knowledge Enhanced Model for Live Video Comment Generation
Viaarxiv icon

Rethinking Benchmarks for Cross-modal Image-text Retrieval

Add code
Bookmark button
Alert button
Apr 21, 2023
Weijing Chen, Linli Yao, Qin Jin

Figure 1 for Rethinking Benchmarks for Cross-modal Image-text Retrieval
Figure 2 for Rethinking Benchmarks for Cross-modal Image-text Retrieval
Figure 3 for Rethinking Benchmarks for Cross-modal Image-text Retrieval
Figure 4 for Rethinking Benchmarks for Cross-modal Image-text Retrieval
Viaarxiv icon

MPMQA: Multimodal Question Answering on Product Manuals

Add code
Bookmark button
Alert button
Apr 19, 2023
Liang Zhang, Anwen Hu, Jing Zhang, Shuo Hu, Qin Jin

Figure 1 for MPMQA: Multimodal Question Answering on Product Manuals
Figure 2 for MPMQA: Multimodal Question Answering on Product Manuals
Figure 3 for MPMQA: Multimodal Question Answering on Product Manuals
Figure 4 for MPMQA: Multimodal Question Answering on Product Manuals
Viaarxiv icon

PHONEix: Acoustic Feature Processing Strategy for Enhanced Singing Pronunciation with Phoneme Distribution Predictor

Add code
Bookmark button
Alert button
Mar 15, 2023
Yuning Wu, Jiatong Shi, Tao Qian, Dongji Gao, Qin Jin

Figure 1 for PHONEix: Acoustic Feature Processing Strategy for Enhanced Singing Pronunciation with Phoneme Distribution Predictor
Figure 2 for PHONEix: Acoustic Feature Processing Strategy for Enhanced Singing Pronunciation with Phoneme Distribution Predictor
Figure 3 for PHONEix: Acoustic Feature Processing Strategy for Enhanced Singing Pronunciation with Phoneme Distribution Predictor
Figure 4 for PHONEix: Acoustic Feature Processing Strategy for Enhanced Singing Pronunciation with Phoneme Distribution Predictor
Viaarxiv icon

Accommodating Audio Modality in CLIP for Multimodal Processing

Add code
Bookmark button
Alert button
Mar 12, 2023
Ludan Ruan, Anwen Hu, Yuqing Song, Liang Zhang, Sipeng Zheng, Qin Jin

Figure 1 for Accommodating Audio Modality in CLIP for Multimodal Processing
Figure 2 for Accommodating Audio Modality in CLIP for Multimodal Processing
Figure 3 for Accommodating Audio Modality in CLIP for Multimodal Processing
Figure 4 for Accommodating Audio Modality in CLIP for Multimodal Processing
Viaarxiv icon