Picture for Ning Shi

Ning Shi

Cross-Modal Consistency in Multimodal Large Language Models

Add code
Nov 14, 2024
Figure 1 for Cross-Modal Consistency in Multimodal Large Language Models
Figure 2 for Cross-Modal Consistency in Multimodal Large Language Models
Figure 3 for Cross-Modal Consistency in Multimodal Large Language Models
Figure 4 for Cross-Modal Consistency in Multimodal Large Language Models
Viaarxiv icon

MIO: A Foundation Model on Multimodal Tokens

Add code
Sep 26, 2024
Figure 1 for MIO: A Foundation Model on Multimodal Tokens
Figure 2 for MIO: A Foundation Model on Multimodal Tokens
Figure 3 for MIO: A Foundation Model on Multimodal Tokens
Figure 4 for MIO: A Foundation Model on Multimodal Tokens
Viaarxiv icon

Action Controlled Paraphrasing

Add code
May 18, 2024
Figure 1 for Action Controlled Paraphrasing
Figure 2 for Action Controlled Paraphrasing
Figure 3 for Action Controlled Paraphrasing
Figure 4 for Action Controlled Paraphrasing
Viaarxiv icon

Optimizing Negative Prompts for Enhanced Aesthetics and Fidelity in Text-To-Image Generation

Add code
Mar 12, 2024
Figure 1 for Optimizing Negative Prompts for Enhanced Aesthetics and Fidelity in Text-To-Image Generation
Figure 2 for Optimizing Negative Prompts for Enhanced Aesthetics and Fidelity in Text-To-Image Generation
Figure 3 for Optimizing Negative Prompts for Enhanced Aesthetics and Fidelity in Text-To-Image Generation
Figure 4 for Optimizing Negative Prompts for Enhanced Aesthetics and Fidelity in Text-To-Image Generation
Viaarxiv icon

Lost in Translation: When GPT-4V Can't See Eye to Eye with Text. A Vision-Language-Consistency Analysis of VLLMs and Beyond

Add code
Oct 19, 2023
Viaarxiv icon

UAlberta at SemEval-2023 Task 1: Context Augmentation and Translation for Multilingual Visual Word Sense Disambiguation

Add code
Jun 24, 2023
Figure 1 for UAlberta at SemEval-2023 Task 1: Context Augmentation and Translation for Multilingual Visual Word Sense Disambiguation
Figure 2 for UAlberta at SemEval-2023 Task 1: Context Augmentation and Translation for Multilingual Visual Word Sense Disambiguation
Figure 3 for UAlberta at SemEval-2023 Task 1: Context Augmentation and Translation for Multilingual Visual Word Sense Disambiguation
Figure 4 for UAlberta at SemEval-2023 Task 1: Context Augmentation and Translation for Multilingual Visual Word Sense Disambiguation
Viaarxiv icon

From Adversarial Arms Race to Model-centric Evaluation: Motivating a Unified Automatic Robustness Evaluation Framework

Add code
May 29, 2023
Figure 1 for From Adversarial Arms Race to Model-centric Evaluation: Motivating a Unified Automatic Robustness Evaluation Framework
Figure 2 for From Adversarial Arms Race to Model-centric Evaluation: Motivating a Unified Automatic Robustness Evaluation Framework
Figure 3 for From Adversarial Arms Race to Model-centric Evaluation: Motivating a Unified Automatic Robustness Evaluation Framework
Figure 4 for From Adversarial Arms Race to Model-centric Evaluation: Motivating a Unified Automatic Robustness Evaluation Framework
Viaarxiv icon

Don't Trust GPT When Your Question Is Not In English

Add code
May 24, 2023
Figure 1 for Don't Trust GPT When Your Question Is Not In English
Figure 2 for Don't Trust GPT When Your Question Is Not In English
Figure 3 for Don't Trust GPT When Your Question Is Not In English
Figure 4 for Don't Trust GPT When Your Question Is Not In English
Viaarxiv icon

Interactive Natural Language Processing

Add code
May 22, 2023
Figure 1 for Interactive Natural Language Processing
Figure 2 for Interactive Natural Language Processing
Figure 3 for Interactive Natural Language Processing
Figure 4 for Interactive Natural Language Processing
Viaarxiv icon

RoChBert: Towards Robust BERT Fine-tuning for Chinese

Add code
Oct 28, 2022
Figure 1 for RoChBert: Towards Robust BERT Fine-tuning for Chinese
Figure 2 for RoChBert: Towards Robust BERT Fine-tuning for Chinese
Figure 3 for RoChBert: Towards Robust BERT Fine-tuning for Chinese
Figure 4 for RoChBert: Towards Robust BERT Fine-tuning for Chinese
Viaarxiv icon